compared test scores: Topics by Science.gov

Sample records for compared test scores

Comparing the Effects of Elementary Music and Visual Arts Lessons on Standardized Mathematics Test Scores

ERIC Educational Resources Information Center

King, Molly Elizabeth

2016-01-01

The purpose of this quantitative, causal-comparative study was to compare the effect elementary music and visual arts lessons had on third through sixth grade standardized mathematics test scores. Inferential statistics were used to compare the differences between test scores of students who took in-school, elementary, music instruction during the…
Multidimensional CAT Item Selection Methods for Domain Scores and Composite Scores: Theory and Applications

ERIC Educational Resources Information Center

Yao, Lihua

2012-01-01

Multidimensional computer adaptive testing (MCAT) can provide higher precision and reliability or reduce test length when compared with unidimensional CAT or with the paper-and-pencil test. This study compared five item selection procedures in the MCAT framework for both domain scores and overall scores through simulation by varying the structure…
A Nonparametric Framework for Comparing Trends and Gaps across Tests

ERIC Educational Resources Information Center

Ho, Andrew Dean

2009-01-01

Problems of scale typically arise when comparing test score trends, gaps, and gap trends across different tests. To overcome some of these difficulties, test score distributions on the same score scale can be represented by nonparametric graphs or statistics that are invariant under monotone scale transformations. This article motivates and then…
Disaggregated Effects of Device on Score Comparability

ERIC Educational Resources Information Center

Davis, Laurie; Morrison, Kristin; Kong, Xiaojing; McBride, Yuanyuan

2017-01-01

The use of tablets for large-scale testing programs has transitioned from concept to reality for many state testing programs. This study extended previous research on score comparability between tablets and computers with high school students to compare score distributions across devices for reading, math, and science and to evaluate device…
Estimating Total-Test Scores from Partial Scores in a Matrix Sampling Design.

ERIC Educational Resources Information Center

Sachar, Jane; Suppes, Patrick

1980-01-01

The present study compared six methods, two of which utilize the content structure of items, to estimate total-test scores using 450 students and 60 items of the 110-item Stanford Mental Arithmetic Test. Three methods yielded fairly good estimates of the total-test score. (Author/RL)
34 CFR 668.144 - Application for test approval.

Code of Federal Regulations, 2010 CFR

2010-07-01

... the comparability of scores on the current test to scores on the previous test, and data from validity... explanation of the methodology and procedures for measuring the reliability of the test; (ii) Evidence that different forms of the test, including, if applicable, short forms, are comparable in reliability; (iii...
A Comparison of Standardized Achievement Test Scores on Right and Left Brain Dominant Fourth-Grade Students.

ERIC Educational Resources Information Center

Bell, Michael L.; Roubinek, Darrell L.

1989-01-01

Compares fourth-graders' subtest scores on the Stanford Achievement Test (SAT), the Iowa Test of Basic Skills (ITBS), and the Metropolitan Achievement Test (MAT). Finds right-brain dominant students scored better on four SAT subtests, and left-brain dominant students scored better on four ITBS subtests and two MAT subtests. (NH)
The Validity of ITBS Reading Comprehension Test Scores for Learning Disabled and Non Learning Disabled Students under Extended-Time Conditions.

ERIC Educational Resources Information Center

Huesman, Ronald L., Jr.; Frisbie, David A.

This study investigated the effect of extended-time limits in terms of performance levels and score comparability for reading comprehension scores on the Iowa Tests of Basic Skills (ITBS). The first part of the study compared the average reading comprehension scores on the ITBS of 61 sixth-graders with learning disabilities and 397 non learning…
Using Patterns of Summed Scores in Paper-and-Pencil Tests and Computer-Adaptive Tests to Detect Misfitting Item Score Patterns

ERIC Educational Resources Information Center

Meijer, Rob R.

2004-01-01

Two new methods have been proposed to determine unexpected sum scores on sub-tests (testlets) both for paper-and-pencil tests and computer adaptive tests. A method based on a conservative bound using the hypergeometric distribution, denoted p, was compared with a method where the probability for each score combination was calculated using a…
Examining the Validity of GED[R] Tests Scores with Scheduling and Setting Accommodations. GED Testing Service Research Studies, 2004-1

ERIC Educational Resources Information Center

George-Ezzelle, Carol E.; Skaggs, Gary

2004-01-01

Current testing standards call for test developers to provide evidence that testing procedures and test scores, and the inferences made based on the test scores, show evidence of validity and are comparable across subpopulations (American Educational Research Association [AERA], American Psychological Association [APA], & National Council on…
Do Gains in Test Scores Explain Labor Market Outcomes?

ERIC Educational Resources Information Center

Rose, Heather

2006-01-01

Using data from the National Education Longitudinal Study of 1988, this article investigates whether students who made relatively large test score gains during high school had larger earnings 7 years after high school compared to students whose scores improved little. In models that control for pre-high school test scores, family background, and…
The Truth about Scores Children Achieve on Tests.

ERIC Educational Resources Information Center

Brown, Jonathan R.

1989-01-01

The importance of using the standard error of measurement (SEm) in determining reliability in test scores is emphasized. The SEm is compared to the hypothetical true score for standardized tests, and procedures for calculation of the SEm are explained. (JDD)
Evaluation of interactive teaching for undergraduate medical students using a classroom interactive response system in India.

PubMed

Datta, Rakesh; Datta, Karuna; Venkatesh, M D

2015-07-01

The classical didactic lecture has been the cornerstone of the theoretical undergraduate medical education. Their efficacy however reduces due to reduced interaction and short attention span of the students. It is hypothesized that the interactive response pad obviates some of these drawbacks. The aim of this study was to evaluate the effectiveness of an interactive response system by comparing it with conventional classroom teaching. A prospective comparative longitudinal study was conducted on 192 students who were exposed to either conventional or interactive teaching over 20 classes. Pre-test, Post-test and retentions test (post 8-12 weeks) scores were collated and statistically analysed. An independent observer measured number of student interactions in each class. Pre-test scores from both groups were similar (p = 0.71). There was significant improvement in both post test scores when compared to pre-test scores in either method (p < 0.001). The interactive post-test score was better than conventional post test score (p < 0.001) by 8-10% (95% CI-difference of means - 8.2%-9.24%-10.3%). The interactive retention test score was better than conventional retention test score (p < 0.001) by 15-18% (95% CI-difference of means - 15.0%-16.64%-18.2%). There were 51 participative events in the interactive group vs 25 in the conventional group. The Interactive Response Pad method was efficacious in teaching. Students taught with the interactive method were likely to score 8-10% higher (statistically significant) in the immediate post class time and 15-18% higher (statistically significant) after 8-12 weeks. The number of student-teacher interactions increases when using the interactive response pads.
Graphical method for comparative statistical study of vaccine potency tests.

PubMed

Pay, T W; Hingley, P J

1984-03-01

Producers and consumers are interested in some of the intrinsic characteristics of vaccine potency assays for the comparative evaluation of suitable experimental design. A graphical method is developed which represents the precision of test results, the sensitivity of such results to changes in dosage, and the relevance of the results in the way they reflect the protection afforded in the host species. The graphs can be constructed from Producer's scores and Consumer's scores on each of the scales of test score, antigen dose and probability of protection against disease. A method for calculating these scores is suggested and illustrated for single and multiple component vaccines, for tests which do or do not employ a standard reference preparation, and for tests which employ quantitative or quantal systems of scoring.
Kernel Equating Under the Non-Equivalent Groups With Covariates Design

PubMed Central

Bränberg, Kenny

2015-01-01

When equating two tests, the traditional approach is to use common test takers and/or common items. Here, the idea is to use variables correlated with the test scores (e.g., school grades and other test scores) as a substitute for common items in a non-equivalent groups with covariates (NEC) design. This is performed in the framework of kernel equating and with an extension of the method developed for post-stratification equating in the non-equivalent groups with anchor test design. Real data from a college admissions test were used to illustrate the use of the design. The equated scores from the NEC design were compared with equated scores from the equivalent group (EG) design, that is, equating with no covariates as well as with equated scores when a constructed anchor test was used. The results indicate that the NEC design can produce lower standard errors compared with an EG design. When covariates were used together with an anchor test, the smallest standard errors were obtained over a large range of test scores. The results obtained, that an EG design equating can be improved by adjusting for differences in test score distributions caused by differences in the distribution of covariates, are useful in practice because not all standardized tests have anchor tests. PMID:29881012
Kernel Equating Under the Non-Equivalent Groups With Covariates Design.

PubMed

Wiberg, Marie; Bränberg, Kenny

2015-07-01

When equating two tests, the traditional approach is to use common test takers and/or common items. Here, the idea is to use variables correlated with the test scores (e.g., school grades and other test scores) as a substitute for common items in a non-equivalent groups with covariates (NEC) design. This is performed in the framework of kernel equating and with an extension of the method developed for post-stratification equating in the non-equivalent groups with anchor test design. Real data from a college admissions test were used to illustrate the use of the design. The equated scores from the NEC design were compared with equated scores from the equivalent group (EG) design, that is, equating with no covariates as well as with equated scores when a constructed anchor test was used. The results indicate that the NEC design can produce lower standard errors compared with an EG design. When covariates were used together with an anchor test, the smallest standard errors were obtained over a large range of test scores. The results obtained, that an EG design equating can be improved by adjusting for differences in test score distributions caused by differences in the distribution of covariates, are useful in practice because not all standardized tests have anchor tests.
Washback to Learning Outcomes: A Comparative Study of IELTS Preparation and University Pre-Sessional Language Courses

ERIC Educational Resources Information Center

Green, Anthony

2007-01-01

This study investigated whether dedicated test preparation classes gave learners an advantage in improving their writing test scores. Score gains following instruction on a measure of academic writing skills--the International English Language Testing System (IELTS) academic writing test--were compared across language courses of three types; all…
State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP

ERIC Educational Resources Information Center

Chudowsky, Naomi; Chudowsky, Victor

2010-01-01

This report compares state math and reading proficiency scores in grades 4 and 8 to National Assessment of Educational Progress (NAEP) basic scores for the period of 2005 to 2009. The study found that scores on state tests and NAEP have increased in most states with sufficient data. Also included with the report are profiles for the 23 states that…
Decreasing scoring errors on Wechsler Scale Vocabulary, Comprehension, and Similarities subtests: a preliminary study.

PubMed

Linger, Michele L; Ray, Glen E; Zachar, Peter; Underhill, Andrea T; LoBello, Steven G

2007-10-01

Studies of graduate students learning to administer the Wechsler scales have generally shown that training is not associated with the development of scoring proficiency. Many studies report on the reduction of aggregated administration and scoring errors, a strategy that does not highlight the reduction of errors on subtests identified as most prone to error. This study evaluated the development of scoring proficiency specifically on the Wechsler (WISC-IV and WAIS-III) Vocabulary, Comprehension, and Similarities subtests during training by comparing a set of 'early test administrations' to 'later test administrations.' Twelve graduate students enrolled in an intelligence-testing course participated in the study. Scoring errors (e.g., incorrect point assignment) were evaluated on the students' actual practice administration test protocols. Errors on all three subtests declined significantly when scoring errors on 'early' sets of Wechsler scales were compared to those made on 'later' sets. However, correcting these subtest scoring errors did not cause significant changes in subtest scaled scores. Implications for clinical instruction and future research are discussed.
Correlation of Simulation Examination to Written Test Scores for Advanced Cardiac Life Support Testing: Prospective Cohort Study.

PubMed

Strom, Suzanne L; Anderson, Craig L; Yang, Luanna; Canales, Cecilia; Amin, Alpesh; Lotfipour, Shahram; McCoy, C Eric; Osborn, Megan Boysen; Langdorf, Mark I

2015-11-01

Traditional Advanced Cardiac Life Support (ACLS) courses are evaluated using written multiple-choice tests. High-fidelity simulation is a widely used adjunct to didactic content, and has been used in many specialties as a training resource as well as an evaluative tool. There are no data to our knowledge that compare simulation examination scores with written test scores for ACLS courses. To compare and correlate a novel high-fidelity simulation-based evaluation with traditional written testing for senior medical students in an ACLS course. We performed a prospective cohort study to determine the correlation between simulation-based evaluation and traditional written testing in a medical school simulation center. Students were tested on a standard acute coronary syndrome/ventricular fibrillation cardiac arrest scenario. Our primary outcome measure was correlation of exam results for 19 volunteer fourth-year medical students after a 32-hour ACLS-based Resuscitation Boot Camp course. Our secondary outcome was comparison of simulation-based vs. written outcome scores. The composite average score on the written evaluation was substantially higher (93.6%) than the simulation performance score (81.3%, absolute difference 12.3%, 95% CI [10.6-14.0%], p<0.00005). We found a statistically significant moderate correlation between simulation scenario test performance and traditional written testing (Pearson r=0.48, p=0.04), validating the new evaluation method. Simulation-based ACLS evaluation methods correlate with traditional written testing and demonstrate resuscitation knowledge and skills. Simulation may be a more discriminating and challenging testing method, as students scored higher on written evaluation methods compared to simulation.

Nurses caring for ENT patients in a district general hospital without a dedicated ENT ward score significantly less in a test of knowledge than nurses caring for ENT patients in a dedicated ENT ward in a comparable district general hospital.

PubMed

Foxton, C R; Black, D; Muhlschlegel, J; Jardine, A

2014-12-01

To assess whether there is a difference in ENT knowledge amongst nurses caring for patients on a dedicated ENT ward and nurses caring for ENT patients in a similar hospital without a dedicated ENT ward. A test of theoretical knowledge of ENT nursing care was devised and administered to nurses working on a dedicated ENT ward and then to nurses working on generic non-subspecialist wards regularly caring for ENT patients in a hospital without a dedicated ENT ward. The test scores were then compared. A single specialist ENT/Maxillo-Facial/Opthalmology ward in hospital A and 3 generic surgical wards in hospital B. Both hospitals are comparable district general hospitals in the south west of England. Nursing staff working in hospital A and hospital B on the relevant wards were approached during the working day. 11 nurses on ward 1, 10 nurses on ward 2, 11 nurses on ward 3 and 10 nurses on ward 4 (the dedicated ENT ward). Each individual test score was used to generate an average score per ward and these scores compared to see if there was a significant difference. The average score out of 10 on ward 1 was 6.8 (+/-1.6). The average score on ward two was 4.8 (+/-1.6). The average score on ward three was 5.5 (+/-2.1). The average score on ward 4, which is the dedicated ENT ward, was 9.7 (+/-0.5). The differences in average test score between the dedicated ENT ward and all of the other wards are statistically significant. Nurses working on a dedicated ENT ward have an average higher score in a test of knowledge than nurses working on generic surgical wards. This difference is statistically significant and persists despite banding or training. © 2014 John Wiley & Sons Ltd.
Estimating Total-test Scores from Partial Scores in a Matrix Sampling Design.

ERIC Educational Resources Information Center

Sachar, Jane; Suppes, Patrick

It is sometimes desirable to obtain an estimated total-test score for an individual who was administered only a subset of the items in a total test. The present study compared six methods, two of which utilize the content structure of items, to estimate total-test scores using 450 students in grades 3-5 and 60 items of the ll0-item Stanford Mental…
Effects of Student Population Density on Academic Achievement in Georgia Elementary Schools.

ERIC Educational Resources Information Center

Swift, Diane O'Rourke

The purpose of this study was to determine the relationship between school density and achievement test scores. The study utilized a bipolar sample in order to include schools whose achievement scores were at the top and bottom of the population spectrum when considering Iowa Tests of Basic Skills (ITBS) scores. Based on comparing test scores and…
Automated Essay Scoring versus Human Scoring: A Comparative Study

ERIC Educational Resources Information Center

Wang, Jinhao; Brown, Michelle Stallone

2007-01-01

The current research was conducted to investigate the validity of automated essay scoring (AES) by comparing group mean scores assigned by an AES tool, IntelliMetric [TM] and human raters. Data collection included administering the Texas version of the WriterPlacer "Plus" test and obtaining scores assigned by IntelliMetric [TM] and by…
Do Neurocognitive SCAT3 Baseline Test Scores Differ Between Footballers (Soccer) Living With and Without Disability? A Cross-Sectional Study.

PubMed

Weiler, Richard; van Mechelen, Willem; Fuller, Colin; Ahmed, Osman Hassan; Verhagen, Evert

2018-01-01

To determine if baseline Sport Concussion Assessment Tool, third Edition (SCAT3) scores differ between athletes with and without disability. Cross-sectional comparison of preseason baseline SCAT3 scores for a range of England international footballers. Team doctors and physiotherapists supporting England football teams recorded players' SCAT 3 baseline tests from August 1, 2013 to July 31, 2014. A convenience sample of 249 England footballers, of whom 185 were players without disability (male: 119; female: 66) and 64 were players with disability (male learning disability: 17; male cerebral palsy: 28; male blind: 10; female deaf: 9). Between-group comparisons of median SCAT3 total and section scores were made using nonparametric Mann-Whitney-Wilcoxon ranked-sum test. All footballers with disability scored higher symptom severity scores compared with male players without disability. Male footballers with learning disability demonstrated no significant difference in the total number of symptoms, but recorded significantly lower scores on immediate memory and delayed recall compared with male players without disability. Male blind footballers' scored significantly higher for total concentration and delayed recall, and male footballers with cerebral palsy scored significantly higher on balance testing and immediate memory, when compared with male players without disability. Female footballers with deafness scored significantly higher for total concentration and balance testing than female footballers without disability. This study suggests that significant differences exist between SCAT3 baseline section scores for footballers with and without disability. Concussion consensus guidelines should recognize these differences and produce guidelines that are specific for the growing number of athletes living with disability.
Does Matching Quality Matter in Mode Comparison Studies?

ERIC Educational Resources Information Center

Zeng, Ji; Yin, Ping; Shedden, Kerby A.

2015-01-01

This article provides a brief overview and comparison of three matching approaches in forming comparable groups for a study comparing test administration modes (i.e., computer-based tests [CBT] and paper-and-pencil tests [PPT]): (a) a propensity score matching approach proposed in this article, (b) the propensity score matching approach used by…
An Analysis of Cross Racial Identity Scale Scores Using Classical Test Theory and Rasch Item Response Models

ERIC Educational Resources Information Center

Sussman, Joshua; Beaujean, A. Alexander; Worrell, Frank C.; Watson, Stevie

2013-01-01

Item response models (IRMs) were used to analyze Cross Racial Identity Scale (CRIS) scores. Rasch analysis scores were compared with classical test theory (CTT) scores. The partial credit model demonstrated a high goodness of fit and correlations between Rasch and CTT scores ranged from 0.91 to 0.99. CRIS scores are supported by both methods.…
Enamel Hypomineralization in Children With Clefts and the Relationship to Treatment: A Cross-sectional Retrospective Study.

PubMed

Allam, Eman; Ghoneima, Ahmed; Tholpady, Sunil S; Kula, Katherine

2018-06-19

The aim of this study was to determine whether molar incisor hypomineralization (MIH) is greater in patients with cleft lip and palate (CLP) who underwent primary alveolar grafting (PAG) as compared with CLP waiting for secondary alveolar grafting (SAG) and with controls. A retrospective analysis of intraoral photographs of 13 CLP patients who underwent a PAG, 28 CLP prior to SAG, and 60 controls without CLP was performed. Mantel-Haenszel χ tests were used to compare the 3 groups for differences in MIH scores, and Wilcoxon rank sum tests were used to compare the groups for differences in average MIH scores. A 5% significance level was used for all tests. Molar incisor hypomineralization scores were significantly higher for the PAG and SAG groups compared with the control group (P < 0.001). The PAG group had significantly higher incisor MIH (P = 0.016) compared with the SAG group. Molar incisor hypomineralization average scores were significantly higher for the 2 graft groups compared with the controls (P < 0.0001). The PAG group had significantly higher average MIH score and average MIH score for incisors compared with the SAG group (P = 0.03). Cleft lip and palate patients have significantly greater MIH compared with controls, and CLP patients with PAGs have significantly greater MIH in the incisor region compared with CLP patients with SAGs, indicating that subjects with PAGs have more severely affected dentition.
The Effects of Coaching on Standardized Admission Examinations. Staff Memorandum of the Boston Regional Office of the Federal Trade Commission.

ERIC Educational Resources Information Center

Federal Trade Commission, Washington, DC. Bureau of Consumer Protection.

A non-experimental design was used to determine if scores of students enrolled in specified major coaching schools were significantly higher than scores of comparable uncoached groups. Score increases at two Scholastic Aptitude Test (SAT) coaching schools and Law School Admission Test (LSAT) schools were compared. Over 1,400 SAT examinees and…
Psychometry and Pescatori projective test in coloproctological patients.

PubMed

Caetano, Ana Célia; Oliveira, Dinis; Gomes, Zaida; Mesquita, Edgar; Rolanda, Carla

2017-01-01

Psychological assessment is not commonly performed nor easily accepted by coloproctological patients. Our aim was to evaluate the psychological component of coloproctological disorders using uncommon tools. The 21-Item Depression Anxiety and Stress Scale and the Pescatori projective test were applied to coloproctological outpatients of the Gastroenterology Department of our hospital as well as to healthy volunteers. Seventy patients (median age 47 years, 22 male) divided in 4 groups (functional constipation, constipated irritable bowel syndrome, benign anorectal disease and perianal Crohn's disease) and 52 healthy volunteers (age 45 years, 18 male) completed the tests. Proctological patients showed higher scores of depression (P<0.001), anxiety (P<0.001), and stress (P<0.001) compared to healthy participants. Compared to the control group, patients with functional constipation, irritable bowel syndrome and perianal Crohn's disease maintained the highest scores in all subscales (P<0.05), while patients with benign anorectal disease only had higher anxiety and stress (P<0.001) scores. The patients' also showed lower scores in the Pescatori projective test (P=0.012). A weak association between the projective test and the depression subscale was found (P=0.05). Proctological patients had higher scores of depression, anxiety and stress and lower scores in the Pescatori projective test compared to healthy controls.
Psychometry and Pescatori projective test in coloproctological patients

PubMed Central

Caetano, Ana Célia; Oliveira, Dinis; Gomes, Zaida; Mesquita, Edgar; Rolanda, Carla

2017-01-01

Background Psychological assessment is not commonly performed nor easily accepted by coloproctological patients. Our aim was to evaluate the psychological component of coloproctological disorders using uncommon tools. Methods The 21-Item Depression Anxiety and Stress Scale and the Pescatori projective test were applied to coloproctological outpatients of the Gastroenterology Department of our hospital as well as to healthy volunteers. Results Seventy patients (median age 47 years, 22 male) divided in 4 groups (functional constipation, constipated irritable bowel syndrome, benign anorectal disease and perianal Crohn’s disease) and 52 healthy volunteers (age 45 years, 18 male) completed the tests. Proctological patients showed higher scores of depression (P<0.001), anxiety (P<0.001), and stress (P<0.001) compared to healthy participants. Compared to the control group, patients with functional constipation, irritable bowel syndrome and perianal Crohn’s disease maintained the highest scores in all subscales (P<0.05), while patients with benign anorectal disease only had higher anxiety and stress (P<0.001) scores. The patients’ also showed lower scores in the Pescatori projective test (P=0.012). A weak association between the projective test and the depression subscale was found (P=0.05). Conclusion Proctological patients had higher scores of depression, anxiety and stress and lower scores in the Pescatori projective test compared to healthy controls. PMID:28655980
Comparability of IQ Scores on Five Widely Used Intelligence Tests

ERIC Educational Resources Information Center

Hieronymus, A. N.; Stroud, James B.

1969-01-01

Attempts to fill research gap on testing by obtaining comparisons of deviation scores, at grade levels four, seven, and ten, from the California Test of Mental Maturity, Henmon-Nelson Tests, and Lorge-Thorndike Intelligence tests. Results tabulated. (CJ)
A Comparison of the Approaches of Generalizability Theory and Item Response Theory in Estimating the Reliability of Test Scores for Testlet-Composed Tests

ERIC Educational Resources Information Center

Lee, Guemin; Park, In-Yong

2012-01-01

Previous assessments of the reliability of test scores for testlet-composed tests have indicated that item-based estimation methods overestimate reliability. This study was designed to address issues related to the extent to which item-based estimation methods overestimate the reliability of test scores composed of testlets and to compare several…
Findings from the 2012 West Virginia Online Writing Scoring Comparability Study

ERIC Educational Resources Information Center

Hixson, Nate; Rhudy, Vaughn

2013-01-01

Student responses to the West Virginia Educational Standards Test (WESTEST) 2 Online Writing Assessment are scored by a computer-scoring engine. The scoring method is not widely understood among educators, and there exists a misperception that it is not comparable to hand scoring. To address these issues, the West Virginia Department of Education…
Exploration of Analysis Methods for Diagnostic Imaging Tests: Problems with ROC AUC and Confidence Scores in CT Colonography

PubMed Central

Mallett, Susan; Halligan, Steve; Collins, Gary S.; Altman, Doug G.

2014-01-01

Background Different methods of evaluating diagnostic performance when comparing diagnostic tests may lead to different results. We compared two such approaches, sensitivity and specificity with area under the Receiver Operating Characteristic Curve (ROC AUC) for the evaluation of CT colonography for the detection of polyps, either with or without computer assisted detection. Methods In a multireader multicase study of 10 readers and 107 cases we compared sensitivity and specificity, using radiological reporting of the presence or absence of polyps, to ROC AUC calculated from confidence scores concerning the presence of polyps. Both methods were assessed against a reference standard. Here we focus on five readers, selected to illustrate issues in design and analysis. We compared diagnostic measures within readers, showing that differences in results are due to statistical methods. Results Reader performance varied widely depending on whether sensitivity and specificity or ROC AUC was used. There were problems using confidence scores; in assigning scores to all cases; in use of zero scores when no polyps were identified; the bimodal non-normal distribution of scores; fitting ROC curves due to extrapolation beyond the study data; and the undue influence of a few false positive results. Variation due to use of different ROC methods exceeded differences between test results for ROC AUC. Conclusions The confidence scores recorded in our study violated many assumptions of ROC AUC methods, rendering these methods inappropriate. The problems we identified will apply to other detection studies using confidence scores. We found sensitivity and specificity were a more reliable and clinically appropriate method to compare diagnostic tests. PMID:25353643
Exploration of analysis methods for diagnostic imaging tests: problems with ROC AUC and confidence scores in CT colonography.

PubMed

Mallett, Susan; Halligan, Steve; Collins, Gary S; Altman, Doug G

2014-01-01

Different methods of evaluating diagnostic performance when comparing diagnostic tests may lead to different results. We compared two such approaches, sensitivity and specificity with area under the Receiver Operating Characteristic Curve (ROC AUC) for the evaluation of CT colonography for the detection of polyps, either with or without computer assisted detection. In a multireader multicase study of 10 readers and 107 cases we compared sensitivity and specificity, using radiological reporting of the presence or absence of polyps, to ROC AUC calculated from confidence scores concerning the presence of polyps. Both methods were assessed against a reference standard. Here we focus on five readers, selected to illustrate issues in design and analysis. We compared diagnostic measures within readers, showing that differences in results are due to statistical methods. Reader performance varied widely depending on whether sensitivity and specificity or ROC AUC was used. There were problems using confidence scores; in assigning scores to all cases; in use of zero scores when no polyps were identified; the bimodal non-normal distribution of scores; fitting ROC curves due to extrapolation beyond the study data; and the undue influence of a few false positive results. Variation due to use of different ROC methods exceeded differences between test results for ROC AUC. The confidence scores recorded in our study violated many assumptions of ROC AUC methods, rendering these methods inappropriate. The problems we identified will apply to other detection studies using confidence scores. We found sensitivity and specificity were a more reliable and clinically appropriate method to compare diagnostic tests.
Modified Balance Error Scoring System (M-BESS) test scores in athletes wearing protective equipment and cleats.

PubMed

Azad, Aftab Mohammad; Al Juma, Saad; Bhatti, Junaid Ahmad; Delaney, J Scott

2016-01-01

Balance testing is an important part of the initial concussion assessment. There is no research on the differences in Modified Balance Error Scoring System (M-BESS) scores when tested in real world as compared to control conditions. To assess the difference in M-BESS scores in athletes wearing their protective equipment and cleats on different surfaces as compared to control conditions. This cross-sectional study examined university North American football and soccer athletes. Three observers independently rated athletes performing the M-BESS test in three different conditions: (1) wearing shorts and T-shirt in bare feet on firm surface (control); (2) wearing athletic equipment with cleats on FieldTurf; and (3) wearing athletic equipment with cleats on firm surface. Mean M-BESS scores were compared between conditions. 60 participants were recruited: 39 from football (all males) and 21 from soccer (11 males and 10 females). Average age was 21.1 years (SD=1.8). Mean M-BESS scores were significantly lower (p<0.001) for cleats on FieldTurf (mean=26.3; SD=2.0) and for cleats on firm surface (mean=26.6; SD=2.1) as compared to the control condition (mean=28.4; SD=1.5). Females had lower scores than males for cleats on FieldTurf condition (24.9 (SD=1.9) vs 27.3 (SD=1.6), p=0.005). Players who had taping or bracing on their ankles/feet had lower scores when tested with cleats on firm surface condition (24.6 (SD=1.7) vs 26.9 (SD=2.0), p=0.002). Total M-BESS scores for athletes wearing protective equipment and cleats standing on FieldTurf or a firm surface are around two points lower than M-BESS scores performed on the same athletes under control conditions.
Modified Balance Error Scoring System (M-BESS) test scores in athletes wearing protective equipment and cleats

PubMed Central

Azad, Aftab Mohammad; Al Juma, Saad; Bhatti, Junaid Ahmad; Delaney, J Scott

2016-01-01

Background Balance testing is an important part of the initial concussion assessment. There is no research on the differences in Modified Balance Error Scoring System (M-BESS) scores when tested in real world as compared to control conditions. Objective To assess the difference in M-BESS scores in athletes wearing their protective equipment and cleats on different surfaces as compared to control conditions. Methods This cross-sectional study examined university North American football and soccer athletes. Three observers independently rated athletes performing the M-BESS test in three different conditions: (1) wearing shorts and T-shirt in bare feet on firm surface (control); (2) wearing athletic equipment with cleats on FieldTurf; and (3) wearing athletic equipment with cleats on firm surface. Mean M-BESS scores were compared between conditions. Results 60 participants were recruited: 39 from football (all males) and 21 from soccer (11 males and 10 females). Average age was 21.1 years (SD=1.8). Mean M-BESS scores were significantly lower (p<0.001) for cleats on FieldTurf (mean=26.3; SD=2.0) and for cleats on firm surface (mean=26.6; SD=2.1) as compared to the control condition (mean=28.4; SD=1.5). Females had lower scores than males for cleats on FieldTurf condition (24.9 (SD=1.9) vs 27.3 (SD=1.6), p=0.005). Players who had taping or bracing on their ankles/feet had lower scores when tested with cleats on firm surface condition (24.6 (SD=1.7) vs 26.9 (SD=2.0), p=0.002). Conclusions Total M-BESS scores for athletes wearing protective equipment and cleats standing on FieldTurf or a firm surface are around two points lower than M-BESS scores performed on the same athletes under control conditions. PMID:27900181
Comparing Graphical and Verbal Representations of Measurement Error in Test Score Reports

ERIC Educational Resources Information Center

Zwick, Rebecca; Zapata-Rivera, Diego; Hegarty, Mary

2014-01-01

Research has shown that many educators do not understand the terminology or displays used in test score reports and that measurement error is a particularly challenging concept. We investigated graphical and verbal methods of representing measurement error associated with individual student scores. We created four alternative score reports, each…
Analysis of 2009-10 WCPSS SAT Scores. Measuring Up. E&R Report No. 10.25

ERIC Educational Resources Information Center

Holdzkom, David; Gilleland, Kevin

2010-01-01

Wake County Public School System (WCPSS) students continue to fare well on the SAT test as compared with students in the state and nation. While there was a decline in average test scores in 2009-10 as compared with the prior year, the posted scores continue a trend of measurable improvement over time. Over the past 20 years, the average SAT…

Clock Drawing Test and the diagnosis of amnestic mild cognitive impairment: can more detailed scoring systems do the work?

PubMed

Rubínová, Eva; Nikolai, Tomáš; Marková, Hana; Siffelová, Kamila; Laczó, Jan; Hort, Jakub; Vyhnálek, Martin

2014-01-01

The Clock Drawing Test is a frequently used cognitive screening test with several scoring systems in elderly populations. We compare simple and complex scoring systems and evaluate the usefulness of the combination of the Clock Drawing Test with the Mini-Mental State Examination to detect patients with mild cognitive impairment. Patients with amnestic mild cognitive impairment (n = 48) and age- and education-matched controls (n = 48) underwent neuropsychological examinations, including the Clock Drawing Test and the Mini-Mental State Examination. Clock drawings were scored by three blinded raters using one simple (6-point scale) and two complex (17- and 18-point scales) systems. The sensitivity and specificity of these scoring systems used alone and in combination with the Mini-Mental State Examination were determined. Complex scoring systems, but not the simple scoring system, were significant predictors of the amnestic mild cognitive impairment diagnosis in logistic regression analysis. At equal levels of sensitivity (87.5%), the Mini-Mental State Examination showed higher specificity (31.3%, compared with 12.5% for the 17-point Clock Drawing Test scoring scale). The combination of Clock Drawing Test and Mini-Mental State Examination scores increased the area under the curve (0.72; p < .001) and increased specificity (43.8%), but did not increase sensitivity, which remained high (85.4%). A simple 6-point scoring system for the Clock Drawing Test did not differentiate between healthy elderly and patients with amnestic mild cognitive impairment in our sample. Complex scoring systems were slightly more efficient, yet still were characterized by high rates of false-positive results. We found psychometric improvement using combined scores from the Mini-Mental State Examination and the Clock Drawing Test when complex scoring systems were used. The results of this study support the benefit of using combined scores from simple methods.
A Quantitative Study Analyzing Predictive Factors That Affect Achievement on Florida's Algebra I End-of-Course Exam (EOC)

ERIC Educational Resources Information Center

Holley, Hope D.

2017-01-01

Despite research that high-stakes tests do not improve knowledge, Florida requires students to pass an Algebra I End-of-Course exam (EOC) to earn a high school diploma. Test passing scores are determined by a raw score to t-score to scale score analysis. This method ultimately results as a comparative test model where students' passage is…
Language of administration and neuropsychological test performance in neurologically intact Hispanic American bilingual adults.

PubMed

Gasquoine, Philip Gerard; Croyle, Kristin L; Cavazos-Gonzalez, Cynthia; Sandoval, Omar

2007-11-01

This study compared the performance of Hispanic American bilingual adults on Spanish and English language versions of a neuropsychological test battery. Language achievement test scores were used to divide 36 bilingual, neurologically intact, Hispanic Americans from south Texas into Spanish-dominant, balanced, and English-dominant bilingual groups. They were administered the eight subtests of the Bateria Neuropsicologica and the Matrix Reasoning subtest of the WAIS-III in Spanish and English. Half the participants were tested in Spanish first. Balanced bilinguals showed no significant differences in test scores between Spanish and English language administrations. Spanish and/or English dominant bilinguals showed significant effects of language of administration on tests with higher language compared to visual perceptual weighting (Woodcock-Munoz Language Survey-Revised, Letter Fluency, Story Memory, and Stroop Color and Word Test). Scores on tests with higher visual-perceptual weighting (Matrix Reasoning, Figure Memory, Wisconsin Card Sorting Test, and Spatial Span), were not significantly affected by language of administration, nor were scores on the Spanish/California Verbal Learning Test, and Digit Span. A problem was encountered in comparing false positive rates in each language, as Spanish norms fell below English norms, resulting in a much higher false positive rate in English across all bilingual groupings. Use of a comparison standard (picture vocabulary score) reduced false positive rates in both languages, but the higher false positive rate in English persisted.
Evaluating the Stability of Test Score Means for the "TOEIC"® Speaking and Writing Tests. Research Report. ETS RR-17-50

ERIC Educational Resources Information Center

Qu, Yanxuan; Huo, Yan; Chan, Eric; Shotts, Matthew

2017-01-01

For educational tests, it is critical to maintain consistency of score scales and to understand the sources of variation in score means over time. This practice helps to ensure that interpretations about test takers' abilities are comparable from one administration (or one form) to another. This study examines the consistency of reported scores…
The Use of the Peabody Individual Achievement Test and the Woodcock Reading Mastery Tests in the Diagnosis of a Learning Disability in Reading: A Caveat.

ERIC Educational Resources Information Center

Caskey, William E., Jr.

1985-01-01

Using a counterbalanced order, 34 learning disabled students were given the Peabody Individual Achievement Test and the Woodcock Reading Mastery Tests. When scores in reading recognition and comprehension subtests were compared, Peabody scores were higher, indicating that fewer students were certifiable as learning disabled by Peabody scores than…
The Impact of Inclusion and Resource Instruction on Standardized Test Scores of Special Education Students

ERIC Educational Resources Information Center

Derico, Vontrice L.

2017-01-01

The purpose of the proposed quasi-experimental quantitative study was to determine if students who were taught in the inclusive setting yielded higher standardized test scores compared to students who were taught in the resource setting. The researcher analyzed the standardized test scores, in the areas of Language Arts, Reading, and Mathematics…
Self-Monitoring Assessments for Educational Accountability Systems

ERIC Educational Resources Information Center

Koretz, Daniel; Beguin, Anton

2010-01-01

Test-based accountability is now the cornerstone of U.S. education policy, and it is becoming more important in many other nations as well. Educators sometimes respond to test-based accountability in ways that produce score inflation. In the past, score inflation has usually been evaluated by comparing trends in scores on a high-stakes test to…
Asymptotic Standard Errors of Observed-Score Equating with Polytomous IRT Models

ERIC Educational Resources Information Center

Andersson, Björn

2016-01-01

In observed-score equipercentile equating, the goal is to make scores on two scales or tests measuring the same construct comparable by matching the percentiles of the respective score distributions. If the tests consist of different items with multiple categories for each item, a suitable model for the responses is a polytomous item response…
The diagnostic value and accuracy of conjunctival impression cytology, dry eye symptomatology, and routine tear function tests in computer users.

PubMed

Bhargava, Rahul; Kumar, Prachi; Kaur, Avinash; Kumar, Manjushri; Mishra, Anurag

2014-07-01

To compare the diagnostic value and accuracy of dry eye scoring system (DESS), conjunctival impression cytology (CIC), tear film breakup time (TBUT), and Schirmer's test in computer users. A case-control study was done at two referral eye centers. Eyes of 344 computer users were compared to 371 eyes of age and sex matched controls. Dry eye questionnaire (DESS) was administered to both groups and they further underwent measurement of TBUT, Schirmer's, and CIC. Correlation analysis was performed between DESS, CIC, TBUT, and Schirmer's test scores. A Pearson's coefficient of the linear expression (R (2)) of 0.5 or more was statistically significant. The mean age in cases (26.05 ± 4.06 years) was comparable to controls (25.67 ± 3.65 years) (P = 0.465). The mean symptom score in computer users was significantly higher as compared to controls (P < 0.001). Mean TBUT, Schirmer's test values, and goblet cell density were significantly reduced in computer users (P < 0.001). TBUT, Schirmer's, and CIC were abnormal in 48.5%, 29.1%, and 38.4% symptomatic computer users respectively as compared to 8%, 6.7%, and 7.3% symptomatic controls respectively. On correlation analysis, there was a significant (inverse) association of dry eye symptoms (DESS) with TBUT and CIC scores (R (2) > 0.5), in contrast to Schirmer's scores (R(2) < 0.5). Duration of computer usage had a significant effect on dry eye symptoms severity, TBUT, and CIC scores as compared to Schirmer's test. DESS should be used in combination with TBUT and CIC for dry eye evaluation in computer users.
Speech-discrimination scores modeled as a binomial variable.

PubMed

Thornton, A R; Raffin, M J

1978-09-01

Many studies have reported variability data for tests of speech discrimination, and the disparate results of these studies have not been given a simple explanation. Arguments over the relative merits of 25- vs 50-word tests have ignored the basic mathematical properties inherent in the use of percentage scores. The present study models performance on clinical tests of speech discrimination as a binomial variable. A binomial model was developed, and some of its characteristics were tested against data from 4120 scores obtained on the CID Auditory Test W-22. A table for determining significant deviations between scores was generated and compared to observed differences in half-list scores for the W-22 tests. Good agreement was found between predicted and observed values. Implications of the binomial characteristics of speech-discrimination scores are discussed.
Routine blood tests to predict liver fibrosis in chronic hepatitis C.

PubMed

Hsieh, Yung-Yu; Tung, Shui-Yi; Lee, Kamfai; Wu, Cheng-Shyong; Wei, Kuo-Liang; Shen, Chien-Heng; Chang, Te-Sheng; Lin, Yi-Hsiung

2012-02-28

To verify the usefulness of FibroQ for predicting fibrosis in patients with chronic hepatitis C, compared with other noninvasive tests. This retrospective cohort study included 237 consecutive patients with chronic hepatitis C who had undergone percutaneous liver biopsy before treatment. FibroQ, aspartate aminotransferase (AST)/alanine aminotransferase ratio (AAR), AST to platelet ratio index, cirrhosis discriminant score, age-platelet index (API), Pohl score, FIB-4 index, and Lok's model were calculated and compared. FibroQ, FIB-4, AAR, API and Lok's model results increased significantly as fibrosis advanced (analysis of variance test: P < 0.001). FibroQ trended to be superior in predicting significant fibrosis score in chronic hepatitis C compared with other noninvasive tests. FibroQ is a simple and useful test for predicting significant fibrosis in patients with chronic hepatitis C.
Cognitive test scores in male adolescent cigarette smokers compared to non-smokers: a population-based study.

PubMed

Weiser, Mark; Zarka, Salman; Werbeloff, Nomi; Kravitz, Efrat; Lubin, Gad

2010-02-01

Although previous studies indicate that people with lower intelligence quotient (IQ) scores are more likely to become cigarette smokers, IQ scores of siblings discordant for smoking and of adolescents who began smoking between ages 18-21 years have not been studied systematically. Each year a random sample of Israeli military recruits complete a smoking questionnaire. Cognitive functioning is assessed by the military using standardized tests equivalent to IQ. Of 20 221 18-year-old males, 28.5% reported smoking at least one cigarette a day (smokers). An unadjusted comparison found that smokers scored 0.41 effect sizes (ES, P < 0.001) lower than non-smokers; adjusted analyses remained significant (adjusted ES = 0.27, P < 0.001). Adolescents smoking one to five, six to 10, 11-20 and 21+ cigarettes/day had cognitive test scores 0.14, 0.22, 0.33 and 0.5 adjusted ES poorer than those of non-smokers (P < 0.001). Adolescents who did not smoke by age 18, and then began to smoke between ages 18-21 had lower cognitive test scores compared to never-smokers (adjusted ES = 0.14, P < 0.001). An analysis of brothers discordant for smoking found that smoking brothers had lower cognitive scores than non-smoking brothers (adjusted ES = 0.27; P = 0.014). Controlled analyses from this large population-based cohort of male adolescents indicate that IQ scores are lower in male adolescents who smoke compared to non-smokers and in brothers who smoke compared to their non-smoking brothers. The IQs of adolescents who began smoking between ages 18-21 are lower than those of non-smokers. Adolescents with poorer IQ scores might be targeted for programmes designed to prevent smoking.
Carprofen provides better post-operative analgesia than tramadol in dogs after enucleation: A randomized, masked clinical trial

PubMed Central

Delgado, Cherlene; Bentley, Ellison; Hetzel, Scott; Smith, Lesley J

2015-01-01

Objective To compare analgesia provided by carprofen or tramadol in dogs after enucleation. Design Randomized, masked trial Animals Forty-three dogs Procedures Client-owned dogs admitted for routine enucleation were randomly assigned to receive either carprofen or tramadol orally 2 hours prior to surgery and 12 hours after the first dose. Dogs were scored for pain at baseline, and postoperatively at 0.25, 0.5, 1, 2, 4, 6, 8, 24, and 30 hours after extubation. Dogs received identical premedication and inhalation anesthesia regimens, including premedication with hydromorphone. If the total pain score was ≥9, if there was a score ≥ 3 in any one category, or if the visual analog scale score (VAS) was ≥35 combined with a palpation score of >0, rescue analgesia (hydromorphone) was administered and treatment failure was recorded. Characteristics between groups were compared with a Student’s t-test and Fisher’s exact test. The incidence of rescue was compared between groups using a log rank test. Pain scores and VAS scores between groups were compared using repeated measures ANOVA. Results There was no difference in age (p=0.493), gender (p=0.366) or baseline pain scores (p=0.288) between groups. Significantly more dogs receiving tramadol required rescue analgesia (6/21) compared to dogs receiving carprofen (1/22; p=0.035). Pain and VAS scores decreased linearly over time (p=0.038, p<0.001, respectively). There were no significant differences in pain (p=0.915) or VAS scores (p=0.372) between groups at any time point (dogs were excluded from analysis after rescue). Conclusions and Clinical Relevance This study suggests that carprofen, with opioid premedication, provides more effective post-operative analgesia than tramadol in dogs undergoing enucleation. PMID:25459482
Comparing State SAT Scores: Problems, Biases, and Corrections.

ERIC Educational Resources Information Center

Gohmann, Stephen F.

1988-01-01

One method to correct for selection bias in comparing Scholastic Aptitude Test (SAT) scores among states is presented, which is a modification of J. J. Heckman's Selection Bias Correction (1976, 1979). Empirical results suggest that sample selection bias is present in SAT score regressions. (SLD)
An Evaluation of Kernel Equating: Parallel Equating with Classical Methods in the SAT Subject Tests[TM] Program. Research Report. ETS RR-09-06

ERIC Educational Resources Information Center

Grant, Mary C.; Zhang, Lilly; Damiano, Michele

2009-01-01

This study investigated kernel equating methods by comparing these methods to operational equatings for two tests in the SAT Subject Tests[TM] program. GENASYS (ETS, 2007) was used for all equating methods and scaled score kernel equating results were compared to Tucker, Levine observed score, chained linear, and chained equipercentile equating…
Linking U.S. School District Test Score Distributions to a Common Scale. CEPA Working Paper No. 16-09

ERIC Educational Resources Information Center

Reardon, Sean F.; Kalogrides, Demetra; Ho, Andrew D.

2017-01-01

There is no comprehensive database of U.S. district-level test scores that is comparable across states. We describe and evaluate a method for constructing such a database. First, we estimate linear, reliability-adjusted linking transformations from state test score scales to the scale of the National Assessment of Educational Progress (NAEP). We…
Linking Scores from Tests of Similar Content Given in Different Languages: An Illustration Involving Methodological Alternatives

ERIC Educational Resources Information Center

Cascallar, Alicia S.; Dorans, Neil J.

2005-01-01

This study compares two methods commonly used (concordance and prediction) to establish linkages between scores from tests of similar content given in different languages. Score linkages between the Verbal and Math sections of the SAT I and the corresponding sections of the Spanish-language admissions test, the Prueba de Aptitud Academica (PAA),…
Substitution of California Verbal Learning Test, second edition for Verbal Paired Associates on the Wechsler Memory Scale, fourth edition.

PubMed

Miller, Justin B; Axelrod, Bradley N; Rapport, Lisa J; Hanks, Robin A; Bashem, Jesse R; Schutte, Christian

2012-01-01

Two common measures used to evaluate verbal learning and memory are the Verbal Paired Associates (VPA) subtest from the Wechsler Memory Scales (WMS) and the second edition of the California Verbal Learning Test (CVLT-II). For the fourth edition of the WMS, scores from the CVLT-II can be substituted for VPA; the present study sought to examine the validity of the substitution. For each substitution, paired-samples t tests were conducted between original VPA scaled scores and scaled scores obtained from the CVLT-II substitution to evaluate comparability. Similar comparisons were made at the index score level. At the index score level, substitution resulted in significantly lower scores for the AMI (p = .03; r = .13) but not for the IMI (p = .29) or DMI (p = .09). For the subtest scores, substituted scaled scores for VPA were not significantly different from original scores for the immediate recall condition (p = .20) but were significantly lower at delayed recall (p = .01). These findings offer partial support for the substitution. For both the immediate and delayed conditions, the substitution produced generally lower subtest scores compared to original VPA subtest scores.
The feasibility of automated eye tracking with the Early Childhood Vigilance Test of attention in younger HIV-exposed Ugandan children.

PubMed

Boivin, Michael J; Weiss, Jonathan; Chhaya, Ronak; Seffren, Victoria; Awadu, Jorem; Sikorskii, Alla; Giordani, Bruno

2017-07-01

Tobii eye tracking was compared with webcam-based observer scoring on an animation viewing measure of attention (Early Childhood Vigilance Test; ECVT) to evaluate the feasibility of automating measurement and scoring. Outcomes from both scoring approaches were compared with the Mullen Scales of Early Learning (MSEL), Color-Object Association Test (COAT), and Behavior Rating Inventory of Executive Function for preschool children (BRIEF-P). A total of 44 children 44 to 65 months of age were evaluated with the ECVT, COAT, MSEL, and BRIEF-P. Tobii ×2-30 portable infrared cameras were programmed to monitor pupil direction during the ECVT 6-min animation and compared with observer-based PROCODER webcam scoring. Children watched 78% of the cartoon (Tobii) compared with 67% (webcam scoring), although the 2 measures were highly correlated (r = .90, p = .001). It is possible for 2 such measures to be highly correlated even if one is consistently higher than the other (Bergemann et al., 2012). Both ECVT Tobii and webcam ECVT measures significantly correlated with COAT immediate recall (r = .37, p = .02 vs. r = .38, p = .01, respectively) and total recall (r = .33, p = .06 vs. r = .42, p = .005) measures. However, neither the Tobii eye tracking nor PROCODER webcam ECVT measures of attention correlated with MSEL composite cognitive performance or BRIEF-P global executive composite. ECVT scoring using Tobii eye tracking is feasible with at-risk very young African children and consistent with webcam-based scoring approaches in their correspondence to one another and other neurocognitive performance-based measures. By automating measurement and scoring, eye tracking technologies can improve the efficiency and help better standardize ECVT testing of attention in younger children. This holds promise for other neurodevelopmental tests where eye movements, tracking, and gaze length can provide important behavioral markers of neuropsychological and neurodevelopmental processes associated with such tests. (PsycINFO Database Record (c) 2017 APA, all rights reserved).
Intelligence is in the eye of the beholder: investigating repeated IQ measurements in forensic psychiatry.

PubMed

Habets, Petra; Jeandarme, Inge; Uzieblo, Kasia; Oei, Karel; Bogaerts, Stefan

2015-05-01

A stable assessment of cognition is of paramount importance for forensic psychiatric patients (FPP). The purpose of this study was to compare repeated measures of IQ scores in FPPs with and without intellectual disability. Repeated measurements of IQ scores in FPPs (n = 176) were collected. Differences between tests were computed, and each IQ score was categorized. Additionally, t-tests and regression analyses were performed. Differences of 10 points or more were found in 66% of the cases comparing WAIS-III with RAVEN scores. Fisher's exact test revealed differences between two WAIS-III scores and the WAIS categories. The WAIS-III did not predict other IQs (WAIS or RAVEN) in participants with intellectual disability. This study showed that stability or interchangeability of scores is lacking, especially in individuals with intellectual disability. Caution in interpreting IQ scores is therefore recommended, and the use of the unitary concept of IQ should be discouraged. © 2014 John Wiley & Sons Ltd.

Association Between Low IQ Scores and Early Mortality in Men and Women: Evidence From a Population-Based Cohort Study.

PubMed

Maenner, Matthew J; Greenberg, Jan S; Mailick, Marsha R

2015-05-01

Lower (versus higher) IQ scores have been shown to increase the risk of early mortality, however, the underlying mechanisms are poorly understood and previous studies underrepresent individuals with intellectual disability (ID) and women. This study followed one third of all senior-year students (approximately aged 17) attending public high school in Wisconsin, U.S. in 1957 (n = 10,317) until 2011. Men and women with the lowest IQ test scores (i.e., IQ scores ≤ 85) had increased rates of mortality compared to people with the highest IQ test scores, particularly for cardiovascular disease. Importantly, when educational attainment was held constant, people with lower IQ test scores did not have higher mortality by age 70 than people with higher IQ test scores. Individuals with lower IQ test scores likely experience multiple disadvantages throughout life that contribute to increased risk of early mortality.
Validity and Reliability of Baseline Testing in a Standardized Environment.

PubMed

Higgins, Kathryn L; Caze, Todd; Maerlender, Arthur

2017-08-11

The Immediate Postconcussion Assessment and Cognitive Testing (ImPACT) is a computerized neuropsychological test battery commonly used to determine cognitive recovery from concussion based on comparing post-injury scores to baseline scores. This model is based on the premise that ImPACT baseline test scores are a valid and reliable measure of optimal cognitive function at baseline. Growing evidence suggests that this premise may not be accurate and a large contributor to invalid and unreliable baseline test scores may be the protocol and environment in which baseline tests are administered. This study examined the effects of a standardized environment and administration protocol on the reliability and performance validity of athletes' baseline test scores on ImPACT by comparing scores obtained in two different group-testing settings. Three hundred-sixty one Division 1 cohort-matched collegiate athletes' baseline data were assessed using a variety of indicators of potential performance invalidity; internal reliability was also examined. Thirty-one to thirty-nine percent of the baseline cases had at least one indicator of low performance validity, but there were no significant differences in validity indicators based on environment in which the testing was conducted. Internal consistency reliability scores were in the acceptable to good range, with no significant differences between administration conditions. These results suggest that athletes may be reliably performing at levels lower than their best effort would produce. © The Author 2017. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
COPD assessment test and severity of airflow limitation in patients with asthma, COPD, and asthma-COPD overlap syndrome.

PubMed

Kurashima, Kazuyoshi; Takaku, Yotaro; Ohta, Chie; Takayanagi, Noboru; Yanagisawa, Tsutomu; Sugita, Yutaka

2016-01-01

The COPD assessment test (CAT) consists of eight nonspecific scores of quality of life. The aim of this study was to compare the health-related quality of life and severity of airflow limitation in patients with asthma, COPD, and asthma-COPD overlap syndrome (ACOS) using the CAT. We examined CAT and lung functions in 138 patients with asthma, 99 patients with COPD, 51 patients with ACOS, and 44 patients with chronic cough as a control. The CAT score was recorded in all subjects, and the asthma control test was also administered to patients with asthma and ACOS. The CAT scores were compared, and the relationships between the scores and lung function parameters were analyzed. The total CAT scores and scores for cough, phlegm, and dyspnea were higher in patients with ACOS than in patients with asthma and COPD. The total CAT scores were correlated with the percent predicted forced expiratory volume in 1 second only in patients with COPD. The total CAT scores and dyspnea scores adjusted by the percent predicted forced expiratory volume in 1 second were higher in patients with ACOS than in patients with COPD and asthma. The CAT scores and asthma control test scores were more closely correlated in patients with ACOS than in patients with asthma. Patients with ACOS have higher disease impacts and dyspnea sensation unproportional to the severity of airflow limitation.
Correlation or Limits of Agreement? Applying the Bland-Altman Approach to the Comparison of Cognitive Screening Instruments.

PubMed

Larner, A J

2016-01-01

Calculation of correlation coefficients is often undertaken as a way of comparing different cognitive screening instruments (CSIs). However, test scores may correlate but not agree, and high correlation may mask lack of agreement between scores. The aim of this study was to use the methodology of Bland and Altman to calculate limits of agreement between the scores of selected CSIs and contrast the findings with Pearson's product moment correlation coefficients between the test scores of the same instruments. Datasets from three pragmatic diagnostic accuracy studies which examined the Mini-Mental State Examination (MMSE) vs. the Montreal Cognitive Assessment (MoCA), the MMSE vs. the Mini-Addenbrooke's Cognitive Examination (M-ACE), and the M-ACE vs. the MoCA were analysed to calculate correlation coefficients and limits of agreement between test scores. Although test scores were highly correlated (all >0.8), calculated limits of agreement were broad (all >10 points), and in one case, MMSE vs. M-ACE, was >15 points. Correlation is not agreement. Highly correlated test scores may conceal broad limits of agreement, consistent with the different emphases of different tests with respect to the cognitive domains examined. Routine incorporation of limits of agreement into diagnostic accuracy studies which compare different tests merits consideration, to enable clinicians to judge whether or not their agreement is close. © 2016 S. Karger AG, Basel.
A comparison of simulation-based education versus lecture-based instruction for toxicology training in emergency medicine residents.

PubMed

Maddry, Joseph K; Varney, Shawn M; Sessions, Daniel; Heard, Kennon; Thaxton, Robert E; Ganem, Victoria J; Zarzabal, Lee A; Bebarta, Vikhyat S

2014-12-01

Simulation-based teaching (SIM) is a common method for medical education. SIM exposes residents to uncommon scenarios that require critical, timely actions. SIM may be a valuable training method for critically ill poisoned patients whose diagnosis and treatment depend on key clinical findings. Our objective was to compare medical simulation (SIM) to traditional lecture-based instruction (LEC) for training emergency medicine (EM) residents in the acute management of critically ill poisoned patients. EM residents completed two pre-intervention questionnaires: (1) a 24-item multiple-choice test of four toxicological emergencies and (2) a questionnaire using a five-point Likert scale to rate the residents' comfort level in diagnosing and treating patients with specific toxicological emergencies. After completing the pre-intervention questionnaires, residents were randomized to SIM or LEC instruction. Two toxicologists and three EM physicians presented four toxicology topics to both groups in four 20-min sessions. One group was in the simulation center, and the other in a lecture hall. Each group then repeated the multiple-choice test and questionnaire immediately after instruction and again at 3 months after training. Answers were not discussed. The primary outcome was comparison of immediate mean post-intervention test scores and final scores 3 months later between SIM and LEC groups. Test score outcomes between groups were compared at each time point (pre-test, post-instruction, 3-month follow-up) using Wilcoxon rank sum test. Data were summarized by descriptive statistics. Continuous variables were characterized by means (SD) and tested using t tests or Wilcoxon rank sum. Categorical variables were summarized by frequencies (%) and compared between training groups with chi-square or Fisher's exact test. Thirty-two EM residents completed pre- and post-intervention tests and comfort questionnaires on the study day. Both groups had higher post-intervention mean test scores (p < 0.001), but the LEC group showed a greater improvement compared to the SIM group (5.6 [2.3] points vs. 3.6 [2.4], p = 0.02). At the 3-month follow-up, 24 (75 %) tests and questionnaires were completed. There was no improvement in 3-month mean test scores in either group compared to immediate post-test scores. The SIM group had higher final mean test scores than the LEC group (16.6 [3.1] vs. 13.3 [2.2], p = 0.009). SIM and LEC groups reported similar diagnosis and treatment comfort level scores at baseline and improved equally after instruction. At 3 months, there was no difference between groups in comfort level scores for diagnosis or treatment. Lecture-based teaching was more effective than simulation-based instruction immediately after intervention. At 3 months, the SIM group showed greater retention than the LEC group. Resident comfort levels for diagnosis and treatment were similar regardless of the type of education.
European Society of Cardiology-Recommended Coronary Artery Disease Consortium Pretest Probability Scores More Accurately Predict Obstructive Coronary Disease and Cardiovascular Events Than the Diamond and Forrester Score: The Partners Registry.

PubMed

Bittencourt, Marcio Sommer; Hulten, Edward; Polonsky, Tamar S; Hoffman, Udo; Nasir, Khurram; Abbara, Suhny; Di Carli, Marcelo; Blankstein, Ron

2016-07-19

The most appropriate score for evaluating the pretest probability of obstructive coronary artery disease (CAD) is unknown. We sought to compare the Diamond-Forrester (DF) score with the 2 CAD consortium scores recently recommended by the European Society of Cardiology. We included 2274 consecutive patients (age, 56±13 years; 57% male) without prior CAD referred for coronary computed tomographic angiography. Computed tomographic angiography findings were used to determine the presence or absence of obstructive CAD (≥50% stenosis). We compared the DF score with the 2 CAD consortium scores with respect to their ability to predict obstructive CAD and the potential implications of these scores on the downstream use of testing for CAD, as recommended by current guidelines. The DF score did not satisfactorily fit the data and resulted in a significant overestimation of the prevalence of obstructive CAD (P<0.001); the CAD consortium basic score had no significant lack of fitness; and the CAD consortium clinical provided adequate goodness of fit (P=0.39). The DF score had a lower discrimination for obstructive CAD, with an area under the receiver-operating characteristics curve of 0.713 versus 0.752 and 0.791 for the CAD consortium models (P<0.001 for both). Consequently, the use of the DF score was associated with fewer individuals being categorized as requiring no additional testing (8.3%) compared with the CAD consortium models (24.6% and 30.0%; P<0.001). The proportion of individuals with a high pretest probability was 18% with the DF and only 1.1% with the CAD consortium scores (P<0.001) CONCLUSIONS: Among contemporary patients referred for noninvasive testing, the DF risk score overestimates the risk of obstructive CAD. On the other hand, the CAD consortium scores offered improved goodness of fit and discrimination; thus, their use could decrease the need for noninvasive or invasive testing while increasing the yield of such tests. © 2016 American Heart Association, Inc.
Establishing the Validity of TOEIC Bridge™ Test Scores for Students in Colombia, Chile, and Ecuador. Research Report. ETS RR-08-58

ERIC Educational Resources Information Center

Sinharay, Sandip; Feng, Ying; Saldivia, Luis; Powers, Donald E.; Ginuta, Anthony; Simpson, Annabelle; Weng, Vincent

2008-01-01

The validity of TOEIC Bridge™ scores as a measure of English language skill was examined from the standpoint of a unified concept of test validity. In this study, more than 6,000 test takers in 3 Latin American countries (Chile, Colombia, and Ecuador) took 1 form of the TOEIC Bridge test, and their scores were compared to additional information…
A Direct Comparison of Real-World and Virtual Navigation Performance in Chronic Stroke Patients.

PubMed

Claessen, Michiel H G; Visser-Meily, Johanna M A; de Rooij, Nicolien K; Postma, Albert; van der Ham, Ineke J M

2016-04-01

An increasing number of studies have presented evidence that various patient groups with acquired brain injury suffer from navigation problems in daily life. This skill is, however, scarcely addressed in current clinical neuropsychological practice and suitable diagnostic instruments are lacking. Real-world navigation tests are limited by geographical location and associated with practical constraints. It was, therefore, investigated whether virtual navigation might serve as a useful alternative. To investigate the convergent validity of virtual navigation testing, performance on the Virtual Tubingen test was compared to that on an analogous real-world navigation test in 68 chronic stroke patients. The same eight subtasks, addressing route and survey knowledge aspects, were assessed in both tests. In addition, navigation performance of stroke patients was compared to that of 44 healthy controls. A correlation analysis showed moderate overlap (r = .535) between composite scores of overall real-world and virtual navigation performance in stroke patients. Route knowledge composite scores correlated somewhat stronger (r = .523) than survey knowledge composite scores (r = .442). When comparing group performances, patients obtained lower scores than controls on seven subtasks. Whereas the real-world test was found to be easier than its virtual counterpart, no significant interaction-effects were found between group and environment. Given moderate overlap of the total scores between the two navigation tests, we conclude that virtual testing of navigation ability is a valid alternative to navigation tests that rely on real-world route exposure.
Developmental Eye Movement (DEM) Test Norms for Mandarin Chinese-Speaking Chinese Children.

PubMed

Xie, Yachun; Shi, Chunmei; Tong, Meiling; Zhang, Min; Li, Tingting; Xu, Yaqin; Guo, Xirong; Hong, Qin; Chi, Xia

2016-01-01

The Developmental Eye Movement (DEM) test is commonly used as a clinical visual-verbal ocular motor assessment tool to screen and diagnose reading problems at the onset. No established norm exists for using the DEM test with Mandarin Chinese-speaking Chinese children. This study aims to establish the normative values of the DEM test for the Mandarin Chinese-speaking population in China; it also aims to compare the values with three other published norms for English-, Spanish-, and Cantonese-speaking Chinese children. A random stratified sampling method was used to recruit children from eight kindergartens and eight primary schools in the main urban and suburban areas of Nanjing. A total of 1,425 Mandarin Chinese-speaking children aged 5 to 12 years took the DEM test in Mandarin Chinese. A digital recorder was used to record the process. All of the subjects completed a symptomatology survey, and their DEM scores were determined by a trained tester. The scores were computed using the formula in the DEM manual, except that the "vertical scores" were adjusted by taking the vertical errors into consideration. The results were compared with the three other published norms. In our subjects, a general decrease with age was observed for the four eye movement indexes: vertical score, adjusted horizontal score, ratio, and total error. For both the vertical and adjusted horizontal scores, the Mandarin Chinese-speaking children completed the tests much more quickly than the norms for English- and Spanish-speaking children. However, the same group completed the test slightly more slowly than the norms for Cantonese-speaking children. The differences in the means were significant (P<0.001) in all age groups. For several ages, the scores obtained in this study were significantly different from the reported scores of Cantonese-speaking Chinese children (P<0.005). Compared with English-speaking children, only the vertical score of the 6-year-old group, the vertical-horizontal time ratio of the 8-year-old group and the errors of 9-year-old group had no significant difference (P>0.05); compared with Spanish-speaking children, the scores were statistically significant (P<0.001) for the total error scores of the age groups, except the 6-, 9-, 10-, and 11-year-old age groups (P>0.05). DEM norms may be affected by differences in language, cultural, and educational systems among various ethnicities. The norms of the DEM test are proposed for use with Mandarin Chinese-speaking children in Nanjing and will be proposed for children throughout China.
Academic performance in adolescents born after ART-a nationwide registry-based cohort study.

PubMed

Spangmose, A L; Malchau, S S; Schmidt, L; Vassard, D; Rasmussen, S; Loft, A; Forman, J; Pinborg, A

2017-02-01

Is academic performance in adolescents aged 15-16 years and conceived after ART, measured as test scores in ninth grade, comparable to that for spontaneously conceived (SC) adolescents? ART singletons had a significantly lower mean test score in the adjusted analysis when compared with SC singletons, yet the differences were small and probably not of clinical relevance. Previous studies have shown similar intelligence quotient (IQ) levels in ART and SC children, but only a few have been on adolescents. Academic performance measured with standardized national tests has not previously been explored in a complete national cohort of adolescents conceived after ART. A Danish national registry-based cohort including all 4766 ART adolescents (n = 2836 singletons and n = 1930 twins) born in 1995-1998 were compared with two SC control cohorts: a randomly selected singleton population (n = 5660) and all twins (n = 7064) born from 1995 to 1998 in Denmark. Nine children who died during the follow-up period were excluded from the study. Mean test scores on a 7-point-marking scale from -3 to 12 were compared, and adjustments were made for relevant reproductive and socio-demographic covariates including occupational and educational level of the parents. The crude mean test score was higher in both ART singletons and ART twins compared with SC adolescents. The crude mean differences were +0.41 (95% CI 0.30-0.53) and +0.45 (95% CI 0.28-0.62) between ART and SC singletons and between ART and SC twins, respectively. However, the adjusted mean overall test score was significantly lower for ART singletons compared with SC singletons (adjusted mean difference -0.15 (95% CI -0.29-(-0.02))). For comparison, the adjusted mean difference was +2.05 (95% CI 1.82-2.28) between the highest and the lowest parental educational level, suggesting that the effect of ART is weak compared with the conventional predictors. The adjusted analyses showed significantly lower mean test scores in mathematics and physics/chemistry for ART singletons compared with SC singletons. Comparing ART twins with SC twins yielded no difference in academic performance in the adjusted analyses. Similar crude and adjusted overall mean test scores were found when comparing ART singletons and ART twins. Missing data on educational test scores occurred in 6.6% of adolescents aged 15-16 years for the birth cohorts 1995-1997, where all of the children according to their age should have passed the ninth grade exam at the time of data retrieval. As sensitivity analyses yielded no significant difference in the adjusted risk of having missing test scores between any of the groups, it is unlikely that this should bias our results. Adjustment for body mass index and smoking during pregnancy was not possible. As our results are based on national data, our findings can be applied to other populations. The findings of this paper suggest that a possible small negative effect of parental subfertility or ART treatment is counterbalanced by the higher educational level in the ART parents. The Danish Medical Association in Copenhagen (KMS) funded this study with a scholarship grant. None of the authors had any competing interests. 704676. © The Author 2017. Published by Oxford University Press on behalf of the European Society of Human Reproduction and Embryology. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Fold assessment for comparative protein structure modeling.

PubMed

Melo, Francisco; Sali, Andrej

2007-11-01

Accurate and automated assessment of both geometrical errors and incompleteness of comparative protein structure models is necessary for an adequate use of the models. Here, we describe a composite score for discriminating between models with the correct and incorrect fold. To find an accurate composite score, we designed and applied a genetic algorithm method that searched for a most informative subset of 21 input model features as well as their optimized nonlinear transformation into the composite score. The 21 input features included various statistical potential scores, stereochemistry quality descriptors, sequence alignment scores, geometrical descriptors, and measures of protein packing. The optimized composite score was found to depend on (1) a statistical potential z-score for residue accessibilities and distances, (2) model compactness, and (3) percentage sequence identity of the alignment used to build the model. The accuracy of the composite score was compared with the accuracy of assessment by single and combined features as well as by other commonly used assessment methods. The testing set was representative of models produced by automated comparative modeling on a genomic scale. The composite score performed better than any other tested score in terms of the maximum correct classification rate (i.e., 3.3% false positives and 2.5% false negatives) as well as the sensitivity and specificity across the whole range of thresholds. The composite score was implemented in our program MODELLER-8 and was used to assess models in the MODBASE database that contains comparative models for domains in approximately 1.3 million protein sequences.
The Impact of Scholastic Instrumental Music and Scholastic Chess Study on the Standardized Test Scores of Students in Grades Three, Four, and Five

ERIC Educational Resources Information Center

Martinez, Edwin E.

2012-01-01

This study examines the impact of instrumental music study and group chess lessons on the standardized test scores of suburban elementary public school students (grades three through five) in Levittown, New York. The study divides the students into the following groups and compares the standardized test scores of each: a) instrumental music…
ACT Test Preparation Course and Its Impact on Students' College- and Career-Readiness

ERIC Educational Resources Information Center

Parrott, Timothy Nolan

2012-01-01

This study examined the effectiveness of an ACT intervention course developed for high school juniors at Anderson County High School during the 2011-2012 school year. This study compared the ACT composite test scores of the treatment group to the ACT composite test scores of the control group by using their PLAN scores as a baseline, to determine…
Construct-related validity of the TOCS measures: comparison of intelligibility and speaking rate scores in children with and without speech disorders.

PubMed

Hodge, Megan M; Gotzke, Carrie L

2014-01-01

This study evaluated construct-related validity of the Test of Children's Speech (TOCS). Intelligibility scores obtained using open-set word identification tasks (orthographic transcription) for the TOCS word and sentence tests and rate scores for the TOCS sentence test (words per minute or WPM and intelligible words per minute or IWPM) were compared for a group of 15 adults (18-30 years of age) with normal speech production and three groups of children: 48 3-6 year-olds with typical speech development and neurological histories (TDS), 48 3-6 year-olds with a speech sound disorder of unknown origin and no identified neurological impairment (SSD-UNK), and 22 3-10 year-olds with dysarthria and cerebral palsy (DYS). As expected, mean intelligibility scores and rates increased with age in the TDS group. However, word test intelligibility, WPM and IWPM scores for the 6 year-olds in the TDS group were significantly lower than those for the adults. The DYS group had significantly lower word and sentence test intelligibility and WPM and IWPM scores than the TDS and SSD-UNK groups. Compared to the TDS group, the SSD-UNK group also had significantly lower intelligibility scores for the word and sentence tests, and significantly lower IWPM, but not WPM scores on the sentence test. The results support the construct-related validity of TOCS as a tool for obtaining intelligibility and rate scores that are sensitive to group differences in 3-6 year-old children, with and without speech sound disorders, and to 3+ year-old children with speech disorders, with and without dysarthria. Readers will describe the word and sentence intelligibility and speaking rate performance of children with typically developing speech at age levels of 3, 4, 5 and 6 years, as measured by the Test of Children's Speech, and how these compare with adult speakers and two groups of children with speech disorders. They will also recognize what measures on this test differentiate children with speech sound disorders of unknown origin from children with cerebral palsy and dysarthria. Copyright © 2014 Elsevier Inc. All rights reserved.
Feasibility of remote administration of the Fundamentals of Laparoscopic Surgery (FLS) skills test.

PubMed

Okrainec, Allan; Vassiliou, Melina; Kapoor, Andrew; Pitzul, Kristen; Henao, Oscar; Kaneva, Pepa; Jackson, Timothy; Ritter, E Matt

2013-11-01

Fundamentals of Laparoscopic Surgery (FLS) certification testing currently is offered at accredited test centers or at select surgical conferences. Maintaining these test centers requires considerable investment in human and financial resources. Additionally, it can be challenging for individuals outside North America to become FLS certified. The objective of this pilot study was to assess the feasibility of remotely administering and scoring the FLS examination using live videoconferencing compared with standard onsite testing. This parallel mixed-methods study used both FLS scoring data and participant feedback to determine the barriers to feasibility of remote proctoring for the FLS examination. Participants were tested at two accredited FLS testing centers. An official FLS proctor administered and scored the FLS exam remotely while another onsite proctor provided a live score of participants' performance. Participant feedback was collected during testing. Interrater reliabilities of onsite and remote FLS scoring data were compared using intraclass correlation coefficients (ICCs). Participant feedback was analyzed using modified grounded theory to identify themes for barriers to feasibility. The scores of the remote and onsite proctors showed excellent interrater reliability in the total FLS (ICC 0.995, CI [0.985-0.998]). Several barriers led to critical errors in remote scoring, but most were accompanied by a solution incorporated into the study protocol. The most common barrier was the chain of custody for exam accessories. The results of this pilot study suggest that remote administration of the FLS has the potential to decrease costs without altering test-taker scores or exam validity. Further research is required to validate protocols for remote and onsite proctors and to direct execution of these protocols in a controlled environment identical to current FLS test administration.
The Economic Effects of Cognitive and Educational Differences Among Low-Ability and Blue-Collar Origin Men: A Comparative Analysis.

ERIC Educational Resources Information Center

Olneck, Michael R.

This study used five data sets to investigate the effects of measured cognitive skills on educational attainment, and the effects of cognitive skills and educational attainment on occupational status and earning among men with low test scores, as compared to men with high test scores, and among men with blue-collar fathers, as compared to men with…
Transforming Biology Assessment with Machine Learning: Automated Scoring of Written Evolutionary Explanations

NASA Astrophysics Data System (ADS)

Nehm, Ross H.; Ha, Minsu; Mayfield, Elijah

2012-02-01

This study explored the use of machine learning to automatically evaluate the accuracy of students' written explanations of evolutionary change. Performance of the Summarization Integrated Development Environment (SIDE) program was compared to human expert scoring using a corpus of 2,260 evolutionary explanations written by 565 undergraduate students in response to two different evolution instruments (the EGALT-F and EGALT-P) that contained prompts that differed in various surface features (such as species and traits). We tested human-SIDE scoring correspondence under a series of different training and testing conditions, using Kappa inter-rater agreement values of greater than 0.80 as a performance benchmark. In addition, we examined the effects of response length on scoring success; that is, whether SIDE scoring models functioned with comparable success on short and long responses. We found that SIDE performance was most effective when scoring models were built and tested at the individual item level and that performance degraded when suites of items or entire instruments were used to build and test scoring models. Overall, SIDE was found to be a powerful and cost-effective tool for assessing student knowledge and performance in a complex science domain.
Verbal Serial List Learning in Mild Cognitive Impairment: A Profile Analysis of Interference, Forgetting, and Errors

PubMed Central

Libon, David J.; Bondi, Mark W.; Price, Catherine C.; Lamar, Melissa; Eppig, Joel; Wambach, Denene M.; Nieves, Christine; Delano-Wood, Lisa; Giovannetti, Tania; Lippa, Carol; Kabasakalian, Anahid; Cosentino, Stephanie; Swenson, Rod; Penney, Dana L.

2012-01-01

Using cluster analysis Libon et al. (2010) found three verbal serial list-learning profiles involving delay memory test performance in patients with mild cognitive impairment (MCI). Amnesic MCI (aMCI) patients presented with low scores on delay free recall and recognition tests; mixed MCI (mxMCI) patients scored higher on recognition compared to delay free recall tests; and dysexecutive MCI (dMCI) patients generated relatively intact scores on both delay test conditions. The aim of the current research was to further characterize memory impairment in MCI by examining forgetting/savings, interference from a competing word list, intrusion errors/perseverations, intrusion word frequency, and recognition foils in these three statistically determined MCI groups compared to normal control (NC) participants. The aMCI patients exhibited little savings, generated more highly prototypic intrusion errors, and displayed indiscriminate responding to delayed recognition foils. The mxMCI patients exhibited higher saving scores, fewer and less prototypic intrusion errors, and selectively endorsed recognition foils from the interference list. dMCI patients also selectively endorsed recognition foils from the interference list but performed similarly compared to NC participants. These data suggest the existence of distinct memory impairments in MCI and caution against the routine use of a single memory test score to operationally define MCI. PMID:21880171
Auditing for Score Inflation Using Self-Monitoring Assessments: Findings from Three Pilot Studies

ERIC Educational Resources Information Center

Koretz, Daniel; Jennings, Jennifer L.; Ng, Hui Leng; Yu, Carol; Braslow, David; Langi, Meredith

2016-01-01

Test-based accountability often produces score inflation. Most studies have evaluated inflation by comparing trends on a high-stakes test and a lower stakes audit test. However, Koretz and Beguin (2010) noted weaknesses of audit tests and suggested self-monitoring assessments (SMAs), which incorporate audit items into high-stakes tests. This…
A Comparison of the Performance of Graduate and Undergraduate School Applicants on the Test of Written English. TOEFL Research Reports Report 50.

ERIC Educational Resources Information Center

Zwick, Rebecca; Thayer, Dorothy T.

The performance of graduate and undergraduate school applicants on the Test of Written English (TWE) was compared for each of 66 data sets, dating from 1988 to 1993. The analyses compared the average TWE score for graduates and undergraduates after matching examinees on the total score on the Test of English as a Foreign Language (TOEFL). The main…

Automated smartphone audiometry: Validation of a word recognition test app.

PubMed

Dewyer, Nicholas A; Jiradejvong, Patpong; Henderson Sabes, Jennifer; Limb, Charles J

2018-03-01

Develop and validate an automated smartphone word recognition test. Cross-sectional case-control diagnostic test comparison. An automated word recognition test was developed as an app for a smartphone with earphones. English-speaking adults with recent audiograms and various levels of hearing loss were recruited from an audiology clinic and were administered the smartphone word recognition test. Word recognition scores determined by the smartphone app and the gold standard speech audiometry test performed by an audiologist were compared. Test scores for 37 ears were analyzed. Word recognition scores determined by the smartphone app and audiologist testing were in agreement, with 86% of the data points within a clinically acceptable margin of error and a linear correlation value between test scores of 0.89. The WordRec automated smartphone app accurately determines word recognition scores. 3b. Laryngoscope, 128:707-712, 2018. © 2017 The American Laryngological, Rhinological and Otological Society, Inc.
Universality, correlations, and rankings in the Brazilian universities national admission examinations

NASA Astrophysics Data System (ADS)

da Silva, Roberto; Lamb, Luis C.; Barbosa, Marcia C.

2016-09-01

We analyze the scores obtained by students who have taken the ENEM examination, The Brazilian High School National Examination which is used in the admission process at Brazilian universities. The average high schools scores from different disciplines are compared through the Pearson correlation coefficient. The results show a very large correlation between the performance in the different school subjects. Even though the students' scores in the ENEM form a Gaussian due to the standardization, we show that the high schools' scores form a bimodal distribution that cannot be used to evaluate and compare students performance over time. We also show that this high schools distribution reflects the correlation between school performance and the economic level (based on the average family income) of the students. The ENEM scores are compared with a Brazilian non standardized exam, the entrance examination from the Universidade Federal do Rio Grande do Sul. The analysis of the performance of the same individuals in both tests shows that the two tests not only select different abilities, but also lead to the admission of different sets of individuals. Our results indicate that standardized tests might be an interesting tool to compare performance of individuals over the years, but not of institutions.
A Strategy for Replacing Sum Scoring

ERIC Educational Resources Information Center

Ramsay, James O.; Wiberg, Marie

2017-01-01

This article promotes the use of modern test theory in testing situations where sum scores for binary responses are now used. It directly compares the efficiencies and biases of classical and modern test analyses and finds an improvement in the root mean squared error of ability estimates of about 5% for two designed multiple-choice tests and…
Developmental Eye Movement (DEM) Test Norms for Mandarin Chinese-Speaking Chinese Children

PubMed Central

Tong, Meiling; Zhang, Min; Li, Tingting; Xu, Yaqin; Guo, Xirong; Hong, Qin; Chi, Xia

2016-01-01

The Developmental Eye Movement (DEM) test is commonly used as a clinical visual-verbal ocular motor assessment tool to screen and diagnose reading problems at the onset. No established norm exists for using the DEM test with Mandarin Chinese-speaking Chinese children. This study aims to establish the normative values of the DEM test for the Mandarin Chinese-speaking population in China; it also aims to compare the values with three other published norms for English-, Spanish-, and Cantonese-speaking Chinese children. A random stratified sampling method was used to recruit children from eight kindergartens and eight primary schools in the main urban and suburban areas of Nanjing. A total of 1,425 Mandarin Chinese-speaking children aged 5 to 12 years took the DEM test in Mandarin Chinese. A digital recorder was used to record the process. All of the subjects completed a symptomatology survey, and their DEM scores were determined by a trained tester. The scores were computed using the formula in the DEM manual, except that the “vertical scores” were adjusted by taking the vertical errors into consideration. The results were compared with the three other published norms. In our subjects, a general decrease with age was observed for the four eye movement indexes: vertical score, adjusted horizontal score, ratio, and total error. For both the vertical and adjusted horizontal scores, the Mandarin Chinese-speaking children completed the tests much more quickly than the norms for English- and Spanish-speaking children. However, the same group completed the test slightly more slowly than the norms for Cantonese-speaking children. The differences in the means were significant (P<0.001) in all age groups. For several ages, the scores obtained in this study were significantly different from the reported scores of Cantonese-speaking Chinese children (P<0.005). Compared with English-speaking children, only the vertical score of the 6-year-old group, the vertical-horizontal time ratio of the 8-year-old group and the errors of 9-year-old group had no significant difference (P>0.05); compared with Spanish-speaking children, the scores were statistically significant (P<0.001) for the total error scores of the age groups, except the 6-, 9-, 10-, and 11-year-old age groups (P>0.05). DEM norms may be affected by differences in language, cultural, and educational systems among various ethnicities. The norms of the DEM test are proposed for use with Mandarin Chinese-speaking children in Nanjing and will be proposed for children throughout China. PMID:26881754
Affecting College English Placement Scores: College Readiness Remediation for High-School Seniors

ERIC Educational Resources Information Center

Olsen Rowland, Joyce Kay

2011-01-01

The purpose of the quantitative ex post facto study was to compare the English Placement Test (EPT) scores of students before and after the Expository Reading and Writing Curriculum (ERWC) remediation efforts had been employed and to determine the effectiveness of the ERWC in raising EPT scores. Using a Wilcoxon signed rank test, the researcher…
Animal source foods have a positive impact on the primary school test scores of Kenyan schoolchildren in a cluster-randomised, controlled feeding intervention trial.

PubMed

Hulett, Judie L; Weiss, Robert E; Bwibo, Nimrod O; Galal, Osman M; Drorbaugh, Natalie; Neumann, Charlotte G

2014-03-14

Micronutrient deficiencies and suboptimal energy intake are widespread in rural Kenya, with detrimental effects on child growth and development. Sporadic school feeding programmes rarely include animal source foods (ASF). In the present study, a cluster-randomised feeding trial was undertaken to determine the impact of snacks containing ASF on district-wide, end-term standardised school test scores and nutrient intake. A total of twelve primary schools were randomly assigned to one of three isoenergetic feeding groups (a local plant-based stew (githeri) with meat, githeri plus whole milk or githeri with added oil) or a control group receiving no intervention feeding. After the initial term that served as baseline, children were fed at school for five consecutive terms over two school years from 1999 to 2001. Longitudinal analysis was used controlling for average energy intake, school attendance, and baseline socio-economic status, age, sex and maternal literacy. Children in the Meat group showed significantly greater improvements in test scores than those in all the other groups, and the Milk group showed significantly greater improvements in test scores than the Plain Githeri (githeri+oil) and Control groups. Compared with the Control group, the Meat group showed significant improvements in test scores in Arithmetic, English, Kiembu, Kiswahili and Geography. The Milk group showed significant improvements compared with the Control group in test scores in English, Kiswahili, Geography and Science. Folate, Fe, available Fe, energy per body weight, vitamin B₁₂, Zn and riboflavin intake were significant contributors to the change in test scores. The greater improvements in test scores of children receiving ASF indicate improved academic performance, which can result in greater academic achievement.
The Effect of English Language on Multiple Choice Question Scores of Thai Medical Students.

PubMed

Phisalprapa, Pochamana; Muangkaew, Wayuda; Assanasen, Jintana; Kunavisarut, Tada; Thongngarm, Torpong; Ruchutrakool, Theera; Kobwanthanakun, Surapon; Dejsomritrutai, Wanchai

2016-04-01

Universities in Thailand are preparing for Thailand's integration into the ASEAN Economic Community (AEC) by increasing the number of tests in English language. English language is not the native language of Thailand Differences in English language proficiency may affect scores among test-takers, even when subject knowledge among test-takers is comparable and may falsely represent the knowledge level of the test-taker. To study the impact of English language multiple choice test questions on test scores of medical students. The final examination of fourth-year medical students completing internal medicine rotation contains 120 multiple choice questions (MCQ). The languages used on the test are Thai and English at a ratio of 3:1. Individual scores of tests taken in both languages were collected and the effect of English language on MCQ was analyzed Individual MCQ scores were then compared with individual student English language proficiency and student grade point average (GPA). Two hundred ninety five fourth-year medical students were enrolled. The mean percentage of MCQ scores in Thai and English were significantly different (65.0 ± 8.4 and 56.5 ± 12.4, respectively, p < 0.001). The correlation between MCQ scores in Thai and English was fair (Spearman's correlation coefficient = 0.41, p < 0.001). Of 295 students, only 73 (24.7%) students scored higher when being tested in English than in Thai language. Students were classified into six grade categories (A, B+, B, C+, C, and D+), which cumulatively measured total internal medicine rotation performance score plus final examination score. MCQ scores from Thai language examination were more closely correlated with total course grades than were the scores from English language examination (Spearman's correlation coefficient = 0.73 (p < 0.001) and 0.53 (p < 0.001), respectively). The gap difference between MCQ scores in both languages was higher in borderline students than in the excellent student group (11.2 ± 11.2 and 7.1 ± 8.2, respectively, p < 0.001). Overall, average student English proficiency score was very high, at 3.71 ± 0.35 from a total of 4.00. Mean student GPA was 3.40 ± 0.33 from a possible 4.00. English language MCQ examination scores were more highly associated with GPA than with English language proficiency. The use of English language multiple choice question test may decrease scores of the fourth-year internal medicine post-rotation final examination, especially those of borderline students.
The Effects of Routing and Scoring within a Computer Adaptive Multi-Stage Framework

ERIC Educational Resources Information Center

Dallas, Andrew

2014-01-01

This dissertation examined the overall effects of routing and scoring within a computer adaptive multi-stage framework (ca-MST). Testing in a ca-MST environment has become extremely popular in the testing industry. Testing companies enjoy its efficiency benefits as compared to traditionally linear testing and its quality-control features over…
Neuropsychological test scores, academic performance, and developmental disorders in Spanish-speaking children.

PubMed

Rosselli, M; Ardila, A; Bateman, J R; Guzmán, M

2001-01-01

Limited information is currently available about performance of Spanish-speaking children on different neuropsychological tests. This study was designed to (a) analyze the effects of age and sex on different neuropsychological test scores of a randomly selected sample of Spanish-speaking children, (b) analyze the value of neuropsychological test scores for predicting school performance, and (c) describe the neuropsychological profile of Spanish-speaking children with learning disabilities (LD). Two hundred ninety (141 boys, 149 girls) 6- to 11-year-old children were selected from a school in Bogotá, Colombia. Three age groups were distinguished: 6- to 7-, 8- to 9-, and 10- to 11-year-olds. Performance was measured utilizing the following neuropsychological tests: Seashore Rhythm Test, Finger Tapping Test (FTT), Grooved Pegboard Test, Children's Category Test (CCT), California Verbal Learning Test-Children's Version (CVLT-C), Benton Visual Retention Test (BVRT), and Bateria Woodcock Psicoeducativa en Español (Woodcock, 1982). Normative scores were calculated. Age effect was significant for most of the test scores. A significant sex effect was observed for 3 test scores. Intercorrelations were performed between neuropsychological test scores and academic areas (science, mathematics, Spanish, social studies, and music). In a post hoc analysis, children presenting very low scores on the reading, writing, and arithmetic achievement scales of the Woodcock battery were identified in the sample, and their neuropsychological test scores were compared with a matched normal group. Finally, a comparison was made between Colombian and American norms.
Applying team-based learning of diagnostics for undergraduate students: assessing teaching effectiveness by a randomized controlled trial study

PubMed Central

Zeng, Rui; Xiang, Lian-rui; Zeng, Jing; Zuo, Chuan

2017-01-01

Background We aimed to introduce team-based learning (TBL) as one of the teaching methods for diagnostics and to compare its teaching effectiveness with that of the traditional teaching methods. Methods We conducted a randomized controlled trial on diagnostics teaching involving 111 third-year medical undergraduates, using TBL as the experimental intervention, compared with lecture-based learning as the control, for teaching the two topics of symptomatology. Individual Readiness Assurance Test (IRAT)-baseline and Group Readiness Assurance Test (GRAT) were performed in members of each TBL subgroup. The scores in Individual Terminal Test 1 (ITT1) immediately after class and Individual Terminal Test 2 (ITT2) 1 week later were compared between the two groups. The questionnaire and interview were also implemented to survey the attitude of students and teachers toward TBL. Results There was no significant difference between the two groups in ITT1 (19.85±4.20 vs 19.70±4.61), while the score of the TBL group was significantly higher than that of the control group in ITT2 (19.15±3.93 vs 17.46±4.65). In the TBL group, the scores of the two terminal tests after the teaching intervention were significantly higher than the baseline test score of individuals. IRAT-baseline, ITT1, and ITT2 scores of students at different academic levels in the TBL teaching exhibited significant differences, but the ITT1-IRAT-baseline and ITT2-IRAT-baseline indicated no significant differences among the three subgroups. Conclusion Our TBL in symptomatology approach was highly accepted by students in the improvement of interest and self-directed learning and resulted in an increase in knowledge acquirements, which significantly improved short-term test scores compared with lecture-based learning. TBL is regarded as an effective teaching method worthy of promoting. PMID:28331383
Applying team-based learning of diagnostics for undergraduate students: assessing teaching effectiveness by a randomized controlled trial study.

PubMed

Zeng, Rui; Xiang, Lian-Rui; Zeng, Jing; Zuo, Chuan

2017-01-01

We aimed to introduce team-based learning (TBL) as one of the teaching methods for diagnostics and to compare its teaching effectiveness with that of the traditional teaching methods. We conducted a randomized controlled trial on diagnostics teaching involving 111 third-year medical undergraduates, using TBL as the experimental intervention, compared with lecture-based learning as the control, for teaching the two topics of symptomatology. Individual Readiness Assurance Test (IRAT)-baseline and Group Readiness Assurance Test (GRAT) were performed in members of each TBL subgroup. The scores in Individual Terminal Test 1 (ITT1) immediately after class and Individual Terminal Test 2 (ITT2) 1 week later were compared between the two groups. The questionnaire and interview were also implemented to survey the attitude of students and teachers toward TBL. There was no significant difference between the two groups in ITT1 (19.85±4.20 vs 19.70±4.61), while the score of the TBL group was significantly higher than that of the control group in ITT2 (19.15±3.93 vs 17.46±4.65). In the TBL group, the scores of the two terminal tests after the teaching intervention were significantly higher than the baseline test score of individuals. IRAT-baseline, ITT1, and ITT2 scores of students at different academic levels in the TBL teaching exhibited significant differences, but the ITT1-IRAT-baseline and ITT2-IRAT-baseline indicated no significant differences among the three subgroups. Our TBL in symptomatology approach was highly accepted by students in the improvement of interest and self-directed learning and resulted in an increase in knowledge acquirements, which significantly improved short-term test scores compared with lecture-based learning. TBL is regarded as an effective teaching method worthy of promoting.
College Math Assessment: SAT Scores vs. College Math Placement Scores

ERIC Educational Resources Information Center

Foley-Peres, Kathleen; Poirier, Dawn

2008-01-01

Many colleges and university's use SAT math scores or math placement tests to place students in the appropriate math course. This study compares the use of math placement scores and SAT scores for 188 freshman students. The student's grades and faculty observations were analyzed to determine if the SAT scores and/or college math assessment scores…
Assessing mNIS+7Ionis and international neurologists' proficiency in a familial amyloidotic polyneuropathy trial.

PubMed

Dyck, Peter J; Kincaid, John C; Dyck, P James B; Chaudhry, Vinay; Goyal, Namita A; Alves, Christina; Salhi, Hayet; Wiesman, Janice F; Labeyrie, Celine; Robinson-Papp, Jessica; Cardoso, Márcio; Laura, Matilde; Ruzhansky, Katherine; Cortese, Andrea; Brannagan, Thomas H; Khoury, Julie; Khella, Sami; Waddington-Cruz, Márcia; Ferreira, João; Wang, Annabel K; Pinto, Marcus V; Ayache, Samar S; Benson, Merrill D; Berk, John L; Coelho, Teresa; Polydefkis, Michael; Gorevic, Peter; Adams, David H; Plante-Bordeneuve, Violaine; Whelan, Carol; Merlini, Giampaolo; Heitner, Stephen; Drachman, Brian M; Conceição, Isabel; Klein, Christopher J; Gertz, Morie A; Ackermann, Elizabeth J; Hughes, Steven G; Mauermann, Michelle L; Bergemann, Rito; Lodermeier, Karen A; Davies, Jenny L; Carter, Rickey E; Litchy, William J

2017-11-01

Polyneuropathy signs (Neuropathy Impairment Score, NIS), neurophysiologic tests (m+7 Ionis ), disability, and health scores were assessed in baseline evaluations of 100 patients entered into an oligonucleotide familial amyloidotic polyneuropathy (FAP) trial. We assessed: (1) Proficiency of grading neurologic signs and correlation with neurophysiologic tests, and (2) clinometric performance of modified NIS+7 neurophysiologic tests (mNIS+7 Ionis ) and its subscores and correlation with disability and health scores. The mNIS+7 Ionis sensitively detected, characterized, and broadly scaled diverse polyneuropathy impairments. Polyneuropathy signs (NIS and subscores) correlated with neurophysiology tests, disability, and health scores. Smart Somatotopic Quantitative Sensation Testing of heat as pain 5 provided a needed measure of small fiber involvement not adequately assessed by other tests. Specially trained neurologists accurately assessed neuropathy signs as compared to referenced neurophysiologic tests. The score, mNIS+7 Ionis , broadly detected, characterized, and scaled polyneuropathy abnormality in FAP, which correlated with disability and health scores. Muscle Nerve 56: 901-911, 2017. © 2017 Wiley Periodicals, Inc.
Spanish Multicenter Normative Studies (NEURONORMA Project): norms for Boston naming test and token test.

PubMed

Peña-Casanova, Jordi; Quiñones-Ubeda, Sonia; Gramunt-Fombuena, Nina; Aguilar, Miquel; Casas, Laura; Molinuevo, José Luis; Robles, Alfredo; Rodríguez, Dolores; Barquero, María Sagrario; Antúnez, Carmen; Martínez-Parra, Carlos; Frank-García, Anna; Fernández, Manuel; Molano, Ana; Alfonso, Verónica; Sol, Josep M; Blesa, Rafael

2009-06-01

As part of the Spanish Multicenter Normative Studies (NEURONORMA project), we provide age- and education-adjusted norms for the Boston naming test and Token test. The sample consists of 340 and 348 participants, respectively, who are cognitively normal, community-dwelling, and ranging in age from 50 to 94 years. Tables are provided to convert raw scores to age-adjusted scaled scores. These were further converted into education-adjusted scaled scores by applying regression-based adjustments. Age and education affected the score of the both tests, but sex was found to be unrelated to naming and verbal comprehension efficiency. Our norms should provide clinically useful data for evaluating elderly Spaniards. The normative data presented here were obtained from the same study sample as all the other NEURONORMA norms and the same statistical procedures for data analyses were applied. These co-normed data allow clinicians to compare scores from one test with all tests.
Effect of education and language on baseline concussion screening tests in professional baseball players.

PubMed

Jones, Nathaniel S; Walter, Kevin D; Caplinger, Roger; Wright, Daniel; Raasch, William G; Young, Craig

2014-07-01

The purpose of the present study was to investigate the possible effects of sociocultural influences, specifically pertaining to language and education, on baseline neuropsychological concussion testing as obtained via immediate postconcussion assessment and cognitive testing (ImPACT) of players from a professional baseball team. A retrospective chart review. Baseline testing of a professional baseball organization. Four hundred five professional baseball players. Age, languages spoken, hometown country location (United States/Canada vs overseas), and years of education. The 5 ImPACT composite scores (verbal memory, visual memory, visual motor speed, reaction time, impulse control) and ImPACT total symptom score from the initial baseline testing. The result of t tests revealed significant differences (P < 0.05) when comparing native English to native Spanish speakers in many scores. Even when corrected for education, the significant differences (P < 0.05) remained in some scores. Sociocultural differences may result in differences in computer-based neuropsychological testing scores.
A Note on the Use of the Hiskey-Nebraska Test of Learning Aptitude with Deaf Children.

ERIC Educational Resources Information Center

Watson, Betty U.; Goldgar, David E.

1985-01-01

Comparing distribution of scores on the Hiskey-Nebraska Test of Learning Aptitude (H-NTLA) with those from the Wechsler Performance Scales for 71 hearing impaired Ss revealed a correlation of .85. However, the H-NTLA yielded more Ss with extreme scores. Findings stress the need for caution in interpreting extreme H-NTLA scores. (CL)
Comparison of Lecture-Based Learning vs Discussion-Based Learning in Undergraduate Medical Students.

PubMed

Zhao, Beiqun; Potter, Donald D

2016-01-01

To compare lecture-based learning (LBL) and discussion-based learning (DBL) by assessing immediate and long-term knowledge retention and application of practical knowledge in third- and fourth-year medical students. A prospective, randomized control trial was designed to study the effects of DBL. Medical students were randomly assigned to intervention (DBL) or control (LBL) groups. Both the groups were instructed regarding the management of gastroschisis. The control group received a PowerPoint presentation, whereas the intervention group was guided only by an objectives list and a gastroschisis model. Students were evaluated using a multiple-choice pretest (Pre-Test MC) immediately before the teaching session, a posttest (Post-Test MC) following the session, and a follow-up test (Follow-Up MC) at 3 months. A practical examination (PE), which tested simple skills and management decisions, was administered at the end of the clerkship (Initial PE) and at 3 months after clerkship (Follow-Up PE). Students were also given a self-evaluation immediately following the Post-Test MC to gauge satisfaction and comfort level in the management of gastroschisis. University of Iowa Hospitals and Clinics and the Carver College of Medicine, Iowa City, IA. A total of 49 third- and fourth-year medical students who were enrolled in the general surgery clerkship were eligible for this study. Enrollment into the study was completely voluntary. Of the 49 eligible students, 36 students agreed to participate in the study, and 27 completed the study. Mean scores for the Pre-Test MC, Post-Test MC, and Follow-Up MC were similar between the control and intervention groups. In the control group, the Post-Test MC scores were significantly greater than Pre-Test MC scores (8.92 ± 0.79 vs 4.00 ± 1.04, p < 0.0001), whereas the Follow-Up MC scores were significantly lower than Post-Test MC scores (7.17 ± 1.75 vs 8.92 ± 0.79, p = 0.005). In the control group, the Follow-Up MC scores were significantly greater than Pre-Test MC scores (7.17 ± 1.75 vs 4.00 ± 1.04, p < 0.0001). Analysis of variance for all control group MC examinations had a p < 0.0001. In the intervention group, the Post-Test MC scores were significantly greater than Pre-Test MC scores (8.33 ± 1.23 vs 4.60 ± 1.55, p < 0.0001), whereas the Follow-Up MC scores were significantly lower than Post-Test MC scores (7.13 ± 1.77 vs 8.33 ± 1.23, p = 0.04). In the intervention group, the Follow-Up MC scores were significantly greater than Pre-Test MC scores (7.13 ± 1.77 vs 4.60 ± 1.55, p = 0.0002). Analysis of variance for all intervention group MC examinations had a p < 0.0001. Mean scores for the Initial PE were significantly higher for the intervention group compared with the control group's score (7.47 ± 1.68 vs 5.25 ± 2.34, p = 0.008). Mean scores for the Follow-Up PE were significantly higher for the intervention group compared with the control group's score (7.87 ± 1.77 vs 5.83 ± 2.04, p = 0.005). A comparison of Initial PE vs Follow-Up PE was not significant in either group. Students in the intervention group were more comfortable in the immediate management of gastroschisis and placement of a silo and felt that the educational experience was more worthwhile than students in the control group did. After a single instructional session, there was a significant difference in the students' scores between the control and the intervention groups on both administrations of the PEs. There were no significant differences between the 2 groups in any administration of the MC examinations. This seems to suggest that DBL may lead to better practical knowledge and potentially improved long-term knowledge retention when compared with LBL. Students in the DBL group also felt more comfortable with the management of gastroschisis and were more satisfied with the educational session. Copyright © 2015 Association of Program Directors in Surgery. Published by Elsevier Inc. All rights reserved.
Role of binding entropy in the refinement of protein-ligand docking predictions: analysis based on the use of 11 scoring functions.

PubMed

Ruvinsky, Anatoly M

2007-06-01

We present results of testing the ability of eleven popular scoring functions to predict native docked positions using a recently developed method (Ruvinsky and Kozintsev, J Comput Chem 2005, 26, 1089) for estimation the entropy contributions of relative motions to protein-ligand binding affinity. The method is based on the integration of the configurational integral over clusters obtained from multiple docked positions. We use a test set of 100 PDB protein-ligand complexes and ensembles of 101 docked positions generated by (Wang et al. J Med Chem 2003, 46, 2287) for each ligand in the test set. To test the suggested method we compared the averaged root-mean square deviations (RMSD) of the top-scored ligand docked positions, accounting and not accounting for entropy contributions, relative to the experimentally determined positions. We demonstrate that the method increases docking accuracy by 10-21% when used in conjunction with the AutoDock scoring function, by 2-25% with G-Score, by 7-41% with D-Score, by 0-8% with LigScore, by 1-6% with PLP, by 0-12% with LUDI, by 2-8% with F-Score, by 7-29% with ChemScore, by 0-9% with X-Score, by 2-19% with PMF, and by 1-7% with DrugScore. We also compared the performance of the suggested method with the method based on ranking by cluster occupancy only. We analyze how the choice of a clustering-RMSD and a low bound of dense clusters impacts on docking accuracy of the scoring methods. We derive optimal intervals of the clustering-RMSD for 11 scoring functions.
The Effect of Pretest Exercise on Baseline Computerized Neurocognitive Test Scores.

PubMed

Pawlukiewicz, Alec; Yengo-Kahn, Aaron M; Solomon, Gary

2017-10-01

Baseline neurocognitive assessment plays a critical role in return-to-play decision making following sport-related concussions. Prior studies have assessed the effect of a variety of modifying factors on neurocognitive baseline test scores. However, relatively little investigation has been conducted regarding the effect of pretest exercise on baseline testing. The aim of our investigation was to determine the effect of pretest exercise on baseline Immediate Post-Concussion Assessment and Cognitive Testing (ImPACT) scores in adolescent and young adult athletes. We hypothesized that athletes undergoing self-reported strenuous exercise within 3 hours of baseline testing would perform more poorly on neurocognitive metrics and would report a greater number of symptoms than those who had not completed such exercise. Cross-sectional study; Level of evidence, 3. The ImPACT records of 18,245 adolescent and young adult athletes were retrospectively analyzed. After application of inclusion and exclusion criteria, participants were dichotomized into groups based on a positive (n = 664) or negative (n = 6609) self-reported history of strenuous exercise within 3 hours of the baseline test. Participants with a positive history of exercise were then randomly matched, based on age, sex, education level, concussion history, and hours of sleep prior to testing, on a 1:2 basis with individuals who had reported no pretest exercise. The baseline ImPACT composite scores of the 2 groups were then compared. Significant differences were observed for the ImPACT composite scores of verbal memory, visual memory, reaction time, and impulse control as well as for the total symptom score. No significant between-group difference was detected for the visual motor composite score. Furthermore, pretest exercise was associated with a significant increase in the overall frequency of invalid test results. Our results suggest a statistically significant difference in ImPACT composite scores between individuals who report strenuous exercise prior to baseline testing compared with those who do not. Since return-to-play decision making often involves documentation of return to neurocognitive baseline, the baseline test scores must be valid and accurate. As a result, we recommend standardization of baseline testing such that no strenuous exercise takes place 3 hours prior to test administration.
Monitoring the Performance of Human and Automated Scores for Spoken Responses

ERIC Educational Resources Information Center

Wang, Zhen; Zechner, Klaus; Sun, Yu

2018-01-01

As automated scoring systems for spoken responses are increasingly used in language assessments, testing organizations need to analyze their performance, as compared to human raters, across several dimensions, for example, on individual items or based on subgroups of test takers. In addition, there is a need in testing organizations to establish…

The Effects of Item by Item Feedback Given during an Ability Test.

ERIC Educational Resources Information Center

Whetton, C.; Childs, R.

1981-01-01

Answer-until-correct (AUC) is a procedure for providing feedback during a multiple-choice test, giving an increased range of scores. The performance of secondary students on a verbal ability test using AUC procedures was compared with a group using conventional instructions. AUC scores considerably enhanced reliability but not validity.…
Evaluation of an Innovative Digital Assessment Tool in Dental Anatomy.

PubMed

Lam, Matt T; Kwon, So Ran; Qian, Fang; Denehy, Gerald E

2015-05-01

The E4D Compare software is an innovative tool that provides immediate feedback to students' projects and competencies. It should provide consistent scores even when different scanners are used which may have inherent subtle differences in calibration. This study aimed to evaluate potential discrepancies in evaluation using the E4D Compare software based on four different NEVO scanners in dental anatomy projects. Additionally, correlation between digital and visual scores was evaluated. Thirty-five projects of maxillary left central incisors were evaluated. Among these, thirty wax-ups were performed by four operators and five consisted of standard dentoform teeth. Five scores were obtained for each project: one from an instructor that visually graded the project and from four different NEVO scanners. A faculty involved in teaching the dental anatomy course blindly scored the 35 projects. One operator scanned all projects to four NEVO scanners (D4D Technologies, Richardson, TX, USA). The images were aligned to the gold standard, and tolerance set at 0.3 mm to generate a score. The score reflected percentage match between the project and the gold standard. One-way ANOVA with repeated measures was used to determine whether there was a significant difference in scores among the four NEVO scanners. Paired-sample t-test was used to detect any difference between visual scores and the average scores of the four NEVO scanners. Pearson's correlation test was used to assess the relationship between visual and average scores of NEVO scanners. There was no significant difference in mean scores among four different NEVO scanners [F(3, 102) = 2.27, p = 0.0852 one-way ANOVA with repeated measures]. Moreover, the data provided strong evidence that a significant difference existed between visual and digital scores (p = 0.0217; a paired - sample t-test). Mean visual scores were significantly lower than digital scores (72.4 vs 75.1). Pearson's correlation coefficient of 0.85 indicated a strong correlation between visual and digital scores (p < 0.0001). The E4D Compare software provides consistent scores even when different scanners are used and correlates well with visual scores. The use of innovative digital assessment tools in dental education is promising with the E4D Compare software correlating well with visual scores and providing consistent scores even when different scanners are used.
Gender determines cortisol and alpha-amylase responses to acute physical and psychosocial stress in patients with borderline personality disorder.

PubMed

Inoue, Ayako; Oshita, Harumi; Maruyama, Yoshihiro; Tanaka, Yoshihiro; Ishitobi, Yoshinobu; Kawano, Aimi; Ikeda, Rie; Ando, Tomoko; Aizawa, Saeko; Masuda, Koji; Higuma, Haruka; Kanehisa, Masayuki; Ninomiya, Taiga; Akiyoshi, Jotaro

2015-07-30

Borderline personality disorder (BPD) is characterized by affective instability, unstable relationships, and identity disturbance. We measured salivary alpha-amylase (sAA) and salivary cortisol levels in all participants during exposure to the Trier Social Stress Test (TSST) and an electric stimulation stress. Seventy-two BPD patients were compared with 377 age- and gender- matched controls. The State and Trait versions of the Spielberger Anxiety Inventory test (STAI-S and STAI-T, respectively), the Profile of Mood State (POMS) tests, and the Beck Depression Inventory (BDI), the Depression and Anxiety Cognition Scale (DACS) were administered to participants before electrical stimulation. Following TSST exposure, salivary cortisol levels significantly decreased in female patients and significantly increased in male patients compared with controls. POMS tension-anxiety, depression-dejection, anger-hostility, fatigue, and confusion scores were significantly increased in BPD patients compared with controls. In contrast, vigor scores were significantly decreased in BPD patients relative to controls. Furthermore, STAI-T and STAI-S anxiety scores and BDI scores were significantly increased in BPD patient compared with controls. DACS scores were significantly increased in BPD patient compared with controls. Different stressors (e.g., psychological or physical) induced different responses in the HPA and SAM systems in female or male BPD patients. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Comparison of formula and number-right scoring in undergraduate medical training: a Rasch model analysis.

PubMed

Cecilio-Fernandes, Dario; Medema, Harro; Collares, Carlos Fernando; Schuwirth, Lambert; Cohen-Schotanus, Janke; Tio, René A

2017-11-09

Progress testing is an assessment tool used to periodically assess all students at the end-of-curriculum level. Because students cannot know everything, it is important that they recognize their lack of knowledge. For that reason, the formula-scoring method has usually been used. However, where partial knowledge needs to be taken into account, the number-right scoring method is used. Research comparing both methods has yielded conflicting results. As far as we know, in all these studies, Classical Test Theory or Generalizability Theory was used to analyze the data. In contrast to these studies, we will explore the use of the Rasch model to compare both methods. A 2 × 2 crossover design was used in a study where 298 students from four medical schools participated. A sample of 200 previously used questions from the progress tests was selected. The data were analyzed using the Rasch model, which provides fit parameters, reliability coefficients, and response option analysis. The fit parameters were in the optimal interval ranging from 0.50 to 1.50, and the means were around 1.00. The person and item reliability coefficients were higher in the number-right condition than in the formula-scoring condition. The response option analysis showed that the majority of dysfunctional items emerged in the formula-scoring condition. The findings of this study support the use of number-right scoring over formula scoring. Rasch model analyses showed that tests with number-right scoring have better psychometric properties than formula scoring. However, choosing the appropriate scoring method should depend not only on psychometric properties but also on self-directed test-taking strategies and metacognitive skills.
Evaluation of visual acuity measurements after autorefraction vs manual refraction in eyes with and without diabetic macular edema.

PubMed

Sun, Jennifer K; Qin, Haijing; Aiello, Lloyd Paul; Melia, Michele; Beck, Roy W; Andreoli, Christopher M; Edwards, Paul A; Glassman, Adam R; Pavlica, Michael R

2012-04-01

To compare visual acuity (VA) scores after autorefraction vs manual refraction in eyes of patients with diabetes mellitus and a wide range of VAs. The letter score from the Electronic Visual Acuity (EVA) test from the electronic Early Treatment Diabetic Retinopathy Study was measured after autorefraction (AR-EVA score) and after manual refraction (MR-EVA score), which is the research protocol of the Diabetic Retinopathy Clinical Research Network. Testing order was randomized, study participants and VA examiners were masked to refraction source, and a second EVA test using an identical supplemental manual refraction (MR-EVAsuppl score) was performed to determine test-retest variability. In 878 eyes of 456 study participants, the median MR-EVA score was 74 (Snellen equivalent, approximately 20/32). The spherical equivalent was often similar for manual refraction and autorefraction (median difference, 0.00; 5th-95th percentile range, -1.75 to 1.13 diopters). However, on average, the MR-EVA scores were slightly better than the AR-EVA scores, across the entire VA range. Furthermore, the variability between the AR-EVA scores and the MR-EVA scores was substantially greater than the test-retest variability of the MR-EVA scores (P < .001). The variability of differences was highly dependent on the autorefractor model. Across a wide range of VAs at multiple sites using a variety of autorefractors, VA measurements tend to be worse with autorefraction than manual refraction. Differences between individual autorefractor models were identified. However, even among autorefractor models that compare most favorably with manual refraction, VA variability between autorefraction and manual refraction is higher than the test-retest variability of manual refraction. The results suggest that, with current instruments, autorefraction is not an acceptable substitute for manual refraction for most clinical trials with primary outcomes dependent on best-corrected VA.
Comparing the MMPI-2 Scale Scores of Parents Involved in Parental Competency and Child Custody Assessments

ERIC Educational Resources Information Center

Resendes, John; Lecci, Len

2012-01-01

MMPI-2 scores from a parent competency sample (N = 136 parents) are compared with a previously published data set of MMPI-2 scores for child custody litigants (N = 508 parents; Bathurst et al., 1997). Independent samples t tests yielded significant and in some cases substantial differences on the standard MMPI-2 clinical scales (especially Scales…
Rater Comparability Scoring and Equating: Does Choice of Target Population Weights Matter in This Context?

ERIC Educational Resources Information Center

Puhan, Gautam

2013-01-01

When a constructed-response test form is reused, raw scores from the two administrations of the form may not be comparable. The solution to this problem requires a rescoring, at the current administration, of examinee responses from the previous administration. The scores from this "rescoring" can be used as an anchor for equating. In…
Can a smartphone app improve medical trainees' knowledge of antibiotics?

PubMed

Fralick, Michael; Haj, Reem; Hirpara, Dhruvin; Wong, Karen; Muller, Matthew; Matukas, Larissa; Bartlett, John; Leung, Elizabeth; Taggart, Linda

2017-11-30

To determine whether a smartphone app, containing local bacterial resistance patterns (antibiogram) and treatment guidelines, improved knowledge of prescribing antimicrobials among medical trainees. We conducted a prospective, controlled, pre-post study of medical trainees with access to a smartphone app (app group) containing our hospital's antibiogram and treatment guidelines compared to those without access (control group). Participants completed a survey which included a knowledge assessment test (score range, 0 [lowest possible score] to 12 [highest possible score]) at the start of the study and four weeks later. The primary outcome was change in mean knowledge assessment test scores between week 0 and week 4. Change in knowledge assessment test scores in the app group were compared to the difference in scores in the control group using multivariable linear regression. Sixty-two residents and senior medical students participated in the study. In a multivariable analysis controlling for sex and prior knowledge, app use was associated with a 1.1 point (95% CI: 0.10, 2.1) [β = 1.08, t(1) = 2.08, p = 0.04] higher change in knowledge score compared to the change in knowledge scores in the control group. Among those in the app group, 88% found it easy to navigate, 85% found it useful, and about one- quarter used it daily. An antibiogram and treatment algorithm app increased knowledge of prescribing antimicrobials in the context of local antibiotic resistance patterns. These findings reinforce the notion that smartphone apps can be a useful and innovative means of delivering medical education.
Clinical score and rapid antigen detection test to guide antibiotic use for sore throats: randomised controlled trial of PRISM (primary care streptococcal management).

PubMed

Little, Paul; Hobbs, F D Richard; Moore, Michael; Mant, David; Williamson, Ian; McNulty, Cliodna; Cheng, Ying Edith; Leydon, Geraldine; McManus, Richard; Kelly, Joanne; Barnett, Jane; Glasziou, Paul; Mullee, Mark

2013-10-10

To determine the effect of clinical scores that predict streptococcal infection or rapid streptococcal antigen detection tests compared with delayed antibiotic prescribing. Open adaptive pragmatic parallel group randomised controlled trial. Primary care in United Kingdom. Patients aged ≥ 3 with acute sore throat. An internet programme randomised patients to targeted antibiotic use according to: delayed antibiotics (the comparator group for analyses), clinical score, or antigen test used according to clinical score. During the trial a preliminary streptococcal score (score 1, n=1129) was replaced by a more consistent score (score 2, n=631; features: fever during previous 24 hours; purulence; attends rapidly (within three days after onset of symptoms); inflamed tonsils; no cough/coryza (acronym FeverPAIN). Symptom severity reported by patients on a 7 point Likert scale (mean severity of sore throat/difficulty swallowing for days two to four after the consultation (primary outcome)), duration of symptoms, use of antibiotics. For score 1 there were no significant differences between groups. For score 2, symptom severity was documented in 80% (168/207 (81%) in delayed antibiotics group; 168/211 (80%) in clinical score group; 166/213 (78%) in antigen test group). Reported severity of symptoms was lower in the clinical score group (-0.33, 95% confidence interval -0.64 to -0.02; P=0.04), equivalent to one in three rating sore throat a slight versus moderate problem, with a similar reduction for the antigen test group (-0.30, -0.61 to -0.00; P=0.05). Symptoms rated moderately bad or worse resolved significantly faster in the clinical score group (hazard ratio 1.30, 95% confidence interval 1.03 to 1.63) but not the antigen test group (1.11, 0.88 to 1.40). In the delayed antibiotics group, 75/164 (46%) used antibiotics. Use of antibiotics in the clinical score group (60/161) was 29% lower (adjusted risk ratio 0.71, 95% confidence interval 0.50 to 0.95; P=0.02) and in the antigen test group (58/164) was 27% lower (0.73, 0.52 to 0.98; P=0.03). There were no significant differences in complications or reconsultations. Targeted use of antibiotics for acute sore throat with a clinical score improves reported symptoms and reduces antibiotic use. Antigen tests used according to a clinical score provide similar benefits but with no clear advantages over a clinical score alone. ISRCTN32027234.
Comparative Efficacy of a Soft Toothbrush with Tapered-tip Bristles and an ADA Reference Toothbrush on Established Gingivitis and Supragingival Plaque over a 12-Week Period.

PubMed

Gallob, John; Petrone, Dolores M; Mateo, Luis R; Chaknis, Patricia; Morrison, Boyce M; Williams, Malcolm; Panagakos, Foti

2016-06-01

Evaluation of the efficacy of a soft toothbrush with tapered-tip bristles (Test Toothbrush) and an ADA reference soft toothbrush (ADA Toothbrush) on established gingivitis and supragingival plaque over a 12-week period. This randomized, single-center, examiner-blind, two-cell, parallel clinical research study assessed plaque removal by the comparison of pre- to- post-brushing after a single use, and again after six- and 12-weeks' use, using the Quigley-Hein Plaque Index, Turesky Modification. The study also assessed gingivitis after six weeks and 12 weeks using the Löe & Silness Gingival Index. Adult male and female subjects from the Central New Jersey, USA area refrained from all oral hygiene procedures for 24 hours. They reported to the study site after refraining from eating, drinking, and smoking for four hours. Subjects had the study procedure explained to them both orally and by written instructions. Subjects then gave written consent to participate before entry into the study. Following an examination for plaque (pre-brushing) and gingivitis (baseline), the subjects were randomized into two balanced groups, each group assigned to one of the two study toothbrushes. Subjects were instructed to brush their teeth for one minute under supervision with their assigned toothbrush and a commercially available fluoride toothpaste (Colgate© Cavity Protection Toothpaste), after which they were again evaluated for plaque (post-brushing). Subjects were dismissed from the study site with their assigned toothbrush and toothpaste, and instructed to brush twice daily at home for the next 12 weeks. The subjects were instructed to brush for one minute during each tooth brushing. The subjects reported to the study site after six weeks and 12 weeks of product use, at which time they were evaluated for plaque and gingivitis. Seventy-one (71) subjects complied with the protocol and completed the clinical study. Compared to the ADA Toothbrush, the Test Toothbrush provided statistically significantly (p < 0.05) greater reductions of 71.1% in whole mouth plaque index scores, 43.8% in plaque severity index scores, and 81.3% in interproximal sites plaque scores after a single tooth brushing. After six weeks' use, the Test Toothbrush provided statistically significantly (p < 0.05) greater reductions of 700% in whole mouth gingival index scores, 700% in gingivitis severity index scores, and 400% in interproximal sites gingival scores compared to the ADA Toothbrush. Also after six weeks' use, the Test Toothbrush provided statistically significantly (p < 0.05) greater reductions of 188.9% in whole mouth plaque index scores, 165% in plaque severity index scores, and 203% in interproximal sites plaque scores compared to the ADA Toothbrush. After 12 weeks' use, the Test Toothbrush provided statistically significantly (p < 0.05) greater reductions of 266.7% in whole mouth gingival index scores, 300% in gingivitis severity index scores, and 250% in interproximal sites gingival scores compared to the ADA Toothbrush. Also after 12 weeks' use, the Test Toothbrush provided statistically significantly (p < 0.05) greater reductions of 158.1% in whole mouth plaque index scores, 143.5% in plaque severity index scores, and 145.4% in interproximal sites plaque scores compared to the ADA Toothbrush. This study demonstrated that a soft toothbrush with tapered-tip bristles provided a significantly greater reduction in supragingival plaque after a single tooth brushing, as well as after six and 12 weeks of twice-daily use, compared to the ADA Toothbrush. After six and 12 weeks of twice-daily use, it also provided a significantly greater reduction in gingivitis as compared to the ADA Toothbrush.
Science Teacher Efficacy and Outcome Expectancy as Predictors of Students' End-of-Instruction (EOI) Biology I Test Scores

ERIC Educational Resources Information Center

Angle, Julie; Moseley, Christine

2009-01-01

The purpose of this study was to compare teacher efficacy beliefs of secondary Biology I teachers whose students' mean scores on the statewide End-of-Instruction (EOI) Biology I test met or exceeded the state academic proficiency level (Proficient Group) to teacher efficacy beliefs of secondary Biology I teachers whose students' mean scores on the…
The Effect of Mobility on Texas Assessment of Knowledge and Skills Test Scores

ERIC Educational Resources Information Center

Alvarez, Ray

2006-01-01

This research studies the effects of mobility on the high-stakes test scores of a Title I South Central Texas school district. The study involved 10, 5th-grade elementary feeder school populations graduating to the 6th grade in 3 middle schools. The researcher compared the 1st administration scores of the Texas Assessment of Knowledge and Skills…
Test Score Equating Using Discrete Anchor Items versus Passage-Based Anchor Items: A Case Study Using "SAT"® Data. Research Report. ETS RR-14-14

ERIC Educational Resources Information Center

Liu, Jinghua; Zu, Jiyun; Curley, Edward; Carey, Jill

2014-01-01

The purpose of this study is to investigate the impact of discrete anchor items versus passage-based anchor items on observed score equating using empirical data.This study compares an "SAT"® critical reading anchor that contains more discrete items proportionally, compared to the total tests to be equated, to another anchor that…
Animated video vs pamphlet: comparing the success of educating parents about proper antibiotic use.

PubMed

Schnellinger, Mark; Finkelstein, Marsha; Thygeson, Megan V; Vander Velden, Heidi; Karpas, Anna; Madhok, Manu

2010-05-01

The objective was to create an animated video to teach parents about the appropriate use of antibiotics and to compare their knowledge to parents who were provided with the American Academy of Pediatrics pamphlet. We hypothesized that the video format would result in improved comprehension and retention. This prospective randomized, controlled trial was conducted in an urban pediatric emergency department. Parent subjects were randomly assigned to a control group, a pamphlet group, and a video group and completed a survey at 3 time points. Analysis included the nonparametric matched Friedman test, Kruskal-Wallis test, and the Mann-Whitney U test. A 2-sided P value of < .05 was required for significance, and a Bonferroni-corrected P value of < .017 was required for paired comparisons. Postintervention survey scores improved significantly in the pamphlet and video groups compared with baseline. The video group's follow-up scores were not significantly different from the postintervention-survey scores (P = .32). The pamphlet-group scores at follow-up were significantly lower than the postintervention-survey scores (P = .002). The control group's scores were similar at all 3 time periods. The pamphlet group had significantly better scores than the control group after the intervention (P < .001). The video-group scores exceeded the control-group scores at all 3 time periods. An animated video is highly effective for educating parents about the appropriate use of antibiotics in the emergency department setting and results in long-term knowledge retention. The results of this study provide a foundation to further evaluate the use of animated video in additional populations.
Comparing the MMPI-2 scale scores of parents involved in parental competency and child custody assessments.

PubMed

Resendes, John; Lecci, Len

2012-12-01

MMPI-2 scores from a parent competency sample (N = 136 parents) are compared with a previously published data set of MMPI-2 scores for child custody litigants (N = 508 parents; Bathurst et al., 1997). Independent samples t tests yielded significant and in some cases substantial differences on the standard MMPI-2 clinical scales (especially Scales 4, 8, 2, and 0), with the competency sample obtaining higher clinical scores as well as higher scores on F, FB, VRIN, TRIN, and L, but lower scores on K, relative to the custody sample. Despite the higher scores in the competency sample, MMPI-2 mean scores did not exceed the clinical cutoff (T > 65). Moreover, the present competency sample essentially replicates the MMPI-2 scores of a previously published competency sample, suggesting that the present findings are representative of that population. The present findings suggest that separate reference groups be used when conducting child custody vs. parental competency evaluations, as these appear to be distinct populations despite there being similarities in the testing circumstances.
Major bleeding and intracranial hemorrhage risk prediction in patients with atrial fibrillation: Attention to modifiable bleeding risk factors or use of a bleeding risk stratification score? A nationwide cohort study.

PubMed

Chao, Tze-Fan; Lip, Gregory Y H; Lin, Yenn-Jiang; Chang, Shih-Lin; Lo, Li-Wei; Hu, Yu-Feng; Tuan, Ta-Chuan; Liao, Jo-Nan; Chung, Fa-Po; Chen, Tzeng-Ji; Chen, Shih-Ann

2018-03-01

While modifiable bleeding risks should be addressed in all patients with atrial fibrillation (AF), use of a bleeding risk score enables clinicians to 'flag up' those at risk of bleeding for more regular patient contact reviews. We compared a risk assessment strategy for major bleeding and intracranial hemorrhage (ICH) based on modifiable bleeding risk factors (referred to as a 'MBR factors' score) against established bleeding risk stratification scores (HEMORR 2 HAGES, HAS-BLED, ATRIA, ORBIT). A nationwide cohort study of 40,450 AF patients who received warfarin for stroke prevention was performed. The clinical endpoints included ICH and major bleeding. Bleeding scores were compared using receiver operating characteristic (ROC) curves (areas under the ROC curves [AUCs], or c-index) and the net reclassification index (NRI). During a follow up of 4.60±3.62years, 1581 (3.91%) patients sustained ICH and 6889 (17.03%) patients sustained major bleeding events. All tested bleeding risk scores at baseline were higher in those sustaining major bleeds. When compared to no ICH, patients sustaining ICH had higher baseline HEMORR 2 HAGES (p=0.003), HAS-BLED (p<0.001) and MBR factors score (p=0.013) but not ATRIA and ORBIT scores. When HAS-BLED was compared to other bleeding scores, c-indexes were significantly higher compared to MBR factors (p<0.001) and ORBIT (p=0.05) scores for major bleeding. C-indexes for the MBR factors score was significantly lower compared to all other scores (De long test, all p<0.001). When NRI was performed, HAS-BLED outperformed all other bleeding risk scores for major bleeding (all p<0.001). C-indexes for ATRIA and ORBIT scores suggested no significant prediction for ICH. All contemporary bleeding risk scores had modest predictive value for predicting major bleeding but the best predictive value and NRI was found for the HAS-BLED score. Simply depending on modifiable bleeding risk factors had suboptimal predictive value for the prediction of major bleeding in AF patients, when compared to the HAS-BLED score. Copyright © 2017 Elsevier Ireland Ltd. All rights reserved.
Relationship of Attention Deficit Hyperactivity Disorder and Postconcussion Recovery in Youth Athletes.

PubMed

Mautner, Kenneth; Sussman, Walter I; Axtman, Matthew; Al-Farsi, Yahya; Al-Adawi, Samir

2015-07-01

To investigate whether attention deficit hyperactivity disorder (ADHD) influences postconcussion recovery, as measured by computerized neurocognitive testing. This is a retrospective case control study. Computer laboratories across 10 high schools in the greater Atlanta, Georgia area. Immediate postconcussion assessment and cognitive testing (ImPACT) scores of 70 athletes with a self-reported diagnosis of ADHD and who sustained a sport-related concussion were compared with a randomly selected age-matched control group. Immediate postconcussion assessment and cognitive testing scores over a 5-year interval were reviewed for inclusion. Postconcussion recovery was defined as a return to equivalent baseline neurocognitive score on the ImPACT battery, and a concussion symptom score of ≤7. Athletes with ADHD had on average a longer time to recovery when compared with the control group (16.5 days compared with 13.5 days), although not statistically significant. The number of previous concussions did not have any effect on the rate of recovery in the ADHD or the control group. In addition, baseline neurocognitive testing did not statistically differ between the 2 groups, except in verbal memory. Although not statistically significant, youth athletes with ADHD took on average 3 days longer to return to baseline neurocognitive testing compared with a control group without ADHD. Youth athletes with ADHD may have a marginally prolonged recovery as indexed by neurocognitive testing and should be considered when prognosticating time to recovery in this subset of student athletes.
The Score-Boosting Game.

ERIC Educational Resources Information Center

Popham, W. James

2000-01-01

Teachers everywhere are playing the score-boosting game to raise scores on mandated standardized achievement tests, although five nationally recognized assessments compare student performance instead of measuring classroom learning. Since curriculum standards are often vague and misaligned with assessments, teachers sprinkle instruction with…
The Comparison of Accuracy Scores on the Paper and Pencil Testing vs. Computer-Based Testing

ERIC Educational Resources Information Center

Retnawati, Heri

2015-01-01

This study aimed to compare the accuracy of the test scores as results of Test of English Proficiency (TOEP) based on paper and pencil test (PPT) versus computer-based test (CBT). Using the participants' responses to the PPT documented from 2008-2010 and data of CBT TOEP documented in 2013-2014 on the sets of 1A, 2A, and 3A for the Listening and…
Difficulties Using Standardized Tests to Identify the Receptive Expressive Gap in Bilingual Children's Vocabularies.

PubMed

Gibson, Todd A; Oller, D Kimbrough; Jarmulowicz, Linda

2018-03-01

Receptive standardized vocabulary scores have been found to be much higher than expressive standardized vocabulary scores in children with Spanish as L1, learning L2 (English) in school (Gibson et al., 2012). Here we present evidence suggesting the receptive-expressive gap may be harder to evaluate than previously thought because widely-used standardized tests may not offer comparable normed scores. Furthermore monolingual Spanish-speaking children tested in Mexico and monolingual English-speaking children in the US showed other, yet different statistically significant discrepancies between receptive and expressive scores. Results suggest comparisons across widely used standardized tests in attempts to assess a receptive-expressive gap are precarious.

Collaborative Testing Improves Performance but Not Content Retention in a Large-Enrollment Introductory Biology Class

PubMed Central

Leight, Hayley; Saunders, Cheston; Calkins, Robin; Withers, Michelle

2012-01-01

Collaborative testing has been shown to improve performance but not always content retention. In this study, we investigated whether collaborative testing could improve both performance and content retention in a large, introductory biology course. Students were semirandomly divided into two groups based on their performances on exam 1. Each group contained equal numbers of students scoring in each grade category (“A”–“F”) on exam 1. All students completed each of the four exams of the semester as individuals. For exam 2, one group took the exam a second time in small groups immediately following the individually administered test. The other group followed this same format for exam 3. Individual and group exam scores were compared to determine differences in performance. All but exam 1 contained a subset of cumulative questions from the previous exam. Performances on the cumulative questions for exams 3 and 4 were compared for the two groups to determine whether there were significant differences in content retention. Even though group test scores were significantly higher than individual test scores, students who participated in collaborative testing performed no differently on cumulative questions than students who took the previous exam as individuals. PMID:23222835
Standard Error Estimation of 3PL IRT True Score Equating with an MCMC Method

ERIC Educational Resources Information Center

Liu, Yuming; Schulz, E. Matthew; Yu, Lei

2008-01-01

A Markov chain Monte Carlo (MCMC) method and a bootstrap method were compared in the estimation of standard errors of item response theory (IRT) true score equating. Three test form relationships were examined: parallel, tau-equivalent, and congeneric. Data were simulated based on Reading Comprehension and Vocabulary tests of the Iowa Tests of…
Comparison of Standardized Test Scores from Traditional Classrooms and Those Using Problem-Based Learning

ERIC Educational Resources Information Center

Needham, Martha Elaine

2010-01-01

This research compares differences between standardized test scores in problem-based learning (PBL) classrooms and a traditional classroom for 6th grade students using a mixed-method, quasi-experimental and qualitative design. The research shows that problem-based learning is as effective as traditional teaching methods on standardized tests. The…
The Effects of Using Selected Metacognitive Strategies on ACT Mathematics Sub-Test Scores

ERIC Educational Resources Information Center

LeMay, Jeffrey W.

2016-01-01

This quasi-experimental post-test only control group designed quantitative study examined whether or not members of an experimental group of participants who utilized two metacognitive strategy training regimens experienced a significant increase in their ACT mathematics sub-test scores compared to a group of students who did not utilize either of…
Comparing Latent Distributions.

ERIC Educational Resources Information Center

Andersen, Erling B.

1980-01-01

The problem of comparing the latent abilities of groups of individuals (as opposed to their observable test scores) is considered. Tests of equality of means, variances, and longitudinal applications are discussed. (JKS)
Academic performance in adolescence after inguinal hernia repair in infancy: a nationwide cohort study.

PubMed

Hansen, Tom G; Pedersen, Jacob K; Henneberg, Steen W; Pedersen, Dorthe A; Murray, Jeffrey C; Morton, Neil S; Christensen, Kaare

2011-05-01

Although animal studies have indicated that general anesthetics may result in widespread apoptotic neurodegeneration and neurocognitive impairment in the developing brain, results from human studies are scarce. We investigated the association between exposure to surgery and anesthesia for inguinal hernia repair in infancy and subsequent academic performance. Using Danish birth cohorts from 1986-1990, we compared the academic performance of all children who had undergone inguinal hernia repair in infancy to a randomly selected, age-matched 5% population sample. Primary analysis compared average test scores at ninth grade adjusting for sex, birth weight, and paternal and maternal age and education. Secondary analysis compared the proportions of children not attaining test scores between the two groups. From 1986-1990 in Denmark, 2,689 children underwent inguinal hernia repair in infancy. A randomly selected, age-matched 5% population sample consists of 14,575 individuals. Although the exposure group performed worse than the control group (average score 0.26 lower; 95% CI, 0.21-0.31), after adjusting for known confounders, no statistically significant difference (-0.04; 95% CI, -0.09 to 0.01) between the exposure and control groups could be demonstrated. However, the odds ratio for test score nonattainment associated with inguinal hernia repair was 1.18 (95% CI, 1.04-1.35). Excluding from analyses children with other congenital malformations, the difference in mean test scores remained nearly unchanged (0.05; 95% CI, 0.00-0.11). In addition, the increased proportion of test score nonattainment within the exposure group was attenuated (odds ratio = 1.13; 95% CI, 0.98-1.31). In the ethnically and socioeconomically homogeneous Danish population, we found no evidence that a single, relatively brief anesthetic exposure in connection with hernia repair in infancy reduced academic performance at age 15 or 16 yr after adjusting for known confounding factors. However, the higher test score nonattainment rate among the hernia group could suggest that a subgroup of these children are developmentally disadvantaged compared with the background population.
A comparison of WISC-IV and SB-5 intelligence scores in adolescents with autism spectrum disorder.

PubMed

Baum, Katherine T; Shear, Paula K; Howe, Steven R; Bishop, Somer L

2015-08-01

In autism spectrum disorders, results of cognitive testing inform clinical care, theories of neurodevelopment, and research design. The Wechsler Intelligence Scale for Children and the Stanford-Binet are commonly used in autism spectrum disorder evaluations and scores from these tests have been shown to be highly correlated in typically developing populations. However, they have not been compared in individuals with autism spectrum disorder, whose core symptoms can make testing challenging, potentially compromising test reliability. We used a within-subjects research design to evaluate the convergent validity between the Wechsler Intelligence Scale for Children, 4th ed., and Stanford-Binet, 5th ed., in 40 youth (ages 10-16 years) with autism spectrum disorder. Corresponding intelligence scores were highly correlated (r = 0.78 to 0.88), but full-scale intelligence quotient (IQ) scores (t(38) = -2.27, p = 0.03, d = -0.16) and verbal IQ scores (t(36) = 2.23, p = 0.03; d = 0.19) differed between the two tests. Most participants obtained higher full-scale IQ scores on the Stanford-Binet, 5th ed., compared to Wechsler Intelligence Scale for Children, 4th ed., with 14% scoring more than one standard deviation higher. In contrast, verbal indices were higher on the Wechsler Intelligence Scale for Children, 4th ed., Verbal-nonverbal discrepancy classifications were only consistent for 60% of the sample. Comparisons of IQ test scores in autism spectrum disorder and other special groups are important, as it cannot necessarily be assumed that convergent validity findings in typically developing children and adolescents hold true across all pediatric populations. © The Author(s) 2014.
Testing for Bias against Female Test Takers of the Graduate Management Admissions Test and Potential Impact on Admissions to Graduate Programs in Business.

ERIC Educational Resources Information Center

Wright, Robert E.; Bachrach, Daniel G.

2003-01-01

Graduate Management Admission Test (GMAT) scores and grade point average in graduate core courses were compared for 190 male and 144 female business administration students. No significant differences in course performance were found, but males had been admitted with significantly higher GMAT scores, suggesting a bias against women. (Contains 27…
Comparing PETS and GEPT in China and Taiwan

ERIC Educational Resources Information Center

Wu, Mei

2012-01-01

This paper compares the Public English Test System (PETS) administered in mainland, China and the General English Proficiency Test (GEPT) administered in Taiwan, from the aspects of test levels, test contents and scoring weight. Compared with the PETS, the GEPT is found to value the English productive skills more, and have a greater ability to…
Childhood overweight and academic performance: national study of kindergartners and first-graders.

PubMed

Datar, Ashlesha; Sturm, Roland; Magnabosco, Jennifer L

2004-01-01

To examine the association between children's overweight status in kindergarten and their academic achievement in kindergarten and first grade. The data analyzed consisted of 11,192 first time kindergartners from the Early Childhood Longitudinal Study, a nationally representative sample of kindergartners in the U.S. in 1998. Multivariate regression techniques were used to estimate the independent association of overweight status with children's math and reading standardized test scores in kindergarten and grade 1. We controlled for socioeconomic status, parent-child interaction, birth weight, physical activity, and television watching. Overweight children had significantly lower math and reading test scores compared with non-overweight children in kindergarten. Both groups were gaining similarly on math and reading test scores, resulting in significantly lower test scores among overweight children at the end of grade 1. However, these differences, except for boys' math scores at baseline (difference = 1.22 points, p = 0.001), became insignificant after including socioeconomic and behavioral variables, indicating that overweight is a marker but not a causal factor. Race/ethnicity and mother's education were stronger predictors of test score gains or levels than overweight status. Significant differences in test scores by overweight status at the beginning of kindergarten and the end of grade 1 can be explained by other individual characteristics, including parental education and the home environment. However, overweight is more easily observable by other students compared with socioeconomic characteristics, and its significant (unadjusted) association with worse academic performance can contribute to the stigma of overweight as early as the first years of elementary school.
Online pre-race education improves test scores for volunteers at a marathon.

PubMed

Maxwell, Shane; Renier, Colleen; Sikka, Robby; Widstrom, Luke; Paulson, William; Christensen, Trent; Olson, David; Nelson, Benjamin

2017-09-01

This study examined whether an online course would lead to increased knowledge about the medical issues volunteers encounter during a marathon. Health care professionals who volunteered to provide medical coverage for an annual marathon were eligible for the study. Demographic information about medical volunteers including profession, specialty, education level and number of marathons they had volunteered for was collected. A 15-question test about the most commonly encountered medical issues was created by the authors and administered before and after the volunteers took the online educational course and compared to a pilot study the previous year. Seventy-four subjects completed the pre-test. Those who participated in the pilot study last year (N = 15) had pre-test scores that were an average of 2.4 points higher than those who did not (mean ranks: pilot study = 51.6 vs. non-pilot = 33.9, p = 0.004). Of the 74 subjects who completed the pre-test, 54 also completed the post-test. The overall post-pre mean score difference was 3.8 ± 2.7 (t = 10.5 df = 53 p < 0.001). While subjects with all levels of volunteer experience demonstrated improvement, only change among first time marathon volunteers was significantly different from the others. Subjects reporting all degree/certification levels demonstrated improvement, but no difference in improvement was found between degree/certification levels. In this follow-up to the previous year's pilot study, online education demonstrated a long-term (one-year) increase in test scores. Testing also continued to show short-term improvement in post-course test scores, compared to pre-course test scores. In general, marathon medical volunteers who had no volunteer experience demonstrated greater improvement than those who had prior volunteer experience.
Quality indicators to compare accredited independent pharmacies and accredited chain pharmacies in Thailand.

PubMed

Arkaravichien, Wiwat; Wongpratat, Apichaya; Lertsinudom, Sunee

2016-08-01

Background Quality indicators determine the quality of actual practice in reference to standard criteria. The Community Pharmacy Association (Thailand), with technical support from the International Pharmaceutical Federation, developed a tool for quality assessment and quality improvement at community pharmacies. This tool has passed validity and reliability tests, but has not yet had feasibility testing. Objective (1) To test whether this quality tool could be used in routine settings. (2) To compare quality scores between accredited independent and accredited chain pharmacies. Setting Accredited independent pharmacies and accredited chain pharmacies in the north eastern region of Thailand. Methods A cross sectional study was conducted in 34 accredited independent pharmacies and accredited chain pharmacies. Quality scores were assessed by observation and by interviewing the responsible pharmacists. Data were collected and analyzed by independent t-test and Mann-Whitney U test as appropriate. Results were plotted by histogram and spider chart. Main outcome measure Domain's assessable scores, possible maximum scores, mean and median of measured scores. Results Domain's assessable scores were close to domain's possible maximum scores. This meant that most indicators could be assessed in most pharmacies. The spider chart revealed that measured scores in the personnel, drug inventory and stocking, and patient satisfaction and health promotion domains of chain pharmacies were significantly higher than those of independent pharmacies (p < 0.05). There was no statistical difference between independent pharmacies and chain pharmacies in the premise and facility or dispensing and patient care domains. Conclusion Quality indicators developed by the Community Pharmacy Association (Thailand) could be used to assess quality of practice in pharmacies in routine settings. It is revealed that the quality scores of chain pharmacies were higher than those of independent pharmacies.
Asymptomatic population reference values for three knee patient-reported outcomes measures: evaluation of an electronic data collection system and implications for future international, multi-centre cohort studies.

PubMed

McLean, James M; Brumby-Rendell, Oscar; Lisle, Ryan; Brazier, Jacob; Dunn, Kieran; Gill, Tiffany; Hill, Catherine L; Mandziak, Daniel; Leith, Jordan

2018-05-01

The aim was to assess whether the Knee Society Score, Oxford Knee Score (OKS) and Knee Injury and Osteoarthritis Outcome Score (KOOS) were comparable in asymptomatic, healthy, individuals of different age, gender and ethnicity, across two remote continents. The purpose of this study was to establish normal population values for these scores using an electronic data collection system. There is no difference in clinical knee scores in an asymptomatic population when comparing age, gender and ethnicity, across two remote continents. 312 Australian and 314 Canadian citizens, aged 18-94 years, with no active knee pain, injury or pathology in the ipsilateral knee corresponding to their dominant arm, were evaluated. A knee examination was performed and participants completed an electronically administered questionnaire covering the subjective components of the knee scores. The cohorts were age- and gender-matched. Chi-square tests, Fisher's exact test and Poisson regression models were used where appropriate, to investigate the association between knee scores, age, gender, ethnicity and nationality. There was a significant inverse relationship between age and all assessment tools. OKS recorded a significant difference between gender with females scoring on average 1% lower score. There was no significant difference between international cohorts when comparing all assessment tools. An electronic, multi-centre data collection system can be effectively utilized to assess remote international cohorts. Differences in gender, age, ethnicity and nationality should be taken into consideration when using knee scores to compare to pathological patient scores. This study has established an electronic, normal control group for future studies using the Knee society, Oxford, and KOOS knee scores. Diagnostic Level II.
Impact of a standardized test package on exit examination scores and NCLEX-RN outcomes.

PubMed

Homard, Catherine M

2013-03-01

The purpose of this ex post facto correlational study was to compare exit examination scores and NCLEX-RN(®) pass rates of baccalaureate nursing students who differed in level of participation in a standardized test package. Three cohort groups emerged as a standardized test package was introduced: (a) students who did not participate in a standardized test package; (b) students with two semesters of a standardized test package; and (c) students with four semesters of a standardized test package. Benner's novice-to-expert theory framed the study in the belief that students best acquire knowledge and skills through practice and reflection. Students participating in four semesters of a standardized test package demonstrated higher exit examination scores and NCLEX-RN pass rates compared with students who did not participate in this package. This study's results could inform nurse educators about strategies to facilitate nursing student success on exit examinations and the NCLEX-RN. Copyright 2013, SLACK Incorporated.
Prediction of success in FAA air traffic control field training as a function of selection and screening test performance.

DOT National Transportation Integrated Search

1989-05-01

This study compared correlations between Office of Personnel Management (OPM) selection test scores for Air Traffic Control Specialists (ATCSs) and scores from the FAA Academy's second-stage screening program with measures of field training performan...
Comparison of National Board of Chiropractic Examiners part I examination scores between tutors and tutees at a chiropractic college

PubMed Central

Kenya, Amilliah W.; Hart, John F.; Vuyiya, Charles K.

2016-01-01

Objective: This study compared National Board of Chiropractic Examiners part I test scores between students who did and did not serve as tutors on the subject matter. Methods: Students who had a prior grade point average of 3.45 or above on a 4.0 scale just before taking part I of the board exams were eligible to participate. A 2-sample t-test was used to ascertain the difference in the mean scores on part I between the tutor group (n = 28) and nontutor (n = 29) group. Results: Scores were higher in all subjects for the tutor group compared to the nontutor group and the differences were statistically significant (p < .01) with large effect sizes. Conclusion: The tutors in this study performed better on part I of the board examination compared to nontutors, suggesting that tutoring results in an academic benefit for tutors themselves. PMID:26998665
Reliability of the Community Balance and Mobility Scale (CB&M) in high-functioning school-aged children and adolescents who have an acquired brain injury.

PubMed

Wright, F Virginia; Ryan, Jennifer; Brewer, Kelly

2010-01-01

To examine inter-rater, intra-rater and test-re-test reliability of the Community Balance and Mobility Scale (CB&M) and compare reliability in live vs videotape rating contexts for children with acquired brain injury (ABI). Repeated measures design. Seven physiotherapists (PTs) were trained as assessors. The primary assessor administered and scored baseline CB&M while the second assessor observed and scored independently (inter-rater reliability). Re-assessment occurred 3-10 days later by primary assessor (test-re-test reliability). Assessments were videotaped. There were 32 participants with ABI (mean age = 14 years 1 month (SD = 2 years 1 month)). Baseline mean scores were 67.4% (18.2) and 66.7% (18.3) for primary and second assessor, respectively. Primary assessors' re-test mean score was 69.3%. Inter-rater reliability ICC was 0.93 (95% confidence interval (CI) = 0.87-0.97). Test-re-test ICC was 0.90 (95%CI = 0.81-0.95) and Bland-Altman plot indicated greatest test-re-test differences for mid-range CB&M scores. Minimum detectable change (MDC₉₀) was 13.5% points. The CB&M showed excellent reliability in youth. Reliability was comparable for live and videotape rating approaches, meaning that the easier and less expensive live-rating can be recommended. Future work should focus on evaluation of responsiveness to change in rehabilitation centre and community intervention contexts.
Descriptive Statistics for Modern Test Score Distributions: Skewness, Kurtosis, Discreteness, and Ceiling Effects.

PubMed

Ho, Andrew D; Yu, Carol C

2015-06-01

Many statistical analyses benefit from the assumption that unconditional or conditional distributions are continuous and normal. More than 50 years ago in this journal, Lord and Cook chronicled departures from normality in educational tests, and Micerri similarly showed that the normality assumption is met rarely in educational and psychological practice. In this article, the authors extend these previous analyses to state-level educational test score distributions that are an increasingly common target of high-stakes analysis and interpretation. Among 504 scale-score and raw-score distributions from state testing programs from recent years, nonnormal distributions are common and are often associated with particular state programs. The authors explain how scaling procedures from item response theory lead to nonnormal distributions as well as unusual patterns of discreteness. The authors recommend that distributional descriptive statistics be calculated routinely to inform model selection for large-scale test score data, and they illustrate consequences of nonnormality using sensitivity studies that compare baseline results to those from normalized score scales.
A comparison of low IQ scores from the Reynolds Intellectual Assessment Scales and the Wechsler Adult Intelligence Scale-Third Edition.

PubMed

Umphress, Thomas B

2008-06-01

Twenty people with suspected intellectual disability took the Reynolds Intellectual Assessment Scales (RIAS; C. R. Reynolds & R. W. Kamphaus, 1998) and the Wechsler Adult Intelligence Scale-3rd Edition (WAIS-III; D. Wechsler, 1997) to see if the 2 IQ tests produced comparable results. A t test showed that the RIAS Composite Intelligence Index scores were significantly higher than WAIS-III Full Scale IQ scores at the alpha level of .01. There was a significant difference between the RIAS Nonverbal Intelligence and WAIS-III Performance Scale, but there was no significant difference between the RIAS Verbal Intelligence Index and the WAIS-III Verbal Scale IQ. The results raise questions concerning test selection for diagnosing intellectual disability and the use of the correlation statistic for comparing intelligence tests.
A Corresponding Study of Water Quality Evaluation of the Pasquotank Watershed in Northeastern North Carolina

NASA Astrophysics Data System (ADS)

Stevenson, J.; Walthall, S.; McKenzie, R.; Dixon, R.

2015-12-01

The Pasquotank River Watershed covers 450 sq miles in the Coastal Plain of NE North Carolina. It flows from the Great Dismal Swamp at the VA/NC border into the Albemarle Sound. The watershed provides a transition between spawning grounds and waters of the Albemarle Sound. Forested swamp wetlands border much of the waterways. Increased agricultural and urban development has greatly affected water quality during recent years. Test were completed along the tributaries and the river itself, adding to the previously data from 2011, 2013, and 2014. Streams tested were the Newbegun Creek, Knobbs Creek, Areneuse Creek, Mill Dam Creek, and Sawyers Creek. These streams cover a large area of the watershed and provide a wide variety of shore development from swampland and farmland to industrial development. Samples were tested for pH, salinity, total dissolved solids, and conductivity. Air/water temperature, dissolved oxygen, wind speed/direction, and turbidity/clarity measurements were taken in the field. The results were placed into an online database and correlated to the location of the sample using Google Maps®. Analysis tools were developed to compare the data from all years. Excel spreadsheets were developed to look more closely at individual points and tests for each point. This database was connected to a data visualization page utilizing Google Maps®. The results show variations for the individual water quality scores, but the overall water quality score for all the tested water sources remained at a comparable level from previous years. Mill Dam Creek rose above the previous three scores of 48 (2011), 47 (2013), and 49 (2014) and achieved a medium water quality score of 57. Areneuse Creek improved in water quality with a medium water quality score of 60. Sawyers Creek became the lowest scoring waterway tested at 35. Knobbs Creek decreased from previous years with a water quality score of 42. For a fourth consecutive testing year, Newbegun Creek fell within the medium water quality range with a score of 65. Pasquotank River rose from the previous testing year's score of 35 but still remained within the bad water quality range with a score of 45. The Lower Pasquotank remained the highest scoring tributary for a second consecutive year with a score of 85. Team included authors plus Ricky Dixon and Raveen McKenzie of MVSU.

Local Linear Observed-Score Equating

ERIC Educational Resources Information Center

Wiberg, Marie; van der Linden, Wim J.

2011-01-01

Two methods of local linear observed-score equating for use with anchor-test and single-group designs are introduced. In an empirical study, the two methods were compared with the current traditional linear methods for observed-score equating. As a criterion, the bias in the equated scores relative to true equating based on Lord's (1980)…
Predicting dementia using socio-demographic characteristics and the Free and Cued Selective Reminding Test in the general population.

PubMed

Mura, Thibault; Baramova, Marieta; Gabelle, Audrey; Artero, Sylvaine; Dartigues, Jean-François; Amieva, Hélène; Berr, Claudine

2017-03-23

Our study aimed to determine whether the consideration of socio-demographic features improves the prediction of Alzheimer's dementia (AD) at 5 years when using the Free and Cued Selective Reminding Test (FCSRT) in the general older population. Our analyses focused on 2558 subjects from the prospective Three-City Study, a cohort of community-dwelling individuals aged 65 years and over, with FCSRT scores. Four "residual scores" and "risk scores" were built that included the FCSRT scores and socio-demographic variables. The predictive performance of crude, residual and risk scores was analyzed by comparing the areas under the ROC curve (AUC). In total, 1750 subjects were seen 5 years after completing the FCSRT. AD was diagnosed in 116 of them. Compared with the crude free-recall score, the predictive performances of the residual score and of the risk score were not significantly improved (AUC: 0.83 vs 0.82 and 0.88 vs 0.89 respectively). Using socio-demographic features in addition to the FCSRT does not improve its predictive performance for dementia or AD.
Timed activity performance in persons with upper limb amputation: A preliminary study.

PubMed

Resnik, Linda; Borgia, Mathew; Acluche, Frantzy

55 subjects with upper limb amputation were administered the T-MAP twice within one week. To develop a timed measure of activity performance for persons with upper limb amputation (T-MAP); examine the measure's internal consistency, test-retest reliability and validity; and compare scores by prosthesis use. Measures of activity performance for persons with upper limb amputation are needed The time required to perform daily activities is a meaningful metric that implication for participation in life roles. Internal consistency and test-retest reliability were evaluated. Construct validity was examined by comparing scores by amputation level. Exploratory analyses compared sub-group scores, and examined correlations with other measures. Scale alpha was 0.77, ICC was 0.93. Timed scores differed by amputation level. Subjects using a prosthesis took longer to perform all tasks. T-MAP was not correlated with other measures of dexterity or activity, but was correlated with pain for non-prosthesis users. The timed scale had adequate internal consistency and excellent test-retest reliability. Analyses support reliability and construct validity of the T-MAP. 2c "outcomes" research. Published by Elsevier Inc.
Diabetes and Cognitive Decline in Older Adults: The Ginkgo Evaluation of Memory Study.

PubMed

Palta, Priya; Carlson, Michelle C; Crum, Rosa M; Colantuoni, Elizabeth; Sharrett, A Richey; Yasar, Sevil; Nahin, Richard L; DeKosky, Steven T; Snitz, Beth; Lopez, Oscar; Williamson, Jeff D; Furberg, Curt D; Rapp, Stephen R; Golden, Sherita Hill

2017-12-12

Previous studies have shown that individuals with diabetes exhibit accelerated cognitive decline. However, methodological limitations have limited the quality of this evidence. Heterogeneity in study design, cognitive test administration, and methods of analysis of cognitive data have made it difficult to synthesize and translate findings to practice. We analyzed longitudinal data from the Ginkgo Evaluation of Memory Study to test our hypothesis that older adults with diabetes have greater test-specific and domain-specific cognitive declines compared to older adults without diabetes. Tests of memory, visuo-spatial construction, language, psychomotor speed, and executive function were administered. Test scores were standardized to z-scores and averaged to yield domain scores. Linear random effects models were used to compare baseline differences and changes over time in test and domain scores among individuals with and without diabetes. Among the 3,069 adults, aged 72-96 years, 9.3% reported diabetes. Over a median follow-up of 6.1 years, participants with diabetes exhibited greater baseline differences in a test of executive function (trail making test, Part B) and greater declines in a test of language (phonemic verbal fluency). For the composite cognitive domain scores, participants with diabetes exhibited lower baseline executive function and global cognition domain scores, but no significant differences in the rate of decline. Identifying cognitive domains most affected by diabetes can lead to targeted risk modification, possibly in the form of lifestyle interventions such as diet and physical activity, which we know to be beneficial for improving vascular risk factors, such as diabetes, and therefore may reduce the risk of executive dysfunction and possible dementia. © The Author 2017. Published by Oxford University Press on behalf of The Gerontological Society of America. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
EXPLORATION OF SCORE AGREEMENT ON A MODIFIED UPPER QUARTER Y-BALANCE TEST KIT AS COMPARED TO THE UPPER QUARTER Y-BALANCE TEST.

PubMed

Cramer, Josh; Quintero, Miguel; Rhinehart, Alex; Rutherford, Caitlin; Nasypany, Alan; May, James; Baker, Russell T

2017-02-01

Physical performance measures (PPMs) such as The Star Excursion Balance Test (SEBT) and the Y-Balance Test (YBT) are functional movement tests used to assess participants' dynamic balance, which can be a vital component in physical exams to identify predisposing factors for risk of injury. The YBT is a functional assessment tool for the upper and lower body. It evolved from the SEBT, which has been previously used in research as a lower body functional assessment. It is comprised of fewer movement directions, which help limit fatigue. The YBT kit is a commercialized tool, which may pose barriers for clinicians with limited budgets and/or strict approval process for purchasing capital items in their clinics, especially healthcare providers in the secondary school setting. The cost may also pose a barrier for researchers with limited budgets. A less expensive, easy to make kit, may provide clinicians an opportunity to integrate functional testing into their evaluation or research. The purpose of this pilot study was to describe a cost efficient method to gather participant's upper quarter YBT (UQYBT) measurements and examine the inter- and intra-rater score agreement between this method and the commercial YBT measurements. A convenience sample of 20 physically active participants volunteered to participate in a comparison study of the of Upper Quarter Y-Balance Test (UQYBT) using the commercialized kit and the Modified Upper Quarter Y-Balance Test kit (mUQYBT) made with three cloth tape measures, athletic tape, a goniometer and three 2x4x8 wood blocks. A Pearson Product Moment correlation and Bland-Altman analyses were used to examine the relationship between intra-rater scores comparing the UQYBT and mUQYBT. Inter-rater scores were analyzed using intraclass correlation coefficients (ICC) (2,1) and Bland-Altman analyses. All Pearson Product Moment r-values for intra-rater scores were greater than .96 and statistically significant at p<0.05. Coefficients of determination suggest that the mUQYBT scores account for approximately 92% of the UQYBT composite score when analyzing intra-rater comparisons. Bland-Altman plots suggest moderate agreement between the two tests with a potential bias towards higher composite scores in the mUQYBT. Inter-rater ICC scores were all greater than .98, while Bland-Altman plot analyses suggest moderate agreement between the raters. The mUQYBT produced similar results in both inter- and intra-rater measurements when compared to the commercialized YBT kit and offers a cost-effective alternative for assessing upper quarter PPMs for clinicians with limited budgets. 2b.
Effect of a Lower Extremity Preventive Training Program on Physical Performance Scores in Military Recruits.

PubMed

Peck, Karen Y; DiStefano, Lindsay J; Marshall, Stephen W; Padua, Darin A; Beutler, Anthony I; de la Motte, Sarah J; Frank, Barnett S; Martinez, Jessica C; Cameron, Kenneth L

2017-11-01

Peck, KY, DiStefano, LJ, Marshall, SW, Padua, DA, Beutler, AI, de la Motte, SJ, Frank, BS, Martinez, JC, and Cameron, KL. Effect of a lower extremity preventive training program on physical performance scores in military recruits. J Strength Cond Res 31(11): 3146-3157, 2017-Exercise-based preventive training programs are designed to improve movement patterns associated with lower extremity injury risk; however, the impact of these programs on general physical fitness has not been evaluated. The purpose of this study was to compare fitness scores between participants in a preventive training program and a control group. One thousand sixty-eight freshmen from a U.S. Service Academy were cluster-randomized into either the intervention or control group during 6 weeks of summer training. The intervention group performed a preventive training program, specifically the Dynamic Integrated Movement Enhancement (DIME), which is designed to improve lower extremity movement patterns. The control group performed the Army Preparation Drill (PD), a warm-up designed to prepare soldiers for training. Main outcome measures were the Army Physical Fitness Test (APFT) raw and scaled (for age and sex) scores. Independent t tests were used to assess between-group differences. Multivariable logistic regression models were used to control for the influence of confounding variables. Dynamic Integrated Movement Enhancement group participants completed the APFT 2-mile run 20 seconds faster compared with the PD group (p < 0.001), which corresponded with significantly higher scaled scores (p < 0.001). Army Physical Fitness Test push-up scores were significantly higher in the DIME group (p = 0.041), but there were no significant differences in APFT sit-up scores. The DIME group had significantly higher total APFT scores compared with the PD group (p < 0.001). Similar results were observed in multivariable models after controlling for sex and body mass index (BMI). Committing time to the implementation of a preventive training program does not appear to negatively affect fitness test scores.
Can a smartphone app improve medical trainees’ knowledge of antibiotics?

PubMed Central

Haj, Reem; Hirpara, Dhruvin; Wong, Karen; Muller, Matthew; Matukas, Larissa; Bartlett, John; Leung, Elizabeth; Taggart, Linda

2017-01-01

Objectives To determine whether a smartphone app, containing local bacterial resistance patterns (antibiogram) and treatment guidelines, improved knowledge of prescribing antimicrobials among medical trainees. Methods We conducted a prospective, controlled, pre-post study of medical trainees with access to a smartphone app (app group) containing our hospital’s antibiogram and treatment guidelines compared to those without access (control group). Participants completed a survey which included a knowledge assessment test (score range, 0 [lowest possible score] to 12 [highest possible score]) at the start of the study and four weeks later. The primary outcome was change in mean knowledge assessment test scores between week 0 and week 4. Change in knowledge assessment test scores in the app group were compared to the difference in scores in the control group using multivariable linear regression. Results Sixty-two residents and senior medical students participated in the study. In a multivariable analysis controlling for sex and prior knowledge, app use was associated with a 1.1 point (95% CI: 0.10, 2.1) [β = 1.08, t(1) = 2.08, p = 0.04] higher change in knowledge score compared to the change in knowledge scores in the control group. Among those in the app group, 88% found it easy to navigate, 85% found it useful, and about one- quarter used it daily. Conclusions An antibiogram and treatment algorithm app increased knowledge of prescribing antimicrobials in the context of local antibiotic resistance patterns. These findings reinforce the notion that smartphone apps can be a useful and innovative means of delivering medical education. PMID:29200402
The Michigan Context and Performance Report Card: Public Elementary & Middle Schools, 2013

ERIC Educational Resources Information Center

Spalding, Audrey

2013-01-01

The Michigan Context and Performance Report Card measures school performance by adjusting standardized test scores to account for student background. Comparing schools using unadjusted test scores ignores the significant relationship between academic performance and student socioeconomic background--a dynamic outside a school's control. The…
The Michigan Public High School Context and Performance Report Card

ERIC Educational Resources Information Center

Van Beek, Michael; Bowen, Daniel; Mills, Jonathan

2012-01-01

Assessing a high school's effectiveness is not straightforward. Comparing a school's standardized test scores to those of other schools is one approach to measuring effectiveness, but a major objection to this method is that students' test scores tend to be related to students' "socioeconomic" status--family household income, for…
Assessing Wildlife Habitat Value of New England Salt Marshes: II. Model Testing and Validation

EPA Science Inventory

We test a previously described model to assess the wildlife habitat value of New England salt marshes by comparing modeled habitat values and scores with bird abundance and species richness at sixteen salt marshes in Narragansett Bay, Rhode Island USA. Assessment scores ranged f...
Reliability Estimation When a Test Is Split into Two Parts of Unknown Effective Length.

ERIC Educational Resources Information Center

Feldt, Leonard S.

2002-01-01

Considers the situation in which content or administrative considerations limit the way in which a test can be partitioned to estimate the internal consistency reliability of the total test score. Demonstrates that a single-valued estimate of the total score reliability is possible only if an assumption is made about the comparative size of the…
The Use of Quality Control and Data Mining Techniques for Monitoring Scaled Scores: An Overview. Research Report. ETS RR-12-20

ERIC Educational Resources Information Center

von Davier, Alina A.

2012-01-01

Maintaining comparability of test scores is a major challenge faced by testing programs that have almost continuous administrations. Among the potential problems are scale drift and rapid accumulation of errors. Many standard quality control techniques for testing programs, which can effectively detect and address scale drift for small numbers of…
Effects of reading-oriented tasks on students' reading comprehension of geometry proof

NASA Astrophysics Data System (ADS)

Yang, Kai-Lin; Lin, Fou-Lai

2012-06-01

This study compared the effects of reading-oriented tasks and writing-oriented tasks on students' reading comprehension of geometry proof (RCGP). The reading-oriented tasks were designed with reading strategies and the idea of problem posing. The writing-oriented tasks were consistent with usual proof instruction for writing a proof and applying it. Twenty-two classes of ninth-grade students ( N = 683), aged 14 to 15 years, and 12 mathematics teachers participated in this quasi-experimental classroom study. While the experimental group was instructed to read and discuss the reading tasks in two 45-minute lessons, the control group was instructed to prove and apply the same propositions. Generalised estimating equation (GEE) method was used to compare the scores of the post-test and the delayed post-test with the pre-test scores as covariates. Results showed that the total scores of the delayed post-test of the experimental group were significantly higher than those of the control group. Furthermore, the scores of the experimental group on all facets of reading comprehension except the application facet were significantly higher than those of the control group for both the post-test and delayed post-test.
Comparing perceived and test-based knowledge of cancer risk and prevention among Hispanic and African Americans: an example of community participatory research.

PubMed

Jones, Loretta; Bazargan, Mohsen; Lucas-Wright, Anna; Vadgama, Jaydutt V; Vargas, Roberto; Smith, James; Otoukesh, Salman; Maxwell, Annette E

2013-01-01

Most theoretical formulations acknowledge that knowledge and awareness of cancer screening and prevention recommendations significantly influence health behaviors. This study compares perceived knowledge of cancer prevention and screening with test-based knowledge in a community sample. We also examine demographic variables and self-reported cancer screening and prevention behaviors as correlates of both knowledge scores, and consider whether cancer related knowledge can be accurately assessed using just a few, simple questions in a short and easy-to-complete survey. We used a community-partnered participatory research approach to develop our study aims and a survey. The study sample was composed of 180 predominantly African American and Hispanic community individuals who participated in a full-day cancer prevention and screening promotion conference in South Los Angeles, California, on July 2011. Participants completed a self-administered survey in English or Spanish at the beginning of the conference. Our data indicate that perceived and test-based knowledge scores are only moderately correlated. Perceived knowledge score shows a stronger association with demographic characteristics and other cancer related variables than the test-based score. Thirteen out of twenty variables that are examined in our study showed a statistically significant correlation with the perceived knowledge score, however, only four variables demonstrated a statistically significant correlation with the test-based knowledge score. Perceived knowledge of cancer prevention and screening was assessed with fewer items than test-based knowledge. Thus, using this assessment could potentially reduce respondent burden. However, our data demonstrate that perceived and test-based knowledge are separate constructs.
The sensitivity and specificity of using a computer aided diagnosis program for automatically scoring chest X-rays of presumptive TB patients compared with Xpert MTB/RIF in Lusaka Zambia.

PubMed

Muyoyeta, Monde; Maduskar, Pragnya; Moyo, Maureen; Kasese, Nkatya; Milimo, Deborah; Spooner, Rosanna; Kapata, Nathan; Hogeweg, Laurens; van Ginneken, Bram; Ayles, Helen

2014-01-01

To determine the sensitivity and specificity of a Computer Aided Diagnosis (CAD) program for scoring chest x-rays (CXRs) of presumptive tuberculosis (TB) patients compared to Xpert MTB/RIF (Xpert). Consecutive presumptive TB patients with a cough of any duration were offered digital CXR, and opt out HIV testing. CXRs were electronically scored as normal (CAD score ≤ 60) or abnormal (CAD score > 60) using a CAD program. All patients regardless of CAD score were requested to submit a spot sputum sample for testing with Xpert and a spot and morning sample for testing with LED Fluorescence Microscopy-(FM). Of 350 patients with evaluable data, 291 (83.1%) had an abnormal CXR score by CAD. The sensitivity, specificity, positive predictive value (PPV) and negative predictive value (NPV) of CXR compared to Xpert were 100% (95%CI 96.2-100), 23.2% (95%CI 18.2-28.9), 33.0% (95%CI 27.6-38.7) and 100% (95% 93.9-100), respectively. The area under the receiver operator curve (AUC) for CAD was 0.71 (95%CI 0.66-0.77). CXR abnormality correlated with smear grade (r = 0.30, p<0.0001) and with Xpert CT(r = 0.37, p<0.0001). To our knowledge this is the first time that a CAD program for TB has been successfully tested in a real world setting. The study shows that the CAD program had high sensitivity but low specificity and PPV. The use of CAD with digital CXR has the potential to increase the use and availability of chest radiography in screening for TB where trained human resources are scarce.
A randomized controlled trial of high-fidelity simulation versus lecture-based education in preclinical medical students.

PubMed

Alluri, Ram Kiran; Tsing, Pamela; Lee, Edward; Napolitano, Jason

2016-01-01

The purpose of this study was to compare the efficacy of simulation versus lecture-based education among preclinical medical students. Twenty medical students participated in this randomized, controlled crossover study. Students were randomized to four groups. Each group received two simulations and two lectures covering four different topics. Students were administered a pre-test, post-test and delayed post-test. The mean percentage of questions answered correctly on each test was calculated. The mean of each student's change in score across the three tests was used to compare simulation- versus lecture-based education. Students in both the simulation and lecture groups demonstrated improvement between the pre-test and post-test (p < 0.05). Students in the simulation group demonstrated improvement between the immediate post-test and delayed post-test (p < 0.05), while students in the lecture group did not demonstrate improvement (p > 0.05). When comparing interventions, the change in score between the pre-test and post-test was similar among both the groups (p > 0.05). The change in score between the post-test and delayed post-test was greater in the simulation group (p < 0.05). High-fidelity simulation may serve as a viable didactic platform for preclinical medical education. Our study demonstrated equivalent immediate knowledge gain and superior long-term knowledge retention in comparison to lectures.
Clinical outcomes for patients finished with the SureSmile™ method compared with conventional fixed orthodontic therapy.

PubMed

Alford, Timothy J; Roberts, W Eugene; Hartsfield, James K; Eckert, George J; Snyder, Ronald J

2011-05-01

Utilize American Board of Orthodontics (ABO) cast/radiographic evaluation (CRE) to compare a series of 63 consecutive patients, finished with manual wire bending (conventional) treatment, vs a subsequent series of 69 consecutive patients, finished by the same orthodontist using the SureSmile™ (SS) method. Records of 132 nonextraction patients were scored by a calibrated examiner blinded to treatment mode. Age and discrepancy index (DI) between groups were compared by t-tests. A chi-square test was used to compare for differences in sex and whether the patient was treated using braces only (no orthopedic correction). Analysis of covariance tested for differences in CRE outcomes and treatment times, with sex and DI included as covariates. A logarithmic transformation of CRE outcomes and treatment times was used because their distributions were skewed. Significance was defined as P < .05. Compared with conventional finishing, SS patients had significantly lower DI scores, less treatment time (∼7 months), and better CRE scores for first-order alignment-rotation and interproximal space closure; however, second-order root angulation (RA) was inferior. SS patients were treated in less time to better CRE scores for first-order rotation (AR) and interproximal space closure (IC) but on the average, malocclusions were less complex and second order root alignment was inferior, compared with patients finished with manual wire bending.
Clinical outcomes for patients finished with the SureSmile™ method compared with conventional fixed orthodontic therapy

PubMed Central

Alford, Timothy J.; Roberts, W. Eugene; Hartsfield, James K.; Eckert, George J.; Snyder, Ronald J.

2016-01-01

Objective Utilize American Board of Orthodontics (ABO) cast/radiographic evaluation (CRE) to compare a series of 63 consecutive patients, finished with manual wire bending (conventional) treatment, vs a subsequent series of 69 consecutive patients, finished by the same orthodontist using the SureSmile™ (SS) method. Materials and Methods Records of 132 nonextraction patients were scored by a calibrated examiner blinded to treatment mode. Age and discrepancy index (DI) between groups were compared by t-tests. A chi-square test was used to compare for differences in sex and whether the patient was treated using braces only (no orthopedic correction). Analysis of covariance tested for differences in CRE outcomes and treatment times, with sex and DI included as covariates. A logarithmic transformation of CRE outcomes and treatment times was used because their distributions were skewed. Significance was defined as P < .05. Results Compared with conventional finishing, SS patients had significantly lower DI scores, less treatment time (~7 months), and better CRE scores for first-order alignment-rotation and interproximal space closure; however, second-order root angulation (RA) was inferior. Conclusion SS patients were treated in less time to better CRE scores for first-order rotation (AR) and interproximal space closure (IC) but on the average, malocclusions were less complex and second order root alignment was inferior, compared with patients finished with manual wire bending. PMID:21261488
Microtia and Social Media: Patient Versus Physician Perspective of Quality of Information.

PubMed

Sepehripour, Sarvnaz; McDermott, Ann Louise; Lloyd, Mark Sheldon

2017-05-01

Previous research demonstrates that patients seek high-quality information on the World Wide Web, especially in rare conditions such as microtia. Social media has overtaken other sources of patient information but quality remains untested. This study quantifies the quality of information for patients with Microtia on social media compared with nonsocial media websites and compares physician and patient scoring on quality using the DISCERN tool. In phase 1, quality of the top 100 websites featuring information "Microtia" was ranked according to quality score and position on Google showing the position of social media websites among other nonsocial media websites. Phase 2 involved independent scoring of websites on microtia compared with a patient group with microtia to test whether physicians score differently to patients with t test comparison. Social media websites account for 2% of the scored websites with health providers linking to social media. Social media websites were among the highest ranked on Google. No correlation was found between the quality of information and Google rank. Social media scored higher than nonsocial media websites regarding quality of information on microtia. No significant difference existed between physician and patient quality of information scores on social media and nonsocial media websites (p 1.033). Physicians and patients objectively score microtia websites alike. Social media websites have higher use despite being few in number compared with nonsocial media websites. Physicians providing links to social media on information websites on rare conditions such as microtia are engaging in current information-seeking trends.
[Effects of temporal lobe epilepsy and idiopathic epilepsy on cognitive function and emotion in children].

PubMed

Yang, Xiao-Yan; Long, Li-Li; Xiao, Bo

2016-07-01

To investigate the effects of temporal lobe epilepsy and idiopathic epilepsy on cognitive function and emotion in children and the risk factors for cognitive impairment. A retrospective analysis was performed for the clinical data of 38 children with temporal lobe epilepsy and 40 children with idiopathic epilepsy. The controls were 42 healthy children. All subjects received the following neuropsychological tests: Montreal Cognitive Assessment (MoCA) scale, verbal fluency test, digit span test, block design test, Social Anxiety Scale for Children (SASC), and Depression Self-rating Scale for Children (DSRSC). Compared with the control group, the temporal lobe epilepsy and idiopathic epilepsy groups showed significantly lower scores of MoCA, verbal fluency, digit span, and block design (P<0.05) and significantly higher scores on SASC and DSRSC (P<0.05). Compared with the idiopathic epilepsy group, the temporal lobe epilepsy group showed significantly lower scores of MoCA, verbal fluency, digit span, and block design (P<0.05) and significantly higher scores on SASC and DSRSC (P<0.05). In the temporal lobe epilepsy group, MoCA score was negatively correlated with SASC score, DSRSC score, and seizure frequency (r=-0.571, -0.529, and -0.545 respectively; P<0.01). In the idiopathic epilepsy group, MoCA score was also negatively correlated with SASC score, DSRSC score, and seizure frequency (r=-0.542, -0.487, and -0.555 respectively; P<0.01). Children with temporal lobe epilepsy and idiopathic epilepsy show impaired whole cognition, verbal fluency, memory, and executive function and have anxiety and depression, which are more significant in children with temporal lobe epilepsy. High levels of anxiety, depression, and seizure frequency are risk factors for impaired cognitive function.

The role of three-dimensional printed models of skull in anatomy education: a randomized controlled trail.

PubMed

Chen, Shi; Pan, Zhouxian; Wu, Yanyan; Gu, Zhaoqi; Li, Man; Liang, Ze; Zhu, Huijuan; Yao, Yong; Shui, Wuyang; Shen, Zhen; Zhao, Jun; Pan, Hui

2017-04-03

Three-dimensional (3D) printed models represent educational tools of high quality compared with traditional teaching aids. Colored skull models were produced by 3D printing technology. A randomized controlled trial (RCT) was conducted to compare the learning efficiency of 3D printed skulls with that of cadaveric skulls and atlas. Seventy-nine medical students, who never studied anatomy, were randomized into three groups by drawing lots, using 3D printed skulls, cadaveric skulls, and atlas, respectively, to study the anatomical structures in skull through an introductory lecture and small group discussions. All students completed identical tests, which composed of a theory test and a lab test, before and after a lecture. Pre-test scores showed no differences between the three groups. In post-test, the 3D group was better than the other two groups in total score (cadaver: 29.5 [IQR: 25-33], 3D: 31.5 [IQR: 29-36], atlas: 27.75 [IQR: 24.125-32]; p = 0.044) and scores of lab test (cadaver: 14 [IQR: 10.5-18], 3D: 16.5 [IQR: 14.375-21.625], atlas: 14.5 [IQR: 10-18.125]; p = 0.049). Scores involving theory test, however, showed no difference between the three groups. In this RCT, an inexpensive, precise and rapidly-produced skull model had advantages in assisting anatomy study, especially in structure recognition, compared with traditional education materials.
Factors contributing to speech perception scores in long-term pediatric cochlear implant users.

PubMed

Davidson, Lisa S; Geers, Ann E; Blamey, Peter J; Tobey, Emily A; Brenner, Christine A

2011-02-01

The objectives of this report are to (1) describe the speech perception abilities of long-term pediatric cochlear implant (CI) recipients by comparing scores obtained at elementary school (CI-E, 8 to 9 yrs) with scores obtained at high school (CI-HS, 15 to 18 yrs); (2) evaluate speech perception abilities in demanding listening conditions (i.e., noise and lower intensity levels) at adolescence; and (3) examine the relation of speech perception scores to speech and language development over this longitudinal timeframe. All 112 teenagers were part of a previous nationwide study of 8- and 9-yr-olds (N = 181) who received a CI between 2 and 5 yrs of age. The test battery included (1) the Lexical Neighborhood Test (LNT; hard and easy word lists); (2) the Bamford Kowal Bench sentence test; (3) the Children's Auditory-Visual Enhancement Test; (4) the Test of Auditory Comprehension of Language at CI-E; (5) the Peabody Picture Vocabulary Test at CI-HS; and (6) the McGarr sentences (consonants correct) at CI-E and CI-HS. CI-HS speech perception was measured in both optimal and demanding listening conditions (i.e., background noise and low-intensity level). Speech perception scores were compared based on age at test, lexical difficulty of stimuli, listening environment (optimal and demanding), input mode (visual and auditory-visual), and language age. All group mean scores significantly increased with age across the two test sessions. Scores of adolescents significantly decreased in demanding listening conditions. The effect of lexical difficulty on the LNT scores, as evidenced by the difference in performance between easy versus hard lists, increased with age and decreased for adolescents in challenging listening conditions. Calculated curves for percent correct speech perception scores (LNT and Bamford Kowal Bench) and consonants correct on the McGarr sentences plotted against age-equivalent language scores on the Test of Auditory Comprehension of Language and Peabody Picture Vocabulary Test achieved asymptote at similar ages, around 10 to 11 yrs. On average, children receiving CIs between 2 and 5 yrs of age exhibited significant improvement on tests of speech perception, lipreading, speech production, and language skills measured between primary grades and adolescence. Evidence suggests that improvement in speech perception scores with age reflects increased spoken language level up to a language age of about 10 yrs. Speech perception performance significantly decreased with softer stimulus intensity level and with introduction of background noise. Upgrades to newer speech processing strategies and greater use of frequency-modulated systems may be beneficial for ameliorating performance under these demanding listening conditions.
Correlation of Patient-Reported Outcomes Measurement Information System (PROMIS) scores with legacy patient-reported outcome scores in patients undergoing rotator cuff repair.

PubMed

Patterson, Brendan M; Orvets, Nathan D; Aleem, Alexander W; Keener, Jay D; Calfee, Ryan P; Nixon, Devon C; Chamberlain, Aaron M

2018-06-01

The Patient-Reported Outcomes Measurement Information System (PROMIS) is being used to assess outcomes in many patient populations despite limited validation. The purpose of this study was to investigate the relationship between American Shoulder and Elbow Surgeons (ASES) and Simple Shoulder Test (SST) scores and PROMIS Physical Function (PF) and Upper Extremity (UE) function scores collected preoperatively in patients undergoing rotator cuff repair. This cross-sectional study analyzed 164 consecutive patients undergoing arthroscopic rotator cuff repair. Study inclusion required preoperative completion of the ASES and SST evaluations, as well as the PROMIS PF, UE, and Pain Interference computerized adaptive tests. Descriptive statistics were produced, and Pearson correlation coefficients were calculated between each of the outcome measures. Average PROMIS UE scores indicated greater impairment than PROMIS PF scores (34 vs 44). Three percent of patients reached the PROMIS UE ceiling score of 56. PROMIS PF scores demonstrated a weak correlation with ASES scores (r = 0.43, P < .001) and a moderate correlation with SST scores (r = 0.51, P < .001). PROMIS UE scores demonstrated a moderate correlation with both ASES scores (r = 0.59, P < .001) and SST scores (r = 0.62, P < .001). PROMIS Pain Interference scores demonstrated weak negative correlations with both ASES scores (r = -0.43, P < .001) and SST scores (r = -0.41, P < .001). Patients answered fewer questions on average using the PROMIS PF and UE instruments as compared with the ASES and SST instruments. PROMIS UE scores indicate greater impairment and demonstrate a stronger correlation with the legacy shoulder scores than PROMIS PF scores in patients with symptomatic rotator cuff tears. PROMIS computerized adaptive tests allow for more efficient patient-reported outcome data collection compared with traditional outcome scores. Copyright © 2018 Journal of Shoulder and Elbow Surgery Board of Trustees. Published by Elsevier Inc. All rights reserved.
Race, Socioeconomic Status, and Implicit Bias: Implications for Closing the Achievement Gap

NASA Astrophysics Data System (ADS)

Schlosser, Elizabeth Auretta Cox

This study accessed the relationship between race, socioeconomic status, age and the race implicit bias held by middle and high school science teachers in Mobile and Baldwin County Public School Systems. Seventy-nine participants were administered the race Implicit Association Test (race IAT), created by Greenwald, A. G., Nosek, B. A., & Banaji, M. R., (2003) and a demographic survey. Quantitative analysis using analysis of variances, ANOVA and t-tests were used in this study. An ANOVA was performed comparing the race IAT scores of African American science teachers and their Caucasian counterparts. A statically significant difference was found (F = .4.56, p = .01). An ANOVA was also performed using the race IAT scores comparing the age of the participants; the analysis yielded no statistical difference based on age. A t-test was performed comparing the race IAT scores of African American teachers who taught at either Title I or non-Title I schools; no statistical difference was found between groups (t = -17.985, p < .001). A t-test was also performed comparing the race IAT scores of Caucasian teachers who taught at either Title I or non-Title I schools; a statistically significant difference was found between groups ( t = 2.44, p > .001). This research examines the implications of the achievement gap among African American and Caucasian students in science.
Neurocognitive functions of pediatric kidney transplant recipients.

PubMed

Molnar-Varga, Marta; Novak, Marta; Szabo, Attila J; Kelen, Kata; Streja, Elani; Remport, Adam; Mucsi, Istvan; Molnar, Miklos Z; Reusz, Gyorgy

2016-09-01

End-stage renal disease (ESRD) in children is associated with impaired neurocognitive function and development. However, data on factors associated with neurocognitive dysfunctions in children with kidney transplants are limited. We conducted a cross-sectional analysis comparing cognitive functions (using the Woodcock-Johnson International Edition, WJIE) in 35 kidney transplant and 35 healthy control children. Data on laboratory measurements, comorbidities, and social characteristics were collected. Transplant children had significantly worse scores on the intelligence quotient (IQ) test compared with controls [Full Scale IQ score 85 (26) vs 107 (10), p <0.001]. Lower maternal education level was significantly associated with lower WJIE cognitive test scores; however, no association was found between laboratory values and WJIE scores. Among children with kidney transplants, those with medical comorbid conditions had significantly lower Verbal Ability and Full Scale IQ scores. Earlier age of dialysis onset and a longer total time on dialysis (>9 months) were associated with lower test scores. Age-standardized duration of hospitalization was inversely correlated with IQ (r = -0.46, p <0.01) and was an independent significant predictor (Beta = -0.38, p = 0.02) of IQ scores in transplanted children. Child kidney transplant recipients have neurocognitive function impairments that are associated with markers of socioeconomic status (SES) and factors related to disease severity.
Nursing workload in public and private intensive care units

PubMed Central

Nogueira, Lilia de Souza; Koike, Karina Mitie; Sardinha, Débora Souza; Padilha, Katia Grillo; de Sousa, Regina Marcia Cardoso

2013-01-01

Objective This study sought to compare patients at public and private intensive care units according to the nursing workload and interventions provided. Methods This retrospective, comparative cohort study included 600 patients admitted to 4 intensive care units in São Paulo. The nursing workload and interventions were assessed using the Nursing Activities Score during the first and last 24 hours of the patient's stay at the intensive care unit. Pearson's chi-square test, Fisher's exact test, the Mann-Whitney test, and Student's t test were used to compare the patient groups. Results The average Nursing Activities Score upon admission to the intensive care unit was 61.9, with a score of 52.8 upon discharge. Significant differences were found among the patients at public and private intensive care units relative to the average Nursing Activities Score upon admission, as well as for 12 out of 23 nursing interventions performed during the first 24 hours of stay at the intensive care units. The patients at the public intensive care units exhibited a higher average score and overall more frequent nursing interventions, with the exception of those involved in the "care of drains", "mobilization and positioning", and "intravenous hyperalimentation". The groups also differed with regard to the evolution of the Nursing Activities Score among the total case series as well as the groups of survivors from the time of admission to discharge from the intensive care unit. Conclusion Patients admitted to public and private intensive care units exhibit differences in their nursing care demands, which may help managers with nursing manpower planning. PMID:24213086
IRT Equating of the MCAT. MCAT Monograph.

ERIC Educational Resources Information Center

Hendrickson, Amy B.; Kolen, Michael J.

This study compared various equating models and procedures for a sample of data from the Medical College Admission Test(MCAT), considering how item response theory (IRT) equating results compare with classical equipercentile results and how the results based on use of various IRT models, observed score versus true score, direct versus linked…
Health related quality of life in patients with neuroendocrine tumors compared with the general Norwegian population.

PubMed

Haugland, Trude; Vatn, Morten H; Veenstra, Marijke; Wahl, Astrid Klopstad; Natvig, Gerd Karin

2009-08-01

Health related quality of life (HRQoL) was characterized among patients with neuroendocrine tumor (NET) and compared with the general Norwegian population. A cross sectional, comparative design was chosen, and the samples comprised 196 NET patients and 5,258 individuals from the general Norwegian population. We used Chi-square cross tab calculations to evaluate sociodemographic characteristics, T-tests for independent samples and Analysis of Variance (ANOVA) in order to compare HRQoL (SF-36) scores across a range of background variables. Furthermore, T-tests were used to analyze differences in HRQoL scores between the samples. NET patients demonstrated significantly lower on all HRQoL subscales when compared with the general population with the lowest values on general health, physical limitation and vitality. Individuals above 70 years reported lower scores on physical functioning and physical limitations compared with those who were younger. Individuals with higher levels of education reported increased physical functioning compared with those with less education and full-time or part-time workers described higher physical functioning and less physical limitations compared with those who were retired. All SF-36 HRQoL scores were significantly lower among the NET patients when compared with the general population. Assistance from health personnel to NET patients should focus on those domains.
Effect of ice massage on lower extremity functional performance and weight discrimination ability in collegiate footballers.

PubMed

Sharma, Geeta; Noohu, Majumi Mohamad

2014-09-01

Cryotherapy, in the form of ice massge is used to reduce inflammation after acute musculoskeletal injury or trauma. The potential negative effects of ice massage on proprioception are unknown, despite equivocal evidence supporting its effectiveness. The purpose of the study was to test the influence of cooling on weight discrimination ability and hence the performance in footballers. The study was of same subject experimental design (pretest-posttest design). Thirty male collegiate football players, whose mean age was 21.07 years, participated in the study. The participants were assessed for two functional performance tests, single leg hop test and crossed over hop test and weight discrimination ability before and after ice massage for 5 minutes on hamstrings muscle tendon. Pre cooling scores of Single Leg Hop Test of the dominant leg in the subjects was 166.65 (± 10.16) cm and post cooling scores of the dominant leg was 167.25 (± 11.77) cm. Pre cooling scores of Crossed Over Hop Test of the dominant leg in the subjects was 174.14 (± 8.60) cm and post cooling scores of the dominant leg was 174.45 (± 9.28) cm. Pre cooling scores of Weight Discrimination Differential Threshold of the dominant leg in the subjects was 1.625 ± 1.179 kg compared with post cooling scores of the dominant leg 1.85 (± 1.91) kg. Pre cooling scores of single leg hop and crossed over hop test of the dominant leg in the subjects compared with post cooling scores of the dominant leg showed no significant differences and it was also noted that the weight discrimination ability (weight discrimination differential threshold) didn't show any significant difference. All the values are reported as mean ± SD. This study provides additional evidence that proprioceptive acuity in the hamstring muscles (biceps femoris) remains largely unaffected after ice application to the hamstrings tendon (biceps femoris).
Speech perception and communication ability over the telephone by Mandarin-speaking children with cochlear implants.

PubMed

Wu, Che-Ming; Liu, Tien-Chen; Wang, Nan-Mai; Chao, Wei-Chieh

2013-08-01

(1) To understand speech perception and communication ability through real telephone calls by Mandarin-speaking children with cochlear implants and compare them to live-voice perception, (2) to report the general condition of telephone use of this population, and (3) to investigate the factors that correlate with telephone speech perception performance. Fifty-six children with over 4 years of implant use (aged 6.8-13.6 years, mean duration 8.0 years) took three speech perception tests administered using telephone and live voice to examine sentence, monosyllabic-word and Mandarin tone perception. The children also filled out a questionnaire survey investigating everyday telephone use. Wilcoxon signed-rank test was used to compare the scores between live-voice and telephone tests, and Pearson's test to examine the correlation between them. The mean scores were 86.4%, 69.8% and 70.5% respectively for sentence, word and tone recognition over the telephone. The corresponding live-voice mean scores were 94.3%, 84.0% and 70.8%. Wilcoxon signed-rank test showed the sentence and word scores were significantly different between telephone and live voice test, while the tone recognition scores were not, indicating tone perception was less worsened by telephone transmission than words and sentences. Spearman's test showed that chronological age and duration of implant use were weakly correlated with the perception test scores. The questionnaire survey showed 78% of the children could initiate phone calls and 59% could use the telephone 2 years after implantation. Implanted children are potentially capable of using the telephone 2 years after implantation, and communication ability over the telephone becomes satisfactory 4 years after implantation. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
Statistical Assessment of Estimated Transformations in Observed-Score Equating

ERIC Educational Resources Information Center

Wiberg, Marie; González, Jorge

2016-01-01

Equating methods make use of an appropriate transformation function to map the scores of one test form into the scale of another so that scores are comparable and can be used interchangeably. The equating literature shows that the ways of judging the success of an equating (i.e., the score transformation) might differ depending on the adopted…
A Comparison between Linear IRT Observed-Score Equating and Levine Observed-Score Equating under the Generalized Kernel Equating Framework

ERIC Educational Resources Information Center

Chen, Haiwen

2012-01-01

In this article, linear item response theory (IRT) observed-score equating is compared under a generalized kernel equating framework with Levine observed-score equating for nonequivalent groups with anchor test design. Interestingly, these two equating methods are closely related despite being based on different methodologies. Specifically, when…
Translation and validation of the Dutch new Knee Society Scoring System ©.

PubMed

Van Der Straeten, Catherine; Witvrouw, Erik; Willems, Tine; Bellemans, Johan; Victor, Jan

2013-11-01

A new version of The Knee Society Knee Scoring System(©) (KSS) has recently been developed. Before this scale can be used in non-English-speaking populations, it has to be translated and validated for a particular population. We evaluated the construct and content validity, the test-retest reliability, and the internal consistency of the Dutch version of the New Knee Society KSS. A Dutch translation was performed using a forward-backward translation protocol. We tested the construct validity of the Dutch New KSS by comparing it with the Dutch versions of the WOMAC, Knee Injury and Osteoarthritis Outcome Score (KOOS), and SF-12 scores in 137 patients undergoing total knee arthroplasty (TKA). Content validity was assessed by comparing pre- and postoperative scores and by checking floor and ceiling effects. To evaluate test-retest reliability and consistency, 47 patients completed the questionnaire a second time with a mean of 8 days interval (range, 2-20 days) between tests. Construct validity was demonstrated because the Dutch New KSS correlated well with the Dutch WOMAC (r = -0.751; p < 0.001), Dutch KOOS (r = -0.723; p < 0.001), and Dutch SF-12 (r = 0.569; p < 0.001). There was a significant difference between pre- and postoperative scores (p < 0.001) in line with the other scores. Test-retest reliability proved excellent with an intraclass correlation coefficient between 0.73 and 0.92 depending on the domain tested. Consistency as indicated by Cronbach's alpha ranging from 0.84 to 0.96 was good to excellent. As demonstrated by the validation procedure, the Dutch New KSS is an excellent instrument to evaluate TKA outcome in Dutch-speaking patients.
Validation of a Paper and Pencil Test Battery for the Diagnosis of Minimal Hepatic Encephalopathy in Korea.

PubMed

Jeong, Jae Yoon; Jun, Dae Won; Bai, Daiseg; Kim, Ji Yean; Sohn, Joo Hyun; Ahn, Sang Bong; Kim, Sang Gyune; Kim, Tae Yeob; Kim, Hyoung Su; Jeong, Soung Won; Cho, Yong Kyun; Song, Do Seon; Kim, Hee Yeon; Jung, Young Kul; Yoon, Eileen L

2017-09-01

The aim of this study was to validate a new paper and pencil test battery to diagnose minimal hepatic encephalopathy (MHE) in Korea. A new paper and pencil test battery was composed of number connection test-A (NCT-A), number connection test-B (NCT-B), digit span test (DST), and symbol digit modality test (SDMT). The norm of the new test was based on 315 healthy individuals between the ages of 20 and 70 years old. Another 63 healthy subjects (n = 31) and cirrhosis patients (n = 32) were included as a validation cohort. All participants completed the new paper and pencil test, a critical flicker frequency (CFF) test and computerized cognitive function test (visual continuous performance test [CPT]). The scores on the NCT-A and NCT-B increased but those of DST and SDMT decreased according to age. Twelve of the cirrhotic patients (37.5%) were diagnosed with MHE based on the new paper and pencil test battery. The total score of the paper and pencil test battery showed good positive correlation with the CFF (r = 0.551, P < 0.001) and computerized cognitive function test. Also, this score was lower in patients with MHE compared to those without MHE (P < 0.001). Scores on the CFF (32.0 vs. 28.7 Hz, P = 0.028) and the computer base cognitive test decreased significantly in patients with MHE compared to those without MHE. Test-retest reliability was comparable. In conclusion, the new paper and pencil test battery including NCT-A, NCT-B, DST, and SDMT showed good correlation with neuropsychological tests. This new paper and pencil test battery could help to discriminate patients with impaired cognitive function in cirrhosis (registered at Clinical Research Information Service [CRIS], https://cris.nih.go.kr/cris, KCT0000955). © 2017 The Korean Academy of Medical Sciences.
Does It Matter if You "Kill" the Patient or Order Too Many Tests? Scoring Alternatives for a Test of Clinical Reasoning Skill

ERIC Educational Resources Information Center

Childs, Ruth A.; Dunn, Jennifer L.; van Barneveld, Christina; Jaciw, Andrew P.

2007-01-01

This study compares five scoring approaches for a test of clinical reasoning skills. All of the approaches incorporate information about the correct item responses selected and the errors, such as selecting too many responses or selecting a response that is inappropriate and/or harmful to the patient. The approaches are combinations of theoretical…
A Study To Determine the Effects of School Athletic Programs on the CTBS Test Percentiles of Students.

ERIC Educational Resources Information Center

Fleenor, Paula

This study was conducted to determine the positive or negative relationship between school athletic program participation and the academic achievement of students in the 4th through 11th grades as measured by the California Tests of Basic Skills (CTBS). CTBS tests are taken in grades 4 and 11 and their scores are compared to scores from students…
A Discussion and Comparison of Selected Methods for Determining Cutoff Scores for Proficiency and Placement Tests. Placement and Proficiency Testing Report No. 6.

ERIC Educational Resources Information Center

Klein, Anna C.; Whitney, Douglas R.

Procedures and related issues involved in the application of trait-treatment interaction (TTI) to institutional research, in general, and to placement and proficiency testing, in particular, are discussed and illustrated. Traditional methods for choosing cut-off scores are compared and proposals for evaluating the results in the TTI framework are…
Does arthroscopic rotator cuff repair improve patients' activity levels?

PubMed

Baumgarten, Keith M; Chang, Peter S; Dannenbring, Tasha M; Foley, Elaine K

2018-06-04

Rotator cuff repair decreases pain, improves range of motion, and increases strength. Whether these improvements translate to an improvement in a patient's activity level postoperatively remains unknown. The Shoulder Activity Level is a valid and reliable outcomes survey that can be used to measure a patient's shoulder-specific activity level. Currently, there are no studies that examine the effect of rotator cuff repair on shoulder activity level. Preoperative patient-determined outcomes scores collected prospectively on patients undergoing rotator cuff repair were compared with postoperative scores at a minimum of 2 years. These scores included the Shoulder Activity Level, Western Ontario Rotator Cuff Index, American Shoulder and Elbow Surgeons Standardized Shoulder Assessment Form, Single Assessment Numeric Evaluation, and simple shoulder test. Inclusion criteria were patients undergoing arthroscopic rotator cuff repair. Included were 281 shoulders from 273 patients with a mean follow-up of 3.7 years. The postoperative median Western Ontario Rotator Cuff Index (42 vs. 94), American Shoulder and Elbow Surgeons (41 vs. 95), Single Assessment Numeric Evaluation (30 vs. 95), and simple shoulder test (4 vs. 11) scores were statistically significantly improved compared with preoperative scores (P < .0001). The postoperative median Shoulder Activity Level score decreased compared with the preoperative score (12 vs. 11; P < .0001). Patients reported a statistically significant deterioration of their Shoulder Activity Level score after rotator cuff repair compared with their preoperative scores, although disease-specific and joint-specific quality of life scores all had statistically significantly improvement. This study suggests that patients generally have (1) significant improvements in their quality of life and (2) small deteriorations in activity level after arthroscopic rotator cuff repair. Copyright © 2018 Journal of Shoulder and Elbow Surgery Board of Trustees. Published by Elsevier Inc. All rights reserved.
Racial Differences in Mathematics Test Scores for Advanced Mathematics Students

ERIC Educational Resources Information Center

Minor, Elizabeth Covay

2016-01-01

Research on achievement gaps has found that achievement gaps are larger for students who take advanced mathematics courses compared to students who do not. Focusing on the advanced mathematics student achievement gap, this study found that African American advanced mathematics students have significantly lower test scores and are less likely to be…
Evaluating Gifted Identification Practice: Aptitude Testing and Linguistically Diverse Learners

ERIC Educational Resources Information Center

Matthews, Michael S.; Kirsch, Lauri

2011-01-01

The authors examined individually administered IQ scores from an entire K-5 population (N = 432) of Limited English Proficient students referred for gifted program eligibility determination in a single large urban district in the southeastern United States. Of 8 IQ tests compared, only 1, the Stanford-Binet V, had scores appreciably lower than…

Levels of Critical Thinking of Secondary Agriculture Students.

ERIC Educational Resources Information Center

Rollins, Timothy J.

1990-01-01

A total of 668 Iowa secondary agriculture students completed the Cornell Critical Thinking Test Level X. These scores and data from the Iowa Tests of Educational Development (ITED) revealed levels of proficiency comparable to other high school populations. The best indicator of critical thinking score was the ITED subtest Reading Total. (SK)
International Test Score Comparisons and Educational Policy: A Review of the Critiques

ERIC Educational Resources Information Center

Carnoy, Martin

2015-01-01

Stanford education professor Martin Carnoy examines four main critiques of how international test results are used in policymaking. Of particular interest are critiques of the policy analyses published by the Program for International Student Assessment (PISA). Using average PISA scores as a comparative measure of student achievement is misleading…
Comparing Standard Deviation Effects across Contexts

ERIC Educational Resources Information Center

Ost, Ben; Gangopadhyaya, Anuj; Schiman, Jeffrey C.

2017-01-01

Studies using tests scores as the dependent variable often report point estimates in student standard deviation units. We note that a standard deviation is not a standard unit of measurement since the distribution of test scores can vary across contexts. As such, researchers should be cautious when interpreting differences in the numerical size of…
Sequential Neighborhood Effects: The Effect of Long-Term Exposure to Concentrated Disadvantage on Children's Reading and Math Test Scores.

PubMed

Hicks, Andrew L; Handcock, Mark S; Sastry, Narayan; Pebley, Anne R

2018-02-01

Prior research has suggested that children living in a disadvantaged neighborhood have lower achievement test scores, but these studies typically have not estimated causal effects that account for neighborhood choice. Recent studies used propensity score methods to account for the endogeneity of neighborhood exposures, comparing disadvantaged and nondisadvantaged neighborhoods. We develop an alternative propensity function approach in which cumulative neighborhood effects are modeled as a continuous treatment variable. This approach offers several advantages. We use our approach to examine the cumulative effects of neighborhood disadvantage on reading and math test scores in Los Angeles. Our substantive results indicate that recency of exposure to disadvantaged neighborhoods may be more important than average exposure for children's test scores. We conclude that studies of child development should consider both average cumulative neighborhood exposure and the timing of this exposure.
Sequential Neighborhood Effects: The Effect of Long-Term Exposure to Concentrated Disadvantage on Children's Reading and Math Test Scores

PubMed Central

Hicks, Andrew L.; Handcock, Mark S.; Sastry, Narayan

2018-01-01

Prior research has suggested that children living in a disadvantaged neighborhood have lower achievement test scores, but these studies typically have not estimated causal effects that account for neighborhood choice. Recent studies used propensity score methods to account for the endogeneity of neighborhood exposures, comparing disadvantaged and nondisadvantaged neighborhoods. We develop an alternative propensity function approach in which cumulative neighborhood effects are modeled as a continuous treatment variable. This approach offers several advantages. We use our approach to examine the cumulative effects of neighborhood disadvantage on reading and math test scores in Los Angeles. Our substantive results indicate that recency of exposure to disadvantaged neighborhoods may be more important than average exposure for children's test scores. We conclude that studies of child development should consider both average cumulative neighborhood exposure and the timing of this exposure. PMID:29192386
An alternative to the balance error scoring system: using a low-cost balance board to improve the validity/reliability of sports-related concussion balance testing.

PubMed

Chang, Jasper O; Levy, Susan S; Seay, Seth W; Goble, Daniel J

2014-05-01

Recent guidelines advocate sports medicine professionals to use balance tests to assess sensorimotor status in the management of concussions. The present study sought to determine whether a low-cost balance board could provide a valid, reliable, and objective means of performing this balance testing. Criterion validity testing relative to a gold standard and 7 day test-retest reliability. University biomechanics laboratory. Thirty healthy young adults. Balance ability was assessed on 2 days separated by 1 week using (1) a gold standard measure (ie, scientific grade force plate), (2) a low-cost Nintendo Wii Balance Board (WBB), and (3) the Balance Error Scoring System (BESS). Validity of the WBB center of pressure path length and BESS scores were determined relative to the force plate data. Test-retest reliability was established based on intraclass correlation coefficients. Composite scores for the WBB had excellent validity (r = 0.99) and test-retest reliability (R = 0.88). Both the validity (r = 0.10-0.52) and test-retest reliability (r = 0.61-0.78) were lower for the BESS. These findings demonstrate that a low-cost balance board can provide improved balance testing accuracy/reliability compared with the BESS. This approach provides a potentially more valid/reliable, yet affordable, means of assessing sports-related concussion compared with current methods.
Validation of a novel epicutaneous delivery system for patch testing of house dust mite-hypersensitive dogs.

PubMed

Olivry, Thierry; Linder, Keith E; Paps, Judy S; Bizikova, Petra; Dunston, Stan; Donne, Nathalie; Mondoulet, Lucie

2012-12-01

Patch tests with allergens are used for the evaluation of cellular hypersensitivity to food and environmental allergens in dogs and humans with atopic dermatitis. Viaskin is a novel allergen epicutaneous delivery system that enhances epidermal allergen capture by immune cells. To compare the use of Viaskin and Finn chamber patch tests in dogs hypersensitive to mite allergens. Empty control or Dermatophagoides farinae house dust mite-containing Viaskin or Finn chamber patches were applied to the thoracic skin of six mite-hypersensitive Maltese-beagle crossbred atopic dogs. Lesions were graded 49 and 72 h after patch test application, and skin biopsies were collected after 72 h. Overall microscopic inflammation, eosinophil and T-lymphocyte infiltrations were scored. Positive macroscopic patch test reactions developed at five of six Viaskin application sites and four of six Finn chamber application sites. Median microscopic epidermal and dermal inflammation, as well as eosinophil and CD3 T-lymphocyte dermal scores were always higher in biopsies collected at Viaskin than at Finn chamber sites. Microscopic inflammation scores were significantly higher after mite allergen-containing Viaskin compared with empty patches, but this was not the case for mite-containing Finn chambers compared with control chambers. Scores obtained using Viaskin were not significantly different from those obtained using Finn chambers. Macroscopic and microscopic scores were significantly correlated. In mite-allergic dogs, Viaskin epicutaneous delivery systems appear to induce stronger allergen-specific inflammation than currently used Finn chamber patch tests. Consequently, Viaskin patches might offer a better alternative for screening cellular hypersensitivity to food and environmental allergens. © 2012 The Authors. Veterinary Dermatology © 2012 ESVD and ACVD.
The UCSF screening exam effectively screens cognitive and behavioral impairment in patients with ALS.

PubMed

Murphy, Jennifer; Ahmed, Fizaa; Lomen-Hoerth, Catherine

2015-03-01

The University of California San Francisco (UCSF) Screening Battery provides clinicians with a uniquely tailored tool to measure ALS patients' cognitive and behavioral changes, adjusting for dysarthria and hand weakness. The battery consists of the ALS-CBS ( 1 ), Written Fluency Test ( 2 ), and a new revision of the Frontal Behavior Inventory (FBI-ALS) ( 3 ). The validity of each component was tested by comparing results with a gold standard neuropsychological exam (GNE). Consensus criteria-based GNE diagnoses ( 4 ) were assigned (n = 24) and concurrent validity was tested for each screening exam component. Results showed that each of the four cognitive and behavioral screening test components were significantly associated with diagnoses confirmed by GNE. GNE diagnoses were significantly associated with FBI-ALS negative score, written S-words score, and ALS-CBS cognitive score. The total FBI-ALS score and C-words tests were less predictive of GNE-diagnosed impairment. In conclusion, the UCSF Cognitive Screening Battery demonstrates good external validity compared with GNE in this modest sample, encouraging its use in larger investigations. These data suggest that this battery may provide an effective screen to identify ALS patients who will then benefit from a full examination to confirm their diagnosis.
[Development and validation of the Visual Analogue Scale (VAS) Spine Score].

PubMed

Knop, C; Oeser, M; Bastian, L; Lange, U; Zdichavsky, M; Blauth, M

2001-06-01

The aim of the study was the development and validation of a new subjective rating scale for assessment of outcome in patients with thoracolumbar fractures and fracture dislocations. The VAS spine score consists of 19 score items, using 100-mm visual analogue scales. The items are answered by the patients independently of rater assessment. To measure the analogue scales and calculate the score, a computer-aided system was evolved consisting of self-developed software and digitizer board. The overall score is the mean of all items answered with values between 0 and 100. The individual score loss is calculated as the difference between the preinjury score and at follow-up with values between 0 and 100. The VAS spine score was tested for reliability with a group of 136 healthy volunteers. We performed a test-retest study with an interval of 24 h. For statistical analysis of the validity, we prospectively followed a group of 53 patients with the new outcome score. We chose patients with injuries of the thoracolumbar spine, all having been operatively treated by combined posterior-anterior stabilization and fusion between 1994 and 1996. In the reference group, the average test score was 91.95 (58-100) and 92.10 (58-100) at retest. The mean individual difference between test and retest scored 1.037 (0-8). A high reliability was proved by a strong correlation with a coefficient of 0.976 (p < 0.001). A high internal consistency of the VAS spine score was shown by a Cronbach-alpha of 0.9117. The mean score for the preinjury status of the patients was comparable to the reference group, amounting to 89.60 (21-100). The mean score at the time of implant removal was significantly (p < 0.001) decreased to 58.25 (13-97). Until the time of follow-up a significant (p < 0.001) increase was noted, and the group scored 66.08 (15-100) at follow-up. This was a significant (p < 0.001) difference compared with the preinjury status. The individual score loss averaged 24.1 (0-80). In the patient group we also noted a Cronbach-alpha > 0.95, indicating a high internal consistency. With the VAS spine score the authors have inaugurated a new tool for outcome measurement in the treatment of patients with thoracolumbar injuries. The study has proved the score to be both reliable and valid. The application of the score is helpful in analyzing the subjective outcome, and the results can be correlated with objective measures. The score is a useful tool for comparative clinical studies, addressing the outcome after different methods of treatment.
PERFORMANCE OF TWO DIFFERENT CLINICAL SCORING SYSTEMS IN DIAGNOSING DISTAL SENSORY POLYNEUROPATHY IN PATIENTS WITH TYPE-2 DIABETES.

PubMed

Khan, Fehmeda Farrukh; Numan, Ahsan; Khawaja, Khadija Irfan; Atif, Ali; Fatima, Aziz; Masud, Faisal

2015-01-01

Early diagnosis of distal peripheral neuropathy (DSPN) the commonest diabetes complications, helps prevent significant morbidity. Clinical parameters are useful for detection, but subjectivity and lack of operator proficiency often results in inaccuracies. Comparative diagnostic accuracy of Diabetic Neuropathy Symptom (DNS) score and Diabetic Neuropathy Examination (DNE) score in detecting DSPN confirmed by nerve conduction studies (NCS) has not been evaluated. This study compares the performance of these scores in predicting the presence of electro physiologically proven DSPN. The objective of this, study was to compare the diagnostic accuracy of DNS and DNE scores in detecting NCS proven DSPN in type-2 diabetics, and to determine the frequency of sub-clinical DSPN among type-2 diabetics. In this cross-sectional study the DNS score and DNE score were determined in 110 diagnosed type-2 diabetic patients. NCS were carried out and amplitudes, velocities and latencies of sensory and motor nerves in lower limb were recorded. Comparison between the two clinical diagnostic modalities and NCS using Pearson's chi square test showed a significant association between NCS and DNE scores (p-value =.003, specificity 93%). The DNS score performed poorly in comparison (p-value = .068, specificity 77%). When the two scores were taken in combination the specificity in diagnosing DSPN was greater (p-value = .018, specificity 96%) than either alone. 33% of patients had subclinical neuropathy. DNE score alone and in combination with DNS score is reliable in predicting DSPN and is more specific than DNS score in evaluating DSPN. Both tests lack sensitivity. Patients without any evidence of clinical neuropathy manifest abnormalities on NCS.
Association of fall history with the Timed Up and Go test score and the dual task cost: A cross-sectional study among independent community-dwelling older adults.

PubMed

Asai, Tsuyoshi; Oshima, Kensuke; Fukumoto, Yoshihiro; Yonezawa, Yuri; Matsuo, Asuka; Misu, Shogo

2018-05-21

To investigate the associations between fall history and the Timed Up and Go (TUG) test (single-TUG test), TUG test while counting aloud backwards from 100 (dual-TUG test) and the dual-task cost (DTC) among independent community-dwelling older adults. This cross-sectional study included 537 older adults who lived independently in the community. Data on fall history in the previous year were obtained by self-administrated questionnaire. The single- and dual-TUG tests were carried out, and the DTC value was computed from these results. Associations between fall history and these TUG-related values were analyzed using multivariate logistic regression models. The participants were divided into fall risk groups using the cut-off values of those significantly associated with falling, and the odds ratios (OR) were computed. Slower single-TUG test scores and lower DTC values were significantly associated with fall history after adjusting for potential confounders (single-TUG test score: OR 1.133, 95% CI 1.029-1.249; DTC value: OR 0.984, 95% CI 0.968-0.998). Older adults with slower single-TUG test scores and lower DTC values reported a fall history more often than those in other categories (OR compared with the lower-risk single-TUG and lower-risk DTC groups: 3.474, 95% CI 1.881-6.570). Slower single-TUG test scores and lower DTC values are associated with fall history among independent community-dwelling older adults. To some extent, dual task performance might provide added value for fall assessment, compared with administering the TUG test alone. Geriatr Gerontol Int 2018; ••: ••-••. © 2018 Japan Geriatrics Society.
HIV-associated cognitive disorders in perinatally infected children and adolescents: a novel composite cognitive domains score.

PubMed

Phillips, Nicole J; Hoare, Jacqueline; Stein, Dan J; Myer, Landon; Zar, Heather J; Thomas, Kevin G F

2018-04-22

Accurate assessment of HIV-associated cognitive disorders in perinatally infected children and adolescents is challenging. Assessments of general intellectual functioning, or global cognition, may not provide information regarding domain-specific strengths and weaknesses, and may therefore fail to detect, impaired trajectories of development within particular cognitive domains. We compare the efficacy of global cognitive scores to that of composite cognitive domain scores in detecting cognitive disorders in a sample of perinatally HIV-infected children, and a demographically matched HIV negative control group, drawn from the Cape Town Adolescent Antiretroviral Cohort (CTAAC) study. All children were administered a comprehensive neuropsychological test battery. Using data from that test battery, we created ten separate composite cognitive domains: general intellectual functioning, attention, working memory, visual memory, verbal memory, language, visual spatial ability, motor coordination, processing speed and executive function. Within each domain, each test bore a high level of association with each of the other tests in that domain (Cronbach's α ≥ .70 for all domains). We found that composite domain scores calculated on whole-sample data were significantly higher than those calculated using control-sample data. Our comparison of a global cognitive score to composite domain scores suggested that the latter provided more detailed information (regarding strengths, weaknesses, areas of impairment), and when compared to global scores, were more sensitive in detecting HIV-associated cognitive disorders, and were able to distinguish HIV-infected patients from uninfected controls. Hence, we recommend using this method of composite cognitive domains scores, rather than global aggregate scores, when assessing cognitive function in paediatric HIV. This method provides a convenient and relatively accurate assessment that might help with cross-cultural and cross-region comparisons as researchers try to detect cognitive impairment patterns in HIV-infected children and adolescents globally.
Reliability and validity of the Assessment of Daily Activity Performance (ADAP) in community-dwelling older women.

PubMed

de Vreede, Paul L; Samson, Monique M; van Meeteren, Nico L; Duursma, Sijmen A; Verhaar, Harald J

2006-08-01

The Assessment of Daily Activity Performance (ADAP) test was developed, and modeled after the Continuous-scale Physical Functional Performance (CS-PFP) test, to provide a quantitative assessment of older adults' physical functional performance. The aim of this study was to determine the intra-examiner reliability and construct validity of the ADAP in a community-living older population, and to identify the importance of tester experience. Forty-three community-dwelling, older women (mean age 75 yr +/-4.3) were randomized to the test-retest reliability study (n=19) or validation study (n=24). The intra-examiner reliability of an experienced (tester 1) and an inexperienced tester (tester 2) was assessed by comparing test and retest scores of 19 participants. Construct validity was assessed by comparing the ADAP scores of 24 participants with self-perceived function by the SF-36 Health Survey, muscle function tests, and the Timed Up and Go test (TUG). Tester 1 had good consistency and reliability scores (mean difference between test and retest scores (DIF), -1.05+/-1.99; 95% confidence interval (CI), -2.58 to 0.48; Cronbach's alpha (alpha) range, 0.83 to 0.98; intraclass correlation (ICC) range, 0.75 to 0.96; Limits of Agreement (LoA), -2.58 to 4.95). Tester 2 had lower reliability scores (DIF, -2.45+/-4.36; 95% CI, -5.56 to 0.67; alpha range, 0.53 to 0.94; ICC range, 0.36 to 0.90; LoA, -6.09 to 10.99), with a systematic difference between test and retest scores for the ADAP domain lower-body strength (-3.81; 95% CI, -6.09 to -1.54), ADAP correlated with SF-36 Physical Functioning scale (r=0.67), TUG test (r=-0.91) and with isometric knee extensor strength (r=0.80). The ADAP test is a reliable and valid instrument. Our results suggest that testers should practise using the test, to improve reliability, before applying it to clinical settings.
Assessing students' conceptual knowledge of electricity and magnetism

NASA Astrophysics Data System (ADS)

McColgan, Michele W.; Finn, Rose A.; Broder, Darren L.; Hassel, George E.

2017-12-01

We present the Electricity and Magnetism Conceptual Assessment (EMCA), a new assessment aligned with second-semester introductory physics courses. Topics covered include electrostatics, electric fields, circuits, magnetism, and induction. We have two motives for writing a new assessment. First, we find other assessments such as the Brief Electricity and Magnetism Assessment and the Conceptual Survey on Electricity and Magnetism not well aligned with the topics and content depth of our courses. We want to test introductory physics content at a level appropriate for our students. Second, we want the assessment to yield scores and gains comparable to the widely used Force Concept Inventory (FCI). After five testing and revision cycles, the assessment was finalized in early 2015 and is available online. We present performance results for a cohort of 225 students at Siena College who were enrolled in our algebra- and calculus-based physics courses during the spring 2015 and 2016 semesters. We provide pretest, post-test, and gain analyses, as well as individual question and whole test statistics to quantify difficulty and reliability. In addition, we compare EMCA and FCI scores and gains, and we find that students' FCI scores are strongly correlated with their performance on the EMCA. Finally, the assessment was piloted in an algebra-based physics course at George Washington University (GWU). We present performance results for a cohort of 130 GWU students and we find that their EMCA scores are comparable to the scores of students in our calculus-based physics course.
Translation and Validation of the Dysphagia Handicap Index in Hebrew-Speaking Patients.

PubMed

Shapira-Galitz, Yael; Drendel, Michael; Yousovich-Ulriech, Ruth; Shtreiffler-Moskovich, Liat; Wolf, Michael; Lahav, Yonatan

2018-06-07

The Dysphagia Handicap Index (DHI) is a 25-item questionnaire assessing the physical, functional, and emotional aspects of dysphagia patients' quality of life (QoL). The study goal was to translate and validate the Hebrew-DHI. 148 patients undergoing fiberoptic endoscopic examination of swallowing (FEES) in two specialized dysphagia clinics between February and August 2017 filled the Hebrew-DHI and self-reported their dysphagia severity on a scale of 1-7. 21 patients refilled the DHI during a 2-week period following their first visit. FEES were scored for residue (1 point per consistency), penetration and aspiration (1 point for penetration, 2 points for aspiration, per consistency). 51 healthy volunteers also filled the DHI. Internal consistency and test-retest reproducibility were used for reliability testing. Validity was established by comparing DHI scores of dysphagia patients and healthy controls. Concurrent validity was established by correlating the DHI score with the FEES score. Internal consistency of the Hebrew-DHI was high (Cronbach's alpha = 0.96), as was the test-retest reproducibility (Spearman's correlation coefficient = 0.82, p < 0.001). The Hebrew-DHI's total score, and its three subscales (physical/functional/emotional) were significantly higher in dysphagia patients compared to those in healthy controls (median 38 pts, IQR 18-56 for dysphagia patients compared to 0, IQR 0-2 for healthy controls, p < 0.0001). A strong correlation was observed between the DHI score and the self-reported dysphagia severity measure (Spearman's correlation coefficient = 0.88, p < 0.0001). A moderate correlation was found between the DHI score and the FEES score (Pearson's correlation coefficient = 0.245, p = 0.003). The Hebrew-DHI is a reliable and valid questionnaire assessing dysphagia patients' QoL.
The Relationship of Comorbidities and Patient Navigation to Time to Diagnostic Resolution after Abnormal Cancer Screening

PubMed Central

Whitley, Elizabeth M; Raich, Peter C; Dudley, Donald J; Freund, Karen M; Paskett, Electra D; Patierno, Steven R; Simon, Melissa; Warren-Mears, Victoria; Snyder, Frederick R

2016-01-01

Background Whether patient navigation improves outcomes in patients with comorbidities is unknown. Study aims were to determine the effect of comorbidities on time to diagnostic resolution following an abnormal cancer screening test, and to examine for patients with comorbidities, if patient navigation improves timeliness and likelihood of diagnostic resolution compared to patients without navigation. Methods A secondary analysis from the Patient Navigation Research Program sites that collected comorbidity data using the Charlson Comorbidity Index (CCI) was conducted. Participants were 6,349 patients with abnormal breast, cervical, colon or prostate cancer screening tests between 2007 and 2011. The intervention was patient navigation or usual care. CCI data were highly skewed across projects and cancer sites and were categorized as 0, no comorbidities identified, CCI score of 0 (76% of cases); 1, CCI score of 1 (16% of cases); or 2, CCI score of ≥2 (8% of cases). A separate adjusted hazards ratio for each site and cancer type was obtained, and then pooled using meta-analysis random effects methodology. Results Having a CCI score of ≥2 delayed the time to diagnostic resolution following an abnormal cancer screening test compared with those with fewer than one comorbidity. Patient Navigation reduced delays in diagnostic resolution with the greatest benefit seen in those with a CCI score of ≥2. Conclusions Persons with a CCI score of ≥2 experienced significant delays in timely diagnostic care compared to patients without comorbidities. Patient navigation was effective in reducing delays in diagnostic resolution among those with CCI scores > 1. PMID:27648520
An explorative study of school performance and antipsychotic medication.

PubMed

van der Schans, J; Vardar, S; Çiçek, R; Bos, H J; Hoekstra, P J; de Vries, T W; Hak, E

2016-09-21

Antipsychotic therapy can reduce severe symptoms of psychiatric disorders, however, data on school performance among children on such treatment are lacking. The objective was to explore school performance among children using antipsychotic drugs at the end of primary education. A cross-sectional study was conducted using the University Groningen pharmacy database linked to academic achievement scores at the end of primary school (Dutch Cito-test) obtained from Statistics Netherlands. Mean Cito-test scores and standard deviations were obtained for children on antipsychotic therapy and reference children, and statistically compared using analyses of covariance. In addition, differences in subgroups as boys versus girls, ethnicity, household income, and late starters (start date within 12 months of the Cito-test) versus early starters (start date > 12 months before the Cito-test) were tested. In all, data from 7994 children could be linked to Cito-test scores. At the time of the Cito-test, 45 (0.6 %) were on treatment with antipsychotics. Children using antipsychotics scored on average 3.6 points lower than the reference peer group (534.5 ± 9.5). Scores were different across gender and levels of household income (p < 0.05). Scores of early starters were significantly higher than starters within 12 months (533.7 ± 1.7 vs. 524.1 ± 2.6). This first exploration showed that children on antipsychotic treatment have lower school performance compared to the reference peer group at the end of primary school. This was most noticeable for girls, but early starters were less affected than later starters. Due to the observational cross-sectional nature of this study, no causality can be inferred, but the results indicate that school performance should be closely monitored and causes of underperformance despite treatment warrants more research.
Integrated Behavioral Z-Scoring Increases the Sensitivity and Reliability of Behavioral Phenotyping in mice: Relevance to Emotionality and Sex

PubMed Central

Guilloux, Jean-Philippe; Seney, Marianne; Edgar, Nicole; Sibille, Etienne

2011-01-01

Defining anxiety- and depressive-like states in mice (“emotionality”) is best characterized by the use of complementary tests, leading sometimes to puzzling discrepancies and lack of correlation between similar paradigms. To address this issue, we hypothesized that integrating measures along the same behavioral dimensions in different tests would reduce the intrinsic variability of single tests and provide a robust characterization of the underlying “emotionality” of individual mouse, similarly as mood and related syndromes are defined in humans through various related symptoms over time. We describe the use of simple mathematical and integrative tools to help phenotype animals across related behavioral tests (syndrome diagnosis) and experiments (meta-analysis). We applied z-normalization across complementary measures of emotionality in different behavioral tests after unpredictable chronic mild stress (UCMS) or prolonged corticosterone exposure - two approaches to induce anxious-/depressive-like states in mice. Combining z-normalized test values, lowered the variance of emotionality measurement, enhanced the reliability of behavioral phenotyping, and increased analytical opportunities. Comparing integrated emotionality scores across studies revealed a robust sexual dimorphism in the vulnerability to develop high emotionality, manifested as higher UCMS-induced emotionality z-scores, but lower corticosterone-induced scores in females compared to males. Interestingly, the distribution of individual z-scores revealed a pattern of increased baseline emotionality in female mice, reminiscent of what is observed in humans. Together, we show that the z-scoring method yields robust measures of emotionality across complementary tests for individual mice and experimental groups, hence facilitating the comparison across studies and refining the translational applicability of these models. PMID:21277897
Integrated behavioral z-scoring increases the sensitivity and reliability of behavioral phenotyping in mice: relevance to emotionality and sex.

PubMed

Guilloux, Jean-Philippe; Seney, Marianne; Edgar, Nicole; Sibille, Etienne

2011-04-15

Defining anxiety- and depressive-like states in mice (emotionality) is best characterized by the use of complementary tests, leading sometimes to puzzling discrepancies and lack of correlation between similar paradigms. To address this issue, we hypothesized that integrating measures along the same behavioral dimensions in different tests would reduce the intrinsic variability of single tests and provide a robust characterization of the underlying "emotionality" of individual mouse, similarly as mood and related syndromes are defined in humans through various related symptoms over time. We describe the use of simple mathematical and integrative tools to help phenotype animals across related behavioral tests (syndrome diagnosis) and experiments (meta-analysis). We applied z-normalization across complementary measures of emotionality in different behavioral tests after unpredictable chronic mild stress (UCMS) or prolonged corticosterone exposure - two approaches to induce anxious-/depressive-like states in mice. Combining z-normalized test values, lowered the variance of emotionality measurement, enhanced the reliability of behavioral phenotyping, and increased analytical opportunities. Comparing integrated emotionality scores across studies revealed a robust sexual dimorphism in the vulnerability to develop high emotionality, manifested as higher UCMS-induced emotionality z-scores, but lower corticosterone-induced scores in females compared to males. Interestingly, the distribution of individual z-scores revealed a pattern of increased baseline emotionality in female mice, reminiscent of what is observed in humans. Together, we show that the z-scoring method yields robust measures of emotionality across complementary tests for individual mice and experimental groups, hence facilitating the comparison across studies and refining the translational applicability of these models. Copyright © 2011 Elsevier B.V. All rights reserved.
Coppersmith Self-Esteem Inventory Scores of Boys with Severe Behavior Problems

ERIC Educational Resources Information Center

Wood, Frank H.; Johnson, Ardes

1972-01-01

Scores on the Coopersmith Self-Esteem Inventory of 44 behaviorally disturbed boys ranging in age from 8 to 12 years were compared with the test's norms, with later retest scores, with teacher assigned self esteem ranks, and with peer group status as measured by sociometric procedures. (DB)

A Comparison of Three Methods for Computing Scale Score Conditional Standard Errors of Measurement. ACT Research Report Series, 2013 (7)

ERIC Educational Resources Information Center

Woodruff, David; Traynor, Anne; Cui, Zhongmin; Fang, Yu

2013-01-01

Professional standards for educational testing recommend that both the overall standard error of measurement and the conditional standard error of measurement (CSEM) be computed on the score scale used to report scores to examinees. Several methods have been developed to compute scale score CSEMs. This paper compares three methods, based on…
Clock face drawing test performance in children with ADHD.

PubMed

Ghanizadeh, Ahmad; Safavi, Salar; Berk, Michael

2013-01-01

The utility and discriminatory pattern of the clock face drawing test in ADHD is unclear. This study therefore compared Clock Face Drawing test performance in children with ADHD and controls. 95 school children with ADHD and 191 other children were matched for gender ratio and age. ADHD symptoms severities were assessed using DSM-IV ADHD checklist and their intellectual functioning was assessed. The participants completed three clock-drawing tasks, and the following four functions were assessed: Contour score, Numbers score, Hands setting score, and Center score. All the subscales scores of the three clock drawing tests of the ADHD group were lower than that of the control group. In ADHD children, inattention and hyperactivity/ impulsivity scores were not related to free drawn clock test scores. When pre-drawn contour test was performed, inattentiveness score was statistically associated with Number score while none of the other variables of age, gender, intellectual functioning, and hand use preference were associated with that kind of score. In pre-drawn clock, no association of ADHD symptoms with any CDT subscales found significant. In addition, more errors are observed with free drawn clock and Pre-drawn contour than pre-drawn clock. Putting Numbers and Hands setting are more sensitive measures to screen ADHD than Contour and Center drawing. Test performance, except Hands setting, may have already reached a developmental plateau. It is probable that Hand setting deficit in children with ADHD may not decrease from age 8 to 14 years. Performance of children with ADHD is associated with complexity of CDT.
Integrating the ACR Appropriateness Criteria Into the Radiology Clerkship: Comparison of Didactic Format and Group-Based Learning.

PubMed

Stein, Marjorie W; Frank, Susan J; Roberts, Jeffrey H; Finkelstein, Malka; Heo, Moonseong

2016-05-01

The aim of this study was to determine whether group-based or didactic teaching is more effective to teach ACR Appropriateness Criteria to medical students. An identical pretest, posttest, and delayed multiple-choice test was used to evaluate the efficacy of the two teaching methods. Descriptive statistics comparing test scores were obtained. On the posttest, the didactic group gained 12.5 points (P < .0001), and the group-based learning students gained 16.3 points (P < .0001). On the delayed test, the didactic group gained 14.4 points (P < .0001), and the group-based learning students gained 11.8 points (P < .001). The gains in scores on both tests were statistically significant for both groups. However, the differences in scores were not statistically significant comparing the two educational methods. Compared with didactic lectures, group-based learning is more enjoyable, time efficient, and equally efficacious. The choice of educational method can be individualized for each institution on the basis of group size, time constraints, and faculty availability. Copyright © 2016 American College of Radiology. Published by Elsevier Inc. All rights reserved.
The impact of testing accommodations on MCAT scores: descriptive results.

PubMed

Julian, Ellen R; Ingersoll, Deborah J; Etienne, Patricia M; Hilger, Anthony E

2004-04-01

Medical College Admission Test (MCAT) examinees with disabilities who receive accommodations receive flagged scores indicating nonstandard administration. This report compares MCAT examinees who received accommodations and their performances with standard examinees. Aggregate history records of all 1994-2000 MCAT examinees were identified as flagged (2,401) or standard (297,880), then further sorted by race/ethnicity (broadly identified as underrepresented minority and non-URM, at the time of testing) and gender. Those with flagged scores were also classified by disability (LD = learning disability, ADHD = attention deficit hyperactivity disorder, LD/ADHD = learning disability and attention deficit hyperactivity disorder, and Other = other disability) and type of accommodation. Mean MCAT scores were calculated for all groups. A group of 866 examinees took the MCAT first as a standard administration and subsequently with accommodations. In a separate analysis, their two sets of scores were compared. Less than 1% of examinees (2,401) had accommodations; of these, 55% were LD, 17% ADHD, 5% LD/ADHD, and 23% Other. Extended time was the most frequently provided accommodation. Mean flagged scores slightly exceeded mean standard scores on all MCAT sections. Examinees who retook the MCAT with accommodations after a standard administration increased their scores by six points, quadrupling the average gain Standard-Standard retest cohort from another study. The small but statistically significant different higher flagged scores may reflect either appropriate compensation or overly generous accommodations. Extended time had a positive impact on the scores of those who retested with this accommodation. The validity the flagged MCAT in predicting success in medical school is not known, and further investigation is underway.
The developmental eye movement (DEM) test and Cantonese-speaking children in Hong Kong SAR, China.

PubMed

Pang, Peter C; Lam, Carly S; Woo, George C

2010-07-01

There is no published norm for the Developmental Eye Movement (DEM) Test for Cantonese-speaking Chinese children. This study aimed to determine the normative values of this test for Cantonese-speaking Chinese children in Hong Kong SAR and to compare the results with the published norms of English-speaking and Spanish-speaking children. Cantonese-speaking students aged from 6 to 11 years were tested by the DEM test in Cantonese and a digital recorder was used to record the process. The DEM scores for the 305 students were determined by listening again to the audio records after the test and computed by using the formula from the DEM manual, except that the 'vertical scores' were adjusted by taking the vertical errors into consideration. The results were compared with other norms that have been published. Our subjects made more vertical errors than in other normative studies and adjusted vertical scores were proposed. In both adjusted vertical and horizontal scores, the Cantonese-speaking children completed the tests much faster than the norms for English- and Spanish-speaking children, the differences of the means being significant (p < 0.0001) in all age groups. The DEM norms may be affected by differences in languages, cultures and education systems among different ethnicities. The norms of the DEM test are proposed for Cantonese-speaking children in Hong Kong SAR, China.
CrossFit athletes exhibit high symmetry of fundamental movement patterns. A cross-sectional study

PubMed Central

Tafuri, Silvio; Notarnicola, Angela; Monno, Antonello; Ferretti, Francesco; Moretti, Biagio

2016-01-01

Summary Background even if CrossFit training programs accounted actually more than 7500 gyms affiliated in the USA and more than 2000 in Europe and involved today more than 1 million of people, actually there were not several studies about the effect of the CrossFit on the health and sport performance. The aim of these research was to evaluate the performance in 7 fundamental movement patterns using a standardized methods, the Functional Movement Screen (FMS). Methods we enrolled three groups of athletes (age 17–40 years; >6 months of training programs): CrossFitters, body builders and professional weightlifters. FMS test was performed to all people enrolled. Scores of FMS test was examined comparing three groups. Results no differences in the three groups were showed in the mean score values of each test and in total score, except for shoulder mobility test (higher among CrossFitters) and trunk stability push-up test (higher among weightlifter). Agreement between the test performed on the two sides was higher in CrossFit groups for hurdle step (93.2%), in line lung (86%), rotary stability test (95.3%) and shoulder mobility (90.7%; p<0.001). Conclusions CrossFitters seem to have a high level of concordance in the scores achieved in bilateral test. CrossFit seems to produce marked symmetry in some fundamental movements compared to weightlifting and bodybuilding. PMID:27331045
CrossFit athletes exhibit high symmetry of fundamental movement patterns. A cross-sectional study.

PubMed

Tafuri, Silvio; Notarnicola, Angela; Monno, Antonello; Ferretti, Francesco; Moretti, Biagio

2016-01-01

even if CrossFit training programs accounted actually more than 7500 gyms affiliated in the USA and more than 2000 in Europe and involved today more than 1 million of people, actually there were not several studies about the effect of the CrossFit on the health and sport performance. The aim of these research was to evaluate the performance in 7 fundamental movement patterns using a standardized methods, the Functional Movement Screen (FMS). we enrolled three groups of athletes (age 17-40 years; >6 months of training programs): CrossFitters, body builders and professional weightlifters. FMS test was performed to all people enrolled. Scores of FMS test was examined comparing three groups. no differences in the three groups were showed in the mean score values of each test and in total score, except for shoulder mobility test (higher among CrossFitters) and trunk stability push-up test (higher among weightlifter). Agreement between the test performed on the two sides was higher in CrossFit groups for hurdle step (93.2%), in line lung (86%), rotary stability test (95.3%) and shoulder mobility (90.7%; p<0.001). CrossFitters seem to have a high level of concordance in the scores achieved in bilateral test. CrossFit seems to produce marked symmetry in some fundamental movements compared to weightlifting and bodybuilding.
Academic Growth Trajectories of ELLs in NAEP Data: The Case of Fourth- and Eighth-Grade ELLs and Non-ELLs on Mathematics and Reading Tests

ERIC Educational Resources Information Center

Polat, Nihat; Zarecky-Hodge, Ashley; Schreiber, James B.

2016-01-01

Utilizing the National Assessment of Educational Progress (NAEP) data, this study examined (1) how fourth and eighth-grade ELLs' mathematics and reading scores on national tests compared to their non-ELL peers' scores over the testing period between 2003 and 2011, and (2) if gender and ethnicity contributed to variation in the growth patterns…
Are WISC IQ scores in children with mathematical learning disabilities underestimated? The influence of a specialized intervention on test performance.

PubMed

Lambert, Katharina; Spinath, Birgit

2018-01-01

Intelligence measures play a pivotal role in the diagnosis of mathematical learning disabilities (MLD). Probably as a result of math-related material in IQ tests, children with MLD often display reduced IQ scores. However, it remains unclear whether the effects of math remediation extend to IQ scores. The present study investigated the impact of a special remediation program compared to a control group receiving private tutoring (PT) on the WISC IQ scores of children with MLD. We included N=45 MLD children (7-12 years) in a study with a pre- and post-test control group design. Children received remediation for two years on average. The analyses revealed significantly greater improvements in the experimental group on the Full-Scale IQ, and the Verbal Comprehension, Perceptual Reasoning, and Working Memory indices, but not Processing Speed, compared to the PT group. Children in the experimental group showed an average WISC IQ gain of more than ten points. Results indicate that the WISC IQ scores of MLD children might be underestimated and that an effective math intervention can improve WISC IQ test performance. Taking limitations into account, we discuss the use of IQ measures more generally for defining MLD in research and practice. Copyright © 2017 Elsevier Ltd. All rights reserved.
Intra- and inter-observer reliability of ten major histological scoring systems used for the evaluation of in vivo cartilage repair.

PubMed

Bonasia, Davide Edoardo; Marmotti, Antongiulio; Massa, Alessandro Domenico Felice; Ferro, Andrea; Blonna, Davide; Castoldi, Filippo; Rossi, Roberto

2015-09-01

In the last two decades, many surgical techniques have been described for articular cartilage repair. Reliable histological scoring systems are fundamental tools to evaluate new procedures. Several histological scoring systems have been described, and these can be divided in elementary and comprehensive scores, according to the number of sub-items. The aim of this study was to test the inter- and intra-observer reliability of ten main scores used for the histological evaluation of in vivo cartilage repair. The authors tested the starting hypothesis that elementary scores would show superior intra- and inter-observer reliability compared with comprehensive scores. Fifty histological sections obtained from the trochlea of New Zealand Rabbit and stained with Safranin-O fast green were used. The histological sections were analysed by 4 observers: 2 experienced in cartilage histology and 2 inexperienced. Histological evaluations were performed at time 1 and time 2, separated by a 30-day interval. The following scores were used: Mankin, O'Driscoll, Pineda, Wakitani, Fortier, Selleres, ICRS, ICRSII, Oswestry (OsScore) and modified O'Driscoll. Intra- and inter-observer reliability were evaluated for each score. In addition, the pavement-ceiling effect and the Bland-Altman Coefficient of Repeatability were then evaluated for each sub-item of every score. Intra-observer reliability was high for all observers in every score, even though the reliability was significantly lower for non-expert observers compared with expert counterparts. In terms of Coefficient of Repeatability, some scores performed better (O'Driscoll, Modified O'Driscoll and ICRSII) than others (Fortier, Seller). Inter-observer reliability was high for all observers in every score, but significantly lower for non-expert compared with expert observers. In expert hands, all the scores showed high intra- and inter-observer reliability, independently of the complexity. Although every score has advantages and disadvantages, ICRSII, O'Driscoll and Modified O'Driscoll scores should be preferred for the evaluation of in vivo cartilage repair in animal models.
A multinational randomised study comparing didactic lectures with case scenario in a severe sepsis medical simulation course.

PubMed

Li, Chih-Huang; Kuan, Win-Sen; Mahadevan, Malcolm; Daniel-Underwood, Lynda; Chiu, Te-Fa; Nguyen, H Bryant

2012-07-01

Medical simulation has been used to teach critical illness in a variety of settings. This study examined the effect of didactic lectures compared with simulated case scenario in a medical simulation course on the early management of severe sepsis. A prospective multicentre randomised study was performed enrolling resident physicians in emergency medicine from four hospitals in Asia. Participants were randomly assigned to a course that included didactic lectures followed by a skills workshop and simulated case scenario (lecture-first) or to a course that included a skills workshop and simulated case scenario followed by didactic lectures (simulation-first). A pre-test was given to the participants at the beginning of the course, post-test 1 was given after the didactic lectures or simulated case scenario depending on the study group assignment, then a final post-test 2 was given at the end of the course. Performance on the simulated case scenario was evaluated with a performance task checklist. 98 participants were enrolled in the study. Post-test 2 scores were significantly higher than pre-test scores in all participants (80.8 ± 12.0% vs 65.4 ± 12.2%, p<0.01). There was no difference in pre-test scores between the two study groups. The lecture-first group had significantly higher post-test 1 scores than the simulation-first group (78.8 ± 10.6% vs 71.6 ± 12.6%, p<0.01). There was no difference in post-test 2 scores between the two groups. The simulated case scenario task performance completion was 90.8% (95% CI 86.6% to 95.0%) in the lecture-first group compared with 83.8% (95% CI 79.5% to 88.1%) in the simulation-first group (p=0.02). A medical simulation course can improve resident physician knowledge in the early management of severe sepsis. Such a course should include a comprehensive curriculum that includes didactic lectures followed by simulation experience.
Comparison of the efficacy of treating sperm with low hypoosmotic swelling test scores with chymotrypsin followed by intrauterine insemination vs in vitro fertilization with intracytoplasmic sperm injection.

PubMed

Bollendorf, A; Check, D; Check, J H; Hourani, W; McMonagle, K

2011-01-01

To compare the efficacy of two treatments for sperm with low hypoosmotic swelling (HOS) test scores - intrauterine insemination (IUI) with sperm pretreated with the protein digestive enzyme chymotrypsin versus in vitro fertilization (IVF) with intracytoplasmic sperm injection (ICSI). The choice of patient therapy was optional. The pregnancy rates following two IUI cycles vs one IVF cycle with ICSI were then compared. The data were further stratified and compared according to the severity of the HOS score defect. The more severe the HOS test defect the less likely for chymotrypsin therapy to work whereas the severity did not affect IVF with ICSI success. The use of IVF with ICSI was much more effective than IUI with chymotrypsin treatment. Though IVF with ICSI is much more effective, IUI is much less expensive. Couples should be presented with these data and be allowed to make their own choice considering risks and expense versus efficacy and speed of success.
A Randomized Controlled Trial of Team-Based Learning Versus Lectures with Break-Out Groups on Knowledge Retention.

PubMed

Thrall, Grace C; Coverdale, John H; Benjamin, Sophiya; Wiggins, Anna; Lane, Christianne Joy; Pato, Michele T

2016-10-01

This goal of this study was to evaluate the efficacy of team-based learning (TBL) on knowledge retention compared to traditional lectures with small break-out group discussion (teaching as usual (TAU)) using a randomized controlled trial. This randomized controlled trial was conducted during a daylong conference for psychiatric educators on attention-deficit hyperactivity disorder and the research literacy topic of efficacy versus effectiveness trials. Learners (n = 115) were randomized with concealed allocation to either TBL or TAU. Knowledge was measured prior to the intervention, immediately afterward, and 2 months later via multiple-choice tests. Participants were necessarily unblinded. Data enterers, data analysts, and investigators were blinded to group assignment in data analysis. Per-protocol analyses of test scores were performed using change in knowledge from baseline. The primary endpoint was test scores at 2 months. At baseline, there were no statistically significant differences between groups in pre-test knowledge. At immediate post-test, both TBL and TAU groups showed improved knowledge scores compared with their baseline scores. The TBL group performed better statistically on the immediate post-test than the TAU group (Cohen's d = 0.73; p < 0.001), although the differences in knowledge scores were not educationally meaningful, averaging just one additional test question correct (out of 15). On the 2-month remote post-test, there were no group differences in knowledge retention among the 42 % of participants who returned the 2-month test. Both TBL and TAU learners acquired new knowledge at the end of the intervention and retained knowledge over 2 months. At the end of the intervention day and after 2 months, knowledge test scores were not meaningfully different between TBL and TAU completers. In conclusion, this study failed to demonstrate the superiority of TBL over TAU on the primary outcome of knowledge retention at 2 months post-intervention.
Simple exercise test score versus cardiac stress test for the prediction of coronary artery disease in patients with type 2 diabetes.

PubMed

Pikto-Pietkiewicz, Witold; Przewłocka, Monika; Chybowska, Barbara; Cyciwa, Alona; Pasierski, Tomasz

2014-01-01

Type 2 diabetes markedly increases the risk of coronary heart disease (CHD), and screening for CHD is suggested by the guidelines. The aim of the study was to compare the diagnostic usefulness of the simple exercise test score, incorporating the clinical data and cardiac stress test results, with the standard stress test in patients with type 2 diabetes. A total of 62 consecutive patients (aged 65.4 ±8.5 years; 32 men) with type 2 diabetes and clinical symptoms suggesting CHD underwent a stress test followed by coronary angiography. The simple score was calculated for all patients. Significant coronary stenosis was observed in 41 patients (66.1%). Stress test results were positive in 36 patients (58.1%). The mean simple score was high (65.5 ±14.3 points). A positive linear relationship was observed between the score and the prevalence of CHD (R2 = 0.19; P <0.001) as well as its severity (R² = 0.23; P <0.001). The area under the receiver-operating characteristic curve for the simple score was 0.74 (95% confidence interval [CI], 0.62-0.86). At the original cut-off value of 60 points, the score had a similar prognostic value to that of the standard stress test. However, in a multivariate analysis, only the simple score (odds ratio [OR], 1.46; 95% CI, 1.11-1.94; P <0.01 for an increase in the score by 1 point) and male sex (OR, 1.57; 95% CI, 1.24-1.98; P <0.001) remained independent predictors of CHD. In patients with type 2 diabetes, the simple score correlated with the prevalence and severity of CHD. However, the cut-off value of 60 points was inadequate in the population of diabetic patients with high risk of CHD. The simple score used instead of or together with the stress test was a better predictor of CHD than the stress test alone.
ADAMTS13 test and/or PLASMIC clinical score in management of acquired thrombotic thrombocytopenic purpura: a cost-effective analysis.

PubMed

Kim, Chong H; Simmons, Sierra C; Williams, Lance A; Staley, Elizabeth M; Zheng, X Long; Pham, Huy P

2017-11-01

The ADAMTS13 test distinguishes thrombotic thrombocytopenic purpura (TTP) from other thrombotic microangiopathies (TMAs). The PLASMIC score helps determine the pretest probability of ADAMTS13 deficiency. Due to inherent limitations of both tests, and potential adverse effects and cost of unnecessary treatments, we performed a cost-effectiveness analysis (CEA) investigating the benefits of incorporating an in-hospital ADAMTS13 test and/or PLASMIC score into our clinical practice. A CEA model was created to compare four scenarios for patients with TMAs, utilizing either an in-house or a send-out ADAMTS13 assay with or without prior risk stratification using PLASMIC scoring. Model variables, including probabilities and costs, were gathered from the medical literature, except for the ADAMTS13 send-out and in-house tests, which were obtained from our institutional data. If only the cost is considered, in-house ADAMTS13 test for patients with intermediate- to high-risk PLASMIC score is the least expensive option ($4,732/patient). If effectiveness is assessed as measured by the number of averted deaths, send-out ADAMTS13 test is the most effective. Considering the cost/effectiveness ratio, the in-house ADAMTS13 test in patients with intermediate- to high-risk PLASMIC score is the best option, followed by the in-house ADAMTS13 test without the PLASMIC score. In patients with clinical presentations of TMAs, having an in-hospital ADAMTS13 test to promptly establish the diagnosis of TTP appears to be cost-effective. Utilizing the PLASMIC score further increases the cost-effectiveness of the in-house ADAMTS13 test. Our findings indicate the benefit of having a rapid and reliable in-house ADAMTS13 test, especially in the tertiary medical center. © 2017 AABB.
Effects of Programmed Learning Sequences on the Mathematics Test Scores of Bermudian Middle School Students

ERIC Educational Resources Information Center

Tully, Derek; Dunn, Rita; Hlawaty, Heide

2006-01-01

This research compared the effects of a Programmed Learning Sequence (PLS) (Dunn & Dunn, 1993) versus Traditional Teaching (TT) on 100 sixth-grade Bermudian students' test scores on a Fractions Unit. Fifty-three males' and forty-seven females' learning styles were identified with the "Learning Style Inventory" (LSI) (Dunn, Dunn,…
Can a Two-Question Test Be Reliable and Valid for Predicting Academic Outcomes?

ERIC Educational Resources Information Center

Bridgeman, Brent

2016-01-01

Scores on essay-based assessments that are part of standardized admissions tests are typically given relatively little weight in admissions decisions compared to the weight given to scores from multiple-choice assessments. Evidence is presented to suggest that more weight should be given to these assessments. The reliability of the writing scores…
America's Mediocre Test Scores: Education Crisis or Poverty Crisis?

ERIC Educational Resources Information Center

Petrilli, Michael J.; Wright, Brandon L.

2016-01-01

At a time when the national conversation is focused on lagging upward mobility, it is no surprise that many educators point to poverty as the explanation for mediocre test scores among U.S. students compared to those of students in other countries. If American teachers in struggling U.S. schools taught in Finland, says Finnish educator Pasi…
Test Accommodations and Equating Invariance on a Fifth-Grade Science Exam

ERIC Educational Resources Information Center

Huggins, Anne Corinne; Elbaum, Batya

2013-01-01

The purpose of this study is to utilize Score Equity Assessment (SEA) to examine measurement comparability and equity in reported scores on a statewide fifth-grade science assessment with respect to groups of students defined by disability status, English Language Learner status and use of test accommodations. Benefits of SEA include a focus on…
Instructional Comparative Advantages May Exist Despite the "Comprehensive Uniformity" of Traditional Public Schools

ERIC Educational Resources Information Center

Merrifield, John

2012-01-01

A special tabulation of individual student scores from the Texas Assessment of Academic Skills (TAAS) test allowed a ranking of Texas schools according to test score changes ("value added"). The rankings varied greatly by student subpopulation. That is, the vast majority of schools are much more effective with some kinds of students than…

Reading Comprehension as a Factor in Communication with Engineers.

ERIC Educational Resources Information Center

Sacks, George A.; Sacks, Florence

A study of the reading rate and comprehension of 10 aerospace engineers and analysis of the readability of sample company communications were undertaken. The Nelson-Denny Reading Test comprehension scores for the engineers, when compared with scores of a norm group provided by the Nelson-Denny Test Manual, were nearly the same in mean and standard…
Bayesian Ideal Types: Integration of Psychometric Data for Visually Impaired Persons.

ERIC Educational Resources Information Center

Jones, W. P.

1991-01-01

A model is proposed for the clinical synthesis of data from psychological tests of persons with visual impairments. The model integrates the concepts of the ideal type and Bayesian probability and compares actual test scores with ideal scores through use of a pattern similarity coefficient. A pilot study with Business Enterprise Program operators…
Developing Local Oral Reading Fluency Cut Scores for Predicting High-Stakes Test Performance

ERIC Educational Resources Information Center

Grapin, Sally L.; Kranzler, John H.; Waldron, Nancy; Joyce-Beaulieu, Diana; Algina, James

2017-01-01

This study evaluated the classification accuracy of a second grade oral reading fluency curriculum-based measure (R-CBM) in predicting third grade state test performance. It also compared the long-term classification accuracy of local and publisher-recommended R-CBM cut scores. Participants were 266 students who were divided into a calibration…
Multiple Imputation of Item Scores in Test and Questionnaire Data, and Influence on Psychometric Results

ERIC Educational Resources Information Center

van Ginkel, Joost R.; van der Ark, L. Andries; Sijtsma, Klaas

2007-01-01

The performance of five simple multiple imputation methods for dealing with missing data were compared. In addition, random imputation and multivariate normal imputation were used as lower and upper benchmark, respectively. Test data were simulated and item scores were deleted such that they were either missing completely at random, missing at…
Utility of TICS-M for the assessment of cognitive function in older adults.

PubMed

de Jager, Celeste A; Budge, Marc M; Clarke, Robert

2003-04-01

Routine screening of high-risk elderly people for early cognitive impairment is constrained by the limitations of currently available cognitive function tests. The Telephone Interview of Cognitive Status is a novel instrument for assessment of cognitive function that can be administered in person or by telephone. To evaluate the determinants and utility of TICS-M (13-item modified version) for assessment of cognitive function in healthy elderly people. The utility of TICS-M was compared with more widely used MMSE and CAMCOG in a cross-sectional survey of 120 older (62 to 89 years) UK adults. The TICS-M cognitive test scores (27.97, SD 4.15) were normally distributed in contrast with those for MMSE and CAMCOG that had a negatively skewed distribution. TICS-M scores were inversely correlated with age (r = -0.21) and with the NART fullscale IQ (r = -0.35), but were independent of years of education in this cohort. TICS-M was highly correlated with MMSE (r = 0.57) and with CAMCOG (r = 0.62) scores. The time required to complete the test is comparable to MMSE and substantially less than CAMCOG. The normal distribution of TICS-M test scores suggest that this test is less constrained by the ceiling effect which limits the utility of MMSE and CAMCOG test scores in detecting early cognitive impairment. TICS-M is an appropriate instrument to assess cognitive function in both research and in clinical practice. Copyright 2003 John Wiley & Sons, Ltd.
Accountancy, teaching methods, sex, and American College Test scores.

PubMed

Heritage, J; Harper, B S; Harper, J P

1990-10-01

This study examines the significance of sex, methodology, academic preparation, and age as related to development of judgmental and problem-solving skills. Sex, American College Test (ACT) Mathematics scores, Composite ACT scores, grades in course work, grade point average (GPA), and age were used in studying the effects of teaching method on 96 students' ability to analyze data in financial statements. Results reflect positively on accounting students compared to the general college population and the women students in particular.
Development of an Itemwise Efficiency Scoring Method: Concurrent, Convergent, Discriminant, and Neuroimaging-Based Predictive Validity Assessed in a Large Community Sample

PubMed Central

Moore, Tyler M.; Reise, Steven P.; Roalf, David R.; Satterthwaite, Theodore D.; Davatzikos, Christos; Bilker, Warren B.; Port, Allison M.; Jackson, Chad T.; Ruparel, Kosha; Savitt, Adam P.; Baron, Robert B.; Gur, Raquel E.; Gur, Ruben C.

2016-01-01

Traditional “paper-and-pencil” testing is imprecise in measuring speed and hence limited in assessing performance efficiency, but computerized testing permits precision in measuring itemwise response time. We present a method of scoring performance efficiency (combining information from accuracy and speed) at the item level. Using a community sample of 9,498 youths age 8-21, we calculated item-level efficiency scores on four neurocognitive tests, and compared the concurrent, convergent, discriminant, and predictive validity of these scores to simple averaging of standardized speed and accuracy-summed scores. Concurrent validity was measured by the scores' abilities to distinguish men from women and their correlations with age; convergent and discriminant validity were measured by correlations with other scores inside and outside of their neurocognitive domains; predictive validity was measured by correlations with brain volume in regions associated with the specific neurocognitive abilities. Results provide support for the ability of itemwise efficiency scoring to detect signals as strong as those detected by standard efficiency scoring methods. We find no evidence of superior validity of the itemwise scores over traditional scores, but point out several advantages of the former. The itemwise efficiency scoring method shows promise as an alternative to standard efficiency scoring methods, with overall moderate support from tests of four different types of validity. This method allows the use of existing item analysis methods and provides the convenient ability to adjust the overall emphasis of accuracy versus speed in the efficiency score, thus adjusting the scoring to the real-world demands the test is aiming to fulfill. PMID:26866796
Reliability and Validity of the Italian Version of the Protocol of Orofacial Myofunctional Evaluation with Scores (I-OMES).

PubMed

Scarponi, Letizia; de Felicio, Claudia Maria; Sforza, Chiarella; Pimenta Ferreira, Claudia Lucia; Ginocchio, Daniela; Pizzorni, Nicole; Barozzi, Stefania; Mozzanica, Francesco; Schindler, Antonio

2018-05-30

To evaluate the reliability, validity, and responsiveness of the Italian OMES (I-OMES). The study consisted of 3 phases: (1) internal consistency and reliability, (2) validity, and (3) responsiveness analysis. The recruited population included 27 patients with orofacial myofunctional disorders (OMD) and 174 healthy volunteers. Forty-seven subjects, 18 healthy and all recruited patients with OMD were assessed for inter-rater and test-retest reliability analysis. I-OMES and Nordic Orofacial Test - Screening (NOT-S) scores of the patients were correlated for concurrent validity analysis. I-OMES scores from 27 patients with OMD and 27 age- and gender-matched healthy subjects were compared to investigate construct validity. I-OMES scores before and after successful swallowing rehabilitation in patients were compared for responsiveness analysis. Adequate internal consistency (Cronbach α = 0.71) and strong inter-rater and test-retest reliability (intraclass coefficient correlation = 0.97 and 0.98, respectively) were found. I-OMES and NOT-S scores significantly and inversely correlated (r = -0.38). A statistical significance (p < 0.001) was found between the pathological group and the control group for the total I-OMES score. The mean I-OMES score improved from 90 (78-102) to 99 (89-103) after myofunctional rehabilitation (p < 0.001). The I-OMES is a reliable and valid tool to evaluate OMD. © 2018 S. Karger AG, Basel.
Wire-bending test as a predictor of preclinical performance by dental students.

PubMed

Kao, E C; Ngan, P W; Wilson, S; Kunovich, R

1990-10-01

Traditional Dental Aptitude Test and academic grade point average have been shown to be poor predictors of clinical performance by dental students. To refine predictors of psychomotor skills, a wire-bending test was given to 105 freshmen at the beginning of their dental education. Grades from seven restorative preclinical courses in their freshman and sophomore years were compared to scores on wire bending and the three traditional predictors: GPA, academic aptitude, and perceptual aptitude scores. Wire-bending scores correlated significantly with six out of seven preclinical restorative courses. The predictive power for preclinical performance was doubled when wire bending was added to traditional predictors in stepwise multiple regression analysis. Wire-bending scores identified students of low performance. These preliminary results suggest that the wire-bending test shows some potential as a screening test for identifying students who may hae psychomotor difficulties, early in their dental education.
Support vector regression scoring of receptor-ligand complexes for rank-ordering and virtual screening of chemical libraries.

PubMed

Li, Liwei; Wang, Bo; Meroueh, Samy O

2011-09-26

The community structure-activity resource (CSAR) data sets are used to develop and test a support vector machine-based scoring function in regression mode (SVR). Two scoring functions (SVR-KB and SVR-EP) are derived with the objective of reproducing the trend of the experimental binding affinities provided within the two CSAR data sets. The features used to train SVR-KB are knowledge-based pairwise potentials, while SVR-EP is based on physicochemical properties. SVR-KB and SVR-EP were compared to seven other widely used scoring functions, including Glide, X-score, GoldScore, ChemScore, Vina, Dock, and PMF. Results showed that SVR-KB trained with features obtained from three-dimensional complexes of the PDBbind data set outperformed all other scoring functions, including best performing X-score, by nearly 0.1 using three correlation coefficients, namely Pearson, Spearman, and Kendall. It was interesting that higher performance in rank ordering did not translate into greater enrichment in virtual screening assessed using the 40 targets of the Directory of Useful Decoys (DUD). To remedy this situation, a variant of SVR-KB (SVR-KBD) was developed by following a target-specific tailoring strategy that we had previously employed to derive SVM-SP. SVR-KBD showed a much higher enrichment, outperforming all other scoring functions tested, and was comparable in performance to our previously derived scoring function SVM-SP.
Principle-based structured case discussions: do they foster moral competence in medical students? - A pilot study.

PubMed

Friedrich, Orsolya; Hemmerling, Kay; Kuehlmeyer, Katja; Nörtemann, Stefanie; Fischer, Martin; Marckmann, Georg

2017-03-03

Recent findings suggest that medical students' moral competence decreases throughout medical school. This pilot study gives preliminary insights into the effects of two educational interventions in ethics classes on moral competence among medical students in Munich, Germany. Between 2012 and 2013, medical students were tested using Lind's Moral Competence Test (MCT) prior to and after completing different ethics classes. The experimental group (EG, N = 76) participated in principle-based structured case discussions (PBSCDs) and was compared with a control group with theory-based case discussions (TBCDs) (CG, N = 55). The pre/post C-scores were compared using a Wilcoxon Test, ANOVA and effect-size calculation. The C-score improved by around 3.2 C-points in the EG, and by 0.2 C-points in the CG. The mean C-score difference was not statistically significant for the EG (P = 0.14) or between the two groups (P = 0.34). There was no statistical significance for the teachers' influence (P = 0.54) on C-score. In both groups, students with below-average (M = 29.1) C-scores improved and students with above-average C-scores regressed. The increase of the C-Index was greater in the EG than in the CG. The absolute effect-size of the EG compared with the CG was 3.0 C-points, indicating a relevant effect. Teaching ethics with PBSCDs did not provide a statistically significant influence on students' moral competence, compared with TBCDs. Yet, the effect size suggests that PBSCDs may improve moral competence among medical students more effectively. Further research with larger and completely randomized samples is needed to gain definite explanations for the results.
Web-based versus traditional lecture: are they equally effective as a flexible bronchoscopy teaching method?

PubMed

Mata, Caio Augusto Sterse; Ota, Luiz Hirotoshi; Suzuki, Iunis; Telles, Adriana; Miotto, Andre; Leão, Luiz Eduardo Vilaça

2012-01-01

This study compares the traditional live lecture to a web-based approach in the teaching of bronchoscopy and evaluates the positive and negative aspects of both methods. We developed a web-based bronchoscopy curriculum, which integrates texts, images and animations. It was applied to first-year interns, who were later administered a multiple-choice test. Another group of eight first-year interns received the traditional teaching method and the same test. The two groups were compared using the Student's t-test. The mean scores (± SD) of students who used the website were 14.63 ± 1.41 (range 13-17). The test scores of the other group had the same range, with a mean score of 14.75 ± 1. The Student's t-test showed no difference between the test results. The common positive point noted was the presence of multimedia content. The web group cited as positive the ability to review the pages, and the other one the role of the teacher. Web-based bronchoscopy education showed results similar to the traditional live lecture in effectiveness.
Dividing the Force Concept Inventory into two equivalent half-length tests

NASA Astrophysics Data System (ADS)

Han, Jing; Bao, Lei; Chen, Li; Cai, Tianfang; Pi, Yuan; Zhou, Shaona; Tu, Yan; Koenig, Kathleen

2015-06-01

The Force Concept Inventory (FCI) is a 30-question multiple-choice assessment that has been a building block for much of the physics education research done today. In practice, there are often concerns regarding the length of the test and possible test-retest effects. Since many studies in the literature use the mean score of the FCI as the primary variable, it would be useful then to have different shorter tests that can produce FCI-equivalent scores while providing the benefits of being quicker to administer and overcoming the test-retest effects. In this study, we divide the 1995 version of the FCI into two half-length tests; each contains a different subset of the original FCI questions. The two new tests are shorter, still cover the same set of concepts, and produce mean scores equivalent to those of the FCI. Using a large quantitative data set collected at a large midwestern university, we statistically compare the assessment features of the two half-length tests and the full-length FCI. The results show that the mean error of equivalent scores between any two of the three tests is within 3%. Scores from all tests are well correlated. Based on the analysis, it appears that the two half-length tests can be a viable option for score based assessment that need to administer tests quickly or need to measure short-term gains where using identical pre- and post-test questions is a concern.
Equivalence of Laptop and Tablet Administrations of the Minnesota Multiphasic Personality Inventory-2 Restructured Form.

PubMed

Menton, William H; Crighton, Adam H; Tarescavage, Anthony M; Marek, Ryan J; Hicks, Adam D; Ben-Porath, Yossef S

2017-06-01

The present study investigated the comparability of laptop computer- and tablet-based administration modes for the Minnesota Multiphasic Personality Inventory-2-Restructured Form (MMPI-2-RF). Employing a counterbalanced within-subjects design, the MMPI-2-RF was administered via both modes to a sample of college undergraduates ( N = 133). Administration modes were compared in terms of mean scale scores, internal consistency, test-retest consistency, external validity, and administration time. Mean scores were generally similar, and scores produced via both methods appeared approximately equal in terms of internal consistency and test-retest consistency. Scores from the two modalities also evidenced highly similar patterns of associations with external criteria. Notably, tablet administration of the MMPI-2-RF was substantially longer than laptop administration in the present study (mean difference 7.2 minutes, Cohen's d = .95). Overall, results suggest that varying administration mode between laptop and tablet has a negligible influence on MMPI-2-RF scores, providing evidence that these modes of administration can be considered psychometrically equivalent.
Examining Exam Reviews: A Comparison of Exam Scores and Attitudes

ERIC Educational Resources Information Center

Hackathorn, Jana; Cornell, Kathryn; Garczynski, Amy M.; Solomon, Erin D.; Blankmeyer, Katheryn E.; Tennial, Rachel E.

2012-01-01

Instructors commonly use exam reviews to help students prepare for exams and to increase student success. The current study compared the effects of traditional, trivia, and practice test-based exam reviews on actual exam scores, as well as students' attitudes toward each review. Findings suggested that students' exam scores were significantly…
Evaluating Score Equity Assessment for State NAEP

ERIC Educational Resources Information Center

Wells, Craig S.; Baldwin, Su; Hambleton, Ronald K.; Sireci, Stephen G.; Karatonis, Ana; Jirka, Stephen

2009-01-01

Score equity assessment is an important analysis to ensure inferences drawn from test scores are comparable across subgroups of examinees. The purpose of the present evaluation was to assess the extent to which the Grade 8 NAEP Math and Reading assessments for 2005 were equivalent across selected states. More specifically, the present study…
An Evaluation of the IntelliMetric[SM] Essay Scoring System

ERIC Educational Resources Information Center

Rudner, Lawrence M.; Garcia, Veronica; Welch, Catherine

2006-01-01

This report provides a two-part evaluation of the IntelliMetric[SM] automated essay scoring system based on its performance scoring essays from the Analytic Writing Assessment of the Graduate Management Admission Test[TM] (GMAT[TM]). The IntelliMetric system performance is first compared to that of individual human raters, a Bayesian system…
Biases and power for groups comparison on subjective health measurements.

PubMed

Hamel, Jean-François; Hardouin, Jean-Benoit; Le Neel, Tanguy; Kubis, Gildas; Roquelaure, Yves; Sébille, Véronique

2012-01-01

Subjective health measurements are increasingly used in clinical research, particularly for patient groups comparisons. Two main types of analytical strategies can be used for such data: so-called classical test theory (CTT), relying on observed scores and models coming from Item Response Theory (IRT) relying on a response model relating the items responses to a latent parameter, often called latent trait. Whether IRT or CTT would be the most appropriate method to compare two independent groups of patients on a patient reported outcomes measurement remains unknown and was investigated using simulations. For CTT-based analyses, groups comparison was performed using t-test on the scores. For IRT-based analyses, several methods were compared, according to whether the Rasch model was considered with random effects or with fixed effects, and the group effect was included as a covariate or not. Individual latent traits values were estimated using either a deterministic method or by stochastic approaches. Latent traits were then compared with a t-test. Finally, a two-steps method was performed to compare the latent trait distributions, and a Wald test was performed to test the group effect in the Rasch model including group covariates. The only unbiased IRT-based method was the group covariate Wald's test, performed on the random effects Rasch model. This model displayed the highest observed power, which was similar to the power using the score t-test. These results need to be extended to the case frequently encountered in practice where data are missing and possibly informative.
Comparison of the performance of first-grade and mentally retarded students on the Peabody Mathematics Readiness Test.

PubMed

Richardson, L I; Thurman, R L; Bassler, O C

1978-07-01

The Peabody Mathematics Readiness Test was developed to assess mathematics readiness and identify children who would encounter difficulty in first-grade mathematics. In the present study, we compared performances of mentally retarded subjects and first-grade subjects on this test. Retarded subjects' mean scores were significantly lower than those of the nonretarded subjects on the drawing test; however, there were no significant differences between the mean scores of the groups on the other five subscales.
The American Education Diet: Can U.S. Students Survive on Junk Food?

ERIC Educational Resources Information Center

DeSchryver, Dave

U.S. student scores in science compare unfavorably with those of other nations, and other standardized test scores by U.S. students are also comparatively lower. Polls of parents indicate dissatisfaction with U.S. education. College teachers and employers feel that high school graduates are weak in skills. This document offers negative perceptions…

Using the Teach-Back Method in Patient Education to Improve Patient Satisfaction.

PubMed

Centrella-Nigro, Andrea M; Alexander, Catherine

2017-01-01

This quasi-experimental research study used two similar nursing units to test the effects of teach back on Hospital Consumer Assessment of Healthcare Providers and Systems (HCAHPS) scores. A pretest-posttest design tested 24 nurses' knowledge, attitudes, and beliefs about teach back. Education specialists provided a 1-hour teaching session on teach back to all nurses in the intervention unit. A significant improvement in knowledge scores in the pretest-posttest was found using paired t tests (p = .002). Qualitative analysis of nurses' comments demonstrated strong support for teach back in the post-test. The HCAHPS scores were not significantly improved in the intervention unit when compared with the control unit. More research needs to be conducted to determine the effectiveness of teach back on HCAHPS scores. J Contin Educ Nurs. 2017;48(1):47-52. Copyright 2017, SLACK Incorporated.
Measurement of hepatic functional mass by means of 13C-methacetin and 13C-phenylalanine breath tests in chronic liver disease: Comparison with Child-Pugh score and serum bile acid levels

PubMed Central

Festi, D.; Capodicasa, S.; Sandri, L.; Colaiocco-Ferrante, L.; Staniscia, T.; Vitacolonna, E.; Vestito, A.; Simoni, P.; Mazzella, G.; Portincasa, P.; Roda, E.; Colecchia, A.

2005-01-01

AIM: To evaluate and compare the clinical usefulness of 13C-phenylalanine and 13C-methacetin breath tests in quantitating functional hepatic mass in patients with chronic liver disease and to further compare these results with those of conventional tests, Child-Pugh score and serum bile acid levels. METHODS: One hundred and forty patients (50 HCV- related chronic hepatitis, 90 liver cirrhosis patients) and 40 matched healthy controls were studied. Both breath test and routine liver test, serum levels of cholic and chenodeoxycholic acid conjugates were evaluated. RESULTS: Methacetin breath test, expressed as 60 min cumulative percent of oxidation, discriminated the hepatic functional capacity not only between controls and liver disease patients, but also between different categories of chronic liver disease patients. Methacetin breath test was correlated with liver function tests and serum bile acids. Furthermore, methacetin breath test, as well as serum bile acids, were highly predictive of Child-Pugh scores. The diagnostic power of phenylalanine breath test was always less than that of methacetin breath test. CONCLUSION: Methacetin breath test represents a safe and accurate diagnostic tool in the evaluation of hepatic functional mass in chronic liver disease patients. PMID:15609414
Assessment of body-powered upper limb prostheses by able-bodied subjects, using the Box and Blocks Test and the Nine-Hole Peg Test.

PubMed

Haverkate, Liz; Smit, Gerwin; Plettenburg, Dick H

2016-02-01

The functional performance of currently available body-powered prostheses is unknown. The goal of this study was to objectively assess and compare the functional performance of three commonly used body-powered upper limb terminal devices. Experimental trial. A total of 21 able-bodied subjects (n = 21, age = 22 ± 2) tested three different terminal devices: TRS voluntary closing Hook Grip 2S, Otto Bock voluntary opening hand and Hosmer Model 5XA hook, using a prosthesis simulator. All subjects used each terminal device nine times in two functional tests: the Nine-Hole Peg Test and the Box and Blocks Test. Significant differences were found between the different terminal devices and their scores on the Nine-Hole Peg Test and the Box and Blocks Test. The Hosmer hook scored best in both tests. The TRS Hook Grip 2S scored second best. The Otto Bock hand showed the lowest scores. This study is a first step in the comparison of functional performances of body-powered prostheses. The data can be used as a reference value, to assess the performance of a terminal device or an amputee. The measured scores enable the comparison of the performance of a prosthesis user and his or her terminal device relative to standard scores. © The International Society for Prosthetics and Orthotics 2014.
Persian version of frontal assessment battery: Correlations with formal measures of executive functioning and providing normative data for Persian population.

PubMed

Asaadi, Sina; Ashrafi, Farzad; Omidbeigi, Mahmoud; Nasiri, Zahra; Pakdaman, Hossein; Amini-Harandi, Ali

2016-01-05

Cognitive impairment in patients with Parkinson's disease (PD) mainly involves executive function (EF). The frontal assessment battery (FAB) is an efficient tool for the assessment of EFs. The aims of this study were to determine the validity and reliability of the psychometric properties of the Persian version of FAB and assess its correlation with formal measures of EFs to provide normative data for the Persian version of FAB in patients with PD. The study recruited 149 healthy participants and 49 patients with idiopathic PD. In PD patients, FAB results were compared to their performance on EF tests. Reliability analysis involved test-retest reliability and internal consistency, whereas validity analysis involved convergent validity approach. FAB scores compared in normal controls and in PD patients matched for age, education, and Mini-Mental State Examination (MMSE) score. In PD patients, FAB scores were significantly decreased compared to normal controls, and correlated with Stroop test and Wisconsin Card Sorting Test (WCST). In healthy subjects, FAB scores varied according to the age, education, and MMSE. In the FAB subtest analysis, the performances of PD patients were worse than the healthy participants on similarities, fluency tasks, and Luria's motor series. Persian version of FAB could be used as a reliable scale for the assessment of frontal lobe functions in Iranian patients with PD. Furthermore, normative data provided for the Persian version of this test improve the accuracy and confidence in the clinical application of the FAB.
37: COMPARISON OF TWO METHODS: TBL-BASED AND LECTURE-BASED LEARNING IN NURSING CARE OF PATIENTS WITH DIABETES IN NURSING STUDENTS

PubMed Central

Khodaveisi, Masoud; Qaderian, Khosro; Oshvandi, Khodayar; Soltanian, Ali Reza; Vardanjani, Mehdi molavi

2017-01-01

Background and aims learning plays an important role in developing nursing skills and right care-taking. The Present study aims to evaluate two learning methods based on team –based learning and lecture-based learning in learning care-taking of patients with diabetes in nursing students. Method In this quasi-experimental study, 64 students in term 4 in nursing college of Bukan and Miandoab were included in the study based on knowledge and performance questionnaire including 15 questions based on knowledge and 5 questions based on performance on care-taking in patients with diabetes were used as data collection tool whose reliability was confirmed by cronbach alpha (r=0.83) by the researcher. To compare the mean score of knowledge and performance in each group in pre-test step and post-test step, pair –t test and to compare mean of scores in two groups of control and intervention, the independent t- test was used. Results There was not significant statistical difference between two groups in pre terms of knowledge and performance score (p=0.784). There was significant difference between the mean of knowledge scores and diabetes performance in the post-test in the team-based learning group and lecture-based learning group (p=0.001). There was significant difference between the mean score of knowledge of diabetes care in pre-test and post-test in base learning groups (p=0.001). Conclusion In both methods team-based and lecture-based learning approaches resulted in improvement in learning in students, but the rate of learning in the team-based learning approach is greater compared to that of lecture-based learning and it is recommended that this method be used as a higher education method in the education of students.
Sensitivity of a computer adaptive assessment for measuring functional mobility changes in children enrolled in a community fitness programme.

PubMed

Haley, Stephen M; Fragala-Pinkham, Maria; Ni, Pengsheng

2006-07-01

To examine the relative sensitivity to detect functional mobility changes with a full-length parent questionnaire compared with a computerized adaptive testing version of the questionnaire after a 16-week group fitness programme. Prospective, pre- and posttest study with a 16-week group fitness intervention. Three community-based fitness centres. Convenience sample of children (n = 28) with physical or developmental disabilities. A 16-week group exercise programme held twice a week in a community setting. A full-length (161 items) paper version of a mobility parent questionnaire based on the Pediatric Evaluation of Disability Inventory, but expanded to include expected skills of children up to 15 years old was compared with a 15-item computer adaptive testing version. Both measures were administered at pre- and posttest intervals. Both the full-length Pediatric Evaluation of Disability Inventory and the 15-item computer adaptive testing version detected significant changes between pre- and posttest scores, had large effect sizes, and standardized response means, with a modest decrease in the computer adaptive test as compared with the 161-item paper version. Correlations between the computer adaptive and paper formats across pre- and posttest scores ranged from r = 0.76 to 0.86. Both functional mobility test versions were able to detect positive functional changes at the end of the intervention period. Greater variability in score estimates was generated by the computerized adaptive testing version, which led to a relative reduction in sensitivity as defined by the standardized response mean. Extreme scores were generally more difficult for the computer adaptive format to estimate with as much accuracy as scores in the mid-range of the scale. However, the reduction in accuracy and sensitivity, which did not influence the group effect results in this study, is counterbalanced by the large reduction in testing burden.
Nutritional screening of patients at a memory clinic--association between patients' and their relatives' self-reports.

PubMed

Lyngroth, Anne Liv; Hernes, Susanne Miriam Sørensen; Madsen, Bengt-Ove; Söderhamn, Ulrika; Grov, Ellen Karine

2016-03-01

To compare individual reports by patients and relatives (proxy) of the Nutritional Form For the Elderly and relate the Nutritional Form For the Elderly scores to Mini Mental Status Examination scores, weight loss, Body Mass Index, five-point Clock Drawing Test and background variables. Undernutrition or risk of undernutrition is a significant problem among people with dementia. A poor nutritional state increases the risk of infections, delayed convalescence after acute illness and reduced quality of life. A cross-sectional study. Application of the Nutritional Form For the Elderly in addition to clinical nutrition parameters and cognitive tests in a memory clinic among 213 persons referred for assessment due to possible cognitive impairment or dementia. Patients' and proxy Nutritional Form For the Elderly scores yielded comparative results. Nutritional Form For the Elderly scores ≥6 (medium to high risk of undernutrition) were found in 32% of the patients vs. 43% of proxy. Mean Mini Mental Status Examination score was 23·2 (SD 4·5) and 50% failed the Clock Drawing Test. Involuntary weight loss was reported by 42% of the patients, and in 26% of the patients, Body Mass Index values were below 22 kg/m(2) , indicating undernutrition. By regression analysis, Clock Drawing Test (p = 0·019) and Mini Mental Status Examination (p = 0·04) might predict the risk of reduced nutritional status. The study demonstrates that a significant proportion of patients at our memory clinic were at nutritional risk. Corresponding results exist between patients' and proxy Nutritional Form For the Elderly scores; however, the patients assessed themselves more well-nourished as compared to proxy assessment. The discrepancies seem to increase with more severe cognitive impairment. Females and single-dwelling individuals were at higher risk of undernutrition compared to males and cohabitants. Self-reporting and proxy-rating seem both applicable for nutritional screening among moderate cognitive impaired. Cognitive decline seems to affect the accuracy when patients rate themselves. A reduced Mini Mental Status Examination and/or failed Clock Drawing Test might predict the risk of undernutrition. © 2016 John Wiley & Sons Ltd.
Randomized, multicenter, comparative study of NEURO versus CIMT in poststroke patients with upper limb hemiparesis: the NEURO-VERIFY Study.

PubMed

Abo, Masahiro; Kakuda, Wataru; Momosaki, Ryo; Harashima, Hiroaki; Kojima, Miki; Watanabe, Shigeto; Sato, Toshihiro; Yokoi, Aki; Umemori, Takuma; Sasanuma, Jinichi

2014-07-01

Many poststroke patients suffer functional motor limitation of the affected upper limb, which is associated with diminished health-related quality of life. The aim of this study is to conduct a randomized, multicenter, comparative study of low-frequency repetitive transcranial magnetic stimulation combined with intensive occupational therapy, NEURO (NovEl intervention Using Repetitive TMS and intensive Occupational therapy) versus constraint-induced movement therapy in poststroke patients with upper limb hemiparesis. In this randomized controlled study of NEURO and constraint-induced movement therapy, 66 poststroke patients with upper limb hemiparesis were randomly assigned at 2:1 ratio to low-frequency repetitive transcranial magnetic stimulation plus occupational therapy (NEURO group) or constraint-induced movement therapy (constraint-induced movement therapy group) for 15 days. Fugl-Meyer Assessment and Wolf Motor Function Test and Functional Ability Score of Wolf Motor Function Test were used for assessment. No differences in patients' characteristics were found between the two groups at baseline. The Fugl-Meyer Assessment score was significantly higher in both groups after the 15-day treatment compared with the baseline. Changes in Fugl-Meyer Assessment scores and Functional Ability Score of Wolf Motor Function Test were significantly higher in the NEURO group than in the constraint-induced movement therapy group, whereas the decrease in the Wolf Motor Function Test log performance time was comparable between the two groups (changes in Fugl-Meyer Assessment score, NEURO: 5·39 ± 4·28, constraint-induced movement therapy: 3·09 ± 4·50 points; mean ± standard error of the mean; P < 0·05) (changes in Functional Ability Score of Wolf Motor Function Test, NEURO: 3·98 ± 2·99, constraint-induced movement therapy: 2·09 ± 2·96 points; P < 0·05). The results of the 15-day rehabilitative protocol showed the superiority of NEURO relative to constraint-induced movement therapy; NEURO improved the motion of the whole upper limb and resulted in functional improvement in activities of daily living. © 2013 The Authors. International Journal of Stroke © 2013 World Stroke Organization.
Detailed analysis of the Japanese version of the Rapid Dementia Screening Test, revised version.

PubMed

Moriyama, Yasushi; Yoshino, Aihide; Muramatsu, Taro; Mimura, Masaru

2017-11-01

The number-transcoding task on the Japanese version of the Rapid Dementia Screening Test (RDST-J) requires mutual conversion between Arabic and Chinese numerals (209 to , 4054 to , to 681, to 2027). In this task, question and answer styles of Chinese numerals are written horizontally. We investigated the impact of changing the task so that Chinese numerals are written vertically. Subjects were 211 patients with very mild to severe Alzheimer's disease and 42 normal controls. Mini-Mental State Examination scores ranged from 26 to 12, and Clinical Dementia Rating scores ranged from 0.5 to 3. Scores of all four subtasks of the transcoding task significantly improved in the revised version compared with the original version. The sensitivity and specificity of total scores ≥9 on the RDST-J original and revised versions for discriminating between controls and subjects with Clinical Dementia Rating scores of 0.5 were 63.8% and 76.6% on the original and 60.1% and 85.8% on revised version. The revised RDST-J total score had low sensitivity and high specificity compared with the original RDST-J for discriminating subjects with Clinical Dementia Rating scores of 0.5 from controls. © 2017 Japanese Psychogeriatric Society.
Making Infection Prevention Education Interactive Can Enhance Knowledge and Improve Outcomes: Results from the Targeted Infection Prevention (TIP) study

PubMed Central

Koo, Evonne; McNamara, Sara; Lansing, Bonnie; Olmsted, Russell N.; Rye, Ruth Anne; Fitzgerald, Thomas; Mody, Lona

2016-01-01

Objectives To assess effectiveness of an interactive educational program in increasing knowledge of key infection prevention and control (IPC) principles with emphasis on indwelling device care, hand hygiene and multi-drug resistant organisms (MDROs) among nursing home (NH) healthcare personnel (HCP). Methods We conducted a multi-modal randomized-controlled study involving HCP at 12 NHs. Ten comprehensive and interactive modules covered common IPC topics. We compared: a) intervention and control scores to assess differences in pre-test scores as a result of field interventions; b) pre- and post-test scores to assess knowledge gain and c) magnitude of knowledge gain based on job categories. Results 4,962 tests were returned over the course of the intervention with 389–633 HCP/module. Participants were mostly female certified nursing assistants (CNAs). Score improvement was highest for modules emphasizing hand hygiene, urinary catheter care and MDROs (15.6%, 15.95%, and 22.0%, respectively). After adjusting for cluster study design, knowledge scores were significantly higher after each educational module, suggesting the education delivery method was effective. When compared to CNAs, nursing and rehabilitation personnel scored significantly higher in their knowledge tests. Conclusion Our intervention significantly improved IPC knowledge in HCP, especially for those involved in direct patient care. This increase in knowledge along with preemptive barrier precautions and active surveillance has enhanced resident safety by reducing MDROs and infections in high-risk NH residents. PMID:27553671
Effects of coordination and manipulation therapy for patients with Parkinson disease.

PubMed

Zhao, Mingming; Hu, Caiyou; Wu, Zhixin; Chen, Yu; Li, Zhengming; Zhang, Mingsheng

2017-09-01

To determine the effects of a new exercise training regimen, i.e. coordination and manipulation therapy (CMT), on motor, balance, and cardiac functions in patients with Parkinson disease (PD). We divided 36 PD patients into the CMT (n = 22) and control (n = 14) groups. The patients in the CMT group performed dry-land swimming (imitation of the breaststroke) and paraspinal muscle stretching for 30 min/workday for 1 year. The control subjects did not exercise regularly. The same medication regimen was maintained in both groups during the study. Clinical characteristics, Unified Parkinson's Disease Rating Scale (UPDRS) scores, Berg balance scale (BBS) scores, mechanical balance measurements, timed up and go (TUG) test, and left ventricular ejection fraction (LVEF) were compared at 0 (baseline), 6, and 12 months. Biochemical test results were compared at 0 and 12 months. The primary outcome was motor ability. The secondary outcome was cardiac function. In the CMT group, UPDRS scores significantly improved, TUG test time and step number significantly decreased, BBS scores significantly increased, and most mechanical balance measurements significantly improved after 1 year of regular exercise therapy (all p < 0.05). In the control group, UPDRS scores significantly deteriorated, TUG test time and step number significantly increased, BBS scores significantly decreased, and most mechanical balance measurements significantly worsened after 1 year (all P < 0.05). LVEF improved in the CMT group only (P = 0.01). This preliminary study suggests that CMT effectively improved mobility disorder, balance, and cardiac function in PD patients over a 1-year period.
Comparing State and District Test Results to National Norms: Interpretations of Scoring "Above the National Average."

ERIC Educational Resources Information Center

Linn, Robert L.; And Others

Norm-referenced test results reported by states and school districts and factors related to those scores were studied through mail and telephone surveys of 35 states and a nationally representative sample of 153 school districts to determine the degree to which "above average" results were being reported. Part of the stimulus for this…
The Perceptions of Standardized Tests, Academic Self-Efficacy, and Academic Performance of African American Graduate Students: a Correlational and Comparative Analysis

ERIC Educational Resources Information Center

Marrah, Arleezah K.

2012-01-01

The academic performance of African American students continues to be a concern for educators, researchers, and most importantly their community. This issue is particularly prevalent in the standardized test scores of African American students where they score on average one or more standard deviations below their Caucasian and Asian American…
Comparing Standardized Test Scores among Arts-Integrated and Non-Arts Integrated Schools in Central Mississippi

ERIC Educational Resources Information Center

Dean, Darlene

2014-01-01

The topic of arts integration creates continuing dialog among educators and arts advocates. This study examined the degree to which student achievement was affected when arts education is limited or eliminated from schools to meet the mandates of NCLB (2001) legislation. Standardized test scores from 12 schools in Central Mississippi were used to…
FUNCTIONAL PERFORMANCE AND KNEE LAXITY IN NORMAL INDIVIDUALS AND IN INDIVIDUALS SUBMITTED TO ANTERIOR CRUCIATE LIGAMENT RECONSTRUCTION

PubMed Central

de Vasconcelos, Rodrigo Antunes; Bevilaqua-Grossi, Débora; Shimano, Antonio Carlos; Jansen Paccola, Cleber Antonio; Salvini, Tânia Fátima; Prado, Christiane Lanatovits; Mello Junior, Wilson A.

2015-01-01

The aim of this study was to analyze the correlation between deficits in the isokinetic peak torque of the knee extensors and flexors with hop tests, postoperative knee laxity and functional scores in normal and ACL- reconstructed subjects with patellar tendon and hamstring tendon autografts. Methods: Sixty male subjects were enrolled and subdivided into three groups: Twenty subjects without knee injuries (GC group) and two groups of 20 subjects submitted to ACL reconstruction with patellar tendon (GTP group) and hamstrings autograft (GTF group). Results: The results showed significant correlation between knee extensors peak torque and performance in the hop tests for GTF and GC groups. There are no significantly correlations between post op knee laxity and Lysholm score compared with the hop tests and peak torque deficits. Concerning the differences between groups, the GTP group showed greater peak torque deficits in knee extensors, worst Lysholm scores and higher percentage of individuals with lower limb symmetry index (ISM) < 90% in both hop tests when compared to the other two groups. Conclusion: It is not recommendable to use only one measurement instrument for the functional evaluation of ACL-reconstructed patients, because significant correlation between peak torque, subject's functional score, knee laxity and hop tests were not observed in all groups. PMID:26998464
Trends in performance on the psychiatry resident-in-training examination (PRITE®): 10 years of data from a single institution.

PubMed

Cooke, Brian K; Garvan, Cynthia; Hobbs, Jacqueline A

2013-07-01

The purpose of this study was to examine trends in the Psychiatry Resident-In-Training Examination (PRITE®) scores at one institution from 2001 to 2010. The authors hypothesized that two factors, the 2003 implementation of the Accreditation Council for Graduate Medical Education (ACGME) duty-hour restrictions and the residency program's 2008 restructuring of its curriculum to a half-day per week of didactics, would lead to improved scores. Residents in the general psychiatry program at the University of Florida College of Medicine from 2001 to 2010 were included in this study. To examine the effect of the 2003 ACGME duty-hours change, the authors compared test results from 2001-2002 and 2003-2010. To examine the effect of the 2008 didactic restructuring, they compared test results from 2001-2007 and 2008-2010. There were 288 PRITE test scores from 2001 to 2010. The authors did not find a statistical difference between test results before and after the 2003 implementation of ACGME duty-hour restrictions or between test results before and after the 2008 restructuring of residency didactics. The hypothesis was rejected. The results of the literature review propose that examination scores are affected by other elements of residency training.
A two-factor theory for concussion assessment using ImPACT: memory and speed.

PubMed

Schatz, Philip; Maerlender, Arthur

2013-12-01

We present the initial validation of a two-factor structure of Immediate Post-Concussion Assessment and Cognitive Testing (ImPACT) using ImPACT composite scores and document the reliability and validity of this factor structure. Factor analyses were conducted for baseline (N = 21,537) and post-concussion (N = 560) data, yielding "Memory" (Verbal and Visual) and "Speed" (Visual Motor Speed and Reaction Time) Factors; inclusion of Total Symptom Scores resulted in a third discrete factor. Speed and Memory z-scores were calculated, and test-retest reliability (using intra-class correlation coefficients) at 1 month (0.88/0.81), 1 year (0.85/0.75), and 2 years (0.76/0.74) were higher than published data using Composite scores. Speed and Memory scores yielded 89% sensitivity and 70% specificity, which was higher than composites (80%/62%) and comparable with subscales (91%/69%). This emergent two-factor structure has improved test-retest reliability with no loss of sensitivity/specificity and may improve understanding and interpretability of ImPACT test results.
Rehearsal significantly improves immediate and delayed recall on the Rey Auditory Verbal Learning Test.

PubMed

Hessen, Erik

2011-10-01

A repeated observation during memory assessment with the Rey Auditory Verbal Learning Test (RAVLT) is that patients who spontaneously employ a memory rehearsal strategy by repeating the word list more than once achieve better scores than patients who only repeat the word list once. This observation led to concern about the ability of the standard test procedure of RAVLT and similar tests in eliciting the best possible recall scores. The purpose of the present study was to test the hypothesis that a rehearsal recall strategy of repeating the word list more than once would result in improved scores of recall on the RAVLT. We report on differences in outcome after standard administration and after experimental administration on Immediate and Delayed Recall measures from the RAVLT of 50 patients. The experimental administration resulted in significantly improved scores for all the variables employed. Additionally, it was found that patients who failed effort screening showed significantly poorer improvement on Delayed Recall compared with those who passed the effort screening. The general clear improvement both in raw scores and T-scores demonstrates that recall performance can be significantly influenced by the strategy of the patient or by small variations in instructions by the examiner.
Primary Arthrodesis versus Open Reduction and Internal Fixation for Low-Energy Lisfranc Injuries in a Young Athletic Population.

PubMed

Cochran, Grant; Renninger, Christopher; Tompane, Trevor; Bellamy, Joseph; Kuhn, Kevin

2017-09-01

There are 2 Level I studies comparing open reduction and internal fixation (ORIF) and primary arthrodesis (PA) in high-energy Lisfranc injuries. There are no studies comparing ORIF and PA in young athletic patients with low-energy injuries. All operatively managed low-energy Lisfranc injuries sustained by active duty military personnel at a single institution were identified from 2010 to 2015. The injury pattern, method of treatment, and complications were reviewed. Implant removal rates, fitness test scores, return to military duty rates, and Foot and Ankle Ability Measure (FAAM) scores were compared. Thirty-two patients were identified with the average age of 28 years. PA was performed in 14 patients with ORIF in 18. The PA group returned to full duty at an average of 4.5 months whereas the ORIF group returned at an average of 6.7 months ( P = .0066). The PA group ran their fitness test an average of 9 seconds per mile slower than their preoperative average whereas the ORIF group ran it an average of 39 seconds slower per mile ( P = .032). There were no differences between the 2 groups in the FAAM scores at an average of 35 months. Implant removal was performed in 15 (83%) in the ORIF group and 2 (14%) in the PA group ( P = .005). Low-energy Lisfranc injuries treated with primary arthrodesis had a lower implant removal rate, an earlier return to full military activity, and better fitness test scores after 1 year, but there was no difference in FAAM scores after 3 years. Level III, comparative cohort study.
[Application of the dizziness handicap inventory in the patients with benign paroxysmal positional vertigo].

PubMed

Wang, L Y; Peng, H; Huang, W N; Gao, B

2016-04-20

Objective: This study was designed to observe the dizziness handicap inventory (DHI) scores in patients with BPPV (benign paroxysmal positional vertigo) before and after maneuver repositioning and aimed to discuss the values of DHI scores in the diagnosing and treatment of BPPV. Method: Charts of 72 patients with BPPV diagnosed by positioning test were reviewed. Four DHI scores were used including the total score (DHIT), the functional score (DHIF), the emotional score (DHIE), and the physical score (DHIP). We compared the pre-repositioning DHI scores and post-repositioning scores of patients, and also compared the DHI scores of patients with and without residual dizziness. Result: All of the 72 patients were underwent maneuver repositioning and recorded the DHI scores. The mean post-repositioning scores were dramatically decreased compared with pre-repositioning scores, and the difference was significant ( P <0.01). The differences of the DHIP scores between the residual dizziness group and the non-residual dizziness group was not significant, while the DHIF scores, the DHIE scores and the DHIT scores between the two groups were statistically different. Conclusion: After maneuver repositioning the dizziness handicap of BPPV patients could be significantly improved. The next treatment program for residual dizziness patients after successful repositioning could be aimed at the functional and emotional dizziness. Copyright© by the Editorial Department of Journal of Clinical Otorhinolaryngology Head and Neck Surgery.

The behavioral regulation in sport questionnaire (BRSQ): instrument development and initial validity evidence.

PubMed

Lonsdale, Chris; Hodge, Ken; Rose, Elaine A

2008-06-01

The purpose of the four studies described in this article was to develop and test a new measure of competitive sport participants' intrinsic motivation, extrinsic motivation, and amotivation (self-determination theory; Deci & Ryan, 1985). The items for the new measure, named the Behavioral Regulation in Sport Questionnaire (BRSQ), were constructed using interviews, expert review, and pilot testing. Analyses supported the internal consistency, test-retest reliability, and factorial validity of the BRSQ scores. Nomological validity evidence was also supportive, as BRSQ subscale scores were correlated in the expected pattern with scores derived from measures of motivational consequences. When directly compared with scores derived from the Sport Motivation Scale (SMS; Pelletier, Fortier, Vallerand, Tuson, & Blais, 1995) and a revised version of that questionnaire (SMS-6; Mallett, Kawabata, Newcombe, Otero-Forero, & Jackson, 2007), BRSQ scores demonstrated equal or superior reliability and factorial validity as well as better nomological validity.
Score Distributions of the Balance Outcome Measure for Elder Rehabilitation (BOOMER) in Community-Dwelling Older Adults With Vertebral Fracture.

PubMed

Brown, Zachary M; Gibbs, Jenna C; Adachi, Jonathan D; Ashe, Maureen C; Hill, Keith D; Kendler, David L; Khan, Aliya; Papaioannou, Alexandra; Prasad, Sadhana; Wark, John D; Giangregorio, Lora M

2017-11-28

We sought to evaluate the Balance Outcome Measure for Elder Rehabilitation (BOOMER) in community-dwelling women 65 years and older with vertebral fracture and to describe score distributions and potential ceiling and floor effects. This was a secondary data analysis of baseline data from the Build Better Bones with Exercise randomized controlled trial using the BOOMER. A total of 141 women with osteoporosis and radiographically confirmed vertebral fracture were included. Concurrent validity and internal consistency were assessed in comparison to the Short Physical Performance Battery (SPPB). Normality and ceiling/floor effects of total BOOMER scores and component test items were also assessed. Exploratory analyses of assistive aid use and falls history were performed. Tests for concurrent validity demonstrated moderate correlation between total BOOMER and SPPB scores. The BOOMER component tests showed modest internal consistency. Substantial ceiling effect and nonnormal score distributions were present among overall sample and those not using assistive aids for total BOOMER scores, although scores were normally distributed for those using assistive aids. The static standing with eyes closed test demonstrated the greatest ceiling effects of the component tests, with 92% of participants achieving a maximal score. While the BOOMER compares well with the SPPB in community-dwelling women with vertebral fractures, researchers or clinicians considering using the BOOMER in similar or higher-functioning populations should be aware of the potential for ceiling effects.
Assessment of contrast sensitivity by Spaeth Richman Contrast Sensitivity Test and Pelli Robson Chart Test in patients with varying severity of glaucoma.

PubMed

Thakur, Sahil; Ichhpujani, Parul; Kumar, Suresh; Kaur, Ravneet; Sood, Sunandan

2018-05-14

This study was designed to assess the efficacy, reliability and repeatability of SPARCS (Spaeth Richman Contrast Sensitivity Test) as compared to the conventional Pelli Robson Chart Test for the assessment of contrast sensitivity in patients with glaucoma. We evaluated 135 eyes of 135 patients who were age and sex matched into three groups (controls, disc suspects and glaucoma) of 45 patients each. The glaucoma subgroup was further divided into subgroups of mild, moderate and severe based on the visual field damage. There was a strong positive correlation between Pelli Robson scores and SPARCS scores (S = 0.807, P < 0.001). Intraclass correlation coefficient (ICC) for Pelli Robson Test was 0.952 and 0.988 for SPARCS. The coefficient of repeatability (COR) for mean SPARCS was 5.65%, while COR of Pelli Robson Test was 12.44%. SPARCS was found to have better repeatability than Pelli Robson Test based on COR values. Pelli Robson score had a sensitivity of 80% and a specificity of 65.6% for detecting glaucoma patients as compared to 84.4% and 70%, respectively, for SPARCS scores. SPARCS is a better alternative to conventional Pelli Robson Chart Test for assessment of contrast sensitivity in patients with glaucoma. Being independent of the effects of literacy and educational status, it offers a universal way to measure contrast sensitivity. It can also be reliably used in patients with varying severity of glaucoma.
Changes in Motor Development During a 4-Year Follow-up on Children With Univentricular Heart Defects.

PubMed

Mäenpää, Heidi; Häkkinen, Arja; Sarajuuri, Anne

2016-01-01

To compare changes in motor development from 1 to 5 years of age among 18 children with hypoplastic left heart syndrome and 12 with univentricular heart to 42 children without heart defect. Motor development was assessed with the Alberta Infant Motor Scale and Movement Assessment Battery for Children (Movement ABC). Children with hypoplastic left heart syndrome or univentricular heart had significantly lower scores on the Alberta Infant Motor Scale test at the age of 1 and on the Movement ABC test at the age of 5 years compared with controls. Children with clear abnormalities on brain magnetic resonance imaging had lower scores compared with those with normal images or mild changes, and their relative motor scores decreased during follow-up. Some children with univentricular heart defects may benefit from physiotherapeutic interventions to support their motor development.
Effects of Training on Knowledge, Attitude and Practices of Malaria Prevention and Control among Community Role Model Care Givers in South Western Nigeria.

PubMed

Olalekan, Adebimpe W; Adebukola, Adebimpe M

2015-10-01

Malaria is endemic in Nigeria, with significant records of mortality and morbidity. Adequate community involvement is central to a successful implementation of malaria control programs. This study assessed the effects of a training programme on knowledge of malaria prevention and control among community role model care givers. A descriptive cross sectional study of a pre-and post-test design method was conducted among 400 eligible community members in Osun State. Training was given in the form of organized lectures, health education and practical demonstration sessions. Scores of pre-test and post-test conducted after four months interval were compared. Multistage sampling method was adopted in selecting study participants, while data was analyzed using the SPSS software version 17.0. Mean age was 43.8 (±1.4) years. Average knowledge score of cause, transmission, risk factors and consequences, awareness of common symptoms and preventive practices improved during post-training test when compared with pr-training test. The overall descriptive mean knowledge score in pre-test and post-test were 2.1 and 3.5 respectively out of an average maximum score of 5.0, giving an increment of 66.7%. Role model care givers with formal education were twice and three times more likely to know about disease 'transmission' (OR 1.9, 95%CI 0.11-0.19, p=0.002) and 'consequences' (OR 2.9, 95%CI 0.25-0.65, p=0.040) respectively compared to those without formal education. Training on malaria improved the knowledge of malaria prevention and control among role model community care givers towards a successful implementation of malaria control programmes.
Motorcycle-related hospitalization of adolescents in a Level I trauma center in southern Taiwan: a cross-sectional study.

PubMed

Liang, Chi-Cheng; Liu, Hang-Tsung; Rau, Cheng-Shyuan; Hsu, Shiun-Yuan; Hsieh, Hsiao-Yun; Hsieh, Ching-Hua

2015-08-28

The aim of this study was to investigate and compare the injury pattern, mechanisms, severity, and mortality of adolescents and adults hospitalized for treatment of trauma following motorcycle accidents in a Level I trauma center. Detailed data regarding patients aged 13-19 years (adolescents) and aged 30-50 years (adults) who had sustained trauma due to a motorcycle accident were retrieved from the Trauma Registry System between January 1, 2009 and December 31, 2012. The Pearson's chi-squared test, Fisher's exact test, or the independent Student's t-test were performed to compare the adolescent and adult motorcyclists and to compare the motorcycle drivers and motorcycle pillion. Analysis of Abbreviated Injury Scale (AIS) scores revealed that the adolescent patients had sustained higher rates of facial, abdominal, and hepatic injury and of cranial, mandibular, and femoral fracture but lower rates of thorax and extremity injury; hemothorax; and rib, scapular, clavicle, and humeral fracture compared to the adults. No significant differences were found between the adolescents and adults regarding Injury Severity Score (ISS), New Injury Severity Score (NISS), Trauma-Injury Severity Score (TRISS), mortality, length of hospital stay, or intensive care unit (ICU) admission rate. A significantly greater percentage of adolescents compared to adults were found not to have worn a helmet. Motorcycle riders who had not worn a helmet were found to have a significantly lower first Glasgow Coma Scale (GCS) score, and a significantly higher percentage was found to present with unconscious status, head and neck injury, and cranial fracture compared to those who had worn a helmet. Adolescent motorcycle riders comprise a major population of patients hospitalized for treatment of trauma. This population tends to present with a higher injury severity compared to other hospitalized trauma patients and a bodily injury pattern differing from that of adult motorcycle riders, indicating the need to emphasize use of protective equipment, especially helmets, to reduce their rate and severity of injury.
Effectiveness of team-based learning methodology in teaching transfusion medicine to medical undergraduates in third semester: A comparative study.

PubMed

Doshi, Neena Piyush

2017-01-01

Team-based learning (TBL) combines small and large group learning by incorporating multiple small groups in a large group setting. It is a teacher-directed method that encourages student-student interaction. This study compares student learning and teaching satisfaction between conventional lecture and TBL in the subject of pathology. The present study is aimed to assess the effectiveness of TBL method of teaching over the conventional lecture. The present study was conducted in the Department of Pathology, GMERS Medical College and General Hospital, Gotri, Vadodara, Gujarat. The study population comprised 126 students of second-year MBBS, in their third semester of the academic year 2015-2016. "Hemodynamic disorders" were taught by conventional method and "transfusion medicine" by TBL method. Effectiveness of both the methods was assessed. A posttest multiple choice question was conducted at the end of "hemodynamic disorders." Assessment of TBL was based on individual score, team score, and each member's contribution to the success of the team. The individual score and overall score were compared with the posttest score on "hemodynamic disorders." A feedback was taken from the students regarding their experience with TBL. Tukey's multiple comparisons test and ANOVA summary were used to find the significance of scores between didactic and TBL methods. Student feedback was taken using "Student Satisfaction Scale" based on Likert scoring method. The mean of student scores by didactic, Individual Readiness Assurance Test (score "A"), and overall (score "D") was 49.8% (standard deviation [SD]-14.8), 65.6% (SD-10.9), and 65.6% (SD-13.8), respectively. The study showed positive educational outcome in terms of knowledge acquisition, participation and engagement, and team performance with TBL.
Neurocognitive performance and symptom profiles of Spanish-speaking Hispanic athletes on the ImPACT test.

PubMed

Ott, Summer; Schatz, Philip; Solomon, Gary; Ryan, Joseph J

2014-03-01

This study documented baseline neurocognitive performance of 23,815 athletes on the Immediate Post-Concussion Assessment and Cognitive Testing (ImPACT) test. Specifically, 9,733 Hispanic, Spanish-speaking athletes who completed the ImPACT test in English and 2,087 Hispanic, Spanish-speaking athletes who completed the test in Spanish were compared with 11,955 English-speaking athletes who completed the test in English. Athletes were assigned to age groups (13-15, 16-18). Results revealed a significant effect of language group (p < .001; partial η(2) = 0.06) and age (p < .001; partial η(2) = 0.01) on test performance. Younger athletes performed more poorly than older athletes, and Spanish-speaking athletes completing the test in Spanish scored more poorly than Spanish-speaking and English-speaking athletes completing the test in English, on all Composite scores and Total Symptom scores. Spanish-speaking athletes completing the test in English also performed more poorly than English-speaking athletes completing the test in English on three Composite scores. These differences in performance and reported symptoms highlight the need for caution in interpreting ImPACT test data for Hispanic Americans.
Feasibility of TBI Assessment Measures in a Field Environment: A Pilot Study for the Environmental Sensors in Training (ESiT) Project

DTIC Science & Technology

2016-12-22

included assessments and instruments, descriptive statistics were calculated. Independent-samples t-tests were conducted using participant survey scores...integrity tests within a multimodal system. Both conditions included the Military Acute Concussion Evaluation (MACE) and an Ease-of-Use survey . Mean scores...for the Ease-of-Use survey and mean test administration times for each measure were compared. Administrative feedback was also considered for
Validation of undergraduate medical student script concordance test (SCT) scores on the clinical assessment of the acute abdomen.

PubMed

Goos, Matthias; Schubach, Fabian; Seifert, Gabriel; Boeker, Martin

2016-08-17

Health professionals often manage medical problems in critical situations under time pressure and on the basis of vague information. In recent years, dual process theory has provided a framework of cognitive processes to assist students in developing clinical reasoning skills critical especially in surgery due to the high workload and the elevated stress levels. However, clinical reasoning skills can be observed only indirectly and the corresponding constructs are difficult to measure in order to assess student performance. The script concordance test has been established in this field. A number of studies suggest that the test delivers a valid assessment of clinical reasoning. However, different scoring methods have been suggested. They reflect different interpretations of the underlying construct. In this work we want to shed light on the theoretical framework of script theory and give an idea of script concordance testing. We constructed a script concordance test in the clinical context of "acute abdomen" and compared previously proposed scores with regard to their validity. A test comprising 52 items in 18 clinical scenarios was developed, revised along the guidelines and administered to 56 4(th) and 5(th) year medical students at the end of a blended-learning seminar. We scored the answers using five different scoring methods (distance (2×), aggregate (2×), single best answer) and compared the scoring keys, the resulting final scores and Cronbach's α after normalization of the raw scores. All scores except the single best answers calculation achieved acceptable reliability scores (>= 0.75), as measured by Cronbach's α. Students were clearly distinguishable from the experts, whose results were set to a mean of 80 and SD of 5 by the normalization process. With the two aggregate scoring methods, the students' means values were between 62.5 (AGGPEN) and 63.9 (AGG) equivalent to about three expert SD below the experts' mean value (Cronbach's α : 0.76 (AGGPEN) and 0.75 (AGG)). With the two distance scoring methods the students' mean was between 62.8 (DMODE) and 66.8 (DMEAN) equivalent to about two expert SD below the experts' mean value (Cronbach's α: 0.77 (DMODE) and 0.79 (DMEAN)). In this study the single best answer (SBA) scoring key yielded the worst psychometric results (Cronbach's α: 0.68). Assuming the psychometric properties of the script concordance test scores are valid, then clinical reasoning skills can be measured reliably with different scoring keys in the SCT presented here. Psychometrically, the distance methods seem to be superior, wherein inherent statistical properties of the scales might play a significant role. For methodological reasons, the aggregate methods can also be used. Despite the limitations and complexity of the underlying scoring process and the calculation of reliability, we advocate for SCT because it allows a new perspective on the measurement and teaching of cognitive skills.
Comparison of the Reading Subtests of the Wechsler Individual Achievement Test-Third Edition and the Peabody Individual Achievement Test-Revised/Normative Update

ERIC Educational Resources Information Center

Ott, Lauren M.

2010-01-01

This study compared the reading subtests of the Wechsler Individual Achievement Test-Third Edition and the Peabody Individual Achievement Test-Revised/Normative Update. Scores were compared on these two tests in a group of 28 students ages 7 through 12 who were referred or reevaluated for suspected learning problems. The data were collected…
Effect of foot placements during sit to stand transition on timed up and go test in stroke subjects: A cross sectional study.

PubMed

Joshua, Abraham M; Karnad, Shreekanth D; Nayak, Akshatha; Suresh, B V; Mithra, Prasanna; Unnikrishnan, B

2017-01-01

Timed up and go (TUG) test is been used as a screening tool for the assessment of risk of falling in individuals following stroke. Though TUG test is a quick test, it has fair sensitivity compared to other tests. This study was carried out to obtain and compare test scores for different types of foot placements during sit to stand transition in stroke subjects. A Cross-sectional study with purposive sampling included 28 post stroke subjects who were able to walk 6 meter with or without assistance. Timed Up and Go test was carried out with four different types of foot placements and scores were recorded. The data were compared using Kruskal-Wallis One way analysis of variance and Wilcoxon signed ranks test. There were comparable differences between asymmetric 1 test strategy which involved affected extremity to be placed behind the unaffected and other test strategies (Z = -4.457,-3.848,-4.458; p = 0.000). The initial foot placements during sit to stand transition influenced the time taken to complete the test which was significantly higher in asymmetric 1 strategy, Incorporation of the initial foot placement mainly asymmetric 1 strategy into conventional TUG test would help in identifying accurately the subject's functional mobility and postural stability.
Comparative evaluation of different periods of enamel microabrasion on the microleakage of class V resin-modified glass ionomer and compomer restorations: An In vitro study.

PubMed

Bansal, Disha; Mahajan, Mrinalini

2017-01-01

The design of the class V cavity presents a clinical challenge in the field of adhesive dentistry as the margin placement is partially in enamel and partly in dentin, and the trouble associated with this design is the microleakage at the dentinal margin. When these restorations undergo microabrasion due to cosmetic reasons, this trouble aggravates to the significant levels. The aim of this study was the measurement of microleakage of class V glass ionomer restorations over two different periods of enamel microabrasion. This in vitro experimental study was conducted on 120 class V cavities which had been prepared on the buccal and lingual surfaces of 60 sound human premolars. One-half of the cavities were restored with the resin-modified glass ionomer cement (GIC) (60 cavities) and another half with the compomer (60 cavities). Finishing and polishing were performed. Then, the teeth were classified into six groups (n = 20). Microabrasion treatment was performed with Opaluster (Ultradent Product Inc., South Jordan, UT, USA) for 0 (control no treatment), 60 and 120 s. Then, teeth were thermocycled between 5°C and 55°C, immersed in rhodamine B solution (24 h), and sectioned longitudinally in buccolingual direction. Dye penetration was examined with stereomicroscope (×10). Microleakage scores were statistically analyzed. The mean occlusal margin scores and gingival margin scores were compared between all the groups using the Kruskal-Wallis test, Mann-Whitney U-test, Wilcoxon signed-rank test, and post hoc comparison. There was a significant difference between Group 1a, Group 2a, Group 1b, Group 2b, Group 1c, and Group 2c. Statistical analysis used in this study was Kruskal-Wallis test, Mann-Whitney U-test, Wilcoxon signed-rank test, and post hoc comparison. The least microleakage scores were observed in occlusal margins of control groups (without microabrasion). Moreover, in both restorations, the microleakage scores in occlusal margins were higher than gingival margins, and compoglass had less microleakage in occlusal and occlusal plus axial walls of class V cavities compared with resin-modified GIC. Whereas, the light-cured glass ionomer had less microleakage in the gingival and gingival plus axial walls of class V cavities when compared with compoglass. The least microleakage scores were observed in occlusal margins of control groups (without microabrasion). Moreover, in both restorations, the microleakage scores in occlusal margins were higher than gingival margins.
The Flynn effect and U.S. policies: the impact of rising IQ scores on American society via mental retardation diagnoses.

PubMed

Kanaya, Tomoe; Scullin, Matthew H; Ceci, Stephen J

2003-10-01

Over the last century, IQ scores have been steadily rising, a phenomenon dubbed the Flynn effect. Because of the Flynn effect, IQ tests are periodically renormed, making them harder. Given that eligibility for mental retardation (MR) services relies heavily on IQ scores, renormed tests could have a significant impact on MR placements. In longitudinal IQ records from 9 sites around the country, students in the borderline and mild MR range lost an average of 5.6 points when retested on a renormed test and were more likely to be classified MR compared with peers retested on the same test. The magnitude of the effect is large and affects national policies on education, social security, the death penalty, and the military. This paper reports the perceptions of professionals as they relate to IQ score fluctuations in normal, borderline, and/or MR populations.
Use of the binomial distribution to predict impairment: application in a nonclinical sample.

PubMed

Axelrod, Bradley N; Wall, Jacqueline R; Estes, Bradley W

2008-01-01

A mathematical model based on the binomial theory was developed to illustrate when abnormal score variations occur by chance in a multitest battery (Ingraham & Aiken, 1996). It has been successfully used as a comparison for obtained test scores in clinical samples, but not in nonclinical samples. In the current study, this model has been applied to demographically corrected scores on the Halstead-Reitan Neuropsychological Test Battery, obtained from a sample of 94 nonclinical college students. Results found that 15% of the sample had impairments suggested by the Halstead Impairment Index, using criteria established by Reitan and Wolfson (1993). In addition, one-half of the sample obtained impaired scores on one or two tests. These results were compared to that predicted by the binomial model and found to be consistent. The model therefore serves as a useful resource for clinicians considering the probability of impaired test performance.
Greater power and computational efficiency for kernel-based association testing of sets of genetic variants.

PubMed

Lippert, Christoph; Xiang, Jing; Horta, Danilo; Widmer, Christian; Kadie, Carl; Heckerman, David; Listgarten, Jennifer

2014-11-15

Set-based variance component tests have been identified as a way to increase power in association studies by aggregating weak individual effects. However, the choice of test statistic has been largely ignored even though it may play an important role in obtaining optimal power. We compared a standard statistical test-a score test-with a recently developed likelihood ratio (LR) test. Further, when correction for hidden structure is needed, or gene-gene interactions are sought, state-of-the art algorithms for both the score and LR tests can be computationally impractical. Thus we develop new computationally efficient methods. After reviewing theoretical differences in performance between the score and LR tests, we find empirically on real data that the LR test generally has more power. In particular, on 15 of 17 real datasets, the LR test yielded at least as many associations as the score test-up to 23 more associations-whereas the score test yielded at most one more association than the LR test in the two remaining datasets. On synthetic data, we find that the LR test yielded up to 12% more associations, consistent with our results on real data, but also observe a regime of extremely small signal where the score test yielded up to 25% more associations than the LR test, consistent with theory. Finally, our computational speedups now enable (i) efficient LR testing when the background kernel is full rank, and (ii) efficient score testing when the background kernel changes with each test, as for gene-gene interaction tests. The latter yielded a factor of 2000 speedup on a cohort of size 13 500. Software available at http://research.microsoft.com/en-us/um/redmond/projects/MSCompBio/Fastlmm/. heckerma@microsoft.com Supplementary data are available at Bioinformatics online. © The Author 2014. Published by Oxford University Press.
Scoring Systems to Estimate Intracerebral Control and Survival Rates of Patients Irradiated for Brain Metastases;Brain metastases; Radiation therapy; Local control; Survival; Prognostic scores

DOE Office of Scientific and Technical Information (OSTI.GOV)

Rades, Dirk, E-mail: Rades.Dirk@gmx.net; Dziggel, Liesa; Haatanen, Tiina

2011-07-15

Purpose: To create and validate scoring systems for intracerebral control (IC) and overall survival (OS) of patients irradiated for brain metastases. Methods and Materials: In this study, 1,797 patients were randomly assigned to the test (n = 1,198) or the validation group (n = 599). Two scoring systems were developed, one for IC and another for OS. The scores included prognostic factors found significant on multivariate analyses. Age, performance status, extracerebral metastases, interval tumor diagnosis to RT, and number of brain metastases were associated with OS. Tumor type, performance status, interval, and number of brain metastases were associated with IC.more » The score for each factor was determined by dividing the 6-month IC or OS rate (given in percent) by 10. The total score represented the sum of the scores for each factor. The score groups of the test group were compared with the corresponding score groups of the validation group. Results: In the test group, 6-month IC rates were 17% for 14-18 points, 49% for 19-23 points, and 77% for 24-27 points (p < 0.0001). IC rates in the validation group were 19%, 52%, and 77%, respectively (p < 0.0001). In the test group, 6-month OS rates were 9% for 15-19 points, 41% for 20-25 points, and 78% for 26-30 points (p < 0.0001). OS rates in the validation group were 7%, 39%, and 79%, respectively (p < 0.0001). Conclusions: Patients irradiated for brain metastases can be given scores to estimate OS and IC. IC and OS rates of the validation group were similar to the test group demonstrating the validity and reproducibility of both scores.« less
Modified bathroom scale and balance assessment: a comparison with clinical tests.

PubMed

Duchêne, Jacques; Hewson, David; Rumeau, Pierre

2016-01-01

Frailty and detection of fall risk are major issues in preventive gerontology. A simple tool frequently used in daily life, a bathroom scale (balance quality tester: BQT), was modified to obtain information on the balance of 84 outpatients consulting at a geriatric clinic. The results computed from the BQT were compared to the values of three geriatric tests that are widely used either to detect a fall risk or frailty (timed get up and go: TUG; 10 m walking speed: WS; walking time: WT; one-leg stand: OS). The BQT calculates four parameters that are then scored and weighted, thus creating an overall indicator of balance quality. Raw data, partial scores and the global score were compared with the results of the three geriatric tests. The WT values had the highest correlation with BQT raw data (r = 0.55), while TUG (r = 0.53) and WS (r = 0.56) had the highest correlation with BQT partial scores. ROC curves for OS cut-off values (4 and 5 s) were produced, with the best results obtained for a 5 s cut-off, both with the partial scores combined using Fisher's combination (specificity 85 %: <0.11, sensitivity 85 %: >0.48), and with the empirical score (specificity 85 %: <7, sensitivity 85 %: >8). A BQT empirical score of less than seven can detect fall risk in a community dwelling population.
Outcomes of a pharmacotherapy/research rotation in a family medicine training program.

PubMed

Murphy, Julie A; Shrader, Sarah R; Montooth, Audrey K

2008-06-01

The effects of a required pharmacotherapy/research rotation in family medicine residency programs, precepted by a clinical pharmacist, have not been documented in the literature. This study evaluated the effects that a focused pharmacotherapy/research rotation had on family medicine residents' knowledge of pharmacotherapy and research topics. During the first year of a family medicine residency, 15 residents were required to complete 1 month in pharmacotherapy and research. They spent time observing a pharmacist-run clinic and discussing pharmacotherapy and research topics. Residents completed a 20-question pretest and a posttest consisting of 15 pharmacotherapy and five research questions while on the rotation. Higher scores on the tests indicated higher levels of knowledge. The differences in mean scores were evaluated using paired t tests. Overall, the mean score on the pretest was 10.13 compared to 14.67 on the posttest. Mean scores on the pharmacotherapy and research components for the pretests and posttests were 7.27 compared to 10.47 and 2.87 compared to 4.20, respectively. A focused pharmacotherapy/research rotation, precepted by a clinical pharmacist, increases family medicine residents' knowledge.
Effects of intensive short-term dynamic psychotherapy on social cognition in major depression.

PubMed

Ajilchi, Bita; Kisely, Steve; Nejati, Vahid; Frederickson, Jon

2018-05-23

Social cognition is commonly affected in psychiatric disorders and is a determinant of quality of life. However, there are few studies of treatment. To investigate the efficacy of intensive short-term dynamic psychotherapy on social cognition in major depression. This study used a parallel group randomized control design to compare pre-test and post-test social cognition scores between depressed participants receiving ISTDP and those allocated to a wait-list control group. Participants were adults (19-40 years of age) who were diagnosed with depression. We recruited 32 individuals, with 16 participants allocated to the ISTDP and control groups, respectively. Both groups were similar in terms of age, sex and educational level. Multivariate analysis of variance (MANOVA) demonstrated that the intervention was effective in terms of the total score of social cognition: the experimental group had a significant increase in the post-test compared to the control group. In addition, the experimental group showed a significant reduction in the negative subjective score compared to the control group as well as an improvement in response to positive neutral and negative states. Depressed patients receiving ISTDP show a significant improvement in social cognition post treatment compared to a wait-list control group.

The Importance of Minor Salivary Gland Biopsy in Sjögren Syndrome Diagnosis and the Clinicopathological Correlation.

PubMed

Serin, Gürdeniz; Karabulut, Gonca; Kabasakal, Yasemin; Kandiloğlu, Gülşen; Akalin, Taner

2016-01-01

Minor salivary gland biopsy is one of the objective tests used in the diagnosis of Sjögren syndrome. The aim of our study was to compare the clinical and laboratory data of primary and secondary Sjögren syndrome cases with a lymphocyte score 3 and 4 in the minor salivary gland biopsy. Data from a total of 2346 consecutive minor salivary gland biopsies were retrospectively evaluated in this study. Clinical and autoantibody characteristics of 367 cases with lymphocyte score 3 or 4 and diagnosed with primary or secondary Sjögren syndrome were compared. There was no difference between lymphocyte score 3 and 4 primary Sjögren syndrome patients in terms of dry mouth, dry eye symptoms and Schirmer test results but Anti-Ro and Antinuclear Antibody positivity was statistically significantly higher in cases with lymphocyte score 4 (p= 0.025, p= 0.001). Anti-Ro test results were also found to be statistically significantly higher in secondary Sjögren syndrome patients with lymphocyte score 4 (p= 0.048). In this study, the high proportion of cases with negative autoantibody but positive lymphocyte score is significant in terms of showing the contribution of minor salivary gland biopsy to Sjögren syndrome diagnosis. Lymphocyte score 3 and 4 cases were found to have similar clinical findings but a difference regarding antibody positivity in primary Sjögren syndrome. We believe that cases with lymphocyte score 4 may be Sjögren syndrome cases whose clinical manifestations are relatively established and higher autoantibody levels are therefore found.
The Effects of Georgia's Choice Curricular Reform Model on Third Grade Science Scores on the Georgia Criterion Referenced Competency Test

ERIC Educational Resources Information Center

Phemister, Art W.

2010-01-01

The purpose of this study was to evaluate the effectiveness of the Georgia's Choice reading curriculum on third grade science scores on the Georgia Criterion Referenced Competency Test from 2002 to 2008. In assessing the effectiveness of the Georgia's Choice curriculum model this causal comparative study examined the 105 elementary schools that…
The Reliability and Precision of Total Scores and IRT Estimates as a Function of Polytomous IRT Parameters and Latent Trait Distribution

ERIC Educational Resources Information Center

Culpepper, Steven Andrew

2013-01-01

A classic topic in the fields of psychometrics and measurement has been the impact of the number of scale categories on test score reliability. This study builds on previous research by further articulating the relationship between item response theory (IRT) and classical test theory (CTT). Equations are presented for comparing the reliability and…
The Impact of Year-Round Education on Fifth Grade African American Reading Achievement Scores in an Urban Illinois School

ERIC Educational Resources Information Center

Merrill, Carolyn Ann

2012-01-01

The purpose of this quantitative, causal-comparative study was to determine the impact of the year-round education school calendar on the standardized test performance of fifth grade African American students, as measured by the Illinois Standards Achievement Test (ISAT) in reading. The ISAT reading scores from two year-round education (YRE)…
The Effects of Different Types of Anchor Tests on Observed Score Equating. Research Report. ETS RR-09-41

ERIC Educational Resources Information Center

Liu, Jinghua; Sinharay, Sandip; Holland, Paul W.; Feigenbaum, Miriam; Curley, Edward

2009-01-01

This study explores the use of a different type of anchor, a "midi anchor", that has a smaller spread of item difficulties than the tests to be equated, and then contrasts its use with the use of a "mini anchor". The impact of different anchors on observed score equating were evaluated and compared with respect to systematic…
Comparing the Effectiveness of Traditional and Active Learning Methods in Business Statistics: Convergence to the Mean

ERIC Educational Resources Information Center

Weltman, David; Whiteside, Mary

2010-01-01

This research shows that active learning is not universally effective and, in fact, may inhibit learning for certain types of students. The results of this study show that as increased levels of active learning are utilized, student test scores decrease for those with a high grade point average. In contrast, test scores increase as active learning…
A "Nonbiased Assessment" of Intelligence Testing.

ERIC Educational Resources Information Center

Vandivier, Phillip L.; Vandivier, Stella Sue

1979-01-01

Arguments and prejudices against the use of individually administered intelligence tests are considered and compared with possible values that may be obtained. Cautions about test score interpretation are discussed. Implications of abolishing intelligence testing are considered and recommendations for effective testing policies are presented. (CTM)
Comparison of Internet versus lecture instructional methods for teaching nursing research.

PubMed

Woo, M A; Kimmick, J V

2000-01-01

Although many higher education programs are using the Internet to teach classes, there are few published reports on the effectiveness of this method on test scores or student satisfaction. The purpose of this study was to compare test and student satisfaction scores of graduate nursing students who take a nursing research course via the Internet with those of students who take the same course via traditional lecture instruction. In addition, student technical support use and Internet student lecture attendance also were examined. A total of 97 students (Internet, 44; lectures, 53) participated. There were no significant differences in test scores and overall course student satisfaction (P > .05). However, the Internet students reported significantly higher (P = .04) stimulation of learning compared with the traditional lecture students. Technical support use by the Internet students was high initially and was related to software problems. Of interest were the large proportion of Internet students (73 percent) who attended at least 3 of the 10 lectures. Use of the Internet to teach graduate-level nursing research can provide comparable learning and student satisfaction to traditional lecture instructional methods.
The relation of functional visual acuity measurement methodology to tear functions and ocular surface status.

PubMed

Kaido, Minako; Ishida, Reiko; Dogru, Murat; Tsubota, Kazuo

2011-09-01

To investigate the relation of functional visual acuity (FVA) measurements with dry eye test parameters and to compare the testing methods with and without blink suppression and anesthetic instillation. A prospective comparative case series. Thirty right eyes of 30 dry eye patients and 25 right eyes of 25 normal subjects seen at Keio University School of Medicine, Department of Ophthalmology were studied. FVA testing was performed using a FVA measurement system with two different approaches, one in which measurements were made under natural blinking conditions without topical anesthesia (FVA-N) and the other in which the measurements were made under the blink suppression condition with topical anesthetic eye drops (FVA-BS). Tear function examinations, such as the Schirmer test, tear film break-up time, and fluorescein and Rose Bengal vital staining as ocular surface evaluation, were performed. The mean logMAR FVA-N scores and logMAR Landolt visual acuity scores were significantly lower in the dry eye subjects than in the healthy controls (p < 0.05), while there were no statistical differences between the logMAR FVA-BS scores of the dry eye subjects and those of the healthy controls. There was a significant correlation between the logMAR Landolt visual acuities and the logMAR FVA-N and logMAR FVA-BS scores. The FVA-N scores correlated significantly with tear quantities, tear stability and, especially, the ocular surface vital staining scores. FVA measurements performed under natural blinking significantly reflected the tear functions and ocular surface status of the eye and would appear to be a reliable method of FVA testing. FVA measurement is also an accurate predictor of dry eye status.
The King-Devick (K-D) test of rapid eye movements: a bedside correlate of disability and quality of life in MS.

PubMed

Moster, Stephen; Wilson, James A; Galetta, Steven L; Balcer, Laura J

2014-08-15

We investigated the King-Devick (K-D) test of rapid number naming as a visual performance measure in a cohort of patients with multiple sclerosis (MS). In this cross-sectional study, 81 patients with MS and 20 disease-free controls from an ongoing study of visual outcomes underwent K-D testing. A test of rapid number naming, K-D requires saccadic eye movements as well as intact vision, attention and concentration. To perform the K-D test, participants are asked to read numbers aloud as quickly as possible from three test cards; the sum of the three test card times in seconds constitutes the summary score. High-contrast visual acuity (VA), low-contrast letter acuity (1.25% and 2.5% levels), retinal nerve fiber layer (RNFL) thickness by optical coherence tomography (OCT), MS Functional Composite (MSFC) and vision-specific quality of life (QOL) measures (25-Item NEI Visual Functioning Questionnaire [NEI-VFQ-25] and 10-Item Neuro-Ophthalmic Supplement) were also assessed. K-D time scores in the MS cohort (total time to read the three test cards) were significantly higher (worse) compared to those for disease-free controls (P=0.003, linear regression, accounting for age). Within the MS cohort, higher K-D scores were associated with worse scores for the NEI-VFQ-25 composite (P<0.001), 10-Item Neuro-Ophthalmic Supplement (P<0.001), binocular low-contrast acuity (2.5%, 1.25%, P<0.001, and high-contrast VA (P=0.003). Monocular low-contrast vision scores (P=0.001-0.009) and RNFL thickness (P=0.001) were also reduced in eyes of patients with worse K-D scores (GEE models accounting for age and within-patient, inter-eye correlations). Patients with a history of optic neuritis (ON) had increased (worse) K-D scores. Patients who classified their work disability status as disabled (receiving disability pension) did worse on K-D testing compared to those working full-time (P=0.001, accounting for age). The K-D test, a <2 minute bedside test of rapid number naming, is associated with visual dysfunction, neurologic impairment, and reduced vision-specific QOL in patients with MS. Scores reflect work disability as well as structural changes as measured by OCT imaging. History of ON and abnormal binocular acuities were associated with worse K-D scores, suggesting that abnormalities detected by K-D may go along with afferent dysfunction in MS patients. A brief test that requires saccadic eye movements, K-D should be considered for future MS trials as a rapid visual performance measure. Copyright © 2014 Elsevier B.V. All rights reserved.
Specificity and false positive rates of the Test of Memory Malingering, Rey 15-item Test, and Rey Word Recognition Test among forensic inpatients with intellectual disabilities.

PubMed

Love, Christopher M; Glassmire, David M; Zanolini, Shanna Jordan; Wolf, Amanda

2014-10-01

This study evaluated the specificity and false positive (FP) rates of the Rey 15-Item Test (FIT), Word Recognition Test (WRT), and Test of Memory Malingering (TOMM) in a sample of 21 forensic inpatients with mild intellectual disability (ID). The FIT demonstrated an FP rate of 23.8% with the standard quantitative cutoff score. Certain qualitative error types on the FIT showed promise and had low FP rates. The WRT obtained an FP rate of 0.0% with previously reported cutoff scores. Finally, the TOMM demonstrated low FP rates of 4.8% and 0.0% on Trial 2 and the Retention Trial, respectively, when applying the standard cutoff score. FP rates are reported for a range of cutoff scores and compared with published research on individuals diagnosed with ID. Results indicated that although the quantitative variables on the FIT had unacceptably high FP rates, the TOMM and WRT had low FP rates, increasing the confidence clinicians can place in scores reflecting poor effort on these measures during ID evaluations. © The Author(s) 2014.
Prognostic Value of Metabolic Liver Function Tests: a Study on 711 Cirrhotic Patients.

PubMed

Lebossé, Fanny; Guillaud, Olivier; Forestier, Julien; Ecochard, Marie; Boillot, Olivier; Roman, Sabine; Mion, François; Dumortier, Jérôme

2016-09-01

The prognosis of cirrhotic patients is usually assessed by Child-Pugh and MELD scores. Metabolic liver function tests such as aminopyrine breath test (ABT) and indocyanine green clearance (IGC) have been shown to reveal hepatocellular dysfunction. The aim of this retrospective study was to compare the prognostic value of the MELD score, Child-Pugh score, ABT and IGC in a large cohort of cirrhotic patients. Between January 1996 and June 2008, 711 cirrhotic patients were included and the primary endpoint was survival without LT. The ROC curves with c-statistics, correlation coefficient and survival were calculated. Metabolic function tests and scores were strongly correlated. At the time of evaluation, 111 patients had died and 520 had received a transplant. Prognostic ability (estimated by the AUROC curve) to predict survival without LT at 6 months was 0.662, 0.691, 0.738 and 0.715 for ABT, IGC, Child-Pugh score and MELD score, respectively. Similarly, at 1 year, AUROC was 0.738 for Child-Pugh score, 0.716 for MELD score, 0.693 for IGC clearance and 0.651 for ABT. Our results strongly confirm that IGC and ABT have a high prognostic value in cirrhotic patients, similar to Child-Pugh and MELD scores. They could be developed to routinely evaluate the prognosis of patients in addition to clinical and biochemical data.
Predictive value of the korean academy of family medicine in-training examination for certifying examination.

PubMed

Cho, Jung-Jin; Kim, Ji-Yong

2011-09-01

In-training examination (ITE) is a cognitive examination similar to the written test, but it is different from the Clinical Practice Examination of the Korean Academy of Family Medicine (KAFM) Certification Examination (CE). The objective of this is to estimate the positive predictive value of the KAFM-ITE for identifying residents at risk for poor performance on the three types of KAFM-CE. 372 residents who completed the KAFM-CE in 2011 were included. We compared the mean KAFM-CE scores with ITE experience. We evaluated the correlation and the positive predictive value (PPV) of ITE for the multiple choice question (MCQ) scores of 1st written test & 2nd slide examination, the total clinical practice examination scores, and the total sum of 2nd test. 275 out of 372 residents completed ITE. Those who completed ITE had significantly higher MCQ scores of 1st written test than those who did not. The correlation of ITE scores with 1st written MCQ (0.627) was found to be the highest among the other kinds of CE. The PPV of the ITE score for 1st written MCQ scores was 0.672. The PPV of the ITE score ranged from 0.376 to 0.502. The score of the KAFM ITE has acceptable positive predictive value that could be used as a part of comprehensive evaluation system for residents in cognitive field.
Biases and Power for Groups Comparison on Subjective Health Measurements

PubMed Central

Hamel, Jean-François; Hardouin, Jean-Benoit; Le Neel, Tanguy; Kubis, Gildas; Roquelaure, Yves; Sébille, Véronique

2012-01-01

Subjective health measurements are increasingly used in clinical research, particularly for patient groups comparisons. Two main types of analytical strategies can be used for such data: so-called classical test theory (CTT), relying on observed scores and models coming from Item Response Theory (IRT) relying on a response model relating the items responses to a latent parameter, often called latent trait. Whether IRT or CTT would be the most appropriate method to compare two independent groups of patients on a patient reported outcomes measurement remains unknown and was investigated using simulations. For CTT-based analyses, groups comparison was performed using t-test on the scores. For IRT-based analyses, several methods were compared, according to whether the Rasch model was considered with random effects or with fixed effects, and the group effect was included as a covariate or not. Individual latent traits values were estimated using either a deterministic method or by stochastic approaches. Latent traits were then compared with a t-test. Finally, a two-steps method was performed to compare the latent trait distributions, and a Wald test was performed to test the group effect in the Rasch model including group covariates. The only unbiased IRT-based method was the group covariate Wald’s test, performed on the random effects Rasch model. This model displayed the highest observed power, which was similar to the power using the score t-test. These results need to be extended to the case frequently encountered in practice where data are missing and possibly informative. PMID:23115620
Reliability of sonographic assessment of tendinopathy in tennis elbow.

PubMed

Poltawski, Leon; Ali, Syed; Jayaram, Vijay; Watson, Tim

2012-01-01

To assess the reliability and compute the minimum detectable change using sonographic scales to quantify the extent of pathology and hyperaemia in the common extensor tendon in people with tennis elbow. The lateral elbows of 19 people with tennis elbow were assessed sonographically twice, 1-2 weeks apart. Greyscale and power Doppler images were recorded for subsequent rating of abnormalities. Tendon thickening, hypoechogenicity, fibrillar disruption and calcification were each rated on four-point scales, and scores were summed to provide an overall rating of structural abnormality; hyperaemia was scored on a five point scale. Inter-rater reliability was established using the intraclass correlation coefficient (ICC) to compare scores assigned independently to the same set of images by a radiologist and a physiotherapist with training in musculoskeletal imaging. Test-retest reliability was assessed by comparing scores assigned by the physiotherapist to images recorded at the two sessions. The minimum detectable change (MDC) was calculated from the test-retest reliability data. ICC values for inter-rater reliability ranged from 0.35 (95% CI: 0.05, 0.60) for fibrillar disruption to 0.77 (0.55, 0.88) for overall greyscale score, and 0.89 (0.79, 0.95) for hyperaemia. Test-retest reliability ranged from 0.70 (0.48, 0.84) for tendon thickening to 0.82 (0.66, 0.90) for overall greyscale score and 0.86 (0.73, 0.93) for calcification. The MDC for the greyscale total score was 2.0/12 and for the hyperaemia score was 1.1/5. The sonographic scoring system used in this study may be used reliably to quantify tendon abnormalities and change over time. A relatively inexperienced imager can conduct the assessment and use the rating scales reliably.
Updating prognosis of cirrhosis by Cox's regression model using Child-Pugh score and aminopyrine breath test as time-dependent covariates.

PubMed

Merkel, C; Morabito, A; Sacerdoti, D; Bolognesi, M; Angeli, P; Gatta, A

1998-06-01

The determination of aminopyrine breath test on entry into the study was recently shown to improve the accuracy of prediction of death based on the Child-Pugh classification, but the possible usefulness of serial determinations of both parameters has not been assessed. In the present study, we aimed at evaluating whether serial determinations of aminopyrine breath test and Child-Pugh score improve prognostic accuracy in patients with cirrhosis, compared with determinations obtained only on admission. In 74 patients with liver cirrhosis aminopyrine breath test and Child-Pugh score were obtained upon entry into the study. Patients were followed with sequential aminopyrine breath tests and assessments of the Child-Pugh score every 4-6 months. A total number of 232 determinations were obtained. During follow-up 45 patients died, on average after 12 months of follow-up. Child-Pugh score improved in the beginning of follow-up, and then remained fairly constant; aminopyrine breath test showed no improvement in the beginning of follow-up, but rather a slowly progressive decline. In patients who died, both the Child-Pugh score and the metabolism of aminopyrine were significantly more impaired in the last year preceding death (p < 0.05). Applying Cox's regression model with time-dependent covariates, Child-Pugh score and aminopyrine breath test were independent significant predictors of survival. The model with time-dependent covariates explained the observed survival much better than the model with time-fixed covariates (chi-sq. explained by regression = 31.45 vs 11.97; d.f. = 2; p = 0.0000001 vs 0.003). These data suggest that serial determinations of Child-Pugh score and aminopyrine breath test can be used to efficiently update prognosis of cirrhosis.
Further evidence for the increased power of LOD scores compared with nonparametric methods.

PubMed

Durner, M; Vieland, V J; Greenberg, D A

1999-01-01

In genetic analysis of diseases in which the underlying model is unknown, "model free" methods-such as affected sib pair (ASP) tests-are often preferred over LOD-score methods, although LOD-score methods under the correct or even approximately correct model are more powerful than ASP tests. However, there might be circumstances in which nonparametric methods will outperform LOD-score methods. Recently, Dizier et al. reported that, in some complex two-locus (2L) models, LOD-score methods with segregation analysis-derived parameters had less power to detect linkage than ASP tests. We investigated whether these particular models, in fact, represent a situation that ASP tests are more powerful than LOD scores. We simulated data according to the parameters specified by Dizier et al. and analyzed the data by using a (a) single locus (SL) LOD-score analysis performed twice, under a simple dominant and a recessive mode of inheritance (MOI), (b) ASP methods, and (c) nonparametric linkage (NPL) analysis. We show that SL analysis performed twice and corrected for the type I-error increase due to multiple testing yields almost as much linkage information as does an analysis under the correct 2L model and is more powerful than either the ASP method or the NPL method. We demonstrate that, even for complex genetic models, the most important condition for linkage analysis is that the assumed MOI at the disease locus being tested is approximately correct, not that the inheritance of the disease per se is correctly specified. In the analysis by Dizier et al., segregation analysis led to estimates of dominance parameters that were grossly misspecified for the locus tested in those models in which ASP tests appeared to be more powerful than LOD-score analyses.
Cross-validation of the Dot Counting Test in a large sample of credible and non-credible patients referred for neuropsychological testing.

PubMed

McCaul, Courtney; Boone, Kyle B; Ermshar, Annette; Cottingham, Maria; Victor, Tara L; Ziegler, Elizabeth; Zeller, Michelle A; Wright, Matthew

2018-01-18

To cross-validate the Dot Counting Test in a large neuropsychological sample. Dot Counting Test scores were compared in credible (n = 142) and non-credible (n = 335) neuropsychology referrals. Non-credible patients scored significantly higher than credible patients on all Dot Counting Test scores. While the original E-score cut-off of ≥17 achieved excellent specificity (96.5%), it was associated with mediocre sensitivity (52.8%). However, the cut-off could be substantially lowered to ≥13.80, while still maintaining adequate specificity (≥90%), and raising sensitivity to 70.0%. Examination of non-credible subgroups revealed that Dot Counting Test sensitivity in feigned mild traumatic brain injury (mTBI) was 55.8%, whereas sensitivity was 90.6% in patients with non-credible cognitive dysfunction in the context of claimed psychosis, and 81.0% in patients with non-credible cognitive performance in depression or severe TBI. Thus, the Dot Counting Test may have a particular role in detection of non-credible cognitive symptoms in claimed psychiatric disorders. Alternative to use of the E-score, failure on ≥1 cut-offs applied to individual Dot Counting Test scores (≥6.0″ for mean grouped dot counting time, ≥10.0″ for mean ungrouped dot counting time, and ≥4 errors), occurred in 11.3% of the credible sample, while nearly two-thirds (63.6%) of the non-credible sample failed one of more of these cut-offs. An E-score cut-off of 13.80, or failure on ≥1 individual score cut-offs, resulted in few false positive identifications in credible patients, and achieved high sensitivity (64.0-70.0%), and therefore appear appropriate for use in identifying neurocognitive performance invalidity.
Knowledge loss of medical students on first year basic science courses at the university of Saskatchewan

PubMed Central

D'Eon, Marcel F

2006-01-01

Background Many senior undergraduate students from the University of Saskatchewan indicated informally that they did not remember much from their first year courses and wondered why we were teaching content that did not seem relevant to later clinical work or studies. To determine the extent of the problem a course evaluation study that measured the knowledge loss of medical students on selected first year courses was conducted. This study replicates previous memory decrement studies with three first year medicine basic science courses, something that was not found in the literature. It was expected that some courses would show more and some courses would show less knowledge loss. Methods In the spring of 2004 over 20 students were recruited to retake questions from three first year courses: Immunology, physiology, and neuroanatomy. Student scores on the selected questions at the time of the final examination in May 2003 (the 'test') were compared with their scores on the questions 10 or 11 months later (the 're-test') using paired samples t -tests. A repeated-measures MANOVA was used to compare the test and re-test scores among the three courses. The re-test scores were matched with the overall student ratings of the courses and the student scores on the May 2003 examinations. Results A statistically significant main effect of knowledge loss (F = 297.385; p < .001) and an interaction effect by course (F = 46.081; p < .001) were found. The students' scores in the Immunology course dropped 13.1%, 46.5% in Neuroanatomy, and 16.1% in physiology. Bonferroni post hoc comparisons showed a significant difference between Neuroanatomy and Physiology (mean difference of 10.7, p = .004). Conclusion There was considerable knowledge loss among medical students in the three basic science courses tested and this loss was not uniform across courses. Knowledge loss does not seem to be related to the marks on the final examination or the assessment of course quality by the students. PMID:16412241
Effects of arginine vasopressin on musical working memory.

PubMed

Granot, Roni Y; Uzefovsky, Florina; Bogopolsky, Helena; Ebstein, Richard P

2013-01-01

Previous genetic studies showed an association between variations in the gene coding for the 1a receptor of the neuro-hormone arginine vasopressin (AVP) and musical working memory (WM). The current study set out to test the influence of intranasal administration (INA) of AVP on musical as compared to verbal WM using a double blind crossover (AVP-placebo) design. Two groups of 25 males were exposed to 20 IU of AVP in one session, and 20 IU of saline water (placebo) in a second session, 1 week apart. In each session subjects completed the tonal subtest from Gordon's "Musical Aptitude Profile," the interval subtest from the "Montreal Battery for Evaluation of Amusias (MBEA)," and the forward and backward digit span tests. Scores in the digit span tests were not influenced by AVP. In contrast, in the music tests there was an AVP effect. In the MBEA test, scores for the group receiving placebo in the first session (PV) were higher than for the group receiving vasopressin in the first session (VP) (p < 0.05) with no main Session effect nor Group × Session interaction. In the Gordon test there was a main Session effect (p < 0.05) with scores higher in the second as compared to the first session, a marginal main Group effect (p = 0.093) and a marginal Group × Session interaction (p = 0.88). In addition we found that the group that received AVP in the first session scored higher on scales indicative of happiness, and alertness on the positive and negative affect scale, (PANAS). Only in this group and only in the music test these scores were significantly correlated with memory scores. Together the results reflect a complex interaction between AVP, musical memory, arousal, and contextual effects such as session, and base levels of memory. The results are interpreted in light of music's universal use as a means to modulate arousal on the one hand, and AVP's influence on mood, arousal, and social interactions on the other.

A Web-based course on infection control for physicians in training: an educational intervention.

PubMed

Fakih, Mohamad G; Enayet, Iram; Minnick, Steven; Saravolatz, Louis D

2006-07-01

To evaluate the effectiveness of a Web-based course on infection control accessed by physicians in training. Educational intervention. A 607-bed urban teaching hospital. A total of 55 physicians in training beginning their first postgraduate year (the iPGY1 group) and 59 physicians completing their first, second, or third postgraduate year (the oPGY group). Individuals in the iPGY1 group took a Web-based course on infection control practices. Persons in the iPGY1 group who took the Web-based course completed an evaluation test consisting of 15 multiple-choice questions (total possible score, 15 points). The same test was given to persons in the oPGY group, who did not take the Web-based course. We compared scores of the Web-based test taken by subjects in the iPGY1 group immediately after the course with scores of the test they took 3 months after the course and with test scores of subjects in the oPGY group. The mean score (+/-SD) for subjects in the iPGY1 group who took the Web-based course was 10.6+/-2.2, compared with 8.0+/-2.5 for subjects in the oPGY group (P<.001). The mean score (+/-SD) for subjects in the iPGY1 group 3 months after completing the course decreased to 8.0+/-2.4 (P<.001 by the paired t test). For the oPGY group, significant differences were found between the scores (+/-SD) for subjects in the internal medicine (9.9+/-2.3), emergency medicine (8.4+/-1.7), pediatrics (7.0+/-1.7), and family medicine (5.8+/-1.6) residency programs (P<.001); there were no significant differences in scores according to the year of residency. Web-based infection control courses are an attractive teaching tool for physicians in training and need to be considered for teaching infection control. The evaluation of information retention will help identify physicians in training who require further training.
Relationship between the Wide Range Achievement Test 3 and the Wechsler Individual Achievement Test.

PubMed

Smith, T D; Smith, B L

1998-12-01

The present study examined the relationship between the Wide Range Achievement Test 3 and the Wechsler Individual Achievement Test for a sample of children with learning disabilities in two rural school districts. Data were collected for 87 school children who had been classified as learning disabled and placed in special education resource services. Pearson product-moment correlations between scores on the two measures were significant and moderate to high; however, mean scores were not significantly different on Reading, Spelling, and Arithmetic subtests of the Wide Range Achievement Test 3 compared to those for the basic Reading, Spelling, and Mathematics Reasoning subtests of the Wechsler Individual Achievement Test. Although there were significant mean differences between scores on Reading and Reading Comprehension and on Arithmetic and Numerical Operations, magnitudes were small. It appears that the two tests provide similar results when screening for reading, spelling, and arithmetic.
Effect of January vacations and prior night call status on resident ABSITE performance.

PubMed

Sugar, Jane G; Chu, Quyen D; Cole, Philip A; Li, Benjamin D L; Kim, Roger H

2013-01-01

To determine if vacations in January or on-call status have an effect on American Board of Surgery In-Training Examination (ABSITE) scores. Retrospective review of the performance of general surgery residents on ABSITE. Data collected included ABSITE scores, United States Medical Licensing Examination Step 2 scores, January vacation schedules, and call schedules. ABSITE performance was examined for correlation with vacation or call schedules. Student t test was used for statistical analysis, with a p value of less than 0.05 considered significant. General surgery residency program at the Louisiana State University Health Sciences Center-Shreveport, a university hospital-based program with 5 categorical residents per year. Postgraduate year (PGY) 1 through 5 general surgery categorical residents from 2006 to 2012. A total of 170 ABSITE scores from 55 residents were reviewed. The mean score when vacation was taken was 48.6 as compared with 36.3 when no vacation was taken (p = 0.02). Residents who took a January vacation at least once in their residency had a mean score of 42.8 as compared with 37.7 of those who did not (p = 0.43). The mean United States Medical Licensing Examination Step 2 score of residents who took a January vacation at least once in their residency was 218 as compared with 217 for their peers (p = 0.78). Among residents who took January vacations, the mean score in the years they took vacation was 49.4 as compared with 35.4 in the years they did not (p = 0.02). Prior night call status had no effect on the examination scores (44.2 vs 38.6, p = 0.30). Mean ABSITE scores were higher for residents who took a January vacation before the examination, despite no apparent difference in baseline test-taking ability. Among residents who took January vacations, mean scores were higher in the years they took vacation than in other years. On-call status did not have an effect on ABSITE performance. Vacation schedules in January can have a significant effect on ABSITE scores. Copyright © 2013 Association of Program Directors in Surgery. Published by Elsevier Inc. All rights reserved.
A pilot study: the development of a culturally tailored Malaysian Diabetes Education Module (MY-DEMO) based on the Health Belief Model.

PubMed

Ahmad, Badariah; Ramadas, Amutha; Kia Fatt, Quek; Md Zain, Anuar Zaini

2014-04-08

Diabetes education and self-care remains the cornerstone of diabetes management. There are many structured diabetes modules available in the United Kingdom, Europe and United States of America. Contrastingly, few structured and validated diabetes modules are available in Malaysia. This pilot study aims to develop and validate diabetes education material suitable and tailored for a multicultural society like Malaysia. The theoretical framework of this module was founded from the Health Belief Model (HBM). The participants were assessed using 6-item pre- and post-test questionnaires that measured some of the known HBM constructs namely cues to action, perceived severity and perceived benefit. Data was analysed using PASW Statistics 18.0. The pre- and post-test questionnaires were administered to 88 participants (31 males). In general, there was a significant increase in the total score in post-test (97.34 ± 6.13%) compared to pre-test (92.80 ± 12.83%) (p < 0.05) and a significant increase in excellent score (>85%) at post-test (84.1%) compared to pre-test (70.5%) (p < 0.05). There was an improvement in post-test score in 4 of 6 items tested. The remaining 2 items which measured the perceived severity and cues to action had poorer post-test score. The preliminary results from this pilot study suggest contextualised content material embedded within MY DEMO maybe suitable for integration with the existing diabetes education programmes. This was the first known validated diabetes education programme available in the Malay language.
Creating a Computer Adaptive Test Version of the Late-Life Function & Disability Instrument

PubMed Central

Jette, Alan M.; Haley, Stephen M.; Ni, Pengsheng; Olarsch, Sippy; Moed, Richard

2009-01-01

Background This study applied Item Response Theory (IRT) and Computer Adaptive Test (CAT) methodologies to develop a prototype function and disability assessment instrument for use in aging research. Herein, we report on the development of the CAT version of the Late-Life Function & Disability instrument (Late-Life FDI) and evaluate its psychometric properties. Methods We employed confirmatory factor analysis, IRT methods, validation, and computer simulation analyses of data collected from 671 older adults residing in residential care facilities. We compared accuracy, precision, and sensitivity to change of scores from CAT versions of two Late-Life FDI scales with scores from the fixed-form instrument. Score estimates from the prototype CAT versus the original instrument were compared in a sample of 40 older adults. Results Distinct function and disability domains were identified within the Late-Life FDI item bank and used to construct two prototype CAT scales. Using retrospective data, scores from computer simulations of the prototype CAT scales were highly correlated with scores from the original instrument. The results of computer simulation, accuracy, precision, and sensitivity to change of the CATs closely approximated those of the fixed-form scales, especially for the 10- or 15-item CAT versions. In the prospective study each CAT was administered in less than 3 minutes and CAT scores were highly correlated with scores generated from the original instrument. Conclusions CAT scores of the Late-Life FDI were highly comparable to those obtained from the full-length instrument with a small loss in accuracy, precision, and sensitivity to change. PMID:19038841
Greater power and computational efficiency for kernel-based association testing of sets of genetic variants

PubMed Central

Lippert, Christoph; Xiang, Jing; Horta, Danilo; Widmer, Christian; Kadie, Carl; Heckerman, David; Listgarten, Jennifer

2014-01-01

Motivation: Set-based variance component tests have been identified as a way to increase power in association studies by aggregating weak individual effects. However, the choice of test statistic has been largely ignored even though it may play an important role in obtaining optimal power. We compared a standard statistical test—a score test—with a recently developed likelihood ratio (LR) test. Further, when correction for hidden structure is needed, or gene–gene interactions are sought, state-of-the art algorithms for both the score and LR tests can be computationally impractical. Thus we develop new computationally efficient methods. Results: After reviewing theoretical differences in performance between the score and LR tests, we find empirically on real data that the LR test generally has more power. In particular, on 15 of 17 real datasets, the LR test yielded at least as many associations as the score test—up to 23 more associations—whereas the score test yielded at most one more association than the LR test in the two remaining datasets. On synthetic data, we find that the LR test yielded up to 12% more associations, consistent with our results on real data, but also observe a regime of extremely small signal where the score test yielded up to 25% more associations than the LR test, consistent with theory. Finally, our computational speedups now enable (i) efficient LR testing when the background kernel is full rank, and (ii) efficient score testing when the background kernel changes with each test, as for gene–gene interaction tests. The latter yielded a factor of 2000 speedup on a cohort of size 13 500. Availability: Software available at http://research.microsoft.com/en-us/um/redmond/projects/MSCompBio/Fastlmm/. Contact: heckerma@microsoft.com Supplementary information: Supplementary data are available at Bioinformatics online. PMID:25075117
Score tests for independence in semiparametric competing risks models.

PubMed

Saïd, Mériem; Ghazzali, Nadia; Rivest, Louis-Paul

2009-12-01

A popular model for competing risks postulates the existence of a latent unobserved failure time for each risk. Assuming that these underlying failure times are independent is attractive since it allows standard statistical tools for right-censored lifetime data to be used in the analysis. This paper proposes simple independence score tests for the validity of this assumption when the individual risks are modeled using semiparametric proportional hazards regressions. It assumes that covariates are available, making the model identifiable. The score tests are derived for alternatives that specify that copulas are responsible for a possible dependency between the competing risks. The test statistics are constructed by adding to the partial likelihoods for the individual risks an explanatory variable for the dependency between the risks. A variance estimator is derived by writing the score function and the Fisher information matrix for the marginal models as stochastic integrals. Pitman efficiencies are used to compare test statistics. A simulation study and a numerical example illustrate the methodology proposed in this paper.
History of smoking and olfaction in Parkinson's disease.

PubMed

Lucassen, Elisabeth B; Sterling, Nicholas W; Lee, Eun-Young; Chen, Honglei; Lewis, Mechelle M; Kong, Lan; Huang, Xuemei

2014-07-01

Olfactory dysfunction is the most common pre-motor symptom in Parkinson's disease (PD), and smoking is known to be associated with lower risk of PD. This study tested the hypothesis that smoking is associated with better olfaction in PD. Smoking history was obtained from 76 PD subjects (22 with a history of smoking [smokers], 54 who never smoked [nonsmokers]), and 70 controls (17 smokers, 53 nonsmokers). Olfaction was assessed using the 40-item University of Pennsylvania Smell Identification Test (UPSIT). The olfactory scores between groups and subgroups were compared using analysis of covariance with adjustment for age, gender, and monoamine oxidase B (MAO-B) inhibitor usage. Overall the olfactory score was lower in PD compared with controls (olfactory scores: 21.5 vs. 33.5, P < 0.0001). Among controls, there was no significant difference in olfaction between smokers and nonsmokers (olfactory scores, 33.2 vs. 34.2; P = 0.95). Among PD subjects, however, smokers scored significantly better regarding olfaction compared with nonsmokers (olfactory scores: 24.4 vs. 19.9, P = 0.02). These data suggest that a history of smoking is associated with better olfaction among PD patients. The finding may be related to why smoking may be protective against PD. Further studies are needed to confirm this finding and investigate the underlying mechanisms. © 2014 International Parkinson and Movement Disorder Society.
Validation of the Narrowing Beam Walking Test in Lower Limb Prosthesis Users.

PubMed

Sawers, Andrew; Hafner, Brian

2018-04-11

To evaluate the content, construct, and discriminant validity of the Narrowing Beam Walking Test (NBWT), a performance-based balance test for lower limb prosthesis users. Cross-sectional study. Research laboratory and prosthetics clinic. Unilateral transtibial and transfemoral prosthesis users (N=40). Not applicable. Content validity was examined by quantifying the percentage of participants receiving maximum or minimum scores (ie, ceiling and floor effects). Convergent construct validity was examined using correlations between participants' NBWT scores and scores or times on existing clinical balance tests regularly administered to lower limb prosthesis users. Known-groups construct validity was examined by comparing NBWT scores between groups of participants with different fall histories, amputation levels, amputation etiologies, and functional levels. Discriminant validity was evaluated by analyzing the area under each test's receiver operating characteristic (ROC) curve. No minimum or maximum scores were recorded on the NBWT. NBWT scores demonstrated strong correlations (ρ=.70‒.85) with scores/times on performance-based balance tests (timed Up and Go test, Four Square Step Test, and Berg Balance Scale) and a moderate correlation (ρ=.49) with the self-report Activities-specific Balance Confidence scale. NBWT performance was significantly lower among participants with a history of falls (P=.003), transfemoral amputation (P=.011), and a lower mobility level (P<.001). The NBWT also had the largest area under the ROC curve (.81) and was the only test to exhibit an area that was statistically significantly >.50 (ie, chance). The results provide strong evidence of content, construct, and discriminant validity for the NBWT as a performance-based test of balance ability. The evidence supports its use to assess balance impairments and fall risk in unilateral transtibial and transfemoral prosthesis users. Copyright © 2018 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.
The King-Devick test as a determinant of head trauma and concussion in boxers and MMA fighters.

PubMed

Galetta, K M; Barrett, J; Allen, M; Madda, F; Delicata, D; Tennant, A T; Branas, C C; Maguire, M G; Messner, L V; Devick, S; Galetta, S L; Balcer, L J

2011-04-26

Sports-related concussion has received increasing attention as a cause of short- and long-term neurologic symptoms among athletes. The King-Devick (K-D) test is based on measurement of the speed of rapid number naming (reading aloud single-digit numbers from 3 test cards), and captures impairment of eye movements, attention, language, and other correlates of suboptimal brain function. We investigated the K-D test as a potential rapid sideline screening for concussion in a cohort of boxers and mixed martial arts fighters. The K-D test was administered prefight and postfight. The Military Acute Concussion Evaluation (MACE) was administered as a more comprehensive but longer test for concussion. Differences in postfight K-D scores and changes in scores from prefight to postfight were compared for athletes with head trauma during the fight vs those without. Postfight K-D scores (n = 39 participants) were significantly higher (worse) for those with head trauma during the match (59.1 ± 7.4 vs 41.0 ± 6.7 seconds, p < 0.0001, Wilcoxon rank sum test). Those with loss of consciousness showed the greatest worsening from prefight to postfight. Worse postfight K-D scores (r(s) = -0.79, p = 0.0001) and greater worsening of scores (r(s) = 0.90, p < 0.0001) correlated well with postfight MACE scores. Worsening of K-D scores by ≥5 seconds was a distinguishing characteristic noted only among participants with head trauma. High levels of test-retest reliability were observed (intraclass correlation coefficient 0.97 [95% confidence interval 0.90-1.0]). The K-D test is an accurate and reliable method for identifying athletes with head trauma, and is a strong candidate rapid sideline screening test for concussion.
The King-Devick test as a determinant of head trauma and concussion in boxers and MMA fighters

PubMed Central

Galetta, K.M.; Barrett, J.; Allen, M.; Madda, F.; Delicata, D.; Tennant, A.T.; Branas, C.C.; Maguire, M.G.; Messner, L.V.; Devick, S.; Galetta, S.L.

2011-01-01

Objective: Sports-related concussion has received increasing attention as a cause of short- and long-term neurologic symptoms among athletes. The King-Devick (K-D) test is based on measurement of the speed of rapid number naming (reading aloud single-digit numbers from 3 test cards), and captures impairment of eye movements, attention, language, and other correlates of suboptimal brain function. We investigated the K-D test as a potential rapid sideline screening for concussion in a cohort of boxers and mixed martial arts fighters. Methods: The K-D test was administered prefight and postfight. The Military Acute Concussion Evaluation (MACE) was administered as a more comprehensive but longer test for concussion. Differences in postfight K-D scores and changes in scores from prefight to postfight were compared for athletes with head trauma during the fight vs those without. Results: Postfight K-D scores (n = 39 participants) were significantly higher (worse) for those with head trauma during the match (59.1 ± 7.4 vs 41.0 ± 6.7 seconds, p < 0.0001, Wilcoxon rank sum test). Those with loss of consciousness showed the greatest worsening from prefight to postfight. Worse postfight K-D scores (rs = −0.79, p = 0.0001) and greater worsening of scores (rs = 0.90, p < 0.0001) correlated well with postfight MACE scores. Worsening of K-D scores by ≥5 seconds was a distinguishing characteristic noted only among participants with head trauma. High levels of test-retest reliability were observed (intraclass correlation coefficient 0.97 [95% confidence interval 0.90–1.0]). Conclusions: The K-D test is an accurate and reliable method for identifying athletes with head trauma, and is a strong candidate rapid sideline screening test for concussion. PMID:21288984
External validation of the HIT Expert Probability (HEP) score.

PubMed

Joseph, Lee; Gomes, Marcelo P V; Al Solaiman, Firas; St John, Julie; Ozaki, Asuka; Raju, Manjunath; Dhariwal, Manoj; Kim, Esther S H

2015-03-01

The diagnosis of heparin-induced thrombocytopenia (HIT) can be challenging. The HIT Expert Probability (HEP) Score has recently been proposed to aid in the diagnosis of HIT. We sought to externally and prospectively validate the HEP score. We prospectively assessed pre-test probability of HIT for 51 consecutive patients referred to our Consultative Service for evaluation of possible HIT between August 1, 2012 and February 1, 2013. Two Vascular Medicine fellows independently applied the 4T and HEP scores for each patient. Two independent HIT expert adjudicators rendered a diagnosis of HIT likely or unlikely. The median (interquartile range) of 4T and HEP scores were 4.5 (3.0, 6.0) and 5 (3.0, 8.5), respectively. There were no significant differences between area under receiver-operating characteristic curves of 4T and HEP scores against the gold standard, confirmed HIT [defined as positive serotonin release assay and positive anti-PF4/heparin ELISA] (0.74 vs 0.73, p = 0.97). HEP score ≥ 2 was 100 % sensitive and 16 % specific for determining the presence of confirmed HIT while a 4T score > 3 was 93 % sensitive and 35 % specific. In conclusion, the HEP and 4T scores are excellent screening pre-test probability models for HIT, however, in this prospective validation study, test characteristics for the diagnosis of HIT based on confirmatory laboratory testing and expert opinion are similar. Given the complexity of the HEP scoring model compared to that of the 4T score, further validation of the HEP score is warranted prior to widespread clinical acceptance.
The Comparative Effectiveness of Different Item Analysis Techniques in Increasing Change Score Reliability.

ERIC Educational Resources Information Center

Crocker, Linda M.; Mehrens, William A.

Four new methods of item analysis were used to select subsets of items which would yield measures of attitude change. The sample consisted of 263 students at Michigan State University who were tested on the Inventory of Beliefs as freshmen and retested on the same instrument as juniors. Item change scores and total change scores were computed for…
Intelligent Use of Intelligence Tests: Empirical and Clinical Support for Canadian WAIS-IV Norms

ERIC Educational Resources Information Center

Miller, Jessie L.; Weiss, Lawrence G.; Beal, A. Lynne; Saklofske, Donald H.; Zhu, Jianjun; Holdnack, James A.

2015-01-01

It is well established that Canadians produce higher raw scores than their U.S. counterparts on intellectual assessments. As a result of these differences in ability along with smaller variability in the population's intellectual performance, Canadian normative data will yield lower standard scores for most raw score points compared to U.S. norms.…
Comparing Human and Automated Essay Scoring for Prospective Graduate Students with Learning Disabilities and/or ADHD

ERIC Educational Resources Information Center

Buzick, Heather; Oliveri, Maria Elena; Attali, Yigal; Flor, Michael

2016-01-01

Automated essay scoring is a developing technology that can provide efficient scoring of large numbers of written responses. Its use in higher education admissions testing provides an opportunity to collect validity and fairness evidence to support current uses and inform its emergence in other areas such as K-12 large-scale assessment. In this…
Description, measurement and evaluation of tertiary-education food environments.

PubMed

Roy, R; Hebden, L; Kelly, B; De Gois, T; Ferrone, E M; Samrout, M; Vermont, S; Allman-Farinelli, M

2016-05-01

Obesity in young adults is an increasing health problem in Australia and many other countries. Evidence-based information is needed to guide interventions that reduce the obesity-promoting elements in tertiary-education environments. In a food environmental audit survey, 252 outlets were audited across seven institutions: three universities and four technical and further education institutions campuses. A scoring instrument called the food environment-quality index was developed and used to assess all food outlets on these campuses. Information was collated on the availability, accessibility and promotion of foods and beverages and a composite score (maximum score=148; higher score indicates healthier outlets) was calculated. Each outlet and the overall campus were ranked into tertiles based on their 'healthiness'. Differences in median scores for each outcome measure were compared between institutions and outlet types using one-way ANOVA with post hoc Scheffe's testing, χ 2 tests, Kruskal-Wallis H test and the Mann-Whitney U test. Binomial logistic regressions were used to compare the proportion of healthy v. unhealthy food categories across different types of outlets. Overall, the most frequently available items were sugar-sweetened beverages (20 % of all food/drink items) followed by chocolates (12 %), high-energy (>600 kJ/serve) foods (10 %), chips (10 %) and confectionery (10 %). Healthy food and beverages were observed to be less available, accessible and promoted than unhealthy options. The median score across all outlets was 72 (interquartile range=7). Tertiary-education food environments are dominated by high-energy, nutrient-poor foods and beverages. Interventions to decrease availability, accessibility and promotion of unhealthy foods are needed.
Forging the Basis for Developing Protein-Ligand Interaction Scoring Functions.

PubMed

Liu, Zhihai; Su, Minyi; Han, Li; Liu, Jie; Yang, Qifan; Li, Yan; Wang, Renxiao

2017-02-21

In structure-based drug design, scoring functions are widely used for fast evaluation of protein-ligand interactions. They are often applied in combination with molecular docking and de novo design methods. Since the early 1990s, a whole spectrum of protein-ligand interaction scoring functions have been developed. Regardless of their technical difference, scoring functions all need data sets combining protein-ligand complex structures and binding affinity data for parametrization and validation. However, data sets of this kind used to be rather limited in terms of size and quality. On the other hand, standard metrics for evaluating scoring function used to be ambiguous. Scoring functions are often tested in molecular docking or even virtual screening trials, which do not directly reflect the genuine quality of scoring functions. Collectively, these underlying obstacles have impeded the invention of more advanced scoring functions. In this Account, we describe our long-lasting efforts to overcome these obstacles, which involve two related projects. On the first project, we have created the PDBbind database. It is the first database that systematically annotates the protein-ligand complexes in the Protein Data Bank (PDB) with experimental binding data. This database has been updated annually since its first public release in 2004. The latest release (version 2016) provides binding data for 16 179 biomolecular complexes in PDB. Data sets provided by PDBbind have been applied to many computational and statistical studies on protein-ligand interaction and various subjects. In particular, it has become a major data resource for scoring function development. On the second project, we have established the Comparative Assessment of Scoring Functions (CASF) benchmark for scoring function evaluation. Our key idea is to decouple the "scoring" process from the "sampling" process, so scoring functions can be tested in a relatively pure context to reflect their quality. In our latest work on this track, i.e. CASF-2013, the performance of a scoring function was quantified in four aspects, including "scoring power", "ranking power", "docking power", and "screening power". All four performance tests were conducted on a test set containing 195 high-quality protein-ligand complexes selected from PDBbind. A panel of 20 standard scoring functions were tested as demonstration. Importantly, CASF is designed to be an open-access benchmark, with which scoring functions developed by different researchers can be compared on the same grounds. Indeed, it has become a popular choice for scoring function validation in recent years. Despite the considerable progress that has been made so far, the performance of today's scoring functions still does not meet people's expectations in many aspects. There is a constant demand for more advanced scoring functions. Our efforts have helped to overcome some obstacles underlying scoring function development so that the researchers in this field can move forward faster. We will continue to improve the PDBbind database and the CASF benchmark in the future to keep them as useful community resources.
Pilot study on objective measurement of abdominal wall strength in patients with ventral incisional hernia.

PubMed

Parker, Michael; Goldberg, Ross F; Dinkins, Maryane M; Asbun, Horacio J; Daniel Smith, C; Preissler, Susanne; Bowers, Steven P

2011-11-01

Outcomes after ventral incisional hernia (VIH) repair are measured by recurrence rate and subjective measures. No objective metrics evaluate functional outcomes after abdominal wall reconstruction. This study aimed to develop testing of abdominal wall strength (AWS) that could be validated as a useful metric. Data were prospectively collected during 9 months from 35 patients. A total of 10 patients were evaluated before and after VIH repair, for a total of 45 encounters. The patients were tested simultaneously or in succession by two of three examiners. Data were collected for three tests: double leg lowering (DLL), trunk raising (TR), and supine reaching (SR). Raw data were compared and tested for validity, and continuous data were transformed to categorical data. Agreement was measured using the intraclass correlation coefficient (ICC) for DLL and using kappa for the ordinal measures. Simultaneous testing yielded the following interobserver reliability: DLL (0.96 and 0.87), TR (1.00 and 0.95), and SR (0.76). Reproducibility was assessed by consecutive tests, with correlation as follows: DLL (0.81), TR (0.81), and RCH (0.21). Due to poor interobserver reliability for the SR test compared with the DLL and TR tests, the SR test was excluded from calculation of an overall score. Based on raw data distribution from the DLL and TR tests, the DLL data were categorized into 10º increments, allowing construction of a 10-point score. The median AWS score was 5 (interquartile range [IQR], 4-7), and there was agreement within 1 point for 42 of the 45 encounters (93%). The findings from this study demonstrate that the 10-point AWS score may measure AWS in an accurate and reproducible fashion, with potential for objective description of abdominal wall function of VIH patients. This score may help to identify patients suited for abdominal wall reconstruction while measuring progress after VIH repair. Further longitudinal outcomes studies are needed.
A novel examination of atypical major depressive disorder based on attachment theory.

PubMed

Levitan, Robert D; Atkinson, Leslie; Pedersen, Rebecca; Buis, Tom; Kennedy, Sidney H; Chopra, Kevin; Leung, Eman M; Segal, Zindel V

2009-06-01

While a large body of descriptive work has thoroughly investigated the clinical correlates of atypical depression, little is known about its fundamental origins. This study examined atypical depression from an attachment theory framework. Our hypothesis was that, compared to adults with melancholic depression, those with atypical depression would report more anxious-ambivalent attachment and less secure attachment. As gender has been an important consideration in prior work on atypical depression, this same hypothesis was further tested in female subjects only. One hundred ninety-nine consecutive adults presenting to a tertiary mood disorders clinic with major depressive disorder with either atypical or melancholic features according to the Structured Clinical Interview for DSM-IV Axis-I Disorders were administered a self-report adult attachment questionnaire to assess the core dimensions of secure, anxious-ambivalent, and avoidant attachment. Attachment scores were compared across the 2 depressed groups defined by atypical and melancholic features using multivariate analysis of variance. The study was conducted between 1999 and 2004. When men and women were considered together, the multivariate test comparing attachment scores by depressive group was statistically significant at p < .05. Between-subjects testing indicated that atypical depression was associated with significantly lower secure attachment scores, with a trend toward higher anxious-ambivalent attachment scores, than was melancholia. When women were analyzed separately, the multivariate test was statistically significant at p < .01, with both secure and anxious-ambivalent attachment scores differing significantly across depressive groups. These preliminary findings suggest that attachment theory, and insecure and anxious-ambivalent attachment in particular, may be a useful framework from which to study the origins, clinical correlates, and treatment of atypical depression. Gender may be an important consideration when considering atypical depression from an attachment perspective. Copyright 2009 Physicians Postgraduate Press, Inc.
Duration and frequency of migraines affect cognitive function: evidence from neuropsychological tests and event-related potentials.

PubMed

Huang, Lifang; Juan Dong, Hong; Wang, Xi; Wang, Yan; Xiao, Zheman

2017-12-01

The aim of this study was to evaluate the changes in the cognitive performance of migraine patients using a comprehensive series of cognitive/behavioral and electrophysiological tests. A randomized, cross-sectional, within subject approach was used to compare neuropsychological and electrophysiological evaluations from migrane-affected and healthy subjects. Thirty-four patients with migraine (6 males, 28 females, average 36 years old) were included. Migraineurs performed worse in the majority of the Montreal Cognitive Assessment (MoCA) (p = 0.007) compared to the healthy subjects, significantly in language (p = 0.005), memory (p = 0.006), executive functions (p = 0.042), calculation (p = 0.018) and orientation (p = 0.012). Migraineurs had a lower score on the memory trial of the Rey-Osterrieth complex figure test (ROCF) (p = 0.012). The P3 latency in Fz, Cz, Pz was prolonged in migraineurs compared with the normal control group (P < 0.001). In addition, we analyzed significant correlations between MoCA score and the duration of migraine. We also observed that a decrease in the MoCA-executive functions and calculation score and in the ROCF-recall score were both correlated to the frequency of migraine. Migraineurs were more anxious than healthy subjects (p = 0.001), which is independent of cognitive testing. Differences were unrelated to age, gender and literacy. Cognitive performance decreases during migraine, and cognitive dysfunction can be related to the duration and frequency of a migraine attack.

A comparison of paper-and-pencil and computerized forms of Line Orientation and Enhanced Cued Recall Tests.

PubMed

Aşkar, Petek; Altun, Arif; Cangöz, Banu; Cevik, Vildan; Kaya, Galip; Türksoy, Hasan

2012-04-01

The purpose of this study was to assess whether a computerized battery of neuropsychological tests could produce similar results as the conventional forms. Comparisons on 77 volunteer undergraduates were carried out with two neuropsychological tests: Line Orientation Test and Enhanced Cued Recall Test. Firstly, students were assigned randomly across the test medium (paper-and-pencil versus computerized). Secondly, the groups were given the same test in the other medium after a 30-day interval between tests. Results showed that the Enhanced Cued Recall Test-Computer-based did not correlate with the Enhanced Cued Recall Test-Paper-and-pencil results. Line Orientation Test-Computer-based scores, on the other hand, did correlate significantly with the Line Orientation Test-Paper-and-pencil version. In both tests, scores were higher on paper-and-pencil tests compared to computer-based tests. Total score difference between modalities was statistically significant for both Enhanced Cued Recall Tests and for the Line Orientation Test. In both computer-based tests, it took less time for participants to complete the tests.
Objective structured assessment of technical skills evaluation of theoretical compared with hands-on training of shoulder dystocia management: a randomized controlled trial.

PubMed

Buerkle, Bernd; Pueth, Julia; Hefler, Lukas A; Tempfer-Bentz, Eva-Katrin; Tempfer, Clemens B

2012-10-01

To compare the skills of performing a shoulder dystocia management algorithm after hands-on training compared with demonstration. We randomized medical students to a 30-minute hands-on (group 1) and a 30-minute demonstration (group 2) training session teaching a standardized shoulder dystocia management scheme on a pelvic training model. Participants were tested with a 22-item Objective Structured Assessment of Technical Skills scoring system after training and 72 hours thereafter. Objective Structured Assessment of Technical Skills scores were the primary outcome. Performance time, self-assessment, confidence, and global rating scale were the secondary outcomes. Statistics were performed using Mann-Whitney U test, χ test, and multiple linear regression analysis. Two hundred three participants were randomized. Objective Structured Assessment of Technical Skills scores were significantly higher in group 1 (n=103) compared with group 2 (n=100) (17.95±3.14 compared with 15.67±3.18, respectively; P<.001). The secondary outcomes global rating scale (GRS; 10.94±2.71 compared with 8.57±2.61, respectively; P<.001), self-assessment (3.15±0.94 compared with 2.72±1.01; P=.002), and confidence (3.72±0.98 compared with 3.34±0.90, respectively; P=.005), but not performance time (3:19±0:48 minutes compared with 3:31±1:05 minutes; P=.1), were also significantly different, favoring group 1. After 72 hours, Objective Structured Assessment of Technical Skills scores were still significantly higher in group 1 (n=67) compared with group 2 (n=60) (18.17±2.76 compared with 14.98±3.03, respectively; P<.001) as were GRS (10.80±2.62 compared with 8.15±2.59; P<.001) and self assessment (SA; 3.44±0.87 compared with 2.95±0.94; P=.003). In a multiple linear regression analysis, group assignment (group 1 compared with 2; P<.001) and sex (P=.002) independently influenced Objective Structured Assessment of Technical Skills scores. Hands-on training helps to achieve a significant improvement of shoulder dystocia management on a pelvic training model. www.ClinicalTrials.gov, NCT01618565. I.
Prediction of acute pancreatitis risk based on PIP score in children with cystic fibrosis.

PubMed

Terlizzi, V; Tosco, A; Tomaiuolo, R; Sepe, A; Amato, N; Casale, A; Mercogliano, C; De Gregorio, F; Improta, F; Elce, A; Castaldo, G; Raia, V

2014-09-01

Currently no tools to predict risk of acute (AP) and recurrent pancreatitis (ARP) in children with cystic fibrosis (CF) are available. We assessed the prevalence of AP/ARP and tested the potential role of Pancreatic Insufficiency Prevalence (PIP) score in a cohort of children with CF. We identified two groups of children, on the basis of presence/absence of AP/ARP, who were compared for age at diagnosis, clinical features, genotypes and sweat chloride level. PIP score was calculated for each patient. 10/167 (5.9%) experienced at least one episode of AP during follow up; 10/10 were pancreatic sufficient (PS). Patients with AP/ARP showed a PIP score ≤0.25 more frequently (6/10) than patients without AP/ARP. The odds ratio (95% CI) of developing pancreatitis was 4.54 (1.22-16.92) for patients with PIP <0.25 when compared with those who have a PIP score >0.25 (p 0.0151). PIP score was correlated with sweat chloride test (p < 0.01). PIP score, PS status and normal/borderline sweat chloride levels could be applied to predict pancreatitis development in children with CF. ARP could lead to pancreatic insufficiency. Copyright © 2014 European Cystic Fibrosis Society. Published by Elsevier B.V. All rights reserved.
Bleeding Risk and Antithrombotic Strategy in Patients with Sinus Rhythm Heart Failure with Reduced Ejection Fraction Treated with Warfarin or Aspirin

PubMed Central

Ye, Siqin; Cheng, Bin; Lip, Gregory Y. H.; Buchsbaum, Richard; Sacco, Ralph L.; Levin, Bruce; Di Tullio, Marco R.; Qian, Min; Mann, Douglas L.; Pullicino, Patrick M.; Freudenberger, Ronald S.; Teerlink, John R.; Mohr, J.P.; Graham, Susan; Labovitz, Arthur J.; Estol, Conrado J.; Lok, Dirk J.; Ponikowski, Piotr; Anker, Stefan D.; Thompson, John L.P.; Homma, Shunichi

2015-01-01

We sought to assess the performance of existing bleeding risk scores, such as HAS-BLED or OBRI, in patients with heart failure with reduced ejection fraction (HFrEF) in sinus rhythm (SR) treated with warfarin or aspirin. We calculated HAS-BLED and OBRI risk scores for 2,305 patients with HFrEF in SR enrolled in the Warfarin versus Aspirin in Reduced Cardiac Ejection Fraction (WARCEF) trial. Proportional hazards models were used to test whether each score predicted major bleeding, and comparison of different risk scores was performed using Harell’s c-statistic and net-reclassification improvement (NRI) index. For the warfarin arm, both scores predicted bleeding risk, with OBRI having significantly higher c-statistic (0.72 vs 0.61; p=0.03) compared to HAS-BLED, though the NRI for comparing OBRI to HAS-BLED was not significant (0.32, 95% CI - 0.18-0.37). Performance of the OBRI and HAS-BLED risk scores were similar for the aspirin arm. For participants with OBRI score of 0 to 1, warfarin compared with aspirin reduced ischemic stroke (HR 0.51, 95% CI 0.26-0.98, p=0.042) without significantly increasing major bleeding (HR 1.24, 95% CI 0.66-2.30, p=0.51). For those with OBRI score of ≥2, there was a trend for reduced ischemic stroke with warfarin compared to aspirin (HR 0.56, 95% CI 0.27-1.15, p=0.12), but major bleeding was increased (HR 4.04, 95% CI 1.99-8.22, p<0.001). In conclusion, existing bleeding risk scores can identify bleeding risk in HFrEF patients in SR, and could be tested for potentially identifying patients with a favorable risk / benefit profile for antithrombotic therapy with warfarin. PMID:26189039
Effect of a Mobile Web App on Kidney Transplant Candidates' Knowledge About Increased Risk Donor Kidneys: A Randomized Controlled Trial.

PubMed

Gordon, Elisa J; Sohn, Min-Woong; Chang, Chih-Hung; McNatt, Gwen; Vera, Karina; Beauvais, Nicole; Warren, Emily; Mannon, Roslyn B; Ison, Michael G

2017-06-01

Kidney transplant candidates (KTCs) must provide informed consent to accept kidneys from increased risk donors (IRD), but poorly understand them. We conducted a multisite, randomized controlled trial to evaluate the efficacy of a mobile Web application, Inform Me, for increasing knowledge about IRDs. Kidney transplant candidates undergoing transplant evaluation at 2 transplant centers were randomized to use Inform Me after routine transplant education (intervention) or routine transplant education alone (control). Computer adaptive learning method reinforced learning by embedding educational material, and initial (test 1) and additional test questions (test 2) into each chapter. Knowledge (primary outcome) was assessed in person after education (tests 1 and 2), and 1 week later by telephone (test 3). Controls did not receive test 2. Willingness to accept an IRD kidney (secondary outcome) was assessed after tests 1 and 3. Linear regression test 1 knowledge scores were used to test the significance of Inform Me exposure after controlling for covariates. Multiple imputation was used for intention-to-treat analysis. Two hundred eighty-eight KTCs participated. Intervention participants had higher test 1 knowledge scores (mean difference, 6.61; 95% confidence interval [95% CI], 5.37-7.86) than control participants, representing a 44% higher score than control participants' scores. Intervention participants' knowledge scores increased with educational reinforcement (test 2) compared with control arm test 1 scores (mean difference, 9.50; 95% CI, 8.27-10.73). After 1 week, intervention participants' knowledge remained greater than controls' knowledge (mean difference, 3.63; 95% CI, 2.49-4.78) (test 3). Willingness to accept an IRD kidney did not differ between study arms at tests 1 and 3. Inform Me use was associated with greater KTC knowledge about IRD kidneys above routine transplant education alone.
The New Peabody Picture Vocabulary Test-III: An Illusion of Unbiased Assessment?

PubMed

Stockman, Ida J

2000-10-01

This article examines whether changes in the ethnic minority composition of the standardization sample for the latest edition of the Peabody Picture Vocabulary Test (PPVT-III, Dunn & Dunn, 1997) can be used as the sole explanation for children's better test scores when compared to an earlier edition, the Peabody Picture Vocabulary Test-Revised (PPVT-R, Dunn & Dunn, 1981). Results from a comparative analysis of these two test editions suggest that other factors may explain improved performances. Among these factors are the number of words and age levels sampled, the types of words and pictures used, and characteristics of the standardization sample other than its ethnic minority composition. This analysis also raises questions regarding the usefulness of converting scores from one edition to the other and the type of criteria that could be used to evaluate whether the PPVT-III is an unbiased test of vocabulary for children from diverse cultural and linguistic backgrounds.
Proposal for a new categorization of aseptic processing facilities based on risk assessment scores.

PubMed

Katayama, Hirohito; Toda, Atsushi; Tokunaga, Yuji; Katoh, Shigeo

2008-01-01

Risk assessment of aseptic processing facilities was performed using two published risk assessment tools. Calculated risk scores were compared with experimental test results, including environmental monitoring and media fill run results, in three different types of facilities. The two risk assessment tools used gave a generally similar outcome. However, depending on the tool used, variations were observed in the relative scores between the facilities. For the facility yielding the lowest risk scores, the corresponding experimental test results showed no contamination, indicating that these ordinal testing methods are insufficient to evaluate this kind of facility. A conventional facility having acceptable aseptic processing lines gave relatively high risk scores. The facility showing a rather high risk score demonstrated the usefulness of conventional microbiological test methods. Considering the significant gaps observed in calculated risk scores and in the ordinal microbiological test results between advanced and conventional facilities, we propose a facility categorization based on risk assessment. The most important risk factor in aseptic processing is human intervention. When human intervention is eliminated from the process by advanced hardware design, the aseptic processing facility can be classified into a new risk category that is better suited for assuring sterility based on a new set of criteria rather than on currently used microbiological analysis. To fully benefit from advanced technologies, we propose three risk categories for these aseptic facilities.
Is balance exercise training as effective as aerobic exercise training in fibromyalgia syndrome?

PubMed

Duruturk, Neslihan; Tuzun, Emine Handan; Culhaoglu, Belde

2015-05-01

The aim was to compare the effect of aerobic and balance exercises on pain severity, myalgic score, quality of life, exercise capacity and balance in fibromyalgia syndrome (FMS). A total of 33 females diagnosed with FMS by the American College of Rheumatology criteria were recruited in this randomised controlled study and allocated to aerobic exercise (AE) or balance exercise (BE) groups. Exercises were performed three times a week, for 6 weeks on a treadmill or with a Tetrax interactive balance system (TIBS). Outcome measures were characterised by myalgic score, visual analogue scale, Fibromyalgia Impact Questionnaire (FIQ), exercise testing, Timed Up-Go (TUG) and TIBS measurements. Comparisons from baseline to 6 weeks were evaluated using Wilcoxon test. Mann-Whitney U test was used to compare differences between groups. Effect sizes were also calculated. Improvements in pain, myalgic score and FIQ were found in both groups (p < 0.05). While comparing groups, myalgic score was significant (p = 0.02, d = -1.77), the value was higher in AE. Exercise duration, Borg scale, resting blood pressures (RBP) and maximal heart rate were significant in AE. In BE, Borg scale, exercise duration was significant (p < 0.05). While comparing groups, diastolic RBP (p = 0.04, d = -0.92), exercise duration (p = 0.00, d = -1.64) were significant, with higher values in AE. TUG significantly changed in groups (p < 0.05, d ≥ -1.22). Stability scores, eyes open while standing on elastic pads (p = 0.00, d = -0.98) and head back (p = 0.03, d = -0.74), were significant, with higher values in BE. This study showed that BE provided some improvements in FMS, but AE training led to greater gains. BE training should be included in comprehensive programs.
Testing item response theory invariance of the standardized Quality-of-life Disease Impact Scale (QDIS(®)) in acute coronary syndrome patients: differential functioning of items and test.

PubMed

Deng, Nina; Anatchkova, Milena D; Waring, Molly E; Han, Kyung T; Ware, John E

2015-08-01

The Quality-of-life (QOL) Disease Impact Scale (QDIS(®)) standardizes the content and scoring of QOL impact attributed to different diseases using item response theory (IRT). This study examined the IRT invariance of the QDIS-standardized IRT parameters in an independent sample. The differential functioning of items and test (DFIT) of a static short-form (QDIS-7) was examined across two independent sources: patients hospitalized for acute coronary syndrome (ACS) in the TRACE-CORE study (N = 1,544) and chronically ill US adults in the QDIS standardization sample. "ACS-specific" IRT item parameters were calibrated and linearly transformed to compare to "standardized" IRT item parameters. Differences in IRT model-expected item, scale and theta scores were examined. The DFIT results were also compared in a standard logistic regression differential item functioning analysis. Item parameters estimated in the ACS sample showed lower discrimination parameters than the standardized discrimination parameters, but only small differences were found for thresholds parameters. In DFIT, results on the non-compensatory differential item functioning index (range 0.005-0.074) were all below the threshold of 0.096. Item differences were further canceled out at the scale level. IRT-based theta scores for ACS patients using standardized and ACS-specific item parameters were highly correlated (r = 0.995, root-mean-square difference = 0.09). Using standardized item parameters, ACS patients scored one-half standard deviation higher (indicating greater QOL impact) compared to chronically ill adults in the standardization sample. The study showed sufficient IRT invariance to warrant the use of standardized IRT scoring of QDIS-7 for studies comparing the QOL impact attributed to acute coronary disease and other chronic conditions.
Decrease in the traumatic symptoms observed in child survivors within three years of the 2011 Japan earthquake and tsunami.

PubMed

Usami, Masahide; Iwadare, Yoshitaka; Watanabe, Kyota; Kodaira, Masaki; Ushijima, Hirokage; Tanaka, Tetsuya; Harada, Maiko; Tanaka, Hiromi; Sasaki, Yoshinori; Saito, Kazuhiko

2014-01-01

On March 11, 2011, Japan was struck by a massive earthquake and tsunami. The tsunami caused tremendous damage and traumatized several people, including children. The aim of this study was to assess changes in traumatic symptoms 8, 20, and 30 months of the 2011 tsunami. The study comprised three groups. Copies of the Post-Traumatic Stress Symptoms for Children 15 items (PTSSC-15), a self-rating questionnaire on traumatic symptoms, were distributed to 12,524 children (8-month period), 12,193 children (20-month period), and 11,819 children (30-month period). An effective response of children 8 months, 20 months, and 30 month after the disaster was obtained in 11,639 (92.9%), 10,597 (86.9%), and 10,812 children (91.4%), respectively. We calculated the total score, PTSD subscale, and Depression subscale of PTSSC-15. We calculated the total score, PTSD subscale, and Depression subscale of PTSSC-15. The PTSSC-15 total score and PTSD subscale of children belonging to 1st-9th grade groups who were tested 30 and 20 months after the tsunami significantly decreased compared with those of children tested 8 months after the tsunami. The PTSSC-15 total score and PTSD subscale of children in 1st-9th grade groups tested after 30 months did not decrease significantly compared with those of children tested after 20 months. The PTSSC-15 Depression subscale and PTSD subscale of children in 1st-9th grade groups tested after 30 months significantly decreased compared with those of children tested 8 months after the tsunami. The PTSSC-15 Depression subscale of children in 1st-9th grade groups evaluated after 30 months significantly decreased compared with those of children evaluated after 20 months. This study demonstrates that the traumatic symptoms of children who survived the massive tsunami improved with time. Nonetheless, the traumatic symptoms, which in some cases did not improve with time.
Five road safety education programmes for young adolescent pedestrians and cyclists: a multi-programme evaluation in a field setting.

PubMed

Twisk, Divera A M; Vlakveld, Willem P; Commandeur, Jacques J F; Shope, Jean T; Kok, Gerjo

2014-05-01

A practical approach was developed to assess and compare the effects of five short road safety education (RSE) programmes for young adolescents that does not rely on injury or crash data but uses self reported behaviour. Questionnaires were administered just before and about one month after participation in the RSE programmes, both to youngsters who had participated in a RSE programme, the intervention group, and to a comparable reference group of youngsters who had not, the reference group. For each RSE programme, the answers to the questionnaires in the pre- and post-test were checked for internal consistency and then condensed into a single safety score using categorical principal components analysis. Next, an analysis of covariance was performed on the obtained safety scores in order to compare the post-test scores of the intervention and reference groups, corrected for their corresponding pre-test scores. It was found that three out of five RSE programmes resulted in significantly improved self-reported safety behaviour. However, the proportions of participants that changed their behaviour relative to the reference group were small, ranging from 3% to 20%. Comparisons among programme types showed cognitive approaches not to differ in effect from programmes that used fear-appeal approaches. The method used provides a useful tool to assess and compare the effects of different education programmes on self-reported behaviour. Copyright © 2014 Elsevier Ltd. All rights reserved.
Standardized Testing Practices: Effect on Graduation and NCLEX® Pass Rates.

PubMed

Randolph, Pamela K

The use standardized testing in pre-licensure nursing programs has been accompanied by conflicting reports of effective practices. The purpose of this project was to describe standardized testing practices in one states' nursing programs and discover if the use of a cut score or oversight of remediation had any effect on (a) first time NCLEX® pass rates, (b) on-time graduation (OTG) or (c) the combination of (a) and (b). Administrators of 38 nursing programs in one Southwest state were sent surveys; surveys were returned by 34 programs (89%). Survey responses were compared to each program's NCLEX pass rate and on-time graduation rate; t-tests were conducted for significant differences associated with a required minimum score (cut score) and oversight of remediation. There were no significant differences in NCLEX pass or on-time graduation rates related to establishment of a cut score. There was a significant difference when the NCLEX pass rate and on-time graduation rate were combined (Outcome Index "OI") with significantly higher program outcomes (P=.02.) for programs without cut-scores. There were no differences associated with faculty oversight of remediation. The results of this study do not support establishment of a cut-score when implementing a standardized testing. Copyright © 2016. Published by Elsevier Inc.
The impact of using standardized patients in psychiatric cases on the levels of motivation and perceived learning of the nursing students.

PubMed

Sarikoc, Gamze; Ozcan, Celale Tangul; Elcin, Melih

2017-04-01

The use of standardized patients is not very common in psychiatric nursing education and there has been no study conducted in Turkey. This study evaluated the impact of using standardized patients in psychiatric cases on the levels of motivation and perceived learning of the nursing students. This manuscript addressed the quantitative aspect of a doctoral thesis study in which both quantitative and qualitative methods were used. A pre-test and post-test were employed in the quantitative analysis in a randomized and controlled study design. The motivation scores, and interim and post-test scores for perceived learning were higher in the experimental group compared to pre-test scores and the scores of the control group. The students in the experimental group reported that they felt more competent about practical training in clinical psychiatry, as well as in performing interviews with patients having mental problems, and reported less anxiety about performing an interview when compared to students in the control group. It is considered that the inclusion of standardized patient methodology in the nursing education curriculum in order to improve the knowledge level and skills of students would be beneficial in the training of mental health nurses. Copyright © 2017 Elsevier Ltd. All rights reserved.
Validation and clinical utility of the executive function performance test in persons with traumatic brain injury.

PubMed

Baum, C M; Wolf, T J; Wong, A W K; Chen, C H; Walker, K; Young, A C; Carlozzi, N E; Tulsky, D S; Heaton, R K; Heinemann, A W

2017-07-01

This study examined the relationships between the Executive Function Performance Test (EFPT), the NIH Toolbox Cognitive Function tests, and neuropsychological executive function measures in 182 persons with traumatic brain injury (TBI) and 46 controls to evaluate construct, discriminant, and predictive validity. Construct validity: There were moderate correlations between the EFPT and the NIH Toolbox Crystallized (r = -.479), Fluid Tests (r = -.420), and Total Composite Scores (r = -.496). Discriminant validity: Significant differences were found in the EFPT total and sequence scores across control, complicated mild/moderate, and severe TBI groups. We found differences in the organisation score between control and severe, and between mild and severe TBI groups. Both TBI groups had significantly lower scores in safety and judgement than controls. Compared to the controls, the severe TBI group demonstrated significantly lower performance on all instrumental activities of daily living (IADL) tasks. Compared to the mild TBI group, the controls performed better on the medication task, the severe TBI group performed worse in the cooking and telephone tasks. Predictive validity: The EFPT predicted the self-perception of independence measured by the TBI-QOL (beta = -0.49, p < .001) for the severe TBI group. Overall, these data support the validity of the EFPT for use in individuals with TBI.
Development and Validation of a Mobile Device-based External Ventricular Drain Simulator.

PubMed

Morone, Peter J; Bekelis, Kimon; Root, Brandon K; Singer, Robert J

2017-10-01

Multiple external ventricular drain (EVD) simulators have been created, yet their cost, bulky size, and nonreusable components limit their accessibility to residency programs. To create and validate an animated EVD simulator that is accessible on a mobile device. We developed a mobile-based EVD simulator that is compatible with iOS (Apple Inc., Cupertino, California) and Android-based devices (Google, Mountain View, California) and can be downloaded from the Apple App and Google Play Store. Our simulator consists of a learn mode, which teaches users the procedure, and a test mode, which assesses users' procedural knowledge. Twenty-eight participants, who were divided into expert and novice categories, completed the simulator in test mode and answered a postmodule survey. This was graded using a 5-point Likert scale, with 5 representing the highest score. Using the survey results, we assessed the module's face and content validity, whereas construct validity was evaluated by comparing the expert and novice test scores. Participants rated individual survey questions pertaining to face and content validity a median score of 4 out of 5. When comparing test scores, generated by the participants completing the test mode, the experts scored higher than the novices (mean, 71.5; 95% confidence interval, 69.2 to 73.8 vs mean, 48; 95% confidence interval, 44.2 to 51.6; P < .001). We created a mobile-based EVD simulator that is inexpensive, reusable, and accessible. Our results demonstrate that this simulator is face, content, and construct valid. Copyright © 2017 by the Congress of Neurological Surgeons
Pretest online discussion groups to augment teaching and learning.

PubMed

Kuhn, Jonathan; Hasbargen, Barbara; Miziniak, Halina

2010-01-01

Tests and final examination scores of three semesters of control students in a nursing foundation course were compared with tests and final examination scores of three semesters of participating students. Participating students were offered access to an asynchronous pretest online discussion activity with a faculty e-moderator. While the simplified Bloom's revised taxonomy assisted in creating appropriate preparatory test and final examination questions for pretest online discussion, Salmon's five-stage online method provided direction to the e-moderator on how to encourage students to achieve Bloom's higher-order thinking skills during the pretest online discussions. Statistical analysis showed the pretest online discussion activity had a generally positive impact on tests and final examination scores, when controlling for a number of possible confounding variables, including instructor, cumulative grade point average, age, and credit hours.
The effects of short-term and long-term pulmonary rehabilitation on functional capacity, perceived dyspnea, and quality of life.

PubMed

Verrill, David; Barton, Cole; Beasley, Will; Lippard, W Michael

2005-08-01

The purposes of this study were as follows: (1) to determine whether physical performance, quality of life, and dyspnea with activities of daily living improved following both short-term and long-term pulmonary rehabilitation (PR) across multiple hospital outpatient programs; (2) to examine the differences in these parameters between men and women; and (3) to determine what relationships existed between the psychosocial parameters and the results of the 6-min walk (6MW) test performance across programs. Non-experimental, prospective, and comparative. Seven outpatient hospital PR programs from urban and rural settings across North Carolina. Three hundred nine women and 281 men who were 20 to 93 years of age (mean [+/- SD] age, 66.7 +/- 11.1 years) with chronic lung disease. All 6MW tests and health surveys were administered prior to and immediately following 12 and 24 weeks of supervised PR participation. Scores from the 6MW tests, the Ferrans and Powers quality of life index-pulmonary version III (QLI), the Medical Outcomes Study 36-item short form (SF-36), and the University of California at San Diego shortness of breath questionnaire (SOBQ) were compared at PR entry, at 12 weeks, and at 24 weeks for differences by gender with repeated-measures analysis of variance. The study entry and follow-up SF-36 physical and mental component summary scores, the QLI health/function and overall scores, and the SOBQ scores were also compared to the 6MW test scores with Pearson correlation coefficient analysis. The mean summary scores on the SF-36 and the QLI increased after 12 weeks of PR (p < 0.05), and improvements were maintained by 24 weeks of PR participation (p < 0.05). Scores on the SOBQ improved after 12 weeks (p < 0.001) among the short-term participants, but not until after 24 weeks among the long-term participants (p = 0.009). The 6MW test performance improved after 12 weeks (p < 0.001) and again from 12 to 24 weeks (p = 0.002) in the long-term participants. No relevant correlational relationships were found between 6MW scores and the summary scores of the administered surveys (r = -0.43 to 0.36). Physical performance, as measured by the 6MW test, continued to improve with up to 24 weeks of PR participation. Quality-of-life measures and the perception of dyspnea improved after 12 weeks of PR participation, with improvements maintained by 24 weeks of PR participation. It is recommended that PR patients participate in supervised PR for at least 24 weeks to gain and maintain optimal health benefits.
Effect of promoting self-esteem by participatory learning process on emotional intelligence among early adolescents.

PubMed

Munsawaengsub, Chokchai; Yimklib, Somkid; Nanthamongkolchai, Sutham; Apinanthavech, Suporn

2009-12-01

To study the effect of promoting self-esteem by participatory learning program on emotional intelligence among early adolescents. The quasi-experimental study was conducted in grade 9 students from two schools in Bangbuathong district, Nonthaburi province. Each experimental and comparative group consisted of 34 students with the lowest score of emotional intelligence. The instruments were questionnaires, Program to Develop Emotional Intelligence and Handbook of Emotional Intelligence Development. The experimental group attended 8 participatory learning activities in 4 weeks to Develop Emotional Intelligence while the comparative group received the handbook for self study. Assessment the effectiveness of program was done by pre-test and post-test immediately and 4 weeks apart concerning the emotional intelligence. Implementation and evaluation was done during May 24-August 12, 2005. Data were analyzed by frequency, percentage, mean, standard deviation, Chi-square, independent sample t-test and paired sample t-test. Before program implementation, both groups had no statistical difference in mean score of emotional intelligence. After intervention, the experimental group had higher mean score of emotional intelligence both immediately and 4 weeks later with statistical significant (p = 0.001 and < 0.001). At 4 weeks after experiment, the mean score in experimental group was higher than the mean score at immediate after experiment with statistical significance (p < 0.001). The program to promote self-esteem by participatory learning process could enhance the emotional intelligence in early-adolescent. This program could be modified and implemented for early adolescent in the community.
Information Technology and Literacy Assessment.

ERIC Educational Resources Information Center

Balajthy, Ernest

2002-01-01

Compares technology predictions from around 1989 with the technology of 2002. Discusses the place of computer-based assessment today, computer-scored testing, computer-administered formal assessment, Internet-based formal assessment, computerized adaptive tests, placement tests, informal assessment, electronic portfolios, information management,…
Effectiveness of Test-Enhanced Learning (TEL) in lectures for undergraduate medical students

PubMed Central

Ayyub, Aisha; Mahboob, Usman

2017-01-01

Objective: To determine the effectiveness of Test-Enhanced learning as a learning tool in lectures for undergraduate medical students Method: This quantitative, randomized controlled trial included eighty-four students of 4th year MBBS from Yusra Medical & Dental College, Islamabad. The duration of study was from March 2016 to August 2016. After obtaining the informed consent; participants were equally assigned to interventional and non-interventional study groups through stratified randomization. Single best answer MCQs of special pathology were used as data collection instrument after validation. A pre- and post-test was taken from both groups, before and after the intervention, respectively and their results were compared using SPSS version 21. Results: There were 13 male (31%) and 29 female (69%) participants in each study group who showed an equivalent baseline performance on pre-test (p=0.95). Statistically significant difference was found among mean scores of interventional and non-interventional study groups at exit exam (p=0.00). Interventional group also showed a significant improvement in their post-test scores (mean: 17.17±1.59) as compared to pre-test scores (mean: 6.19±1.81). Conclusions: Test-enhanced learning has significant effect on improving the learning of course content delivered to undergraduate medical students through lectures. PMID:29492055

The Effects of Process Oriented Guided Inquiry Learning on Secondary Student ACT Science Scores

NASA Astrophysics Data System (ADS)

Judd, William Lindsey

The purpose of this study was to examine any significant difference on secondary school chemistry students' ACT Science Test scores between students taught by the Process Oriented Guided Inquiry Learning (POGIL) method versus students taught by traditional, teacher-centered pedagogy. This study also examined any difference between students taught by the POGIL method versus students taught by traditional, teacher-centered pedagogy in regard to the three different types of questions on the ACT Science Test: data representation, research summaries, and conflicting viewpoints. The sample consisted of sophomore-level students at two private, suburban Christian schools. A pretest-posttest design was used to compare the mean difference in scores from ACT issued sample test booklets before and after each group had received instruction via the POGIL method or more traditional methods. This study found that there was no significant difference in the mean difference of test scores between the two groups. This study also found that there was not a significant difference in the mean difference of scores in regard to the three different types of questions on the ACT Science Test. Further implications of this study are discussed.
A Comparative Study of the Reliability and Validity of the "Degrees of Reading Power" and the "Iowa Tests of Basic Skills."

ERIC Educational Resources Information Center

Hildebrand, Myrene; Hoover, H. D.

This study compared the reliability and validity of two different measures of reading ability, the Degrees of Reading Power (DRP) and the Iowa Tests of Basic Skills (ITBS) Reading test and the ITBS Vocabulary test. The data consisted of scores of 377 grade 5 and grade 6 students on these tests, along with their assigned reading levels in the…
Parametric analyses of summative scores may lead to conflicting inferences when comparing groups: A simulation study.

PubMed

Khan, Asaduzzaman; Chien, Chi-Wen; Bagraith, Karl S

2015-04-01

To investigate whether using a parametric statistic in comparing groups leads to different conclusions when using summative scores from rating scales compared with using their corresponding Rasch-based measures. A Monte Carlo simulation study was designed to examine between-group differences in the change scores derived from summative scores from rating scales, and those derived from their corresponding Rasch-based measures, using 1-way analysis of variance. The degree of inconsistency between the 2 scoring approaches (i.e. summative and Rasch-based) was examined, using varying sample sizes, scale difficulties and person ability conditions. This simulation study revealed scaling artefacts that could arise from using summative scores rather than Rasch-based measures for determining the changes between groups. The group differences in the change scores were statistically significant for summative scores under all test conditions and sample size scenarios. However, none of the group differences in the change scores were significant when using the corresponding Rasch-based measures. This study raises questions about the validity of the inference on group differences of summative score changes in parametric analyses. Moreover, it provides a rationale for the use of Rasch-based measures, which can allow valid parametric analyses of rating scale data.
A new computer-based Farnsworth Munsell 100-hue test for evaluation of color vision.

PubMed

Ghose, Supriyo; Parmar, Twinkle; Dada, Tanuj; Vanathi, Murugesan; Sharma, Sourabh

2014-08-01

To evaluate a computer-based Farnsworth-Munsell (FM) 100-hue test and compare it with a manual FM 100-hue test in normal and congenital color-deficient individuals. Fifty color defective subjects and 200 normal subjects with a best-corrected visual acuity ≥ 6/12 were compared using a standard manual FM 100-hue test and a computer-based FM 100-hue test under standard operating conditions as recommended by the manufacturer after initial trial testing. Parameters evaluated were total error scores (TES), type of defect and testing time. Pearson's correlation coefficient was used to determine the relationship between the test scores. Cohen's kappa was used to assess agreement of color defect classification between the two tests. A receiver operating characteristic curve was used to determine the optimal cut-off score for the computer-based FM 100-hue test. The mean time was 16 ± 1.5 (range 6-20) min for the manual FM 100-hue test and 7.4 ± 1.4 (range 5-13) min for the computer-based FM 100-hue test, thus reducing testing time to <50 % (p < 0.05). For grading color discrimination, Pearson's correlation coefficient for TES between the two tests was 0.91 (p < 0.001). For color defect classification, Cohen's agreement coefficient was 0.98 (p < 0.01). The computer-based FM 100-hue is an effective and rapid method for detecting, classifying and grading color vision anomalies.
A validation study on the traditional Chinese version of Spinal Appearance Questionnaire for adolescent idiopathic scoliosis.

PubMed

Guo, Jing; Lau, Ajax Hong Yin; Chau, Jack; Ng, Bobby Kin Wah; Lee, Kwong Man; Qiu, Yong; Cheng, Jack Chun Yiu; Lam, Tsz Ping

2016-10-01

"Simplified Chinese" version of Spinal Appearance Questionnaire (SC-SAQ) for patients with adolescent idiopathic scoliosis (AIS) was available but did not fit for communities using "Traditional Chinese" as their primary language. We developed a traditional Chinese version of SAQ (TC-SAQ) and evaluated its reliability and validity. TC-SAQ was administered to 112 AIS patients, of which 101 bilingual (English and Chinese) patients completed E-SAQ and the traditional Chinese version of Scoliosis Research Society-22 questionnaire (TC-SRS-22). Internal consistency and test-retest reliability were evaluated. Concurrent validity was evaluated by comparing TC-SAQ score with E-SAQ score, and convergent validity by comparing TC-SAQ score with TC-SRS-22 self-image domain score, and discriminant validity by analyzing the relationship between TC-SAQ score and patients' characteristics. Internal consistency of individual TC-SAQ domain was high (Cronbach's α = 0.785 to 0.940), except for general (Cronbach's α = 0.665) and shoulders (Cronbach's α = 0.421) domain. Test-retest reliability of TC-SAQ was good (ICCs of each domain from 0.798 to 0.865). Concurrent validity demonstrated an excellent correlation between TC-SAQ and E-SAQ scores (r = 0.820 to 0.954, P < 0.0001 for all domains). Correlation between TC-SAQ domains and TC-SRS-22 self-image domain was weak to moderate. TC-SAQ total score and individual domain scores (except waist and chest domains) were positively correlated to major curve magnitude. TC-SAQ had good internal consistency and test-retest reliability. Concurrent validity evaluated against the original English version was excellent. TC-SAQ was both reliable and valid for clinical use for AIS patients using traditional Chinese as their primary language.
Device Comparability of Tablets and Computers for Assessment Purposes

ERIC Educational Resources Information Center

Davis, Laurie Laughlin; Kong, Xiaojing; McBride, Yuanyuan; Morrison, Kristin M.

2017-01-01

The definition of what it means to take a test online continues to evolve with the inclusion of a broader range of item types and a wide array of devices used by students to access test content. To assure the validity and reliability of test scores for all students, device comparability research should be conducted to evaluate the impact of…
A twin study of spatial and non-spatial delayed response performance in middle age.

PubMed

Kremen, William S; Mai, Tuan; Panizzon, Matthew S; Franz, Carol E; Blankfeld, Howard M; Xian, Hong; Eisen, Seth A; Tsuang, Ming T; Lyons, Michael J

2011-06-01

Delayed alternation and object alternation are classic spatial and non-spatial delayed response tasks. We tested 632 middle-aged male veteran twins on variants of these tasks in order to compare test difficulty, measure their inter-correlation, test order effects, and estimate heritabilities (proportion of observed variance due to genetic influences). Non-spatial alternation (NSA), which may involve greater reliance on processing of subgoals, was significantly more difficult than spatial alternation (SA). Despite their similarities, NSA and SA scores were uncorrelated. NSA performance was worse when administered second; there was no SA order effect. NSA scores were modestly heritable (h(2)=.25; 26); SA was not. There was shared genetic variance between NSA scores and general intellectual ability (r(g)=.55; .67), but this also suggests genetic influences specific to NSA. Compared with findings from small, selected control samples, high "failure" rates in this community-based sample raise concerns about interpretation of brain dysfunction in elderly or patient samples. Copyright © 2011 Elsevier Inc. All rights reserved.
A Twin Study of Spatial and Non-Spatial Delayed Response Performance in Middle Age

PubMed Central

Kremen, William S.; Mai, Tuan; Panizzon, Matthew S.; Franz, Carol E.; Blankfeld, Howard M.; Xian, Hong; Eisen, Seth A.; Tsuang, Ming T.; Lyons, Michael J.

2011-01-01

Delayed alternation and object alternation are classic spatial and non-spatial delayed response tasks. We tested 632 middle-aged male veteran twins on variants of these tasks in order to compare test difficulty, measure their inter-correlation, test order effects, and estimate heritabilities (proportion of observed variance due to genetic influences). Non-spatial alternation (NSA), which may involve greater reliance on processing of subgoals, was significantly more difficult than spatial alternation (SA). Despite their similarities, NSA and SA scores were uncorrelated. NSA performance was worse when administered second; there was no SA order effect. NSA scores were modestly heritable (h2=.25; 26); SA was not. There was shared genetic variance between NSA scores and general intellectual ability (rg=.55; .67), but this also suggests genetic influences specific to NSA. Compared with findings from small, selected control samples, high “failure” rates in this community-based sample raise concerns about interpretation of brain dysfunction in elderly or patient samples. PMID:21477911
Inter-Rater and Test-Retest Reliability of the Beery VMI in Schoolchildren

PubMed Central

Harvey, Erin M.; Leonard-Green, Tina K.; Mohan, Kathleen M.; Kulp, Marjean Taylor; Davis, Amy L.; Miller, Joseph M.; Twelker, J. Daniel; Campus, Irene; Dennis, Leslie K.

2017-01-01

Purpose To assess inter-rater and test-retest reliability of the 6th Edition Beery-Buktenica Developmental Test of Visual-Motor Integration (VMI) and test-retest reliability of the VMI Visual Perception Supplemental Test (VMIp) in school-age children. Methods Subjects were 163 Native American 3rd – 8th grade students with no significant refractive error (astigmatism < 1.00 D, myopia: < 0.75 D, hyperopia: < 2.50 D, anisometropia < 1.50 D) or ocular abnormalities. The VMI and VMIp were administered twice, on separate days. All VMI tests were scored by two trained scorers and a subset of 50 tests were also scored by an experienced scorer. Scorers strictly applied objective scoring criteria. Analyses included inter-rater and test-retest assessments of bias, 95% limits of agreement, and intraclass correlation analysis. Results Trained scorers had no significant scoring bias compared to the experienced scorer. One of the two trained scorers tended to provide higher scores than the other (mean difference in standardized scores = 1.54). Inter-rater correlations were strong (0.75 to 0.88). VMI and VMIp test-retest comparisons indicated no significant bias (subjects did not tend to score better on retest). Test-retest correlations were moderate (0.54 to 0.58). The 95% LOAs for the VMI were −24.14 to 24.67 (scorer 1) and −26.06 to 26.58 (scorer 2) and the 95% LOAs for the VMIp were −27.11 to 27.34. Conclusions The 95% LOA for test-retest differences will be useful for determining if the VMI and VMIp have sufficient sensitivity for detecting change with treatment in both clinical and research settings. Further research on test-retest reliability reporting 95% LOAs for children across different age ranges are recommended, particularly if the test is to be used to detect changes due to intervention or treatment. PMID:28422801
Traditional Nurse Triage vs. Physician Tele-Presence in a Pediatric Emergency Department

PubMed Central

Marconi, Greg P.; Chang, Todd; Pham, Phung K.; Grajower, Daniel N.; Nager, Alan L.

2014-01-01

Objectives To compare traditional nurse triage (TNT) in a Pediatric Emergency Department (PED) to physician tele-presence (PTP). Methods Prospective, 2×2 crossover study with random assignment using a sample of walk-in patients seeking care in a PED at a large, tertiary care children’s hospital, from May 2012 to January 2013. Outcomes of triage times, documentation errors, triage scores, and survey responses were compared between TNT and PTP. Comparison between PTP to actual treating PED physicians regarding the accuracy of ordering blood and urine tests, throat cultures, and radiologic imaging was also studied. Results Paired samples t-tests showed a statistically significant difference in triage time between TNT and PTP (p=0.03), but no significant difference in documentation errors (p=0.10). Triage scores of TNT were 71% accurate, compared to PTP, which were 95% accurate. Both parents and children had favorable scores regarding PTP and the majority indicated they would prefer PTP again at their next PED visit. PTP diagnostic ordering was comparable to the actual PED physician ordering, showing no statistical differences. Conclusions Utilizing physician tele-presence technology to remotely perform triage is a feasible alternative to traditional nurse triage, with no clinically significant differences in time, triage scores, errors and patient and parent satisfaction. PMID:24445223
Development of inquiry-based learning activities integrated with the local learning resource to promote learning achievement and analytical thinking ability of Mathayomsuksa 3 student

NASA Astrophysics Data System (ADS)

Sukji, Paweena; Wichaidit, Pacharee Rompayom; Wichaidit, Sittichai

2018-01-01

The objectives of this study were to: 1) compare learning achievement and analytical thinking ability of Mathayomsuksa 3 students before and after learning through inquiry-based learning activities integrated with the local learning resource, and 2) compare average post-test score of learning achievement and analytical thinking ability to its cutting score. The target of this study was 23 Mathayomsuksa 3 students who were studying in the second semester of 2016 academic year from Banchatfang School, Chainat Province. Research instruments composed of: 1) 6 lesson plans of Environment and Natural Resources, 2) the learning achievement test, and 3) analytical thinking ability test. The results showed that 1) student' learning achievement and analytical thinking ability after learning were higher than that of before at the level of .05 statistical significance, and 2) average posttest score of student' learning achievement and analytical thinking ability were higher than its cutting score at the level of .05 statistical significance. The implication of this research is for science teachers and curriculum developers to design inquiry activities that relate to student's context.
Comparing perceived self-management practices of adult type 2 diabetic patients after completion of a structured ADA certified diabetes self-management education program with unstructured individualized nurse practitioner led diabetes self-management education.

PubMed

Wooley, Dennis S; Kinner, Tracy J

2016-11-01

The purpose was to compare perceived self-management practices of adult type 2 diabetic patients after completing an American Diabetes Association (ADA) certified diabetes self-management education (DSME) program with unstructured individualized nurse practitioner led DSME. Demographic questions and the Self-Care Inventory-Revised (SCIR) were given to two convenience sample patient groups comprising a formal DSME program group and a group within a clinical setting who received informal and unstructured individual education during patient encounters. A t-test was executed between the formal ADA certified education sample and the informal sample's SCI-R individual scores. A second t-test was performed between the two samples' SCI-R mean scores. A t-test determined no statistically significant difference between the formal ADA structured education and informal education samples' SCI-R individual scores. There was not a statistically significant difference between the samples' SCI-R mean scores. The study results suggest that there are not superior DSME settings and instructional approaches. Copyright © 2016 Elsevier Inc. All rights reserved.
Recovery in Level 7-10 Women's USA Artistic Gymnastics.

PubMed

Buckner, Stephen B; Bacon, Nicholas T; Bishop, Phillip A

2017-01-01

This study assessed physical performance in women's artistic gymnastics following three variable recovery periods. Participants included fifteen female gymnasts (mean age = 13.5 ± 1.1) who had competed at USA Gymnastics (USAG) levels 7 - 10 within at least one year prior to the study. Each testing session consisted of a warm-up followed by four muscular endurance tests and one explosive maximal test. Assessments included pull-ups, leg lifts, handstand push-ups, vertical jump, and push-ups. After the performance assessments, the participants completed a typical practice session. The performance measures were reassessed at the beginning of each of the recovery periods of 24, 48, and 72 hours in a counterbalanced design. Performance assessments were converted into Z-scores and then averaged for a composite session Z-score. The composite session Z-scores were compared to evaluate the recovery duration. Composite Z's were significantly lower (p=0.000), after the 24 (z=-1.10) and the 48 hour (z=-0.71) recovery periods compared to baseline (z=0.00). However, there was no difference in scores (p=1.00) between the baseline and 72 hours (z=0.004) recovery. Full recovery required 72 hours under the conditions of this study.
Acoustic radiation force impulse elastography: comparison and combination with other noninvasive tests for the diagnosis of compensated liver cirrhosis.

PubMed

Pfeifer, Lukas; Adler, Werner; Zopf, Steffen; Siebler, Jürgen; Wildner, Dane; Goertz, Ruediger S; Schellhaas, Barbara; Neurath, Markus F; Strobel, Deike

2017-05-01

The aim of this study was to compare acoustic radiation force impulse (ARFI) elastography with other noninvasive tests and to develop a new score for the assessment of liver fibrosis/cirrhosis. B-mode ultrasound (including high-frequency liver surface evaluation), routine blood tests, ARFI quantification, and mini-laparoscopic liver evaluation were obtained in compensated patients scheduled for mini-laparoscopic biopsy. Our new cirrhosis score (CS) for the assessment of liver cirrhosis, based on a linear combination of ARFI, platelet (PLT), liver surface, and prothrombin index (PI), was calculated by linear discriminant analysis. Its performance was compared with ARFI-elastography, APRI, FIB-4, alanine aminotransferase (ALT)/aspartate aminotransferase (AST)-ratio, PLT, and PI. For the diagnosis of cirrhosis, a combined gold standard (cirrhosis at histology and/or at macroscopic liver evaluation) was used. In total, 171 patients, of whom 38 had compensated cirrhosis, were included. The CS was significantly better for the diagnosis of cirrhosis compared with ARFI (P=0.028), APRI (P=0.012), PLTs (P=0.013), PI (P=0.025), and ALT/AST ratio (P=0.001), but not the FIB-4 score (P=0.207), with an area under the receiver operating characteristic curve of 0.92 [95% confidence interval (CI): 0.87-0.97], 0.86 (95% CI:0.79-0.93), 0.80 (95% CI: 0.72-0.87), 0.79 (95% CI: 0.7-0.87), 0.81 (95% CI: 0.73-0.89), 0.72 (95% CI:0.64-0.81), and 0.86 (95% CI: 0.8-0.93), respectively. Sensitivity, specificity, positive predictive value, and negative predictive value for CS were 87%, 86%, 63%, and 96%, respectively. The FIB-4 score was significantly superior to the APRI score (P=0.041) and the ALT/AST ratio (P=0.011), with no significant difference from ARFI elastography (P=0.88) for the diagnosis of cirrhosis. Combining ARFI elastography with other noninvasive tests that are used routinely in the workup of patients with suspected liver disease can improve diagnostic accuracy for compensated liver cirrhosis as compared with ARFI elastography alone. The FIB-4 score showed an overall comparable diagnostic accuracy to ARFI-elastography for compensated cirrhosis.
[Assessment.

ERIC Educational Resources Information Center

Boylan, Hunter R., Ed.; Kerstiens, Gene, Ed.

1989-01-01

These four serial issues examine the effectiveness and appropriateness of a variety of assessment tests as well as their relationship to developmental education. Included are reviews of the following tests: (1) the Comparative Guidance and Placement Program, a self-scoring test of English and mathematics; (2) the Stanford Achievement Test, an…
Cheating in OSCEs: The Impact of Simulated Security Breaches on OSCE Performance.

PubMed

Gotzmann, Andrea; De Champlain, André; Homayra, Fahmida; Fotheringham, Alexa; de Vries, Ingrid; Forgie, Melissa; Pugh, Debra

2017-01-01

Construct: Valid score interpretation is important for constructs in performance assessments such as objective structured clinical examinations (OSCEs). An OSCE is a type of performance assessment in which a series of standardized patients interact with the student or candidate who is scored by either the standardized patient or a physician examiner. In high-stakes examinations, test security is an important issue. Students accessing unauthorized test materials can create an unfair advantage and lead to examination scores that do not reflect students' true ability level. The purpose of this study was to assess the impact of various simulated security breaches on OSCE scores. Seventy-six 3rd-year medical students participated in an 8-station OSCE and were randomized to either a control group or to 1 of 2 experimental conditions simulating test security breaches: station topic (i.e., providing a list of station topics prior to the examination) or egregious security breach (i.e., providing detailed content information prior to the examination). Overall total scores were compared for the 3 groups using both a one-way between-subjects analysis of variance and a repeated measure analysis of variance to compare the checklist, rating scales, and oral question subscores across the three conditions. Overall total scores were highest for the egregious security breach condition (81.8%), followed by the station topic condition (73.6%), and they were lowest for the control group (67.4%). This trend was also found with checklist subscores only (79.1%, 64.9%, and 60.3%, respectively for the security breach, station topic, and control conditions). Rating scale subscores were higher for both the station topic and egregious security breach conditions compared to the control group (82.6%, 83.1%, and 77.6%, respectively). Oral question subscores were significantly higher for the egregious security breach condition (88.8%) followed by the station topic condition (64.3%), and they were the lowest for the control group (48.6%). This simulation of different OSCE security breaches demonstrated that student performance is greatly advantaged by having prior access to test materials. This has important implications for medical educators as they develop policies and procedures regarding the safeguarding and reuse of test content.
The effect of rare variants on inflation of the test statistics in case-control analyses.

PubMed

Pirie, Ailith; Wood, Angela; Lush, Michael; Tyrer, Jonathan; Pharoah, Paul D P

2015-02-20

The detection of bias due to cryptic population structure is an important step in the evaluation of findings of genetic association studies. The standard method of measuring this bias in a genetic association study is to compare the observed median association test statistic to the expected median test statistic. This ratio is inflated in the presence of cryptic population structure. However, inflation may also be caused by the properties of the association test itself particularly in the analysis of rare variants. We compared the properties of the three most commonly used association tests: the likelihood ratio test, the Wald test and the score test when testing rare variants for association using simulated data. We found evidence of inflation in the median test statistics of the likelihood ratio and score tests for tests of variants with less than 20 heterozygotes across the sample, regardless of the total sample size. The test statistics for the Wald test were under-inflated at the median for variants below the same minor allele frequency. In a genetic association study, if a substantial proportion of the genetic variants tested have rare minor allele frequencies, the properties of the association test may mask the presence or absence of bias due to population structure. The use of either the likelihood ratio test or the score test is likely to lead to inflation in the median test statistic in the absence of population structure. In contrast, the use of the Wald test is likely to result in under-inflation of the median test statistic which may mask the presence of population structure.
Predictive Value of the Korean Academy of Family Medicine In-Training Examination for Certifying Examination

PubMed Central

Kim, Ji-Yong

2011-01-01

Background In-training examination (ITE) is a cognitive examination similar to the written test, but it is different from the Clinical Practice Examination of the Korean Academy of Family Medicine (KAFM) Certification Examination (CE). The objective of this is to estimate the positive predictive value of the KAFM-ITE for identifying residents at risk for poor performance on the three types of KAFM-CE. Methods 372 residents who completed the KAFM-CE in 2011 were included. We compared the mean KAFM-CE scores with ITE experience. We evaluated the correlation and the positive predictive value (PPV) of ITE for the multiple choice question (MCQ) scores of 1st written test & 2nd slide examination, the total clinical practice examination scores, and the total sum of 2nd test. Results 275 out of 372 residents completed ITE. Those who completed ITE had significantly higher MCQ scores of 1st written test than those who did not. The correlation of ITE scores with 1st written MCQ (0.627) was found to be the highest among the other kinds of CE. The PPV of the ITE score for 1st written MCQ scores was 0.672. The PPV of the ITE score ranged from 0.376 to 0.502. Conclusion The score of the KAFM ITE has acceptable positive predictive value that could be used as a part of comprehensive evaluation system for residents in cognitive field. PMID:22745873
Dog-appeasing pheromone collars reduce sound-induced fear and anxiety in beagle dogs: a placebo-controlled study.

PubMed

Landsberg, G M; Beck, A; Lopez, A; Deniaud, M; Araujo, J A; Milgram, N W

2015-09-12

The objective of the study was to assess the effects of a dog-appeasing pheromone (DAP) collar in reducing sound-induced fear and anxiety in a laboratory model of thunderstorm simulation. Twenty-four beagle dogs naïve to the current test were divided into two treatment groups (DAP and placebo) balanced on their fear score in response to a thunderstorm recording. Each group was then exposed to two additional thunderstorm simulation tests on consecutive days. Dogs were video-assessed by a trained observer on a 6-point scale for active, passive and global fear and anxiety (combined). Both global and active fear and anxiety scores were significantly improved during and following thunder compared with placebo on both test days. DAP significantly decreased global fear and anxiety across 'during' and 'post' thunder times when compared with baseline. There was no significant improvement in the placebo group from baseline on the test days. In addition, the DAP group showed significantly greater use of the hide box at any time with increased exposure compared with the placebo group. The DAP collar reduced the scores of fear and anxiety, and increased hide use in response to a thunder recording, possibly by counteracting noise-related increased reactivity. British Veterinary Association.
Dog-appeasing pheromone collars reduce sound-induced fear and anxiety in beagle dogs: a placebo-controlled study

PubMed Central

Landsberg, G. M.; Beck, A.; Lopez, A.; Deniaud, M.; Araujo, J. A.; Milgram, N. W.

2015-01-01

The objective of the study was to assess the effects of a dog-appeasing pheromone (DAP) collar in reducing sound-induced fear and anxiety in a laboratory model of thunderstorm simulation. Twenty-four beagle dogs naïve to the current test were divided into two treatment groups (DAP and placebo) balanced on their fear score in response to a thunderstorm recording. Each group was then exposed to two additional thunderstorm simulation tests on consecutive days. Dogs were video-assessed by a trained observer on a 6-point scale for active, passive and global fear and anxiety (combined). Both global and active fear and anxiety scores were significantly improved during and following thunder compared with placebo on both test days. DAP significantly decreased global fear and anxiety across ‘during’ and ‘post’ thunder times when compared with baseline. There was no significant improvement in the placebo group from baseline on the test days. In addition, the DAP group showed significantly greater use of the hide box at any time with increased exposure compared with the placebo group. The DAP collar reduced the scores of fear and anxiety, and increased hide use in response to a thunder recording, possibly by counteracting noise-related increased reactivity. PMID:26311736

Single measure and gated screening approaches for identifying students at-risk for academic problems: Implications for sensitivity and specificity.

PubMed

Van Norman, Ethan R; Nelson, Peter M; Klingbeil, David A

2017-09-01

Educators need recommendations to improve screening practices without limiting students' instructional opportunities. Repurposing previous years' state test scores has shown promise in identifying at-risk students within multitiered systems of support. However, researchers have not directly compared the diagnostic accuracy of previous years' state test scores with data collected during fall screening periods to identify at-risk students. In addition, the benefit of using previous state test scores in conjunction with data from a separate measure to identify at-risk students has not been explored. The diagnostic accuracy of 3 types of screening approaches were tested to predict proficiency on end-of-year high-stakes assessments: state test data obtained during the previous year, data from a different measure administered in the fall, and both measures combined (i.e., a gated model). Extant reading and math data (N = 2,996) from 10 schools in the Midwest were analyzed. When used alone, both measures yielded similar sensitivity and specificity values. The gated model yielded superior specificity values compared with using either measure alone, at the expense of sensitivity. Implications, limitations, and ideas for future research are discussed. (PsycINFO Database Record (c) 2017 APA, all rights reserved).
Tests of measurement invariance failed to support the application of the "then-test".

PubMed

Nolte, Sandra; Elsworth, Gerald R; Sinclair, Andrew J; Osborne, Richard H

2009-11-01

The use of then-test (retrospective pre-test) scores has frequently been proposed as a solution to potential confounding of change scores because of response shift, as it is assumed that then-test and post-test responses are provided from the same perspective. However, this assumption has not been formally tested using robust quantitative methods. The aim of this study was to compare the psychometric performance of then-test/post-test with traditional pre-test/post-test data and assessing whether the resulting data structures support the application of the then-test for evaluations of chronic disease self-management interventions. Pre-test, post-test, and then-test data were collected from 314 participants of self-management courses using the Health Education Impact Questionnaire (heiQ). The derived change scores (pre-test/post-test; then-test/post-test) were examined for their psychometric performance using tests of measurement invariance. Few questionnaire items were noninvariant across pre-test/post-test, with four items identified and requiring removal to enable an unbiased comparison of factor means. In contrast, 12 items were identified and required removal in then-test/post-test data to avoid biased change score estimates. Traditional pre-test/post-test data appear to be robust with little indication of response shift. In contrast, the weaker psychometric performance of then-test/post-test data suggests psychometric flaws that may be the result of implicit theory of change, social desirability, and recall bias.
A comparison of the marginal adaptation of cathode-arc vapor-deposited titanium and cast base metal copings

PubMed Central

Wu, JC; Lai, LC; Sheets, CG; Earthman, J; Newcomb, R

2011-01-01

Statement of problem A new fabrication process has been developed where a titanium coping, which has a gold colored titanium nitride outer layer can be reliably fused to porcelain, but the marginal adaptation characteristics are still undetermined. Purpose The primary purpose of this study is to compare the rate of Clinically Acceptable Marginal Adaptation (CAMA-defined as a marginal gap mean ≤60 μm) of cathode-arc vapor-deposited titanium with the CAMA rate for the cast base metal copings. In addition, the study will evaluate the marginal gap scores themselves to assess their mean difference between the two study groups. Finally, the study will present two analyses of group differences in variability to support the contention that the titanium copings perform more consistently than their base metal counterparts. Material and methods Thirty-seven cathode-arc vapor-deposited titanium copings and 40 cast base metal copings were evaluated by computer-based image analysis using an optical microscope. The conventional lost wax technique was used to fabricate the 40 cast base metal copings that were 0.3 mm thick. The titanium copings were 0.3 mm thick and were formed by a collection of atomic titanium vapor onto a refractory die duplicate in a high vacuum chamber. Fifty vertical marginal gap measurements were collected from each of the 77 copings and the mean of these measurements was computed to form a gap score for each coping. Next, the gap score was compared to the 60 μm criterion to classify each coping as to whether it did or did not achieve Clinically Acceptable Marginal Adaption (CAMA). A comparison of the CAMA rates for each type of coping was used to address the primary purpose of this study. In addition, the gap scores themselves were used to test the (one-sided) hypothesis that the mean of the titanium gap scores is smaller than the mean of the base metal gap scores. Finally, the assertion that the titanium copings provide more consistency in their marginal gap performance was tested in two ways. First, the means of the titanium gap scores were compared to the means of the marginal gap scores for the base metal copings. Second, the standard deviations of the marginal gap scores for the titanium copings were compared with those for the base metal copings. Results Statistical comparison of the CAMA rates for each type of coping showed that the CAMA criterion was achieved by 24 of the 37 (64.86%) titanium copings, while 19 of the 40 (47.50%) base metal copings met this same standard. Noninferiority of the titanium copings was established by the 2-sided 90% Confidence Interval for the 17.36% difference in these rates (−0.95%, 35.68%) and noninferiority of titanium coping adaption was also demonstrated by the Wald Test rejection of the tentative hypothesis of inferiority (Z-score=1.9191, one-sided p=0.0275). The mean of the vertical marginal gap scores for the titanium copings (56.9025) was significantly less than the mean of the marginal gap scores for the base metal copings (71.9041) as shown by the Satterthwaite t-score=−2.29 (one-sided p=0.0126). To compare the adaption consistency of the titanium copings to the base metal counterparts the difference between the variance of the marginal gap scores for the titanium copings (594.843) and the variance of the marginal gap scores for the base metal copings (1510.901) was found to be statistically significant (Folded-F test score=2.63, p=0.0042). Our second method for showing that the titanium copings performed more consistently than the base metal comparisons was to use a one-sided test to show that the mean of the standard deviations of the vertical gap measurements for each titanium coping (29.9835) was significantly lower than the mean of the standard deviations of the vertical gap measurements for each base metal coping (36.1332). This test produced a Satterthwaite’s t-score of −2.24 (one-sided p=0.0141), indicating the titanium adaption was significantly more consistent. Conclusions Cathode-arc vapor deposited titanium copings exhibited a higher rate of Clinically Acceptable Marginal Adaption (CAMA) than the comparison base metal copings. Comparison of the coping marginal adaption score variances and direct assessment of the coping marginal adaption scores provided additional evidence that the titanium copings performed better and with more consistency than their base metal counterparts. PMID:21640242
Association Between Medication Use and Performance on Higher Education Entrance Tests in Individuals With Attention-Deficit/Hyperactivity Disorder.

PubMed

Lu, Yi; Sjölander, Arvid; Cederlöf, Martin; D'Onofrio, Brian M; Almqvist, Catarina; Larsson, Henrik; Lichtenstein, Paul

2017-08-01

Individuals with attention-deficit/hyperactivity disorder (ADHD) are at greater risk for academic problems. Pharmacologic treatment is effective in reducing the core symptoms of ADHD, but it is unclear whether it helps to improve academic outcomes. To investigate the association between the use of ADHD medication and performance on higher education entrance tests in individuals with ADHD. This cohort study observed 61 640 individuals with a diagnosis of ADHD from January 1, 2006, to December 31, 2013. Records of their pharmacologic treatment were extracted from Swedish national registers along with data from the Swedish Scholastic Aptitude Test. Using a within-patient design, test scores when patients were taking medication for ADHD were compared with scores when they were not taking such medication. Data analysis was performed from November 24, 2015, to November 4, 2016. Periods with and without ADHD medication use. Scores from the higher education entrance examination (score range, 1-200 points). Among 930 individuals (493 males and 437 females; mean [SD] age, 22.2 [3.2] years) who had taken multiple entrance tests (n = 2524) and used ADHD medications intermittently, the test scores were a mean of 4.80 points higher (95% CI, 2.26-7.34; P < .001) during periods they were taking medication vs nonmedicated periods, after adjusting for age and practice effects. Similar associations between ADHD medication use and test scores were detected in sensitivity analyses. Individuals with ADHD had higher scores on the higher education entrance tests during periods they were taking ADHD medication vs nonmedicated periods. These findings suggest that ADHD medications may help ameliorate educationally relevant outcomes in individuals with ADHD.
The video-based test of communication skills: description, development, and preliminary findings.

PubMed

Mazor, Kathleen M; Haley, Heather-Lyn; Sullivan, Kate; Quirk, Mark E

2007-01-01

The importance of assessing physician-patient communication skills is widely recognized, but assessment methods are limited. Objective structured clinical examinations are time-consuming and resource intensive. For practicing physicians, patient surveys may be useful, but these also require substantial resources. Clearly, it would be advantageous to develop alternative or supplemental methods for assessing communication skills of medical students, residents, and physicians. The Video-based Test of Communication Skills (VTCS) is an innovative, computer-administered test, consisting of 20 very short video vignettes. In each vignette, a patient makes a statement or asks a question. The examinee responds verbally, as if it was a real encounter and he or she were the physician. Responses are recorded for later scoring. Test administration takes approximately 1 h. Generalizability studies were conducted, and scores for two groups of physicians predicted to differ in their communication skills were compared. Preliminary results are encouraging; the estimated g coefficient for the communication score for 20-vignette test (scored by five raters) is 0.79; g for the personal/affective score under the same conditions is 0.62. Differences between physicians were in the predicted direction, with physicians considered "at risk" for communication difficulties scoring lower than those not so identified. The VTCS is a short, portable test of communication skills. Results reported here suggest that scores reflect differences in skill levels and are generalizable. However, these findings are based on very small sample sizes and must be considered preliminary. Additional work is required before it will be possible to argue confidently that this test in particular, and this approach to testing communication skills in general, is valuable and likely to make a substantial contribution to assessment in medical education.
The Face-Symbol Test and the Symbol-Digit Test are not reliable surrogates for the Paced Auditory Serial Addition Test in multiple sclerosis.

PubMed

Williams, J; O'Rourke, K; Hutchinson, M; Tubridy, N

2006-10-01

The Paced Auditory Serial Addition Test (PASAT) is the chosen task for cognitive assessment in the multiple sclerosis functional composite (MSFC) and a widely used task in neuropsychological studies of people with multiple sclerosis (MS), but is unpopular with patients. The Face-Symbol Test (FST) and Symbol-Digit Tests (SDT) are alternative methods of cognitive testing in MS, which are easily administered and patient-friendly. In order to evaluate the potential of the FST as a possible surrogate for the PASAT, we directly compared the FST to the PASAT and the SDT in a cohort of 50 MS patients with varying levels of disability. There was significant correlation between SDT and FST scores (Spearman's rho 0.80, 95% CI 0.66-0.88), R(2) 65%, with moderate inter-test agreement (k =0.52). In contrast, SDT and FST scores were less predictive of PASAT scores. We concluded that neither the FST nor SDT are reliable surrogates for the PASAT.
Identifying dyslexia in adults: an iterative method using the predictive value of item scores and self-report questions.

PubMed

Tamboer, Peter; Vorst, Harrie C M; Oort, Frans J

2014-04-01

Methods for identifying dyslexia in adults vary widely between studies. Researchers have to decide how many tests to use, which tests are considered to be the most reliable, and how to determine cut-off scores. The aim of this study was to develop an objective and powerful method for diagnosing dyslexia. We took various methodological measures, most of which are new compared to previous methods. We used a large sample of Dutch first-year psychology students, we considered several options for exclusion and inclusion criteria, we collected as many cognitive tests as possible, we used six independent sources of biographical information for a criterion of dyslexia, we compared the predictive power of discriminant analyses and logistic regression analyses, we used both sum scores and item scores as predictor variables, we used self-report questions as predictor variables, and we retested the reliability of predictions with repeated prediction analyses using an adjusted criterion. We were able to identify 74 dyslexic and 369 non-dyslexic students. For 37 students, various predictions were too inconsistent for a final classification. The most reliable predictions were acquired with item scores and self-report questions. The main conclusion is that it is possible to identify dyslexia with a high reliability, although the exact nature of dyslexia is still unknown. We therefore believe that this study yielded valuable information for future methods of identifying dyslexia in Dutch as well as in other languages, and that this would be beneficial for comparing studies across countries.
Generalized functional linear models for gene-based case-control association studies.

PubMed

Fan, Ruzong; Wang, Yifan; Mills, James L; Carter, Tonia C; Lobach, Iryna; Wilson, Alexander F; Bailey-Wilson, Joan E; Weeks, Daniel E; Xiong, Momiao

2014-11-01

By using functional data analysis techniques, we developed generalized functional linear models for testing association between a dichotomous trait and multiple genetic variants in a genetic region while adjusting for covariates. Both fixed and mixed effect models are developed and compared. Extensive simulations show that Rao's efficient score tests of the fixed effect models are very conservative since they generate lower type I errors than nominal levels, and global tests of the mixed effect models generate accurate type I errors. Furthermore, we found that the Rao's efficient score test statistics of the fixed effect models have higher power than the sequence kernel association test (SKAT) and its optimal unified version (SKAT-O) in most cases when the causal variants are both rare and common. When the causal variants are all rare (i.e., minor allele frequencies less than 0.03), the Rao's efficient score test statistics and the global tests have similar or slightly lower power than SKAT and SKAT-O. In practice, it is not known whether rare variants or common variants in a gene region are disease related. All we can assume is that a combination of rare and common variants influences disease susceptibility. Thus, the improved performance of our models when the causal variants are both rare and common shows that the proposed models can be very useful in dissecting complex traits. We compare the performance of our methods with SKAT and SKAT-O on real neural tube defects and Hirschsprung's disease datasets. The Rao's efficient score test statistics and the global tests are more sensitive than SKAT and SKAT-O in the real data analysis. Our methods can be used in either gene-disease genome-wide/exome-wide association studies or candidate gene analyses. © 2014 WILEY PERIODICALS, INC.
Generalized Functional Linear Models for Gene-based Case-Control Association Studies

PubMed Central

Mills, James L.; Carter, Tonia C.; Lobach, Iryna; Wilson, Alexander F.; Bailey-Wilson, Joan E.; Weeks, Daniel E.; Xiong, Momiao

2014-01-01

By using functional data analysis techniques, we developed generalized functional linear models for testing association between a dichotomous trait and multiple genetic variants in a genetic region while adjusting for covariates. Both fixed and mixed effect models are developed and compared. Extensive simulations show that Rao's efficient score tests of the fixed effect models are very conservative since they generate lower type I errors than nominal levels, and global tests of the mixed effect models generate accurate type I errors. Furthermore, we found that the Rao's efficient score test statistics of the fixed effect models have higher power than the sequence kernel association test (SKAT) and its optimal unified version (SKAT-O) in most cases when the causal variants are both rare and common. When the causal variants are all rare (i.e., minor allele frequencies less than 0.03), the Rao's efficient score test statistics and the global tests have similar or slightly lower power than SKAT and SKAT-O. In practice, it is not known whether rare variants or common variants in a gene are disease-related. All we can assume is that a combination of rare and common variants influences disease susceptibility. Thus, the improved performance of our models when the causal variants are both rare and common shows that the proposed models can be very useful in dissecting complex traits. We compare the performance of our methods with SKAT and SKAT-O on real neural tube defects and Hirschsprung's disease data sets. The Rao's efficient score test statistics and the global tests are more sensitive than SKAT and SKAT-O in the real data analysis. Our methods can be used in either gene-disease genome-wide/exome-wide association studies or candidate gene analyses. PMID:25203683
A pilot study: the development of a culturally tailored Malaysian Diabetes Education Module (MY-DEMO) based on the Health Belief Model

PubMed Central

2014-01-01

Background Diabetes education and self-care remains the cornerstone of diabetes management. There are many structured diabetes modules available in the United Kingdom, Europe and United States of America. Contrastingly, few structured and validated diabetes modules are available in Malaysia. This pilot study aims to develop and validate diabetes education material suitable and tailored for a multicultural society like Malaysia. Methods The theoretical framework of this module was founded from the Health Belief Model (HBM). The participants were assessed using 6-item pre- and post-test questionnaires that measured some of the known HBM constructs namely cues to action, perceived severity and perceived benefit. Data was analysed using PASW Statistics 18.0. Results The pre- and post-test questionnaires were administered to 88 participants (31 males). In general, there was a significant increase in the total score in post-test (97.34 ± 6.13%) compared to pre-test (92.80 ± 12.83%) (p < 0.05) and a significant increase in excellent score (>85%) at post-test (84.1%) compared to pre-test (70.5%) (p < 0.05). There was an improvement in post-test score in 4 of 6 items tested. The remaining 2 items which measured the perceived severity and cues to action had poorer post-test score. Conclusions The preliminary results from this pilot study suggest contextualised content material embedded within MY DEMO maybe suitable for integration with the existing diabetes education programmes. This was the first known validated diabetes education programme available in the Malay language. PMID:24708715
Effect of Video-Assisted Teaching Module (VATM) on Knowledge of ASHAs regarding RNTCP in Kuchinda Block of Sambalpur (Odisha).

PubMed

Pradhan, Malati; Dash, Bijayalakshmi

2015-05-01

Infectious disease is a major public health issue for both developed and developing countries. Among infectious diseases, tuberculosis (TB) is most prevalent in the develop- ing countries. India is the highest TB burden country in the world and accounts for nearly one fifth (20%) of global burden of tuberculosis. A pre-experimental design where pre- and post-test without control group with experimental approach was undertaken in Kuchinda block of Sambalpur district (Odisha) with the objectives to assess effectiveness of Video-assisted Teaching Module (VATM) on knowledge of Accredited Social Health Activists (ASHAs) regarding Revised National Tuberculosis Control Programme (RNTCP) Data were collected from 52 ASHAs, selected by systematic random sampling technique through structured questionnaire. The overall mean score in pre-test was 23.31±3.07 which is 58.27 percent of maximum score and good knowledge whereas it was 34.35±3.56 while post-test it was 85.87 percent of maximum score during post-test showing a difference of 27.6 percent effectiveness. Highly significant (p<0.01) differ- ence was found between pre- and post-test knowledge score and no significant (>0.05) association was found between post-test knowledge score when compared to all the demographic variables of ASHAs.
Online Module for Carrier Screening in Ashkenazi Jewish Individuals Compared with In-Person Genetics Education: A Randomized Controlled Trial.

PubMed

Fan, Chia Wei; Castonguay, Lysanne; Rummell, Sonja; Lévesque, Sébastien; Mitchell, John J; Sillon, Guillaume

2018-04-01

To increase accessibility to genetics services for low-urgency patients seeking Ashkenazi Jewish (AJ) carrier screening, we designed an interactive computer (IC) module that provides pre-test genetics education and allows genetics professionals to order the test without meeting the patients beforehand. We compared this module with in-person genetic counseling (GC) using a randomized trial. AJ individuals were randomized to undergo genetics education via the IC module (n = 26) or GC (n = 28). We compared post-interventional genetics knowledge, perceived genetic risk, and anxiety between the two groups, after accounting for pre-interventional scores, using ANCOVA. Wilcoxon Rank-Sum test was used to compare post-interventional satisfaction. Post-interventional genetics knowledge, risk perception, or anxiety were not significantly different between the two groups after accounting for baseline scores (p = 0.50-0.54), although the data are inconclusive regarding the module's non-inferiority at a 5% margin. Post-intervention satisfaction scores were generally higher in the GC group than the IC module group. Our IC module has the potential to improve access to clinical genetics services for patients and staff, but it is not suitable for all AJ patients and cannot completely replace the benefits of in-person consultations.
Effect of two additional interventions, test and reflection, added to standard cardiopulmonary resuscitation training on seventh grade students' practical skills and willingness to act: a cluster randomised trial.

PubMed

Nord, Anette; Hult, Håkan; Kreitz-Sandberg, Susanne; Herlitz, Johan; Svensson, Leif; Nilsson, Lennart

2017-06-23

The aim of this research is to investigate if two additional interventions, test and reflection, after standard cardiopulmonary resuscitation (CPR) training facilitate learning by comparing 13-year-old students' practical skills and willingness to act. Seventh grade students in council schools of two municipalities in south-east Sweden. School classes were randomised to CPR training only (O), CPR training with a practical test including feedback (T) or CPR training with reflection and a practical test including feedback (RT). Measures of practical skills and willingness to act in a potential life-threatening situation were studied directly after training and at 6 months using a digital reporting system and a survey. A modified Cardiff test was used to register the practical skills, where scores in each of 12 items resulted in a total score of 12-48 points. The study was conducted in accordance with current European Resuscitation Council guidelines during December 2013 to October 2014. 29 classes for a total of 587 seventh grade students were included in the study. The total score of the modified Cardiff test at 6 months was the primary outcome. Secondary outcomes were the total score directly after training, the 12 individual items of the modified Cardiff test and willingness to act. At 6 months, the T and O groups scored 32 (3.9) and 30 (4.0) points, respectively (p<0.001), while the RT group scored 32 (4.2) points (not significant when compared with T). There were no significant differences in willingness to act between the groups after 6 months. A practical test including feedback directly after training improved the students' acquisition of practical CPR skills. Reflection did not increase further CPR skills. At 6-month follow-up, no intervention effect was found regarding willingness to make a life-saving effort. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2017. All rights reserved. No commercial use is permitted unless otherwise expressly granted.
Effect of two additional interventions, test and reflection, added to standard cardiopulmonary resuscitation training on seventh grade students’ practical skills and willingness to act: a cluster randomised trial

PubMed Central

Nord, Anette; Hult, Håkan; Kreitz-Sandberg, Susanne; Herlitz, Johan; Svensson, Leif; Nilsson, Lennart

2017-01-01

Objectives The aim of this research is to investigate if two additional interventions, test and reflection, after standard cardiopulmonary resuscitation (CPR) training facilitate learning by comparing 13-year-old students’ practical skills and willingness to act. Settings Seventh grade students in council schools of two municipalities in south-east Sweden. Design School classes were randomised to CPR training only (O), CPR training with a practical test including feedback (T) or CPR training with reflection and a practical test including feedback (RT). Measures of practical skills and willingness to act in a potential life-threatening situation were studied directly after training and at 6 months using a digital reporting system and a survey. A modified Cardiff test was used to register the practical skills, where scores in each of 12 items resulted in a total score of 12–48 points. The study was conducted in accordance with current European Resuscitation Council guidelines during December 2013 to October 2014. Participants 29 classes for a total of 587 seventh grade students were included in the study. Primary and secondary outcome measures The total score of the modified Cardiff test at 6 months was the primary outcome. Secondary outcomes were the total score directly after training, the 12 individual items of the modified Cardiff test and willingness to act. Results At 6 months, the T and O groups scored 32 (3.9) and 30 (4.0) points, respectively (p<0.001), while the RT group scored 32 (4.2) points (not significant when compared with T). There were no significant differences in willingness to act between the groups after 6 months. Conclusions A practical test including feedback directly after training improved the students’ acquisition of practical CPR skills. Reflection did not increase further CPR skills. At 6-month follow-up, no intervention effect was found regarding willingness to make a life-saving effort. PMID:28645953
Surgical Treatment Assessment of Cervical Laminoplasty Using Quantitative Performance Evaluation in Elderly Patients: A Prospective Comparative Study in 505 Patients With Cervical Spondylotic Myelopathy.

PubMed

Machino, Masaaki; Yukawa, Yasutsugu; Imagama, Shiro; Ito, Keigo; Katayama, Yoshito; Matsumoto, Tomohiro; Inoue, Taro; Ouchida, Jun; Tomita, Keisuke; Ishiguro, Naoki; Kato, Fumihiko

2016-05-01

A prospective cohort study. The purpose of this study was to compare surgical outcomes between non-elderly and elderly patients with cervical spondylotic myelopathy (CSM) who underwent laminoplasty. Since age at the time of surgery influences the surgical outcome, we designed a large-scale cohort study to examine the surgical outcome for CSM from a single operative procedure used exclusively in elderly patients. A total of 505 consecutive patients with CSM (311 men; 194 women) were prospectively enrolled. The mean age was 66.6 years (range, 41-91), and the average postoperative follow-up period was 26.5 ± 12.5 months. Patients were divided into three groups according to age: non-elderly (<65 yr, n = 201), young-old (65-74 yr, n = 186), and old-old (≥75 yr, n = 118). Pre- and postoperative neurological status was evaluated using the Japanese Orthopaedic Association scoring system for cervical myelopathy (JOA score) and quantifiable tests-the 10-s grip and release test (10-s G&R test) and the 10-s step test. Mean achieved JOA scores in non-elderly, young-old, and old-old groups were 3.1, 3.2, and 3.0, respectively, with no significant difference among three groups (P = 0.5735). Mean preoperative 10-s G&R test results were 17.3, 14.4, and 13.0, respectively, indicating a significant decrease with increasing age, whereas postoperative results significantly improved in all groups (21.0, 17.9, and 16.3, respectively). Similarly, the 10-s step test significantly decreased with age, with preoperative scores of 14.3, 11.5, and 8.6, respectively, whereas postoperative scores improved to 17.3, 14.9, and 12.5, respectively. The three groups showed no significant difference in the rate of postoperative complications. Elderly patients adequately recovered from laminoplasty in terms of achieved JOA score, the 10-s G&R test, and the 10-s step test. Therefore, laminoplasty for CSM is beneficial in elderly patients. 2.
Patient adjustment to reduced olfactory function.

PubMed

Croy, Ilona; Landis, Basile N; Meusel, Thomas; Seo, Han-Seok; Krone, Franziska; Hummel, Thomas

2011-04-01

To compare the importance of olfaction in daily life between patients with olfactory disorders and healthy normosmic individuals. Quasiexperimental. A total of 470 individuals (235 anosmic or hyposmic patients and 235 normosmic control individuals). The Individual Importance of Olfaction Questionnaire (IO) and olfactory testing using the "Sniffin' Sticks" test kit. The IO scores were lower in people with smell disorders compared with normosmic subjects (P < .001) and lower in patients with anosmia compared with hyposmic patients (P < .001). These scores suggest adjustment processes in the daily use of the sense of smell by patients. Patients attach less importance to their current sense of smell in daily life than do normosmic individuals. This adjustment might be an example of regaining psychological health despite acquired and long-lasting impairments.
Effects of Didactic Instruction and Test-Enhanced Learning in a Nursing Review Course.

PubMed

Tu, Yu-Ching; Lin, Yi-Jung; Lee, Jonathan W; Fan, Lir-Wan

2017-11-01

Determining the most effective approach for students' successful academic performance and achievement on the national licensure examination for RNs is important to nursing education and practice. A quasi-experimental design was used to compare didactic instruction and test-enhanced learning among nursing students divided into two fundamental nursing review courses in their final semester. Students in each course were subdivided into low-, intermediate-, and high-score groups based on their first examination scores. Mixed model of repeated measure and two-way analysis of variance were applied to evaluate students' academic results and both teaching approaches. Intermediate-scoring students' performances improved more through didactic instruction, whereas low-scoring students' performances improved more through test-enhanced learning. Each method had differing effects on individual subgroups within the different performance level groups of their classes, which points to the importance of considering both the didactic and test-enhanced learning approaches. [J Nurs Educ. 2017;56(11):683-687.]. Copyright 2017, SLACK Incorporated.
Some Considerations in Maintaining Adaptive Test Item Pools.

ERIC Educational Resources Information Center

Stocking, Martha L.

The construction of parallel editions of conventional tests for purposes of test security while maintaining score comparability has always been a recognized and difficult problem in psychometrics and test construction. The introduction of new modes of test construction, e.g., adaptive testing, changes the nature of the problem, but does not make…
Convergence Insufficiency Symptom Survey Scores for Reading Versus Other Near Visual Activities in School-Age Children.

PubMed

Clark, Tiana Y; Clark, Robert A

2015-11-01

To measure the difference in Convergence Insufficiency Symptom Survey scores for reading vs favorite near visual activities. Comparative validity analysis of diagnostic tools. At a single clinical private practice, 100 children aged 9-18 with normal binocular vision were recruited to receive either the original survey emphasizing reading or a modified survey replacing "reading" with their favorite near activity. Average survey scores and subscores for questions emphasizing fatigue, discomfort, impaired vision, and cognitive performance were compared using t tests, while responses to individual questions were compared using Mann-Whitney U tests. The average reading survey score was significantly greater than the favorite near activity survey score (14.1 ± 11.5 vs 6.7 ± 5.8, P = .0001). The largest difference resulted from questions emphasizing cognitive performance (subscore 5.8 ± 4.3 vs 2.0 ± 2.1, P = .0000002), although significant differences were also found for fatigue (5.4 ± 3.8 vs 3.0 ± 2.7, P = .0003), discomfort (3.9 ± 4.6 vs 1.8 ± 2.2, P = .004), and impaired vision (3.2 ± 3.9 vs 1.8 ± 2.2, P = .02). Significant differences were found for 7 survey questions, with higher symptom scores for the reading survey in every case. Using survey scores ≥16 to diagnose convergence insufficiency, significantly more children taking the reading survey would have been diagnosed with convergence insufficiency than children taking the favorite near activity survey (19 of 50 [38%] vs 5 of 50 [10%], P = .001). By emphasizing reading, the Convergence Insufficiency Symptom Survey score significantly overestimates near visual symptoms in children with normal binocular vision compared with symptoms caused by preferred near activities that require similar amplitudes of accommodation and convergence. Copyright © 2015 Elsevier Inc. All rights reserved.
Comparing the Effects of Objective Structured Assessment of Technical Skills (OSATS) and Traditional Method on Learning of Students.

PubMed

Mansoorian, Mohammad Reza; Hosseiny, Marzeih Sadat; Khosravan, Shahla; Alami, Ali; Alaviani, Mehri

2015-06-01

Despite the benefits of the objective structured assessment of technical skills (OSATS) and it appropriateness for evaluating clinical abilities of nursing students , few studies are available on the application of this method in nursing education. The purpose of this study was to compare the effect of using OSATS and traditional methods on the students' learning. We also aimed to signify students' views about these two methods and their views about the scores they received in these methods in a medical emergency course. A quasi-experimental study was performed on 45 first semester students in nursing and medical emergencies passing a course on fundamentals of practice. The students were selected by a census method and evaluated by both the OSATS and traditional methods. Data collection was performed using checklists prepared based on the 'text book of nursing procedures checklists' published by Iranian nursing organization and a questionnaire containing learning rate and students' estimation of their received scores. Descriptive statistics as well as paired t-test and independent samples t-test were used in data analysis. The mean of students' score in OSATS was significantly higher than their mean score in traditional method (P = 0.01). Moreover, the mean of self-evaluation score after the traditional method was relatively the same as the score the students received in the exam. However, the mean of self-evaluation score after the OSATS was relatively lower than the scores the students received in the OSATS exam. Most students believed that OSATS can evaluate a wide range of students' knowledge and skills compared to traditional method. Results of this study indicated the better effect of OSATS on learning and its relative superiority in precise assessment of clinical skills compared with the traditional evaluation method. Therefore, we recommend using this method in evaluation of students in practical courses.

Relationship between substances in seminal plasma and Acrobeads Test results.

PubMed

Komori, Kazuhiko; Tsujimura, Akira; Okamoto, Yoshio; Matsuoka, Yasuhiro; Takao, Tetsuya; Miyagawa, Yasushi; Takada, Shingo; Nonomura, Norio; Okuyama, Akihiko

2009-01-01

To asses the effects of seminal plasma on sperm function. Retrospective case-control study. University hospital. One hundred fourteen infertile men. Acrobeads Test scores (0-4) and measurement of interleukin (IL)-6, soluble IL-6 receptor, epidermal growth factor, insulin-like growth factor-I (IGF-I), transforming growth factor-beta I, superoxide dismutase, calcitonin, and macrophage migration inhibitory factor (MIF) levels in seminal plasma. Kruskal-Wallis test to compare the concentrations of substances as a nonparametric test for differences among Acrobeads Test scores and a multivariable logistic regression model to find independent risk factors associated with abnormal Acrobeads Test results. The Acrobeads Test score was 0 for 7 samples, 1 for 20 samples, 2 for 18 samples, 3 for 28 samples, and 4 for 41 samples. Age, abstinence period, and semen parameters, except for sperm motility and percentage of sperm with abnormal morphology, had no effect on the Acrobeads Test results. Concentrations of IGF-I and MIF were significantly higher in patients with abnormal Acrobeads Test results. Multivariate analysis indicated that MIF and IGF-I were significantly associated with abnormal Acrobeads Test results (scores 0 to 1). Although further studies are needed, IGF-I and MIF in seminal plasma may have negative effects on sperm function.
Approaches of truck drivers and non-truck drivers toward reckless on-road behavior.

PubMed

Rosenbloom, Tova; Eldror, Ehud; Shahar, Amit

2009-07-01

The purpose of the study was to compare the reported approaches of truck drivers to those of non-truck drivers toward reckless on-road behaviors. One hundred and sixty-seven adult males, including 70 non-truck drivers, completed the questionnaires voluntarily. The truck drivers were employees of a concrete manufacturing company working at various company plants throughout Israel. Seventy were professional mixer truckers and 27 were tip-truckers. The participants completed the Reckless Driving Self-Report Scale based on Taubman Ben-Ari et al. [Taubman Ben-Ari, O., Florian, V., Mikulincer, M., 1999. The impact of mortality salience on reckless driving: a test of terror management mechanisms. Journal of Personality and Social Psychology 76, 35-45], adapted for truck drivers for this study. It was expected that non-professional, as compared to professional (truck) drivers, would be more permissive regarding reckless driving, since driving risks are less prominent in their daily driving experience. An ANOVA performed on mean reckless-driving scores yielded significant results. The post hoc Schéffe test indicated significantly higher reckless-driving scores for automobile drivers as compared to both mixer-truck driver scores and tip-truck driver scores. In addition, the reckless-driving scores for mixer-truck drivers were significantly higher than the tip-truck driver scores. We discuss various explanations for the findings and consider possible implications for training strategies in organizations as well as for media campaigns focused on mutual safe road use of truck drivers and private vehicle drivers.
Lods, wrods, and mods: the interpretation of lod scores calculated under different models.

PubMed

Hodge, S E; Elston, R C

1994-01-01

In this paper we examine the relationships among classical lod scores, "wrod" scores (lod scores calculated under the wrong genetic model), and "mod" scores (lod scores maximized over genetic model parameters). We compare the behavior of these scores when the state of nature is linkage to their behavior when the state of nature is no linkage. We describe sufficient conditions for mod scores to be valid and discuss their use to determine the correct genetic model. We show that lod scores represent a likelihood-ratio test for independence. We explain the "ascertainment-assumption-free" aspect of using mod scores to determine mode of inheritance and we set this aspect into a well-established statistical framework. Finally, we summarize practical guidelines for the use of mod scores.
Comparison of an expert system with other clinical scores for the evaluation of severity of asthma.

PubMed

Gautier, V; Rédier, H; Pujol, J L; Bousquet, J; Proudhon, H; Michel, C; Daurès, J P; Michel, F B; Godard, P

1996-01-01

"Asthmaexpert" was produced at the special request of several clinicians in order to obtain a better understanding of the medical decisions taken by clinical experts in the management of asthmatic patients. In order to assess the severity of asthma, a new score called Artificial Intelligence score (AI score), produced by Asthmaexpert, was compared with three other scores (Aas, Hargreave and Brooks). One hundred patients were enrolled prospectively in the study during their first consultation in the out-patient clinic. Distribution of severity level according to the different scores was studied, and the reliability between AI and other scores was evaluated by Kappa and MacNemar tests. Correlations with functional parameters were performed. The AI score assessed higher levels of severity than the other scores (Kappa = 18, 28 and 10% for Aas, Hargreave and Brooks, respectively) with significant MacNemar test in all cases. There was a significant correlation between AI score and forced expiratory volume in one second (FEV1) (r = 0.73). These data indicate that the AI score is a severity score which defines higher levels of severity than the chosen scores. Correlations for functional parameters are good. This score appears easy to use for the first consultation of an asthmatic patient.
Comparing five depression measures in depressed Chinese patients using item response theory: an examination of item properties, measurement precision and score comparability.

PubMed

Zhao, Yue; Chan, Wai; Lo, Barbara Chuen Yee

2017-04-04

Item response theory (IRT) has been increasingly applied to patient-reported outcome (PRO) measures. The purpose of this study is to apply IRT to examine item properties (discrimination and severity of depressive symptoms), measurement precision and score comparability across five depression measures, which is the first study of its kind in the Chinese context. A clinical sample of 207 Hong Kong Chinese outpatients was recruited. Data analyses were performed including classical item analysis, IRT concurrent calibration and IRT true score equating. The IRT assumptions of unidimensionality and local independence were tested respectively using confirmatory factor analysis and chi-square statistics. The IRT linking assumptions of construct similarity, equity and subgroup invariance were also tested. The graded response model was applied to concurrently calibrate all five depression measures in a single IRT run, resulting in the item parameter estimates of these measures being placed onto a single common metric. IRT true score equating was implemented to perform the outcome score linking and construct score concordances so as to link scores from one measure to corresponding scores on another measure for direct comparability. Findings suggested that (a) symptoms on depressed mood, suicidality and feeling of worthlessness served as the strongest discriminating indicators, and symptoms concerning suicidality, changes in appetite, depressed mood, feeling of worthlessness and psychomotor agitation or retardation reflected high levels of severity in the clinical sample. (b) The five depression measures contributed to various degrees of measurement precision at varied levels of depression. (c) After outcome score linking was performed across the five measures, the cut-off scores led to either consistent or discrepant diagnoses for depression. The study provides additional evidence regarding the psychometric properties and clinical utility of the five depression measures, offers methodological contributions to the appropriate use of IRT in PRO measures, and helps elucidate cultural variation in depressive symptomatology. The approach of concurrently calibrating and linking multiple PRO measures can be applied to the assessment of PROs other than the depression context.
Examining the association of injury with the Functional Movement Screen and Landing Error Scoring System in military recruits undergoing 16 weeks of introductory fitness training.

PubMed

Everard, Eoin; Lyons, Mark; Harrison, Andrew J

2018-06-01

To examine the association of injury with the Functional Movement Screen (FMS) and Landing Error Scoring System (LESS) in military recruits undergoing an intensive 16-week training block. Prospective cohort study. One hundred and thirty-two entry-level male soldiers (18-25years) were tested using the FMS and LESS. The participants underwent an intensive 16-week training program with injury data recorded daily. Chi-squared statistics were used to examine associations between injury risk and (1) poor LESS scores, (2) any score of 1 on the FMS and (3) composite FMS score of ≤14. A composite FMS score of ≤14 was not a significant predictor of injury. LESS scores of >5 and having a score of 1 on any FMS test were significantly associated with injury. LESS scores had greater relative risk, sensitivity and specificity (2.2 (95% CI=1.48-3.34); 71% and 87% respectively) than scores of 1 on the FMS (relative risk=1.32 (95% CI=1.0-1.7); sensitivity=50% and specificity=76%). There was no association between composite FMS score and injury but LESS scores and scores of 1 in the FMS test were significantly associated with injury in varying degrees. LESS scores had a much better association with injury than both any scores of 1 on the FMS and a combination of LESS scores and scores of 1 on the FMS. Furthermore, the LESS provides comparable information related to injury risk as other well-established markers associated with injury such as age, muscular strength and previous injury. Copyright © 2017. Published by Elsevier Ltd.
Predictors of medical school clerkship performance: a multispecialty longitudinal analysis of standardized examination scores and clinical assessments.

PubMed

Casey, Petra M; Palmer, Brian A; Thompson, Geoffrey B; Laack, Torrey A; Thomas, Matthew R; Hartz, Martha F; Jensen, Jani R; Sandefur, Benjamin J; Hammack, Julie E; Swanson, Jerry W; Sheeler, Robert D; Grande, Joseph P

2016-04-27

Evidence suggests that poor performance on standardized tests before and early in medical school is associated with poor performance on standardized tests later in medical school and beyond. This study aimed to explore relationships between standardized examination scores (before and during medical school) with test and clinical performance across all core clinical clerkships. We evaluated characteristics of 435 students at Mayo Medical School (MMS) who matriculated 2000-2009 and for whom undergraduate grade point average, medical college aptitude test (MCAT), medical school standardized tests (United States Medical Licensing Examination [USMLE] 1 and 2; National Board of Medical Examiners [NBME] subject examination), and faculty assessments were available. We assessed the correlation between scores and assessments and determined USMLE 1 cutoffs predictive of poor performance (≤10th percentile) on the NBME examinations. We also compared the mean faculty assessment scores of MMS students vs visiting students, and for the NBME, we determined the percentage of MMS students who scored at or below the tenth percentile of first-time national examinees. MCAT scores correlated robustly with USMLE 1 and 2, and USMLE 1 and 2 independently predicted NBME scores in all clerkships. USMLE 1 cutoffs corresponding to poor NBME performance ranged from 220 to 223. USMLE 1 scores were similar among MMS and visiting students. For most academic years and clerkships, NBME scores were similar for MMS students vs all first-time examinees. MCAT, USMLE 1 and 2, and subsequent clinical performance parameters were correlated with NBME scores across all core clerkships. Even more interestingly, faculty assessments correlated with NBME scores, affirming patient care as examination preparation. USMLE 1 scores identified students at risk of poor performance on NBME subject examinations, facilitating and supporting implementation of remediation before the clinical years. MMS students were representative of medical students across the nation.
Evaluation of Computer-aided Strategies for Teaching Medical Students Prenatal Ultrasound Diagnostic Skills.

PubMed

Amesse, Lawrence S; Callendar, Ealena; Pfaff-Amesse, Teresa; Duke, Janice; Herbert, William N P

2008-09-24

To evaluate whether computer-based learning (CBL) improves newly acquired knowledge and is an effective strategy for teaching prenatal ultrasound diagnostic skills to third-year medical students when compared with instruction by traditional paper-based methods (PBM). We conducted a randomized, prospective study involving volunteer junior (3(rd) year) medical students consecutively rotating through the Obstetrics and Gynecology clerkship during six months of the 2005-2006 academic year. The students were randomly assigned to permuted blocks and divided into two groups. Half of the participants received instruction in prenatal ultrasound diagnostics using an interactive CBL program; the other half received instruction using equivalent material by the traditional PBM. Outcomes were evaluated by comparing changes in pre-tutorial and post instruction examination scores. All 36 potential participants (100%) completed the study curriculum. Students were divided equally between the CBL (n = 18) and PBM (n = 18) groups. Pre-tutorial exam scores (mean+/-s.d.) were 44%+/-11.1% for the CBL group and 44%+/-10.8% for the PBL cohort, indicating no statistically significant differences (p>0.05) between the two groups. After instruction, post-tutorial exam scores (mean+/-s.d.) were increased from the pre-tutorial scores, 74%+/-11% and 67%+/-12%, for students in the CBL and the PBM groups, respectively. The improvement in post-tutorial exam scores from the pre-test scores was considered significant (p<0.05). When post-test scores for the tutorial groups were compared, the CBL subjects achieved a score that was, on average, 7 percentage points higher than their PBM counterparts, a statistically significant difference (p < 0.05). Instruction by either CBL or PBM strategies is associated with improvements in newly acquired knowledge as reflected by increased post-tutorial examination scores. Students that received CBL had significantlyhigher post-tutorial exam scores than those in the PBM group, indicating that CBL is an effective instruction strategy in this setting.
Comparison of Immunofluorescence and Desmoglein Enzyme-linked Immunosorbent Assay in the Diagnosis of Pemphigus: A Prospective, Cross-sectional Study in a Tertiary Care Hospital

PubMed Central

Ravi, Deepthi; Prabhu, S Smitha; Rao, Raghavendra; Balachandran, C; Bairy, Indira

2017-01-01

Background: Pemphigus is an acquired immunobullous disorder in which antibodies are directed against epidermal cadherins. Despite the commercial availability and less cost of enzyme-linked immunosorbent assays (ELISAs) to detect antidesmoglein 1 (Dsg1) and anti-Dsg3, immunofluorescence is still widely used for confirmation of diagnosis. Aims: (1) To compare the usefulness of indirect immunofluorescence (IIF) and ELISA tests in the diagnosis of pemphigus. (2) To find the clinical correlation between the tests and severity of the disease. Materials and Methods: Sixty-one patients (27 women and 34 men, age distribution from 20 to 75) were clinically diagnosed as pemphigus (pemphigus foliaceus - 11, pemphigus vulgaris - 50) and were recruited for the study. IIF and Dsg ELISA were performed and the findings were compared with each other and with the pemphigus area activity score. Data were entered in SPSS and were analyzed using Kruskal–Wallis test. Results: There was a moderate positive correlation between the cutaneous score and Dsg1 titer, and mucosal score and Dsg3 titer. The titer of IIF showed statistically significant positive correlation with the cutaneous score but not the mucosal score. Dsg ELISA showed higher sensitivity (90.2%) than IIF (75.4%) in the diagnosis of pemphigus. Conclusions: Dsg ELISA is a more sensitive method than IIF and shows more correlation with the disease severity. PMID:28400637
Determining if Instructional Delivery Model Differences Exist in Remedial English

ERIC Educational Resources Information Center

Carter, LaTanya Woods

2012-01-01

The purpose of this causal comparative study is to test the theory of no significant difference that compares pre- and post-test assessment scores, controlling for the instructional delivery model of online and face-to-face students at a Mid-Atlantic university. Online education and virtual distance learning programs have increased in popularity…
A Study of Assessments Designed for Student Success

ERIC Educational Resources Information Center

Delepine, Sidney G., III

2012-01-01

The purpose of this quantitative study is to compare a new assessment tool, the SkillsUSA Connect Assessment with the NOCTI assessment to determine which test results in more students achieving success. A quantitative study, designed to compare test scores of students taking the NOCTI assessment and new assessments from SkillsUSA, called the…
Tutor versus computer: a prospective comparison of interactive tutorial and computer-assisted instruction in radiology education.

PubMed

Lieberman, Gillian; Abramson, Richard; Volkan, Kevin; McArdle, Patricia J

2002-01-01

This study compared the educational effectiveness of an interactive tutorial with that of interactive computer-assisted instruction (CAI) and determined the effects of personal preference, learning style, and level of training. Fifty-four medical students and four radiology residents were prospectively, randomly assigned to receive instruction from different sections of an interactive tutorial and an interactive CAI module. Participants took tests of factual knowledge at the beginning and end of the instruction and a test of visual diagnosis at the end. They completed questionnaires to evaluate their preferred learning styles objectively and to elicit their subjective attitudes toward the two formats. Mean test scores of the tutorial and CAI groups were compared by means of analysis of covariance and two-tailed repeated-measures F test. Both the tutorial and CAI groups demonstrated significant improvement in posttest scores (P < .01 and P < .01, respectively) with the tutorial group's mean posttest score marginally but significantly higher (32.84 vs 28.13, P < .001). There were no significant interaction effects with participants' year of training (P = .845), objectively evaluated preferred learning style (P = .312), subjectively elicited attitude toward learning with CAI (P = .703), or visual diagnosis score (tutorial, 7.61; CD-ROM, 7.75; P = .79). Interactive tutorial and optimal CAI are both effective instructional formats. The tutorial was marginally but significantly more effective at teaching factual knowledge, an effect unrelated to students' year of training, learning style, or stated enjoyment of CAI. The superiority of the tutorial is expected to increase when it is compared with commercially expedient CAI modules.
Minnesota Multiphasic Personality Inventory-2-Restructured Form (MMPI-2-RF) scores generated from the MMPI-2 and MMPI-2-RF test booklets: internal structure comparability in a sample of criminal defendants.

PubMed

Tarescavage, Anthony M; Alosco, Michael L; Ben-Porath, Yossef S; Wood, Arcangela; Luna-Jones, Lynn

2015-04-01

We investigated the internal structure comparability of Minnesota Multiphasic Personality Inventory-2-Restructured Form (MMPI-2-RF) scores derived from the MMPI-2 and MMPI-2-RF booklets in a sample of 320 criminal defendants (229 males and 54 females). After exclusion of invalid protocols, the final sample consisted of 96 defendants who were administered the MMPI-2-RF booklet and 83 who completed the MMPI-2. No statistically significant differences in MMPI-2-RF invalidity rates were observed between the two forms. Individuals in the final sample who completed the MMPI-2-RF did not statistically differ on demographics or referral question from those who were administered the MMPI-2 booklet. Independent t tests showed no statistically significant differences between MMPI-2-RF scores generated with the MMPI-2 and MMPI-2-RF booklets on the test's substantive scales. Statistically significant small differences were observed on the revised Variable Response Inconsistency (VRIN-r) and True Response Inconsistency (TRIN-r) scales. Cronbach's alpha and standard errors of measurement were approximately equal between the booklets for all MMPI-2-RF scales. Finally, MMPI-2-RF intercorrelations produced from the two forms yielded mostly small and a few medium differences, indicating that discriminant validity and test structure are maintained. Overall, our findings reflect the internal structure comparability of MMPI-2-RF scale scores generated from MMPI-2 and MMPI-2-RF booklets. Implications of these results and limitations of these findings are discussed. © The Author(s) 2014.
Using near infrared light to manage symptoms associated with restless legs syndrome.

PubMed

Guffey, J Stephen; Motts, Susan; Barymon, Deanna; Wooten, Amber; Clough, Tim; Payne, Emily; Henderson, McCall; Tice, Neal

2016-01-01

The purpose of this study was to determine whether the application of near infrared (NIR) light could positively modulate symptoms associated with restless legs syndrome (RLS). Twenty-one subjects with RLS were treated with NIR three times weekly for four weeks. Baseline measures of: (1) international restless legs syndrome rating scale (IRLSRS) score; (2) Semmes Weinstein monofilament (SWM) test; (3) visual analog pain scale (VAS); (4) ankle-brachial index (ABI); and (5) sonographic imaging of the popliteal and posterior tibial arteries were compared to post-treatment values. NIR (850 nm) was delivered transcutaneously at 8 J/cm(2) to four locations on each leg and the plantar surface of each foot. A pre-test-post-test one group design was employed. Baseline and post-treatment measures were compared using either a dependent t-test when data were normal or the Wilcoxon signed rank test in the absence of normality. A significant improvement in IRLSRS scores was observed. Sensation improved from less than protective in 16.6% of sites tested at the baseline to 13.4% post-intervention. There was a significant improvement in ABI scores. VAS and sonographic imaging measures other than ABI remained unchanged. The use of NIR to modulate symptoms associated with RLS was supported by the data.
Short physical performance battery for middle-aged and older adult cardiovascular disease patients: implication for strength tests and lower extremity morphological evaluation.

PubMed

Yasuda, Tomohiro; Fukumura, Kazuya; Nakajima, Toshiaki

2017-04-01

[Purpose] To examine if the SPPB is higher with healthy subjects than outpatients, which was higher than inpatients and if the SPPB can be validated assessment tool for strength tests and lower extremity morphological evaluation in cardiovascular disease patients. [Subjects and Methods] Twenty-four middle aged and older adults with cardiovascular disease were recruited from inpatient and outpatient facilities and assigned to separate experimental groups. Twelve age-matched healthy volunteers were assigned to a control group. SPPB test was used to assess balance and functional motilities. The test outcomes were compared with level of care (inpatient vs. outpatient), physical characteristics, strength and lower extremity morphology. [Results] Total SPPB scores, strength tests (knee extensor muscle strength), and lower extremity morphological evaluation (muscle thickness of anterior and posterior mid-thigh and posterior lower-leg) were greater in healthy subjects and outpatients groups compared with inpatients. To predict total Short Physical Performance Battery scores, the predicted knee extension and anterior mid-thigh muscle thickness were calculated. [Conclusion] The SPPB is an effective tool as the strength tests and lower extremity morphological evaluation for middle-aged and older adult cardiovascular disease patients. Notably, high knee extensor muscle strength and quadriceps femoris muscle thickness are positively associated with high SPPB scores.
Deaf Genetic Testing and Psychological Well-Being in Deaf Adults

PubMed Central

Palmer, Christina G.S.; Boudreault, Patrick; Baldwin, Erin E.; Fox, Michelle; Deignan, Joshua L.; Kobayashi, Yoko; Sininger, Yvonne; Grody, Wayne; Sinsheimer, Janet S.

2013-01-01

Limited data suggest that enhanced self-knowledge from genetic information related to non-medical traits can have a positive impact on psychological well-being. Deaf individuals undertake genetic testing for deaf genes to increase self-knowledge. Because deafness is considered a non-medical trait by many individuals, we hypothesized that deaf individuals receiving a genetic explanation for why they are deaf will experience increased psychological well-being. We report results from a prospective, longitudinal study to determine the impact of genetic testing (GJB2, Cx26; GJB6, Cx30) on perceived personal control (PPC), anxiety, and depression in deaf adults (N=209) assessed following pre-test genetic counseling as well as 1-month and 6-months following test result disclosure. Participants were classified as Cx positive (n=82) or Cx negative/inconclusive (n=127). There was significant evidence for Cx group differences in PPC and anxiety over time (PPC: Cx group*time interaction p=0.0007; anxiety: Cx group*time interaction p=0.002), where PPC scores were significantly higher, and anxiety scores were significantly lower for the Cx positive group relative to the negative/inconclusive group following test result disclosure. Compared to pre-test, PPC scores increased at 1-month (p=0.07) and anxiety scores decreased at 6-months for the Cx positive group (p=0.03). In contrast, PPC scores decreased (p=0.009, p<0.0001) and anxiety scores increased (p=0.09, p=0.02) for the Cx negative/inconclusive group at 1- and 6-months post test result disclosure. Genetic testing for deaf genes affects the psychological well-being of deaf individuals. Increasing deaf adults’ access to genetic testing may potentially enhance self-knowledge and increase psychological well-being for those who receive a genetic explanation, which could offer downstream health benefits. PMID:23430402
Using Multiple Technologies to Teach Nursing Students about Adoption

ERIC Educational Resources Information Center

Harrison, Sharonlyn; Henneman, Kris; Herrera, Maida Y.; Hockman, Elaine; Brooks, Evelyn; Darland, Nancy; Kulik, Noel; Sandy-Hanson, Anika E.

2013-01-01

Technology is becoming increasingly more important in the enhancement of educating university students. Very little research has been done regarding how the combination of educational technologies affects test scores, compared to the use of one technology alone. This research article examines whether the post-scores of nursing students increased…
Further Support for Changing Multiple-Choice Answers.

ERIC Educational Resources Information Center

Fabrey, Lawrence J.; Case, Susan M.

1985-01-01

The effect on test scores of changing answers to multiple-choice questions was studied and compared to earlier research. The current setting was a nationally administered, in-training, specialty examination for medical residents in obstetrics and gynecology. Both low and high scorers improved their scores when they changed answers. (SW)
Comparative Predictive Validity of the New MCAT Using Different Admissions Criteria.

ERIC Educational Resources Information Center

Golmon, Melton E.; Berry, Charles A.

1981-01-01

New Medical College Admission Test (MCAT) scores and undergraduate academic achievement were examined for their validity in predicting the performance of two select student populations at Northwestern University Medical School. The data support the hypothesis that New MCAT scores possess substantial predictive validity. (Author/MLW)
Emotional Intelligence Abilities and Traits in Different Career Paths

ERIC Educational Resources Information Center

Kafetsios, Konstantinos; Maridaki-Kassotaki, Aikaterini; Zammuner, Vanda L.; Zampetakis, Leonidas A.; Vouzas, Fotios

2009-01-01

Two studies tested hypotheses about differences in emotional intelligence (EI) abilities and traits between followers of different career paths. Compared to their social science peers, science students had higher scores in adaptability and general mood traits measured with the Emotion Quotient Inventory, but lower scores in strategic EI abilities…

Against Conventional Wisdom: Factors Influencing Hispanic Students' Reading Achievement

ERIC Educational Resources Information Center

Percell, Jay C.; Kaufman, Kristina

2013-01-01

The researchers performed a variable analysis of the 2002 Educational Longitudinal Study data investigating factors that influence students' reading scores on standardized tests. Hispanic and non-Hispanic Scores were analyzed and controlling variables were compared to determine the effect of each on both populations. Certain variables commonly…
The Antinociceptive Effects of Hydroalcoholic Extract of Borago Officinalis Flower in Male Rats Using Formalin Test.

PubMed

Shahraki, Mohammad Reza; Ahmadimoghadm, Mahdieh; Shahraki, Ahmad Reza

2015-10-01

Borago officinalis flower (borage) is a known sedative in herbal medicine; the aim of the present study was to evaluate the antinociceptive effect of borage hydroalcoholic extract in formalin test male rats. Fifty-six adult male albino Wistar rats were randomly divided into seven groups: Control groups of A (intact), B (saline), and C (Positive control) plus test groups of D, E, F, and G (n=8). The groups D, E, and F received 6.25, 12.5, and 25 mg/kg, Borago officinalis flower hydroalcholic extract before the test, respectively but group G received 25 mg/kg borage extract and aspirin before the test. A biphasic pain was induced by injection of formalin 1%. The obtained data were analyzed by SPSS software ver. 17 employing statistical tests of Kruskal-Wallis and Mann-Whitney. The results were expressed as mean±SD. Statistical differences were considered significant at P<0.05. The results revealed that the acute and chronic pain behavior score in test groups of D, E, F, and G significantly decreased compared to groups A and B, but this score did not show any difference compared to group C. Moreover, chronic pain behavior score in group G was significantly lower than all other groups. The results indicated that Borago officinalis hydroalcoholic extract affects the acute and chronic pain behavior response in formaline test male rats.
Sex Differences in Vestibular/Ocular and Neurocognitive Outcomes After Sport-Related Concussion.

PubMed

Sufrinko, Alicia M; Mucha, Anne; Covassin, Tracey; Marchetti, Greg; Elbin, R J; Collins, Michael W; Kontos, Anthony P

2017-03-01

To examine sex differences in vestibular and oculomotor symptoms and impairment in athletes with sport-related concussion (SRC). The secondary purpose was to replicate previously reported sex differences in total concussion symptoms, and performance on neurocognitive and balance testing. Prospective cross-sectional study of consecutively enrolled clinic patients within 21 days of a SRC. Specialty Concussion Clinic. Included male (n = 36) and female (n = 28) athletes ages 9 to 18 years. Vestibular symptoms and impairment was measured with the Vestibular/Ocular Motor Screening (VOMS). Participants completed the Immediate Post-concussion Assessment and Cognitive Test (ImPACT), Post-concussion Symptom Scale (PCSS), and Balance Error Scoring System (BESS). Sex differences on clinical measures. Females had higher PCSS scores (P = 0.01) and greater VOMS vestibular ocular reflex (VOR) score (P = 0.01) compared with males. There were no sex differences on BESS or ImPACT. Total PCSS scores together with female sex accounted for 45% of the variance in VOR scores. Findings suggest higher VOR scores after SRC in female compared with male athletes. Findings did not extend to other components of the VOMS tool suggesting that sex differences may be specific to certain types of vestibular impairment after SRC. Additional research on the clinical significance of the current findings is needed.
Differences of wells scores accuracy, caprini scores and padua scores in deep vein thrombosis diagnosis

NASA Astrophysics Data System (ADS)

Gatot, D.; Mardia, A. I.

2018-03-01

Deep Vein Thrombosis (DVT) is the venous thrombus in lower limbs. Diagnosis is by using venography or ultrasound compression. However, these examinations are not available yet in some health facilities. Therefore many scoring systems are developed for the diagnosis of DVT. The scoring method is practical and safe to use in addition to efficacy, and effectiveness in terms of treatment and costs. The existing scoring systems are wells, caprini and padua score. There have been many studies comparing the accuracy of this score but not in Medan. Therefore, we are interested in comparative research of wells, capriniand padua score in Medan.An observational, analytical, case-control study was conducted to perform diagnostic tests on the wells, caprini and padua score to predict the risk of DVT. The study was at H. Adam Malik Hospital in Medan.From a total of 72 subjects, 39 people (54.2%) are men and the mean age are 53.14 years. Wells score, caprini score and padua score has a sensitivity of 80.6%; 61.1%, 50% respectively; specificity of 80.65; 66.7%; 75% respectively, and accuracy of 87.5%; 64.3%; 65.7% respectively.Wells score has better sensitivity, specificity and accuracy than caprini and padua score in diagnosing DVT.
Revision anterior cruciate ligament reconstruction by double-bundle technique using multi-strand semitendinosus tendon.

PubMed

Muneta, Takeshi; Hara, Kenji; Ju, Young-Jin; Mochizuki, Tomoyuki; Morito, Toshiyuki; Yagishita, Kazuyoshi; Sekiya, Ichiro

2010-06-01

The purpose of the study was to compare the outcome of revision anterior cruciate ligament (ACL) reconstruction by the double-bundle (DB) technique using multi-strand semitendinosus tendon with that of primary reconstruction by use of the same technique. The study included 21 patients who underwent revision ACL reconstruction (mean follow-up, 40 months) with the semitendinosus tendon DB technique between 1995 and 2006 and 86 unilateral primary DB ACL reconstructions (mean follow-up, 33 months) between 2000 and 2004. The outcome of both groups was compared based on differences between operated and unoperated limbs and modified International Knee Documentation Committee grades. Both the overall and sports-related subjective scores were evaluated between the 2 groups. The KT measurements (MEDmetric, San Diego, CA) averaged 1.7 mm (SD, 1.8 mm) in the revision group and 1.5 mm (SD, 1.6 mm) in the primary group. There was no significant difference in KT measurements between the 2 groups. The Lachman test was negative in 83% of revision cases and 87% of primary cases; the anterior drawer test was negative in 83% and 91%, respectively, and the pivot-shift test was negative in 78% and 90%, respectively. There was a tendency for a positive pivot-shift test in the revision group being higher. The Lysholm score and subjective recovery score were significantly lower in the revision group. The semitendinosus tendon DB revision procedure provided range of motion and anterior stability comparable to those after primary DB surgery and a comparable return to athletic activities. However, the patients tended to have positive pivot-shift test results. The revision cases were also inferior in terms of the general evaluation of recovery of knee condition. The outcome scores were lower overall in the revision group. Level IV, therapeutic case series. Copyright (c) 2010 Arthroscopy Association of North America. Published by Elsevier Inc. All rights reserved.
Using a genetic/clinical risk score to stop smoking (GeTSS): randomised controlled trial.

PubMed

Nichols, John A A; Grob, Paul; Kite, Wendy; Williams, Peter; de Lusignan, Simon

2017-10-23

As genetic tests become cheaper, the possibility of their widespread availability must be considered. This study involves a risk score for lung cancer in smokers that is roughly 50% genetic (50% clinical criteria). The risk score has been shown to be effective as a smoking cessation motivator in hospital recruited subjects (not actively seeking cessation services). This was an RCT set in a United Kingdom National Health Service (NHS) smoking cessation clinic. Smokers were identified from medical records. Subjects that wanted to participate were randomised to a test group that was administered a gene-based risk test and given a lung cancer risk score, or a control group where no risk score was performed. Each group had 8 weeks of weekly smoking cessation sessions involving group therapy and advice on smoking cessation pharmacotherapy and follow-up at 6 months. The primary endpoint was smoking cessation at 6 months. Secondary outcomes included ranking of the risk score and other motivators. 67 subjects attended the smoking cessation clinic. The 6 months quit rates were 29.4%, (10/34; 95% CI 14.1-44.7%) for the test group and 42.9% (12/28; 95% CI 24.6-61.2%) for the controls. The difference is not significant. However, the quit rate for test group subjects with a "very high" risk score was 89% (8/9; 95% CI 68.4-100%) which was significant when compared with the control group (p = 0.023) and test group subjects with moderate risk scores had a 9.5% quit rate (2/21; 95% CI 2.7-28.9%) which was significantly lower than for above moderate risk score 61.5% (8/13; 95% CI 35.5-82.3; p = 0.03). Only the sub-group with the highest risk score showed an increased quit rate. Controls and test group subjects with a moderate risk score were relatively unlikely to have achieved and maintained non-smoker status at 6 months. ClinicalTrials.gov ID NCT01176383 (date of registration: 3 August 2010).
Comparison of enterovirus detection in cerebrospinal fluid with Bacterial Meningitis Score in children

PubMed Central

Pires, Frederico Ribeiro; Franco, Andréia Christine Bonotto Farias; Gilio, Alfredo Elias; Troster, Eduardo Juan

2017-01-01

ABSTRACT Objective To measure the role of enterovirus detection in cerebrospinal fluid compared with the Bacterial Meningitis Score in children with meningitis. Methods A retrospective cohort based on analysis of medical records of pediatric patients diagnosed as meningitis, seen at a private and tertiary hospital in São Paulo, Brazil, between 2011 and 2014. Excluded were patients with critical illness, purpura, ventricular shunt or recent neurosurgery, immunosuppression, concomitant bacterial infection requiring parenteral antibiotic therapy, and those who received antibiotics 72 hours before lumbar puncture. Results The study included 503 patients. Sixty-four patients were excluded and 94 were not submitted to all tests for analysis. Of the remaining 345 patients, 7 were in the Bacterial Meningitis Group and 338 in the Aseptic Meningitis Group. There was no statistical difference between the groups. In the Bacterial Meningitis Score analysis, of the 338 patients with possible aseptic meningitis (negative cultures), 121 of them had one or more points in the Bacterial Meningitis Score, with sensitivity of 100%, specificity of 64.2%, and negative predictive value of 100%. Of the 121 patients with positive Bacterial Meningitis Score, 71% (86 patients) had a positive enterovirus detection in cerebrospinal fluid. Conclusion Enterovirus detection in cerebrospinal fluid was effective to differentiate bacterial from viral meningitis. When the test was analyzed together with the Bacterial Meningitis Score, specificity was higher when compared to Bacterial Meningitis Score alone. PMID:28767914
Association Between Prenatal Valproate Exposure and Performance on Standardized Language and Mathematics Tests in School-aged Children.

PubMed

Elkjær, Lars Skou; Bech, Bodil Hammer; Sun, Yuelian; Laursen, Thomas Munk; Christensen, Jakob

2018-02-19

Valproate sodium is used for the treatment of epilepsy and other neuropsychiatric disorders in women of childbearing potential. However, there are concerns about impaired cognitive development in children who have been exposed to valproate during pregnancy. To estimate the association between long-term school performance and prenatal exposure to valproate and a number of other antiepileptic drugs (AEDs). In a prospective, population-based cohort study conducted from August 1, 2015, to May 31, 2017, data used in the study were provided by Statistics Denmark on April 15, 2016. All children born alive in Denmark between 1997 and 2006 (n = 656 496) were identified. From this cohort, children who did not participate in the national tests, with presumed coding errors in gestational age and children missing information on their mother's educational level or household income were excluded (n = 177 469) leaving 479 027 children for the analyses. Children were identified and linked across national registers that had information on exposure, covariates, and outcome. The primary outcome was performance in national tests, an academic test taken by students in Danish primary and lower secondary state schools. We assessed performance in Danish and mathematics at different grades among valproate-exposed children and compared their performance with that of unexposed children and children exposed to another AED (lamotrigine). Test scores were standardized to z scores and adjusted for risk factors. Difference in standardized z scores in Danish and mathematics tests among valproate-exposed children compared with unexposed and lamotrigine-exposed children. Of the 656 496 children identified, 479 027 children who participated in the national tests were evaluated, including children exposed to the following AEDs in monotherapy: valproate, 253; phenobarbital, 86; oxcarbazepine, 236; lamotrigine, 396; clonazepam, 188; and carbamazepine, 294. The mean (SD) age of the 244 095 children completing the sixth-grade Danish test was 12.9 (0.39) years; 122 774 (50.3%; 95% CI, 50.1% to 50.5%) were boys and 121 321 (49.7%; 95% CI, 49.5% to 49.9%) were girls. Valproate-exposed children scored worse on the sixth-grade Danish tests (adjusted difference, -0.27 SD; 95% CI, -0.42 to -0.12) and sixth-grade mathematics tests (adjusted difference, -0.33 SD; (95% CI, -0.47 to -0.19) compared with unexposed children and children exposed to lamotrigine (adjusted difference, -0.33 SD; 95% CI, -0.60 to -0.06). Also, children exposed to clonazepam scored worse in the sixth-grade Danish tests (adjusted difference, -0.07 SD; 95% CI, -0.12 to -0.02). Carbamazepine, lamotrigine, phenobarbital, and oxcarbazepine were not linked to poor school performance compared with unexposed children. Maternal use of valproate was associated with a significant decrease in school performance in offspring compared with children unexposed to AEDs and children exposed to lamotrigine. Findings of this study further caution against the use of valproate among women of childbearing potential.
Imperfect practice makes perfect: error management training improves transfer of learning.

PubMed

Dyre, Liv; Tabor, Ann; Ringsted, Charlotte; Tolsgaard, Martin G

2017-02-01

Traditionally, trainees are instructed to practise with as few errors as possible during simulation-based training. However, transfer of learning may improve if trainees are encouraged to commit errors. The aim of this study was to assess the effects of error management instructions compared with error avoidance instructions during simulation-based ultrasound training. Medical students (n = 60) with no prior ultrasound experience were randomised to error management training (EMT) (n = 32) or error avoidance training (EAT) (n = 28). The EMT group was instructed to deliberately make errors during training. The EAT group was instructed to follow the simulator instructions and to commit as few errors as possible. Training consisted of 3 hours of simulation-based ultrasound training focusing on fetal weight estimation. Simulation-based tests were administered before and after training. Transfer tests were performed on real patients 7-10 days after the completion of training. Primary outcomes were transfer test performance scores and diagnostic accuracy. Secondary outcomes included performance scores and diagnostic accuracy during the simulation-based pre- and post-tests. A total of 56 participants completed the study. On the transfer test, EMT group participants attained higher performance scores (mean score: 67.7%, 95% confidence interval [CI]: 62.4-72.9%) than EAT group members (mean score: 51.7%, 95% CI: 45.8-57.6%) (p < 0.001; Cohen's d = 1.1, 95% CI: 0.5-1.7). There was a moderate improvement in diagnostic accuracy in the EMT group compared with the EAT group (16.7%, 95% CI: 10.2-23.3% weight deviation versus 26.6%, 95% CI: 16.5-36.7% weight deviation [p = 0.082; Cohen's d = 0.46, 95% CI: -0.06 to 1.0]). No significant interaction effects between group and performance improvements between the pre- and post-tests were found in either performance scores (p = 0.25) or diagnostic accuracy (p = 0.09). The provision of error management instructions during simulation-based training improves the transfer of learning to the clinical setting compared with error avoidance instructions. Rather than teaching to avoid errors, the use of errors for learning should be explored further in medical education theory and practice. © 2016 John Wiley & Sons Ltd and The Association for the Study of Medical Education.
Evaluation of the theory of mind in autism spectrum disorders with the Strange Stories test.

PubMed

Velloso, Renata de Lima; Duarte, Cintia Perez; Schwartzman, José Salomão

2013-11-01

To evaluate the theory of mind in autism spectrum disorders (ASD) and control individuals by applying the Strange Stories test that was translated and adapted to the Portuguese language. Twenty-eight children with ASD and 56 controls who were all male and aged between 6 and 12 years participated in the study. There were significant differences between the median scores of the groups for each of the 12 stories of the test and for the sum total of all the median scores. The median scores for all stories were significantly greater in the control group than those in the experimental group (children with ASD). In addition, the protocol had excellent internal consistency. The theory of mind skills assessed with the Strange Stories test indicated alterations in children with ASD compared with children in the control group.
Differences in Neuropsychological Functioning Between Homicidal and Nonviolent Schizophrenia Samples.

PubMed

Stratton, John; Cobia, Derin J; Reilly, James; Brook, Michael; Hanlon, Robert E

2018-02-07

Few studies have compared performance on neurocognitive measures between violent and nonviolent schizophrenia samples. A better understanding of neurocognitive dysfunction in violent individuals with schizophrenia could increase the efficacy of violence reduction strategies and aid in risk assessment and adjudication processes. This study aimed to compare neuropsychological performance between 25 homicide offenders with schizophrenia and 25 nonviolent schizophrenia controls. The groups were matched for age, race, sex, and handedness. Independent t-tests and Mann-Whitney U-tests were used to compare the schizophrenia groups' performance on measures of cognition, including composite scores assessing domain level functioning and individual neuropsychological tests. Results indicated the violent schizophrenia group performed worse on measures of memory and executive functioning, and the Intellectual Functioning composite score, when compared to the nonviolent schizophrenia sample. These findings replicate previous research documenting neuropsychological deficits specific to violent individuals with schizophrenia and support research implicating fronto-limbic dysfunction among violent offenders with schizophrenia. © 2018 American Academy of Forensic Sciences.
Clinical competency evaluation of Brazilian chiropractic interns

PubMed Central

Facchinato, Ana Paula A.; Benedicto, Camila C.; Mora, Aline G.; Cabral, Dayane M.C.; Fagundes, Djalma J.

2015-01-01

Objective This study compares the results of an objective structured clinical examination (OSCE) between 2 groups of students before an internship and after 6 months of clinical practice in an internship. Methods Seventy-two students participated, with 36 students in each cohort. The OSCEs were performed in the simulation laboratory before the participants' clinical practice internship and after 6 months of the internship. Students were tested in 9 stations for clinical skills and knowledge. The same procedures were repeated for both cohorts. The t test was used for unpaired parametric samples and Fisher's exact test was used for comparison of proportions. Results There was no difference in the mean final score between the 2 groups (p = .34 for test 1; p = .08 for test 2). The performance of the students in group 1 was not significantly different when performed before and after 6 months of clinical practice, but in group 2 there was a significant decrease in the average score after 6 months of clinical practice. Conclusions There was no difference in the cumulative average score for the 2 groups before and after 6 months of clinical practice in the internship. There were differences within the cohorts, however, with a significant decrease in the average score in group 2. Issues pertaining to test standardization and student motivation for test 2 may have influenced the scores. PMID:25588200
Assessing working memory in children with ADHD: Minor administration and scoring changes may improve digit span backward's construct validity.

PubMed

Wells, Erica L; Kofler, Michael J; Soto, Elia F; Schaefer, Hillary S; Sarver, Dustin E

2018-01-01

Pediatric ADHD is associated with impairments in working memory, but these deficits often go undetected when using clinic-based tests such as digit span backward. The current study pilot-tested minor administration/scoring modifications to improve digit span backward's construct and predictive validities in a well-characterized sample of children with ADHD. WISC-IV digit span was modified to administer all trials (i.e., ignore discontinue rule) and count digits rather than trials correct. Traditional and modified scores were compared to a battery of criterion working memory (construct validity) and academic achievement tests (predictive validity) for 34 children with ADHD ages 8-13 (M=10.41; 11 girls). Traditional digit span backward scores failed to predict working memory or KTEA-2 achievement (allns). Alternate administration/scoring of digit span backward significantly improved its associations with working memory reordering (r=.58), working memory dual-processing (r=.53), working memory updating (r=.28), and KTEA-2 achievement (r=.49). Consistent with prior work, these findings urge caution when interpreting digit span performance. Minor test modifications may address test validity concerns, and should be considered in future test revisions. Digit span backward becomes a valid measure of working memory at exactly the point that testing is traditionally discontinued. Copyright © 2017 Elsevier Ltd. All rights reserved.
Can the calf-raise senior test predict functional fitness in elderly people? A validation study using electromyography, kinematics and strength tests.

PubMed

André, Helô-Isa; Carnide, Filomena; Moço, Andreia; Valamatos, Maria-João; Ramalho, Fátima; Santos-Rocha, Rita; Veloso, António

2018-06-05

The assessment of the plantar-flexors muscle strength in older adults (OA) is of the utmost importance since they are strongly associated with the performance of fundamental tasks of daily life. The objective was to strengthen the validity of the Calf-Raise-Senior (CRS) test by assessing the biomechanical movement pattern of calf muscles in OA with different levels of functional fitness (FF) and physical activity (PA). Twenty-six OA were assessed with CRS, a FF battery, accelerometry, strength tests, kinematics and electromyography (EMG). OA with the best and worst CRS scores were compared. The association between the scores and EMG pattern of ankle muscles was determined. OA with the best CRS scores presented higher levels of FF, PA, strength, power, speed and range of movement, and a more efficient movement pattern during the test. Subjects who scored more at the CRS test demonstrated the possibility to use a stretch-shortening cycle type of action in the PF muscles to increase power during the movements. OA with different levels of FF can be stratified by the muscular activation pattern of the calf muscles and the scores in CRS test. This study reinforced the validity of CRS for evaluating ankle strength and power in OA. Copyright © 2018 Elsevier Ltd. All rights reserved.
The effect of peer-group size on the delivery of feedback in basic life support refresher training: a cluster randomized controlled trial.

PubMed

Cho, Youngsuk; Je, Sangmo; Yoon, Yoo Sang; Roh, Hye Rin; Chang, Chulho; Kang, Hyunggoo; Lim, Taeho

2016-07-04

Students are largely providing feedback to one another when instructor facilitates peer feedback rather than teaching in group training. The number of students in a group affect the learning of students in the group training. We aimed to investigate whether a larger group size increases students' test scores on a post-training test with peer feedback facilitated by instructor after video-guided basic life support (BLS) refresher training. Students' one-rescuer adult BLS skills were assessed by a 2-min checklist-based test 1 year after the initial training. A cluster randomized controlled trial was conducted to evaluate the effect of student number in a group on BLS refresher training. Participants included 115 final-year medical students undergoing their emergency medicine clerkship. The median number of students was 8 in the large groups and 4 in the standard group. The primary outcome was to examine group differences in post-training test scores after video-guided BLS training. Secondary outcomes included the feedback time, number of feedback topics, and results of end-of-training evaluation questionnaires. Scores on the post-training test increased over three consecutive tests with instructor-led peer feedback, but not differ between large and standard groups. The feedback time was longer and number of feedback topics generated by students were higher in standard groups compared to large groups on the first and second tests. The end-of-training questionnaire revealed that the students in large groups preferred the smaller group size compared to their actual group size. In this BLS refresher training, the instructor-led group feedback increased the test score after tutorial video-guided BLS learning, irrespective of the group size. A smaller group size allowed more participations in peer feedback.
Electronystagmography outcome and neuropsychological findings in tinnitus patients.

PubMed

Jozefowicz-Korczynska, Magdalena; Ciechomska, Elzbieta Agata; Pajor, Anna Maria

2005-01-01

Because psychological aspects often are underscored in the generation of tinnitus, we assessed the neuropsychological status in our group of patients. We found an increased number of abnormal electronystagmography (ENG) recordings in tinnitus patients. The aim of this study was to compare the ENG outcome with the patients' neuropsychological status. We carried out the study on 69 subjects complaining of tinnitus and on 43 healthy persons. We performed clinical neurootological examinations and ENG tests on all patients. Neuropsychological evaluation was conducted by means of the Beck Depression Inventory (BDI), the Hospital Anxiety and Depression (HAD) test, the Mini Mental Status (MMS) test, and the Trail-Making Test (TMT). In 46 patients (66.6%), we found abnormal ENG outcomes (central, 42%; peripheral, 13.0%; mixed, 11.6%). Neuropsychological tests revealed abnormal scores: for the BDI, 43.5% of patients; for the HAD-A, 72.5%; for the HAD-D, 47.8%; for the MMS, 27.5%; and for the TMT, 55.1%. We did not find correlation between the ENG outcomes and neuropsychological test scores. We did not find correlation between the overall ENG outcomes and neuropsychological test scores, with one exception; we found the occurrence of abnormal neuropsychological test scores and the ENG outcome indicating central vestibular dysfunction. Our study showed that despite a high frequency of vestibular system dysfunction signs and a high incidence of abnormal neuropsychological test scores in tinnitus patients, only one correlation existed between these two results.
Interpretation of ambiguities by schoolchildren with low birth weight from Embu das Artes, São Paulo state, Brazil.

PubMed

Pessoa, Rebeca Rodrigues; Araújo, Sarah Cueva Cândido Soares de; Isotani, Selma Mie; Puccini, Rosana Fiorini; Perissinoto, Jacy

To assess the development of language regarding the ability to recognize and interpret lexical ambiguity in low-birth-weight schoolchildren enrolled at the school system in the municipality of Embu das Artes, Sao Paulo state, compared with that of schoolchildren with normal birth weight. A case-control, retrospective, cross-sectional study conducted with 378 schoolchildren, both genders, aged 5 to 9.9 years, from the municipal schools of Embu das Artes. Study Group (SG) comprising 210 schoolchildren with birth weight < 2500 g. Control Group (CG) composed of 168 school children with birth weight ≥ 2500 g. Participants of both groups were compared with respect to the skills of recognition and verbal interpretation of sentences containing lexical ambiguity using the Test of Language Competence. Variables of interest: Age and gender of children; age and schooling of mothers. Statistical analysis: Descriptive analysis to characterize the sample and score per group; Student's t test for comparison between the total scores of each skill/subtest; Chi-square test to compare items within each subtest; multiple regression analysis for the intervening variables. Participants of the SG presented lower scores for ambiguous sentences compared with those of participants of the CG. Multiple regression analysis showed that child's current age was a predictor for all metalinguistic skills regarding interpretation of ambiguities in both groups. Participants of the SG presented lower specific and total scores than those of participants of the CG for ambiguity skills. The child's current age factor positively influenced the ambiguity skills in both groups.
Train the trainer? A randomized controlled trial of a multi-tiered oral health education programme in community-based residential services for adults with intellectual disability.

PubMed

Mac Giolla Phadraig, Caoimhin; Guerin, Suzanne; Nunn, June

2013-04-01

To assess the impact of a multi-tiered oral health education programme on care staff caring for people with intellectual disability (ID). Postal questionnaires were sent to all care staff of a community-based residential care service for adults, randomly assigned to control and intervention groups. A specifically developed training programme was delivered to residential staff nominees, who then trained all staff within the intervention group. The control group received no training. Post-test questionnaires were sent to both groups. Paired-samples t-test was used to compare oral health-related knowledge (K) and behaviour, attitude and self-efficacy (BAS) scores. Of the initial 219 respondents, 154 (response rate between 40% and 35.8%, with attrition rate of 29.7% from baseline to repeat) returned completed questionnaires at post-test (M=8.5 months, range=6.5-11 months). Control and intervention groups were comparable for general training, employment and demographic variables. In the intervention group, mean Knowledge Index score rose from K=7.2 to K=7.9 (P<0.001) and mean BAS scale score rose from BAS=4.7 to BAS=5.4 (P<0.001). There was no statistically significant increase in mean scores from test (K=7.0, BAS=4.7) to post-test (K=7.2, BAS=4.9) for the control group. Mean scores regarding knowledge, attitude, self-efficacy and reported behaviour increased significantly at 8.5 months in staff where training was provided. The results indicate that a multi-tiered training programme improved knowledge, attitude, self-efficacy and reported behaviour amongst staff caring for people with ID. © 2012 John Wiley & Sons A/S.
Focused and Corrective Feedback Versus Structured and Supported Debriefing in a Simulation-Based Cardiac Arrest Team Training: A Pilot Randomized Controlled Study.

PubMed

Kim, Ji-Hoon; Kim, Young-Min; Park, Seong Heui; Ju, Eun A; Choi, Se Min; Hong, Tai Yong

2017-06-01

The aim of the study was to compare the educational impact of two postsimulation debriefing methods-focused and corrective feedback (FCF) versus Structured and Supported Debriefing (SSD)-on team dynamics in simulation-based cardiac arrest team training. This was a pilot randomized controlled study conducted at a simulation center. Fourth-year medical students were randomly assigned to the FCF or SSD group, with each team composed of six students and a confederate. Each team participated in two simulations and the assigned debriefing (FCF or SSD) sessions and then underwent a test simulation. Two trained raters blindly assessed all of the recorded simulations using checklists. The primary outcome was the improvement in team dynamics scores between baseline and test simulation. The secondary outcomes were improvements before and after training in team clinical performance scores, self-assessed comprehension of and confidence in cardiac arrest management and team dynamics, as well as evaluations of the postsimulation debriefing intervention. In total, 95 students participated [FCF (8 teams, n = 47) and SSD (8 teams, n = 48)]. The SSD team dynamics score during the test simulation was higher than at baseline [baseline: 74.5 (65.9-80.9), test: 85.0 (71.9-87.6), P = 0.035]. However, there were no differences in the improvement in the team dynamics or team clinical performance scores between the two groups (P = 0.328, respectively). There was no significant difference in improvement in team dynamics scores during the test simulation compared with baseline between the SSD and FCF groups in a simulation-based cardiac arrest team training in fourth-year Korean medical students.
Resident training for eclampsia and magnesium toxicity management: simulation or traditional lecture?

PubMed

Fisher, Nelli; Bernstein, Peter S; Satin, Andrew; Pardanani, Setul; Heo, Hye; Merkatz, Irwin R; Goffman, Dena

2010-10-01

To compare eclampsia and magnesium toxicity management among residents randomly assigned to lecture or simulation-based education. Statified by year, residents (n = 38) were randomly assigned to 3 educational intervention groups: Simulation→Lecture, Simulation, and Lecture. Postintervention simulations were performed for all and scored using standardized lists. Maternal, fetal, eclampsia management, and magnesium toxcity scores were assigned. Mann-Whitney U, Wilcoxon rank sum and χ(2) tests were used for analysis. Postintervention maternal (16 and 15 vs 12; P < .05) and eclampsia (19 vs 16; P < .05) scores were significantly better in simulation based compared with lecture groups. Postintervention magnesium toxcitiy and fetal scores were not different among groups. Lecture added to simulation did not lead to incremental benefit when eclampsia scores were compared between Simulation→Lecture and Simulation (19 vs 19; P = nonsignificant). Simulation training is superior to traditional lecture alone for teaching crucial skills for the optimal management of both eclampsia and magnesium toxicity, 2 life-threatening obstetric emergencies. Published by Mosby, Inc.

Equating in Small-Scale Language Testing Programs

ERIC Educational Resources Information Center

LaFlair, Geoffrey T.; Isbell, Daniel; May, L. D. Nicolas; Gutierrez Arvizu, Maria Nelly; Jamieson, Joan

2017-01-01

Language programs need multiple test forms for secure administrations and effective placement decisions, but can they have confidence that scores on alternate test forms have the same meaning? In large-scale testing programs, various equating methods are available to ensure the comparability of forms. The choice of equating method is informed by…
Treatment of chemical warfare agent casualties: retention of knowledge and self-perceived competency among military physicians and paramedics.

PubMed

Shiyovich, Arthur; Statlender, Liran; Abu-Tailakh, Muhammad; Plakht, Ygal; Shrot, Shai; Kassirer, Michael

2015-06-01

Specialized training of medical teams for chemical warfare agent (CWA) events is important to save lives. We aimed to evaluate the retention of knowledge (ROK) and self-perceived competency (SPC) of military medical personnel in delivering treatment during CWA events. A questionnaire and a multiple-choice examination were sent to military physicians and paramedics, evaluating their CWA, ROK, and SPC (study group [SG]). Their assessment was compared to medical personnel immediately post training (reference group [RG]). SG was subdivided into two groups: G1 ≤ 1 year and G2 > 1 year, past training. Overall, 135 participants responded (35-RG, 65% physicians). Self-reported ROK and SPC were significantly higher in RG compared to SG and in G1 compared to G2. Test scores were higher in RG compared to SG, but similar in G1 and G2 groups. SPC was lower compared to ROK in the entire cohort and subgroups. A moderate correlation was found between the self-and test-assessed scores (Pearson correlation coefficient 0.45, p < 0.001). Physicians received significantly (p = 0.01) higher test scores in RG compared with paramedics. ROK and SPC among military medical personnel for treatment of CWA casualties deteriorate significantly as early as 1 year post training, SPC > ROK. Thus, we recommend CWA refresher training at least every year. Reprint & Copyright © 2015 Association of Military Surgeons of the U.S.
Relationship Between Cognitive Perceptual Abilities and Accident and Penalty Histories Among Elderly Korean Drivers

PubMed Central

2016-01-01

Objective To investigate the relationship between cognitive perceptual abilities of elderly drivers based on the Cognitive Perceptual Assessment for Driving (CPAD) test and their accident and penalty histories. Methods A total of 168 elderly drivers (aged ≥65 years) participated in the study. Participant data included CPAD scores and incidents of traffic accidents and penalties, attained from the Korea Road Traffic Authority and Korea National Police Agency, respectively. Results Drivers' mean age was 70.25±4.1 years and the mean CPAD score was 52.75±4.72. Elderly drivers' age was negatively related to the CPAD score (p<0.001). The accident history group had marginally lower CPAD scores, as compared to the non-accident group (p=0.051). However, incidence rates for traffic fines did not differ significantly between the two groups. Additionally, the group that passed the CPAD test had experienced fewer traffic accidents (3.6%), as compared to the group that failed (10.6%). The older age group (12.0%) had also experienced more traffic accidents, as compared to the younger group (2.4%). Conclusion Overall, elderly drivers who experienced driving accidents had lower CPAD scores than those who did not, without statistical significance. Thus, driving-related cognitive abilities of elderly drivers with insufficient cognitive ability need to be further evaluated to prevent traffic accidents. PMID:28119840
The Role of Item Feedback in Self-Adapted Testing.

ERIC Educational Resources Information Center

Roos, Linda L.; And Others

1997-01-01

The importance of item feedback in self-adapted testing was studied by comparing feedback and no feedback conditions for computerized adaptive tests and self-adapted tests taken by 363 college students. Results indicate that item feedback is not necessary to realize score differences between self-adapted and computerized adaptive testing. (SLD)
Relationships between Visual and Auditory Perceptual Skills and Comprehension in Students with Learning Disabilities.

ERIC Educational Resources Information Center

Weaver, Phyllis A.; Rosner, Jerome

1979-01-01

Scores of 25 learning disabled students (aged 9 to 13) were compared on five tests: a visual-perceptual test (Coloured Progressive Matrices); an auditory-perceptual test (Auditory Motor Placement); a listening and reading comprehension test (Durrell Listening-Reading Series); and a word recognition test (Word Recognition subtest, Diagnostic…
The influence of four different anticoagulants on dynamic light scattering of platelets.

PubMed

Raczat, T; Kraemer, L; Gall, C; Weiss, D R; Eckstein, R; Ringwald, J

2014-08-01

For testing of dynamic light scattering of platelets with ThromboLUX (TLX) in platelet-rich plasma (PRP) derived from venous whole blood (vWB), anticoagulation is needed. We compared TLX score in PRPs containing citrate, ethylene-diamine-tetraacetic-acid (EDTA), citrate-phosphate-dextrose-adenine (CPDA) or citrate-theophylline-adenosine-dipyridamole. Initial and late TLX scores were measured after 30-120 min or four to six hours, respectively. Compared with citrate, mean differences in initial TLX score were only significant for CPDA. Also, mean differences between initial and late TLX scores were only significant for CPDA. TLX failed to detect EDTA-induced platelet alterations. The clinical relevance of TLX needs further studies. © 2014 International Society of Blood Transfusion.
Factors Affecting the Baseline and Post-Treatment Scores on the Hopkins Verbal Learning Test-Revised Japanese Version before and after Whole-Brain Radiation Therapy

PubMed Central

Saito, Hirotake; Tanaka, Kensuke; Kanemoto, Ayae; Nakano, Toshimichi; Abe, Eisuke; Aoyama, Hidefumi

2016-01-01

Our objectives were to (1) investigate the feasibility of the use of the Japanese version of the Hopkins Verbal Learning Test-Revised (HVLT-R); (2) identify the clinical factors influencing the HVLT-R scores of patients undergoing whole-brain radiation therapy (WBRT); and (3) compare the neurocognitive function (NCF) after WBRT in different dose fractionation schedules. We administered the HVLT-R (Japanese version) before (baseline) and at four and eight months after WBRT in 45 patients who received either therapeutic (35Gy-in-14, n = 16; 30Gy-in-10, n = 18) or prophylactic (25Gy-in-10, n = 11) WBRT. Sixteen patients dropped out before the eight-month examination, due mostly to death from cancer. The Karnofsky Performance Status (KPS) 80–100 group had significantly higher baseline total recall (TR) scores (p = 0.0053), delayed recall (DR) scores (p = 0.012), and delayed recognition (DRecog) scores (p = 0.0078). The patients aged ≤65 years also had significantly higher TR scores (p = 0.030) and DRecog scores (p = 0.031). The patients who underwent two examinations (worse-prognosis group) had significantly decreased DR scores four months after WBRT compared to the baseline (p = 0.0073), and they were significantly more likely to have declined individual TR scores (p = 0.0017) and DR scores (p = 0.035) at four months. The eight-month HVLT-R scores did not significantly decline regardless of the WBRT dose fractionation. The baseline NCF was determined by age and KPS, and the early decline in NCF is characteristic of the worse-prognosis group. PMID:27827891
The S.A.C.S. (Satisfaction-Anatomy-Continence-Safety) score for evaluating pelvic organ prolapse surgery: a proposal for an outcome-based scoring system.

PubMed

Mearini, Luigi; Zucchi, Alessandro; Nunzi, Elisabetta; Di Biase, Manuel; Bini, Vittorio; Costantini, Elisabetta

2015-07-01

To date, there is no overall consensus on the definition of cure after surgery for pelvic organ prolapse (POP). The aim of the study was to design and test the scoring system S.A.C.S. (Satisfaction-Anatomy-Continence-Safety) to assess and compare the outcomes of POP repair. A total of 233 women underwent open sacrocolpopexy. The S.A.C.S. outcome scoring system was scheduled at 24 months of follow-up, and each component was detected according to: Satisfaction by mean of Patient Global Improvement Inventory scale, Anatomy by mean of POP Quantification system and bulge symptom, Continence by mean of pad use, and Safety by mean of the Clavien-Dindo classification of surgical complications. Each component produced a binary nominal categorical variable (1 or 0), with a total score of 4 representing cure. As a comparative tool, patients answered a simple yes/no question: "If you had to undergo surgery all over again, would you still do it?". The degree of concordance was estimated using Cohen's Kappa test. According to the S.A.C.S. scoring system, only 160 patients (68.6 %) reached the maximum score of cure. Sensitivity of the S.A.C.S. score was 74.1 %, specificity was 90 %, total diagnostic capacity was 75.5 %. The S.A.C.S. score internal consistency was good; the k-coefficient was higher for the satisfaction component of the score (k = 0.560). This study proposes an original, simple post-operative scoring system integrating satisfaction, anatomy, continence, and safety reports for patients undergoing surgery for POP, providing a complete, although perfectible, method to accurately report outcomes in all clinical scenarios.
Differences in Empathy Levels of Medical Students Based on Gender, Year of Medical School and Career Choice.

PubMed

Tariq, Nabia; Tayyab, Ali; Jaffery, Tara

2018-04-01

To measure mean empathy scores of Pakistani medical students and to explore any association of empathy scores with gender, medical school year and future career choice. Cross-sectional survey. Shifa College of Medicine, Shifa Tameer-e-Millat University, during the academic year 2015-2016. The student version of Jefferson Scale of Physician Empathy (JSPE) was distributed to the students electronically via the student portal. Response that were completed in full were included in the study. Descriptive statistics was used to analyse student demographic data. The student score on the JSPE was reported as the mean (out of 7) of each item. Independent samples t-test was employed to check the significant differences between genders. Empathy score with advancing year of study was investigated using ANOVA. ANOVA with post-hoc Tukey's test was used to study the relationship between career choice and empathy score. The response rate was 70.94%. The mean score was 4.51 ±0.69. Females obtained greater, but statistically insignificant (p=0.08) empathy score (4.58) as compared to the male students (4.45). No statistically significant difference was seen between scores on the survey across the five academic years (F=0.88, p=0.47). Students who selected medicine and allied as career choice showed a significantly higher empathy score than those who opted for surgery. The internal consistency reliability (Cronbach's alpha) was 0.78. There were low levels of empathy in Pakistani medical students. Students with interest in medicine and allied showed higher empathy scores compared to surgical or technical specialties. No association of empathy scores with gender and medical school year was observed.
Non-traditional vs. Traditional Academic Delivery Systems: Comparing ETS Scores for Undergraduate Students in Business Programs, 1996-1999. AIR 1999 Annual Forum Paper.

ERIC Educational Resources Information Center

Jonas, Peter M.; Weimer, Don

This two-year study involving five colleges and universities compared the academic achievement, as measured by the Educational Testing Service (ETS) Major Field Achievement Test (MFAT) in Business of students in traditional undergraduate programs and those in non-traditional accelerated adult degree programs. The study also compared the subjects'…
[Autonomy accreditation of private Chilean universities (1994-1998)].

PubMed

Cruz-Coke, R

1998-11-01

In 1995, a score to measure the quality of private universities in Chile, using excellency indicators as predictors of autonomy certification, was devised by the author. To compare this score with autonomy certification results of ensuing years, to assess the usefulness of excellency indicators. During 1995, the records of 21 private universities in Santiago were studied. These universities were qualified using eight indicators of academic excellency. These results were compared with the Superior Education Council qualification results, obtained between 1996 and 1998. The scores obtained by universities ranged from 19 and 137 points. Universities with the better scores obtained autonomy and those with the worst scores were eliminated. There was a good concordance between the score obtained in 1995 and the fate of autonomy certification. The best predictors and indicators of academic excellency to certificate autonomy of private universities were the magnitude of indirect budget contributed by the state, the size of academic list of staff and the percentage of admitted students with scores over 573 in the national academic aptitude tests.
The effect of teaching method on long-term knowledge retention.

PubMed

Beers, Geri W; Bowden, Susan

2005-11-01

Choosing a teaching strategy that results in knowledge retention on the part of learners can be challenging for educators. Studies on problem-based learning (PBL) have supported its effectiveness, compared to other, more traditional strategies. The results of a previous study comparing the effect of lecture versus PBL on objective test scores indicated there was no significant difference in scores. To measure long-term knowledge retention, the same groups were evaluated 1 year after instruction. The posttest administered in the original study was repeated, and the scores from a comprehensive adult health examination and the endocrine subsection were analyzed. At an alpha level of 0.05, a statistically significant difference was found in the scores on two of the measures. The scores of the PBL group were significantly higher on the endocrine section of the examination and the repeat posttest.
Comparison of effectiveness of abrasive and enzymatic action of whitening toothpastes in removal of extrinsic stains - a clinical trial.

PubMed

Patil, P A; Ankola, A V; Hebbal, M I; Patil, A C

2015-02-01

To compare the effectiveness of abrasive component (perlite/calcium carbonate) and enzymatic component (papain and bromelain) of whitening toothpaste in removal of extrinsic stains. This study is a randomized, triple blind and parallel group study in which 90 subjects aged 18-40 years were included. At baseline, stains scores were assessed by Macpherson's modification of Lobene Stain Index and subjects were randomly assigned to two groups with 45 subjects in each. Group 1 used whitening toothpaste with enzymatic action and group 2 with abrasive action. After 1 month, stain scores were assessed for the effectiveness of the two toothpastes and 2 months later to check the stain prevention efficacy. Wilcoxson's test was used to compare between baseline 1 and 2 months stain scores, and Mann-Witney U-test was applied for intragroup comparison. The mean baseline total stain score for the subjects allocated to the enzymatic toothpaste was 37.24 ± 2.11 which reduced to 30.77 ± 2.48 in 1 month, and for the abrasive paste, total stain reduced from 35.08 ± 2.96 to 32.89 ± 1.95. The reductions in total stain scores with both the pastes were significant compared with baseline stain scores (at 1 month Group 1, P = 0.0233 and Group 2, P = 0.0324; at 2 months, Group 1 P = 0.0356). Both the toothpastes proved to be equally good in removal of extrinsic stains; however, the enzymatic paste showed better results as compared to abrasive toothpaste. Whitening toothpaste with abrasive action and enzymatic action are equally effective in removal of extrinsic stains; however, whitening toothpaste with abrasive action needs to be used with caution. © 2014 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
The contribution of posterior circulation to memory function during the intracarotid amobarbital procedure.

PubMed

Zijlmans, M; Huibers, C J A; Huiskamp, G J; de Kort, G A P; Alpherts, W C J; Leijten, F S S; Hendrikse, J

2012-08-01

The purpose of this study was to evaluate the contribution of posterior circulation to memory function by comparing memory scores between patients with and without a foetal-type posterior cerebral artery (FTP) during the intracarotid amobarbital procedure (IAP) in epilepsy patients. Patients undergoing bilateral IAP between January 2004 and January 2010 were retrospectively included. Pre-test angiograms were assessed for the presence of a FTP. Memory function scores (% correct) after right and left injections were obtained. Functional significance of FTP was affirmed by relative occipital versus parietal EEG slow-wave increase during IAP. Memory and EEG scores were compared between patients with and without FTP (Mann-Whitney U test). A total of 106 patients were included, 73 with posterior cerebral arteries (PCA) without FTP ('non-FTP'), 28 patients with unilateral FTP and 5 with a bilateral FTP. Memory scores were lower when amytal was injected to the hemisphere contralateral to the presumed seizure focus (on the right decreasing from 98.3 to 59.1, and on the left decreasing from 89.1 to 72.4; p < 0.001). When IAP was performed on the side of FTP memory scores were significantly lower (70.8) compared to non-FTP (82.0; p = 0.02). Relative occipital EEG changes were 0.44 for FTP cases and 0.36 for non-FTP patients (p = 0.01). A relationship between vasculature and brain function was demonstrated by lower memory scores and more slow-wave activity on occipital EEG during IAP in patients with foetal-type PCA compared to patients with non-FTP. This suggests an important contribution of brain areas supplied by the PCA to memory function.
Long term safety and tolerability of Tafluprost 0.0015% vs Timolol 0.1% preservative-free in ocular hypertensive and in primary open-angle glaucoma patients: a cross sectional study.

PubMed

Rolle, Teresa; Spinetta, Roberta; Nuzzi, Raffaele

2017-08-03

The effects of preservatives of antiglaucoma medications on corneal surface and tear function have been widely shown in literature; it's not the same as regards the active compounds themselves. The purpose of our study was to compare Ocular Surface Disease (OSD) signs and symptoms of Tafluprost 0.0015% versus preservative free (PF) Timolol 0.1% eyedrops in ocular hypertensive (OH) and in primary open-angle glaucoma (POAG) patients. A cross-sectional study included patients in monotherapy for at least 36 months with Tafluprost 0.0015% (27) or PF Timolol 0.1% (24) and 20 healthy age and sex-matched volunteers. All subjects underwent clinical tests (Schirmer I and break-up time), in vivo confocal microscopy (IVCM) and were surveyed using Ocular Surface Disease Index (OSDI) and Glaucoma Symptoms Scale (GSS) questionnaires. The groups were compared with ANOVA, Kruskal-Wallis test, t-test, Mann-Whitney test and Bonferroni's adjustment of p-values. No significant differences were found in questionnaires scores, clinical tests, IVCM variables between therapy groups. Tafluprost 0.0015% group showed significantly higher OSDI score, basal epithelial cells density, stromal reflectivity, sub-basal nerves tortuosity (p = 0.0000, 0.037, 0.006, 0.0000) and less GSS score, number of sub-basal nerves (p = 0.0000, 0.037) than controls but similar clinical tests results (p > 0.05). PF Timolol group had significantly higher OSDI score, basal epithelial cells density, stromal reflectivity and sub-basal nerve tortuosity (p = 0.000, 0.014, 0.008, 0.002), less GSS score, BUT and number of sub-basal nerves (p = 0.0000, 0.026, 0.003) than controls. Compared to PF Timolol 0.1%, Tafluprost 0.0015% showed similar safety with regards to tear function and corneal status and a similar tolerability profile. Both therapy groups show some alterations in corneal microstructure but no side effects on tear function except for an increased tear instability in PF Timolol 0.1% group. Ophtalmologists should be aware that even PF formulations may lead to a mild ocular surface impairment.
Blinded randomized controlled study of a web-based otoscopy simulator in undergraduate medical education.

PubMed

Stepniak, Camilla; Wickens, Brandon; Husein, Murad; Paradis, Josee; Ladak, Hanif M; Fung, Kevin; Agrawal, Sumit K

2017-06-01

OtoTrain is a Web-based otoscopy simulator that has previously been shown to have face and content validity. The objective of this study was to evaluate the effectiveness of this Web-based otoscopy simulator in teaching diagnostic otoscopy to novice learners STUDY DESIGN: Prospective, blinded randomized control trial. Second-year medical students were invited to participate in the study. A pretest consisted of a series of otoscopy videos followed by an open-answer format assessment pertaining to the characteristics and diagnosis of each video. Participants were then randomly divided into a control group and a simulator group. Following the pretest, both groups attended standard otology lectures, but the simulator group was additionally given unlimited access to OtoTrain for 1 week. A post-test was completed using a separate set of otoscopy videos. Tests were graded based on a comprehensive marking scheme. The pretest and post-test were anonymized, and the three evaluators were blinded to student allotment. A total of 41 medical students were enrolled in the study and randomized to the control group (n = 20) and the simulator group (n = 21). There was no significant difference between the two groups on their pretest scores. With the standard otology lectures, the control group had a 31% improvement in their post-test score (mean ± standard error of the mean, 30.4 ± 1.5) compared with their pretest score (23.3 ± 1.8) (P < .001). The simulator group had the addition of OtoTrain to the otology lectures, and their score improved by 71% on their post-test (37.8 ± 1.6) compared to their pretest (22.1 ± 1.9) (P < .001). Comparing the post-test results, the simulator group had a 24% higher score than the control group (P < .002). Inter-rater reliability between the blinded evaluators was excellent (r = 0.953, P < .001). The use of OtoTrain increased the diagnostic otoscopic performance in novice learners. OtoTrain may be an effective teaching adjunct for undergraduate medical students. 1b. Laryngoscope, 127:1306-1311, 2017. © 2016 The American Laryngological, Rhinological and Otological Society, Inc.
[Prognostic scores for pulmonary embolism].

PubMed

Junod, Alain

2016-03-23

Nine prognostic scores for pulmonary embolism (PE), based on retrospective and prospective studies, published between 2000 and 2014, have been analyzed and compared. Most of them aim at identifying PE cases with a low risk to validate their ambulatory care. Important differences in the considered outcomes: global mortality, PE-specific mortality, other complications, sizes of low risk groups, exist between these scores. The most popular score appears to be the PESI and its simplified version. Few good quality studies have tested the applicability of these scores to PE outpatient care, although this approach tends to already generalize in the medical practice.
Effects of arginine vasopressin on musical working memory

PubMed Central

Granot, Roni Y.; Uzefovsky, Florina; Bogopolsky, Helena; Ebstein, Richard P.

2013-01-01

Previous genetic studies showed an association between variations in the gene coding for the 1a receptor of the neuro-hormone arginine vasopressin (AVP) and musical working memory (WM). The current study set out to test the influence of intranasal administration (INA) of AVP on musical as compared to verbal WM using a double blind crossover (AVP—placebo) design. Two groups of 25 males were exposed to 20 IU of AVP in one session, and 20 IU of saline water (placebo) in a second session, 1 week apart. In each session subjects completed the tonal subtest from Gordon's “Musical Aptitude Profile,” the interval subtest from the “Montreal Battery for Evaluation of Amusias (MBEA),” and the forward and backward digit span tests. Scores in the digit span tests were not influenced by AVP. In contrast, in the music tests there was an AVP effect. In the MBEA test, scores for the group receiving placebo in the first session (PV) were higher than for the group receiving vasopressin in the first session (VP) (p < 0.05) with no main Session effect nor Group × Session interaction. In the Gordon test there was a main Session effect (p < 0.05) with scores higher in the second as compared to the first session, a marginal main Group effect (p = 0.093) and a marginal Group × Session interaction (p = 0.88). In addition we found that the group that received AVP in the first session scored higher on scales indicative of happiness, and alertness on the positive and negative affect scale, (PANAS). Only in this group and only in the music test these scores were significantly correlated with memory scores. Together the results reflect a complex interaction between AVP, musical memory, arousal, and contextual effects such as session, and base levels of memory. The results are interpreted in light of music's universal use as a means to modulate arousal on the one hand, and AVP's influence on mood, arousal, and social interactions on the other. PMID:24151474
Modified Team-Based Learning in an Ophthalmology Clerkship in China

PubMed Central

Zhou, Yuxian; Ao, Yong; Xin, Wei; Jia, Yu; Yang, Ying; Cai, Yu; Xu, Chaochao; Yang, Yangfan; Lin, Haotian

2016-01-01

Objective Team-based learning (TBL) is an increasingly popular teaching method in medical education. However, TBL hasn’t been well-studied in the ophthalmology clerkship context. This study was to examine the impact of modified TBL in such context and to assess the student evaluations of TBL. Methods Ninety-nine students of an 8-year clinical medicine program from Zhongshan Ophthalmic Centre, Sun Yat-sen University, were randomly divided into four sequential units and assigned to six teams with the same faculty. The one-week ophthalmology clerkship module included traditional lectures, gross anatomy and a TBL module. The effects of the TBL module on student performance were measured by the Individual Readiness Assurance Test (IRAT), the Group Readiness Assurance Test (GRAT), the Group Application Problem (GAP) and final examination scores (FESs). Students’ evaluations of TBL were measured by a 16-item questionnaire. IRAT and GRAT scores were compared using a paired t-test. One-way analysis of variance (ANOVA) and subgroup analysis compared the effects among quartiles that were stratified by the Basic Ophthalmology Levels (BOLs). The BOLs were evaluated before the ophthalmology clerkship. Results In TBL classes, the GRAT scores were significantly higher than the IRAT scores in both the full example and the BOL-stratified groups. It highlighted the advantages of TBL compared to the individual learning. Quartile-stratified ANOVA comparisons showed significant differences at FES scores (P < 0.01). In terms to IRAT, GRAT and GAP scores, there was no significant result. Moreover, IRAT scores only significantly differed between the first and fourth groups. The FES scores of the first three groups are significantly higher than the fourth group. Gender-specific differences were significant in FES but not the IRAT. Overall, 57.65% of student respondents agreed that TBL was helpful. Male students tended to rate TBL higher than female students. Conclusion The application of modified TBL to the ophthalmology clerkship curriculum improved students’ performance and increased students’ engagement and satisfaction. TBL should be further optimized and developed to enhance the educational outcomes among multi-BOLs medical students. PMID:27100286
Effect of illumination on colour vision testing with Farnsworth-Munsell 100 hue test: customized colour vision booth versus room illumination.

PubMed

Zahiruddin, Kowser; Banu, Shaj; Dharmarajan, Ramya; Kulothungan, Vaitheeswaran; Vijayan, Deepa; Raman, Rajiv; Sharma, Tarun

2010-06-01

To evaluate a customized, portable Farnsworth-Munsell 100 (FM 100) hue viewing booth for compliance with colour vision testing standards and to compare it with room illumination in subjects with normal colour vision (trichromats), subjects with acquired colour vision defects (secondary to diabetes mellitus), and subjects with congenital colour vision defects (dichromats). Discrete wavelengths of the tube in the customized booth were measured using a spectrometer using the normal incident method and were compared with the spectral distribution of sunlight. Forty-eight subjects were recruited for the study and were divided into 3 groups: Group 1, Normal Trichromats (30 eyes); Group 2, Congenital Colour Vision Defects (16 eyes); and Group 3, Diabetes Mellitus (20 eyes). The FM 100 hue test performance was compared using two illumination conditions, booth illumination and room illumination. Total error scores of the classical method in Group 2 as mean+/-SD for room and booth illumination was 243.05+/-85.96 and 149.85+/-54.50 respectively (p=0.0001). Group 2 demonstrated lesser correlation (r=0.50, 0.55), lesser reliability (Cronbach's alpha, 0.625, 0.662) and greater variability (Bland & Altman value, 10.5) in total error scores for the classical method and the moment of inertia method between the two illumination conditions when compared to the other two groups. The customized booth demonstrated illumination meeting CIE standards. The total error scores were overestimated by the classical and moment of inertia methods in all groups for room illumination compared with booth illumination, however overestimation was more significant in the diabetes group.

No neurocognitive advantage for immediate antiretroviral treatment in adults with greater than 500 CD4+ T-cell counts.

PubMed

Wright, Edwina J; Grund, Birgit; Robertson, Kevin R; Cysique, Lucette; Brew, Bruce J; Collins, Gary L; Poehlman-Roediger, Mollie; Vjecha, Michael J; Penalva de Oliveira, Augusto César; Standridge, Barbara; Carey, Cate; Avihingsanon, Anchalee; Florence, Eric; Lundgren, Jens D; Arenas-Pinto, Alejandro; Mueller, Nicolas J; Winston, Alan; Nsubuga, Moses S; Lal, Luxshimi; Price, Richard W

2018-05-15

To compare the effect of immediate versus deferred antiretroviral treatment (ART) on neuropsychological test performance in treatment-naive HIV-positive adults with more than 500 CD4 cells/μl. Randomized trial. The START parent study randomized participants to commence immediate versus deferred ART until CD4 less than 350 cells/μl. The START Neurology substudy used eight neuropsychological tests, at baseline, months 4, 8, 12 and annually, to compare groups for changes in test performance. Test results were internally standardized to z-scores. The primary outcome was the average of the eight test z-scores (QNPZ-8). Mean changes in QNPZ-8 from baseline were compared by intent-to-treat using longitudinal mixed models. Changes from baseline to specific time points were compared using ANCOVA models. The 592 participants had a median age of 34 years; median baseline CD4 count was 629 cells/μl; the mean follow-up was 3.4 years. ART was used for 94 and 32% of accrued person-years in the immediate and deferred groups, respectively. There was no difference between the immediate and deferred ART groups in QNPZ-8 change through follow-up [-0.018 (95% CI -0.062 to 0.027, P = 0.44)], or at any visit. However, QNPZ-8 scores increased in both arms during the first year, by 0.22 and 0.24, respectively (P < 0.001 for increase from baseline). We observed substantial improvement in neurocognitive test performance during the first year in both study arms, underlining the importance of using a control group in studies assessing neurocognitive performance over time. Immediate ART neither benefitted nor harmed neurocognitive performance in individuals with CD4 cell counts above 500 cells/μl.
Impact of an engineering design-based curriculum compared to an inquiry-based curriculum on fifth graders' content learning of simple machines

NASA Astrophysics Data System (ADS)

Marulcu, Ismail; Barnett, Michael

2016-01-01

Background: Elementary Science Education is struggling with multiple challenges. National and State test results confirm the need for deeper understanding in elementary science education. Moreover, national policy statements and researchers call for increased exposure to engineering and technology in elementary science education. The basic motivation of this study is to suggest a solution to both improving elementary science education and increasing exposure to engineering and technology in it. Purpose/Hypothesis: This mixed-method study examined the impact of an engineering design-based curriculum compared to an inquiry-based curriculum on fifth graders' content learning of simple machines. We hypothesize that the LEGO-engineering design unit is as successful as the inquiry-based unit in terms of students' science content learning of simple machines. Design/Method: We used a mixed-methods approach to investigate our research questions; we compared the control and the experimental groups' scores from the tests and interviews by using Analysis of Covariance (ANCOVA) and compared each group's pre- and post-scores by using paired t-tests. Results: Our findings from the paired t-tests show that both the experimental and comparison groups significantly improved their scores from the pre-test to post-test on the multiple-choice, open-ended, and interview items. Moreover, ANCOVA results show that students in the experimental group, who learned simple machines with the design-based unit, performed significantly better on the interview questions. Conclusions: Our analyses revealed that the design-based Design a people mover: Simple machines unit was, if not better, as successful as the inquiry-based FOSS Levers and pulleys unit in terms of students' science content learning.
Simulation-Based Educational Module Improves Intern and Medical Student Performance of Closed Reduction and Percutaneous Pinning of Pediatric Supracondylar Humeral Fractures.

PubMed

Butler, Bennet A; Lawton, Cort D; Burgess, Jamie; Balderama, Earvin S; Barsness, Katherine A; Sarwark, John F

2017-12-06

Simulation-based education has been integrated into many orthopaedic residency programs to augment traditional teaching models. Here we describe the development and implementation of a combined didactic and simulation-based course for teaching medical students and interns how to properly perform a closed reduction and percutaneous pinning of a pediatric supracondylar humeral fracture. Subjects included in the study were either orthopaedic surgery interns or subinterns at our institution. Subjects all completed a combined didactic and simulation-based course on pediatric supracondylar humeral fractures. The first part of this course was an electronic (e)-learning module that the subjects could complete at home in approximately 40 minutes. The second part of the course was a 20-minute simulation-based skills learning session completed in the simulation center. Subject knowledge of closed reduction and percutaneous pinning of supracondylar humeral fractures was tested using a 30-question, multiple-choice, written test. Surgical skills were tested in the operating room or in a simulated operating room. Subject pre-intervention and post-intervention scores were compared to determine if and how much they had improved. A total of 21 subjects were tested. These subjects significantly improved their scores on both the written, multiple-choice test and skills test after completing the combined didactic and simulation module. Prior to the module, intern and subintern multiple-choice test scores were significantly worse than postgraduate year (PGY)-2 to PGY-5 resident scores (p < 0.01); after completion of the module, there was no significant difference in the multiple-choice test scores. After completing the module, there was no significant difference in skills test scores between interns and PGY-2 to PGY-5 residents. Both tests were validated using the scores obtained from PGY-2 to PGY-5 residents. Our combined didactic and simulation course significantly improved intern and subintern understanding of supracondylar humeral fractures and their ability to perform a closed reduction and percutaneous pinning of these fractures.
Recording and evaluation of an American dialect version of the Four Alternative Auditory Feature test.

PubMed

Xu, Jingjing; Cox, Robyn M

2014-09-01

The Four Alternative Auditory Feature test (FAAF) is a word-based closed-set speech recognition test. Because the original test materials were recorded in British English dialect, it is not appropriate for use in the United States. The purpose of this study was to produce an American dialect FAAF (AFAAF). The AFAAF materials spoken by a native American-English speaking male were recorded and digitally edited. In the validation study, the AFAAF was administered monaurally at five signal-to-noise ratios (SNRs) in both ears for each listener. A total of 20 young adults with normal hearing participated in the validation study. For each participant, speech recognition scores were collected in one session. The speech level was fixed at 70 dB SPL and the steady-state talker-matched noise level was varied, resulting in five SNRs from -15 to -5 dB. One full list (80 words) was used for each SNR. For each participant, a performance-intensity (PI) function was fit to the discrete mean percent correct scores for the five SNRs according to a best-fit, three-parameter sigmoid function. In addition, scores for the left and right ears were compared to examine test-retest reliability. RESULTS show that the slope of the PI function is 6% per dB, the mean test-retest difference scores for the five SNRs are within 3 rationalized arcsine units (rau), and the 95% critical difference for the 80-word scores is 12 rau. Compared with the FAAF, the slope of the PI function for the AFAAF is slightly less steep. Test-retest reliability of the AFAAF is at least equal to that of the FAAF. It is concluded that the AFAAF is similar but not identical to the FAAF. The AFAAF is now available for measuring speech recognition performance in listeners who use American English as a native language. American Academy of Audiology.
The power of timing: Adding a time-to-completion cutoff to the Word Choice Test and Recognition Memory Test improves classification accuracy.

PubMed

Erdodi, Laszlo A; Tyson, Bradley T; Shahein, Ayman G; Lichtenstein, Jonathan D; Abeare, Christopher A; Pelletier, Chantalle L; Zuccato, Brandon G; Kucharski, Brittany; Roth, Robert M

2017-05-01

The Recognition Memory Test (RMT) and Word Choice Test (WCT) are structurally similar, but psychometrically different. Previous research demonstrated that adding a time-to-completion cutoff improved the classification accuracy of the RMT. However, the contribution of WCT time-cutoffs to improve the detection of invalid responding has not been investigated. The present study was designed to evaluate the classification accuracy of time-to-completion on the WCT compared to the accuracy score and the RMT. Both tests were administered to 202 adults (M age = 45.3 years, SD = 16.8; 54.5% female) clinically referred for neuropsychological assessment in counterbalanced order as part of a larger battery of cognitive tests. Participants obtained lower and more variable scores on the RMT (M = 44.1, SD = 7.6) than on the WCT (M = 46.9, SD = 5.7). Similarly, they took longer to complete the recognition trial on the RMT (M = 157.2 s,SD = 71.8) than the WCT (M = 137.2 s, SD = 75.7). The optimal cutoff on the RMT (≤43) produced .60 sensitivity at .87 specificity. The optimal cutoff on the WCT (≤47) produced .57 sensitivity at .87 specificity. Time-cutoffs produced comparable classification accuracies for both RMT (≥192 s; .48 sensitivity at .88 specificity) and WCT (≥171 s; .49 sensitivity at .91 specificity). They also identified an additional 6-10% of the invalid profiles missed by accuracy score cutoffs, while maintaining good specificity (.93-.95). Functional equivalence was reached at accuracy scores ≤43 (RMT) and ≤47 (WCT) or time-to-completion ≥192 s (RMT) and ≥171 s (WCT). Time-to-completion cutoffs are valuable additions to both tests. They can function as independent validity indicators or enhance the sensitivity of accuracy scores without requiring additional measures or extending standard administration time.
Neurocognitive functioning in children diagnosed with diabetes before age 10 years.

PubMed

Kaufman, F R; Epport, K; Engilman, R; Halvorson, M

1999-01-01

Our objective was to determine scores on tests of neurocognitive functioning in children diagnosed with diabetes before age 10 years and to determine the association of age of diagnosis, duration of diabetes, subtle hypoglycemia, severe hypoglycemia, and history of hypoglycemic seizures with these neurocognitive test scores. Fifty-five of 62 eligible patients with a mean age of 7.9 +/- 1.6 years followed in our center were given the Woodcock-Johnson Psychoeducational Battery, Beery Developmental Test of Visual-Motor Integration, Finger Tapping, Grooved Pegboard, and Verbal Selective Reminding tests to evaluate the following domains: memory/attention, visual-perceptual, broad cognitive function, academic achievement, and fine motor speed/coordination. Fifteen age-matched siblings served as controls. Twenty-seven subjects were less than 5 years of age when diagnosed with diabetes, the mean age at diagnosis was 4.5 +/- 2.1 years of age, and mean diabetes duration was 2.6 +/- 2.0 years. Eighteen patients had a history of severe hypoglycemia, eight of whom had hypoglycemic seizures. The mean HbA1c was 7.8 +/- 1.1% for the year prior to testing. Our results showed that the overall mean scores for the extensive neurocognitive battery were within the normal range and were comparable to the scores of the age-matched sibling controls. Age of diagnosis and duration of diabetes did not relate to neurocognitive test results. Mean HbA1c had a negative association with some tests of memory/attention (p < 0.03-0.04) and academic achievement (p < 0.005-0.03), while number of blood glucose levels less than mg/dL had a positive association with memory/attention (p < 0.004-0.04), verbal comprehension (p < 0.03) and academic achievement (p < 0.018-0.05). There was no association of neurocognitive test scores with severe hypoglycemia, but subjects with history of hypoglycemic seizures had a decrease in scores on tests assessing memory skills (p < 0.03) including short term memory and memory for words. These data suggest that overall neurocognitive test scores were within the normal range and comparable to controls. However, specific aspects of neurocognitive functioning may be adversely affected by having had a hypoglycemic seizure, but not by episodes of severe hypoglycemia without seizure. Lower HbA1c and an increase in the number of blood glucose levels less than 70 mg/ dL (subtle hypoglycemia) which were associated with higher scores in some domains of academic achievement and memory suggests that stable glycemia may influence cognitive abilities and/or that successful diabetes management requires cognitive skills. Strategies to diminish the risk of seizures with hypoglycemia should be investigated.
The effectiveness of dentifrices without and with sodium lauryl sulfate on plaque, gingivitis and gingival abrasion--a randomized clinical trial.

PubMed

Sälzer, S; Rosema, N A M; Martin, E C J; Slot, D E; Timmer, C J; Dörfer, C E; van der Weijden, G A

2016-04-01

The aim of this study was to compare the efficacy of a dentifrice without sodium lauryl sulfate (SLS) to a dentifrice with SLS in young adults aged 18-34 years on gingivitis. One hundred twenty participants (non-dental students) with a moderate gingival inflammation (bleeding on probing at 40-70 % of test sites) were included in this randomized controlled double blind clinical trial. According to randomization, participants had to brush their teeth either with dentifrice without SLS or with SLS for 8 weeks. The primary outcome was bleeding on marginal probing (BOMP). The secondary outcomes were plaque scores and gingival abrasion scores (GA) as well as a visual analogue scale (VAS) score at exit survey. Baseline and end differences were analysed by univariate analysis of covariance (ANCOVA) test, between group differences by independent t test and within groups by paired sample t test. BOMP improved within groups from on average 0.80 at baseline to 0.60 in the group without SLS and to 0.56 in the group with SLS. No statistical difference for BOMP, plaque and gingival abrasion was found between both groups. VAS scores for taste, freshness and foaming effect were significantly in favour of the SLS-containing dentifrice. The test dentifrice without SLS was as effective as a regular SLS dentifrice on gingival bleeding scores and plaque scores. There was no significant difference in the incidence of gingival abrasion. In patients diagnosed with gingivitis, a dentifrice without SLS seems to be equally effective compared to a dentifrice with SLS and did not demonstrate any significant difference in gingival abrasion. In patient with recurrent aphthous ulcers, the absence of SLS may even be beneficial. However, participants indicate that they appreciate the foaming effect of a dentifrice with SLS more.
Using screen-based simulation to improve performance during pediatric resuscitation.

PubMed

Biese, Kevin J; Moro-Sutherland, Donna; Furberg, Robert D; Downing, Brian; Glickman, Larry; Murphy, Alison; Jackson, Cheryl L; Snyder, Graham; Hobgood, Cherri

2009-12-01

To assess the ability of a screen-based simulation-training program to improve emergency medicine and pediatric resident performance in critical pediatric resuscitation knowledge, confidence, and skills. A pre-post, interventional design was used. Three measures of performance were created and assessed before and after intervention: a written pre-course knowledge examination, a self-efficacy confidence score, and a skills-based high-fidelity simulation code scenario. For the high-fidelity skills assessment, independent physician raters recorded and reviewed subject performance. The intervention consisted of eight screen-based pediatric resuscitation scenarios that subjects had 4 weeks to complete. Upon completion of the scenarios, all three measures were repeated. For the confidence assessment, summary pre- and post-test summary confidence scores were compared using a t-test, and for the skills assessment, pre-scores were compared with post-test measures for each individual using McNemar's chi-square test for paired samples. Twenty-six of 35 (71.3%) enrolled subjects completed the institutional review board-approved study. Increases were observed in written test scores, confidence, and some critical interventions in high-fidelity simulation. The mean improvement in cumulative confidence scores for all residents was 10.1 (SD +/-4.9; range 0-19; p < 0.001), with no resident feeling less confident after the intervention. Although overall performance in simulated codes did not change significantly, with average scores of 6.65 (+/-1.76) to 7.04 (+/-1.37) out of 9 possible points (p = 0.58), improvement was seen in the administering of appropriate amounts of IV fluids (59-89%, p = 0.03). In this study, improvements in resident knowledge, confidence, and performance of certain skills in simulated pediatric cardiac arrest scenarios suggest that screen-based simulations may be an effective way to enhance resuscitation skills of pediatric providers. These results should be confirmed using a randomized design with an appropriate control group. (c) 2009 by the Society for Academic Emergency Medicine.
Deficits in Physical Function Among Young Childhood Cancer Survivors

PubMed Central

Hoffman, Megan C.; Mulrooney, Daniel A.; Steinberger, Julia; Lee, Jill; Baker, K. Scott; Ness, Kirsten K.

2013-01-01

Purpose Childhood cancer survivors (CCSs) are at risk for physical disability. The aim of this investigation was to characterize and compare physical performance among CCSs and a group of siblings age < 18 years and determine if diagnosis, treatment, and physical activity levels were associated with lower performance scores. Methods CCSs ≥ 5 years from diagnosis and a sibling comparison group were recruited and evaluated for strength, mobility, and fitness. Physical performance measures were compared in regression models between survivors and siblings by diagnosis and among survivors by treatment exposures and physical activity levels. Results CCSs (n = 183; mean age ± standard deviation [SD], 13.5 ± 2.5 years; 53% male) scored lower than siblings (n = 147; mean age ± SD, 13.4 ± 2.4 years; 50% male) on lower-extremity strength testing, the timed up-and-go (TUG) test, and the 6-minute walk (6MW) test, despite reporting similar levels and types of habitual physical activity. The lowest scores were prevalent among survivors of CNS tumors and bone and soft tissue sarcomas on strength testing (score ± SD: CNS tumors, 76.5 ± 4.7; sarcoma 67.1 ± 7.2 v siblings, 87.3 ± 2.4 Newton-meters quadricep strength at 90° per second; P = .04 and .01, respectively) and among CNS tumor survivors on the TUG (score ± SD: 5.1 ± 0.1 v siblings, 4.4 ± 0.1 seconds; P < .001) and 6MW tests (score ± SD: 533.3 ± 15.6 v siblings, 594.1 ± 8.3 m; P < .001). Conclusion CCSs may have underlying physiologic deficits that interfere with function that cannot be completely overcome by participation in regular physical activity. These survivors may need referral for specialized exercise interventions in addition to usual counseling to remain physically active. PMID:23796992
Comparison of Tear Osmolarity in Rheumatoid Arthritis Patients With and Without Secondary Sjogren Syndrome.

PubMed

Ng, Alex L K; Choy, Bonnie N K; Chan, Tommy C Y; Wong, Ian Y H; Lai, Jimmy S M; Mok, Mo Yin

2017-07-01

To compare tear osmolarity (TO) and other dry eye parameters in rheumatoid arthritis (RA) patients with or without secondary Sjogren syndrome (sSS). Consecutive patients with RA were divided into a sSS group and no-sSS group using conventional diagnostic criteria by rheumatologists using symptomatology, Schirmer test score, and anti-Ro or anti-La autoantibody status. The TO, Ocular Surface Disease Index, dry eye disease (DED) parameters [such as tear breakup time (TBUT) and corneal staining score] and the systemic inflammatory markers [erythrocyte sedimentation rate (ESR) and C-reactive protein (CRP)] were compared. Correlation analyses between TO and the DED parameters and inflammatory markers were also performed. A total of 42 cases with mean age 54.8 ± 12.3 were included, with 12 patients (29%) having sSS and 30 (71%) without sSS. TO was increased in both groups (329 ± 20 and 319 ± 25 mOsm/L, respectively), but no statistically significant difference was found between the 2 groups (P = 0.126). RA with sSS had significantly shorter TBUT, higher corneal staining score, and ESR CRP levels (P < 0.05). TO did not correlate with the Schirmer test score, but had significant positive correlations with age, corneal staining score, ESR, and CRP levels, and a significant negative correlation with TBUT. TO was increased in RA patients with or without sSS. There was no significant correlation between TO and the Schirmer test score, and the physician could not use TO to diagnose sSS. However, TO correlated well with both DED parameters (TBUT and corneal staining score) and systemic inflammatory markers (ESR and CRP).
Medical ethical standards in dermatology: an analytical study of knowledge, attitudes and practices.

PubMed

Mostafa, W Z; Abdel Hay, R M; El Lawindi, M I

2015-01-01

Dermatology practice has not been ethically justified at all times. The objective of the study was to find out dermatologists' knowledge about medical ethics, their attitudes towards regulatory measures and their practices, and to study the different factors influencing the knowledge, the attitude and the practices of dermatologists. This is a cross-sectional comparative study conducted among 214 dermatologists, from five Academic Universities and from participants in two conferences. A 54 items structured anonymous questionnaire was designed to describe the demographical characteristics of the study group as well as their knowledge, attitude and practices regarding the medical ethics standards in clinical and research settings. Five scoring indices were estimated regarding knowledge, attitude and practice. Inferential statistics were used to test differences between groups as indicated. The Student's t-test and analysis of variance were carried out for quantitative variables. The chi-squared test was conducted for qualitative variables. The results were considered statistically significant at a P > 0.05. Analysis of the possible factors having impact on the overall scores revealed that the highest knowledge scores were among dermatologists who practice in an academic setting plus an additional place; however, this difference was statistically non-significant (P = 0.060). Female dermatologists showed a higher attitude score compared to males (P = 0.028). The highest significant attitude score (P = 0.019) regarding clinical practice was recorded among those practicing cosmetic dermatology. The different studied groups of dermatologists revealed a significant impact on the attitude score (P = 0.049), and the evidence-practice score (P < 0.001). Ethical practices will improve the quality and integrity of dermatology research. © 2014 European Academy of Dermatology and Venereology.
Therapeutic alliance in dietetic practice for weight loss: Insights from health coaching.

PubMed

Nagy, Annaliese; McMahon, Anne; Tapsell, Linda; Deane, Frank; Arenson, Danielle

2018-02-13

The psychological construct of 'therapeutic alliance' can be used to better understand the effectiveness of consultations, particularly goal setting for weight management. We analysed audio-recorded health coaching sessions during a weight loss trial to explore relationships between therapeutic alliance and various contextual factors. Audio recordings of 50 health coaching sessions were analysed. After assessing fidelity to the protocol, therapeutic alliance was measured using an adapted Working Alliance Inventory Observer-rated Short Version (WAI-O-S), and examined by (i) identifying relationships between contextual factors and WAI-O-S scores (Spearman's coefficients); (ii) testing the impact of preparatory exercises and body mass index on WAI-O-S scores (one-way analysis of variance and least-squared differences tests) and (iii) comparing differences in WAI-O-S scores based on relationship status, gender and follow-up session completion (independent samples t-tests). Fidelity was high (mean 88%). WAI-O-S total scores ranged from 55 to 70 (out of 84). Session duration was significantly correlated with WAI-O-S component of 'Bond' (r = 0.42, P = 0.002). Those who completed preparatory exercises had significantly higher total WAI-O-S scores, 'Goal' and 'Task' scores. Participants who completed the follow-up session scored significantly higher for 'Goal' compared to no follow-up. Spending more time in a session appears related to increased bonding, a key component of therapeutic alliance. Preparatory work may help build therapeutic alliance and agreement on goals appears to influence follow-up completion. These exploratory findings provide directions for research addressing the professional relationship in dietetic consultations for weight loss. © 2018 Dietitians Association of Australia.
Improvement in Stroke-induced Motor Dysfunction by Music-supported Therapy: A Systematic Review and Meta-analysis

PubMed Central

Zhang, Yingshi; Cai, Jiayi; Zhang, Yaqiong; Ren, Tianshu; Zhao, Mingyi; Zhao, Qingchun

2016-01-01

To conduct a meta-analysis of clinical trials that examined the effect of music-supported therapy on stroke-induced motor dysfunction, comprehensive literature searches of PubMed, Embase and the Cochrane Library from their inception to April 2016 were performed. A total of 10 studies (13 analyses, 358 subjects) were included; all had acceptable quality according to PEDro scale score. The baseline differences between the two groups were confirmed to be comparable. Compared with the control group, the standardized mean difference of 9-Hole Peg Test was 0.28 (−0.01, 0.57), 0.64 (0.31, 0.97) in Box and Block Test, 0.47 (0.08, 0.87) in Arm Paresis Score and 0.35 (−0.04, 0.75) in Action Research Arm Test for upper-limb motor function, 0.11 (−0.24, 0.46) in Berg Balance Scale score, 0.09 (−0.36, 0.54) in Fugl-Meyer Assessment score, 0.30 (−0.15, 0.74) in Wolf Motor Function Test, 0.30 (−0.15, 0.74) in Wolf Motor Function time, 0.65 (0.14, 1.16) in Stride length and 0.62 (0.01, 1.24) in Gait Velocity for total motor function, and 1.75 (0.94, 2.56) in Frontal Assessment Battery score for executive function. There was evidence of a positive effect of music-supported therapy, supporting its use for the treatment of stroke-induced motor dysfunction. This study was registered at PRESPERO (CRD42016037106). PMID:27917945
Comparing NET and ERI standardized exam scores between baccalaureate graduates who pass or fail the NCLEX-RN.

PubMed

Bondmass, Mary D; Moonie, Sheniz; Kowalski, Susan

2008-01-01

In the United States, nursing programs are commonly evaluated by their graduates success on the National Council Licensure Examination for Registered Nurses (NCLEX-RN). The purpose of this paper is to describe a change in NCLEX-RN success rates following the addition of standardized exams throughout our program's curriculum, and to compare these exam scores between graduates who pass NCLEX-RN and those who do not. Our results indicate an 8.5% change (p < 0.000) in the NCLEX-RN pass rate from our previous 5-year mean pass rate, and significant differences in standardized test scores for those who pass the NCLEX-RN compared to those who do not (p < 0.03). We conclude that our selected standardized exam scores are able to significantly identify graduates who are more likely to pass NCLEX-RN than not.
Video as an Effective Method to Deliver Pre-Test Information for Rapid HIV Testing

PubMed Central

Clark, Melissa A.; Mayer, Kenneth H.; Seage, George R.; DeGruttola, Victor G.; Becker, Bruce M.

2008-01-01

Objectives Video-based delivery of HIV pre-test information might assist in streamlining HIV screening and testing efforts in the emergency department (ED). The objectives of this study were to determine if the video “Do you know about rapid HIV testing?” is an acceptable alternative to an in-person information session on rapid HIV pre-test information, in regards to comprehension of rapid HIV pre-test fundamentals; and to identify patients who might have difficulties in comprehending pre-test information. Methods This was a non-inferiority trial of 574 participants in an ED opt-in rapid HIV screening program who were randomly assigned to receive identical pre-test information from either an animated and live-action 9.5-minute video, or an in-person information session. Pre-test information comprehension was assessed using a questionnaire. The video would be accepted as not inferior to the in-person information session if the 95% confidence interval (CI) of the difference (Δ) in mean scores on the questionnaire between the two information groups was less than a 10% decrease in the in-person information session arm's mean score. Linear regression models were constructed to identify patients with lower mean scores based upon study arm assignment, demographic characteristics, and history of prior HIV testing. Results The questionnaire mean scores were 20.1 (95% CI = 19.7 to 20.5) for the video arm and 20.8 (95% CI = 20.4 to 21.2) for the in-person information session arm. The difference in mean scores compared to the mean score for the in-person information session met the non-inferiority criterion for this investigation (Δ = 0.68; 95% CI = 0.18 to 1.26). In a multivariable linear regression model, Blacks/African Americans, Hispanics, and those with Medicare and Medicaid insurance exhibited slightly lower mean scores, regardless of the pre-test information delivery format. There was a strong relationship between fewer years of formal education and lower mean scores on the questionnaire. Age, gender, type of insurance, partner/marital status, and history of prior HIV testing were not predictive of scores on the questionnaire. Conclusions In terms of patient comprehension of rapid HIV pre-test information fundamentals, the video was an acceptable substitute to pre-test information delivered by an HIV test counselor. Both the video and in-person information session were less effective in providing pre-test information for patients with fewer years of formal education. PMID:19120050
Evaluation of the validity of osteoporosis and fracture risk assessment tools (IOF One Minute Test, SCORE, and FRAX) in postmenopausal Palestinian women.

PubMed

Kharroubi, Akram; Saba, Elias; Ghannam, Ibrahim; Darwish, Hisham

2017-12-01

The need for simple self-assessment tools is necessary to predict women at high risk for developing osteoporosis. In this study, tools like the IOF One Minute Test, Fracture Risk Assessment Tool (FRAX), and Simple Calculated Osteoporosis Risk Estimation (SCORE) were found to be valid for Palestinian women. The threshold for predicting women at risk for each tool was estimated. The purpose of this study is to evaluate the validity of the updated IOF (International Osteoporosis Foundation) One Minute Osteoporosis Risk Assessment Test, FRAX, SCORE as well as age alone to detect the risk of developing osteoporosis in postmenopausal Palestinian women. Three hundred eighty-two women 45 years and older were recruited including 131 women with osteoporosis and 251 controls following bone mineral density (BMD) measurement, 287 completed questionnaires of the different risk assessment tools. Receiver operating characteristic (ROC) curves were evaluated for each tool using bone BMD as the gold standard for osteoporosis. The area under the ROC curve (AUC) was the highest for FRAX calculated with BMD for predicting hip fractures (0.897) followed by FRAX for major fractures (0.826) with cut-off values ˃1.5 and ˃7.8%, respectively. The IOF One Minute Test AUC (0.629) was the lowest compared to other tested tools but with sufficient accuracy for predicting the risk of developing osteoporosis with a cut-off value ˃4 total yes questions out of 18. SCORE test and age alone were also as good predictors of risk for developing osteoporosis. According to the ROC curve for age, women ≥64 years had a higher risk of developing osteoporosis. Higher percentage of women with low BMD (T-score ≤-1.5) or osteoporosis (T-score ≤-2.5) was found among women who were not exposed to the sun, who had menopause before the age of 45 years, or had lower body mass index (BMI) compared to controls. Women who often fall had lower BMI and approximately 27% of the recruited postmenopausal Palestinian women had accidents that caused fractures. Simple self-assessment tools like FRAX without BMD, SCORE, and the IOF One Minute Tests were valid for predicting Palestinian postmenopausal women at high risk of developing osteoporosis.
Cognitive Effects of Adenotonsillectomy for Obstructive Sleep Apnea.

PubMed

Taylor, H Gerry; Bowen, Susan R; Beebe, Dean W; Hodges, Elise; Amin, Raouf; Arens, Raanan; Chervin, Ronald D; Garetz, Susan L; Katz, Eliot S; Moore, Reneé H; Morales, Knashawn H; Muzumdar, Hiren; Paruthi, Shalini; Rosen, Carol L; Sadhwani, Anjali; Thomas, Nina Hattiangadi; Ware, Janice; Marcus, Carole L; Ellenberg, Susan S; Redline, Susan; Giordani, Bruno

2016-08-01

Research reveals mixed evidence for the effects of adenotonsillectomy (AT) on cognitive tests in children with obstructive sleep apnea syndrome (OSAS). The primary aim of the study was to investigate effects of AT on cognitive test scores in the randomized Childhood Adenotonsillectomy Trial. Children ages 5 to 9 years with OSAS without prolonged oxyhemoglobin desaturation were randomly assigned to watchful waiting with supportive care (n = 227) or early AT (eAT, n = 226). Neuropsychological tests were administered before the intervention and 7 months after the intervention. Mixed model analysis compared the groups on changes in test scores across follow-up, and regression analysis examined associations of these changes in the eAT group with changes in sleep measures. Mean test scores were within the average range for both groups. Scores improved significantly (P < .05) more across follow-up for the eAT group than for the watchful waiting group. These differences were found only on measures of nonverbal reasoning, fine motor skills, and selective attention and had small effects sizes (Cohen's d, 0.20-0.24). As additional evidence for AT-related effects on scores, gains in test scores for the eAT group were associated with improvements in sleep measures. Small and selective effects of AT were observed on cognitive tests in children with OSAS without prolonged desaturation. Relative to evidence from Childhood Adenotonsillectomy Trial for larger effects of surgery on sleep, behavior, and quality of life, AT may have limited benefits in reversing any cognitive effects of OSAS, or these benefits may require more extended follow-up to become manifest. Copyright © 2016 by the American Academy of Pediatrics.
Cognitive Effects of Adenotonsillectomy for Obstructive Sleep Apnea

PubMed Central

Bowen, Susan R.; Beebe, Dean W.; Hodges, Elise; Amin, Raouf; Arens, Raanan; Chervin, Ronald D.; Garetz, Susan L.; Katz, Eliot S.; Moore, Reneé H.; Morales, Knashawn H.; Muzumdar, Hiren; Paruthi, Shalini; Rosen, Carol L.; Sadhwani, Anjali; Thomas, Nina Hattiangadi; Ware, Janice; Marcus, Carole L.; Ellenberg, Susan S.; Redline, Susan; Giordani, Bruno

2016-01-01

OBJECTIVE: Research reveals mixed evidence for the effects of adenotonsillectomy (AT) on cognitive tests in children with obstructive sleep apnea syndrome (OSAS). The primary aim of the study was to investigate effects of AT on cognitive test scores in the randomized Childhood Adenotonsillectomy Trial. METHODS: Children ages 5 to 9 years with OSAS without prolonged oxyhemoglobin desaturation were randomly assigned to watchful waiting with supportive care (n = 227) or early AT (eAT, n = 226). Neuropsychological tests were administered before the intervention and 7 months after the intervention. Mixed model analysis compared the groups on changes in test scores across follow-up, and regression analysis examined associations of these changes in the eAT group with changes in sleep measures. RESULTS: Mean test scores were within the average range for both groups. Scores improved significantly (P < .05) more across follow-up for the eAT group than for the watchful waiting group. These differences were found only on measures of nonverbal reasoning, fine motor skills, and selective attention and had small effects sizes (Cohen’s d, 0.20–0.24). As additional evidence for AT-related effects on scores, gains in test scores for the eAT group were associated with improvements in sleep measures. CONCLUSIONS: Small and selective effects of AT were observed on cognitive tests in children with OSAS without prolonged desaturation. Relative to evidence from Childhood Adenotonsillectomy Trial for larger effects of surgery on sleep, behavior, and quality of life, AT may have limited benefits in reversing any cognitive effects of OSAS, or these benefits may require more extended follow-up to become manifest. PMID:27464674
The state of Illinois obstetric hemorrhage project: pre-project and post-training examination scores.

PubMed

Wong, Cynthia A; Scott, Shirley; Jones, Robin L; Walzer, Jennifer; Geller, Stacie

2016-03-01

The Illinois Department of Public Health mandated that all clinicians who provide care to obstetric patients participate in the Illinois Obstetric Hemorrhage Project. The aim of the current report is to describe change in knowledge among providers engaged in the project, as assessed by pre- and post-tests. The project, implemented 2008 to 2010, included four components: a written 25-item multiple-choice examination (pre-test), a didactic lecture, skill stations (for teaching blood loss estimation), and a simulation drill and debriefing. Participants completed a post-test 6 months later. Pre- and post-test examination scores were compared. Data from 95 hospitals are included in this analysis (9456 paired test results). The proportion of participants who scored ≥88% correct answers increased from 10.9% on the pre-test to 49.1% on the post-test (p < 0.0001). Registered nurses made greater improvements in test scores than anesthesia and obstetric providers (p < 0.0001). The Illinois Obstetric Hemorrhage Project was successful in improving knowledge of obstetric hemorrhage in a large number of providers with different expertise and experience levels. Further long-term study is essential to determine whether the skills acquired during the Project contribute to improved obstetric hemorrhage outcomes for the women of Illinois.
Assessment of theory of mind in children with communication disorders: role of presentation mode.

PubMed

van Buijsen, Marit; Hendriks, Angelique; Ketelaars, Mieke; Verhoeven, Ludo

2011-01-01

Children with communication disorders have problems with both language and social interaction. The theory-of-mind hypothesis provides an explanation for these problems, and different tests have been developed to test this hypothesis. However, different modes of presentation are used in these tasks, which make the results difficult to compare. In the present study, the performances of typically developing children, children with specific language impairments, and children with autism spectrum disorders were therefore compared using three theory-of-mind tests (the Charlie test, the Smarties test, and the Sally-and-Anne test) presented in three different manners each (spoken, video, and line drawing modes). The results showed differential outcomes for the three types of tests and a significant interaction between group of children and mode of presentation. For the typically developing children, no differential effects of presentation mode were detected. For the children with SLI, the highest test scores were consistently evidenced in the line-drawing mode. For the children with ASD, test performance depended on the mode of presentation. Just how the children's non-verbal age, verbal age, and short-term memory related to their test scores was also explored for each group of children. The test scores of the SLI group correlated significantly with their short-term memory, those of the ASD group with their verbal age. These findings demonstrate that performance on theory-of-mind tests clearly depend upon mode of test presentation as well as the children's cognitive and linguistic abilities. Copyright © 2011 Elsevier Ltd. All rights reserved.

Comparative neurobehavioral study of a polybrominated biphenyl-exposed population in Michigan and a nonexposed group in Wisconsin.

PubMed Central

Valciukas, J A; Lilis, R; Wolff, M S; Anderson, H A

1978-01-01

An analysis of findings regarding the prevalence and time course of symptoms and the results of neurobehavioral testing among Michigan and Wisconsin dairy farmers, is reported. Reviewed are: (1) differences in the prevalence of neurological symptoms at the time of examination; (2) differences in the incidence and time course of symptoms for the period 1972--1976; (3) differences among populations and subgroups (sex and age) regarding performance test scores; (4) correlations between performance test scores and neurological symptoms; and (5) correlations between serum PBB levels as indicators of exposure and performance tests and neurological symptoms. PMID:209977
A Comparison of Interactive Multimedia Instruction Designs Addressing Soldiers Learning Needs

DTIC Science & Technology

2016-03-01

tailored training group would have higher point gains from pretest to posttest for the less familiar content domain (Adjust Indirect Fire) compared to...and posttest scores and user experiences, a correlation analysis was conducted. Soldiers who tended to have higher scores on the pretest also had...Soldiers enrolled in the Warrior Leaders Course at Fort Benning, GA. All the IMI variations were associated with increased test scores on posttests , but
Articular cartilage scores in cranial cruciate ligament-deficient dogs with or without bucket handle tears of the medial meniscus.

PubMed

Kaufman, Kathryn; Beale, Brian S; Thames, Howard D; Saunders, W Brian

2017-01-01

To compare articular cartilage scores in cranial cruciate ligament (CCL)-deficient dogs with or without concurrent bucket handle tears (BHT) of the medial meniscus. Retrospective case series. Client-owned dogs treated with arthroscopy and tibial plateau leveling osteotomy or extracapsular repair for complete CCL rupture (290 stifles from 264 dogs). Medical records and arthroscopic images were reviewed. Medial femoral condyle (MFC) and medial tibial plateau (MTP) cartilage was scored using the modified Outerbridge scale. Periarticular osteophytosis (PAO) and injury to the medial meniscus were recorded. Data were analyzed using Student's t-tests, Wilcoxon rank-sum test, and Fisher's exact test for changes in the stifle based on meniscal condition, body weight, and duration of lameness. PAO, MFC, and MTP articular cartilage scores were not significantly different in dogs with or without BHT. There were no significant differences in MFC or MTP scores when dogs were evaluated based on bodyweight and the presence or absence of a BHT. However, PAO formation was significantly increased in dogs weighing >13.6 kg and concurrent meniscal injury vs. dogs weighing <13.6 kg and concurrent meniscal injury (P < .001). Significantly more stifles with chronic lameness (40 of 89; 44.9%) had the highest PAO score of 2 reported compared to only 42 of 182 stifles (23.1%) with acute lameness (P < .001). The presence of a BHT of the medial meniscus was not associated with more severe arthroscopic articular cartilage lesions in the medial joint compartment at the time of surgery. © 2016 The American College of Veterinary Surgeons.
Urdu translation of the Hamilton Rating Scale for Depression: Results of a validation study

PubMed Central

Hashmi, Ali M.; Naz, Shahana; Asif, Aftab; Khawaja, Imran S.

2016-01-01

Objective: To develop a standardized validated version of the Hamilton Rating Scale for Depression (HAM-D) in Urdu. Methods: After translation of the HAM-D into the Urdu language following standard guidelines, the final Urdu version (HAM-D-U) was administered to 160 depressed outpatients. Inter-item correlation was assessed by calculating Cronbach alpha. Correlation between HAM-D-U scores at baseline and after a 2-week interval was evaluated for test-retest reliability. Moreover, scores of two clinicians on HAM-D-U were compared for inter-rater reliability. For establishing concurrent validity, scores of HAM-D-U and BDI-U were compared by using Spearman correlation coefficient. The study was conducted at Mayo Hospital, Lahore, from May to December 2014. Results: The Cronbach alpha for HAM-D-U was 0.71. Composite scores for HAM-D-U at baseline and after a 2-week interval were also highly correlated with each other (Spearman correlation coefficient 0.83, p-value < 0.01) indicating good test-retest reliability. Composite scores for HAM-D-U and BDI-U were positively correlated with each other (Spearman correlation coefficient 0.85, p < 0.01) indicating good concurrent validity. Scores of two clinicians for HAM-D-U were also positively correlated (Spearman correlation coefficient 0.82, p-value < 0.01) indicated good inter-rater reliability. Conclusion: The HAM-D-U is a valid and reliable instrument for the assessment of Depression. It shows good inter-rater and test-retest reliability. The HAM-D-U can be a tool either for clinical management or research. PMID:28083049
Urdu translation of the Hamilton Rating Scale for Depression: Results of a validation study.

PubMed

Hashmi, Ali M; Naz, Shahana; Asif, Aftab; Khawaja, Imran S

2016-01-01

To develop a standardized validated version of the Hamilton Rating Scale for Depression (HAM-D) in Urdu. After translation of the HAM-D into the Urdu language following standard guidelines, the final Urdu version (HAM-D-U) was administered to 160 depressed outpatients. Inter-item correlation was assessed by calculating Cronbach alpha. Correlation between HAM-D-U scores at baseline and after a 2-week interval was evaluated for test-retest reliability. Moreover, scores of two clinicians on HAM-D-U were compared for inter-rater reliability. For establishing concurrent validity, scores of HAM-D-U and BDI-U were compared by using Spearman correlation coefficient. The study was conducted at Mayo Hospital, Lahore, from May to December 2014. The Cronbach alpha for HAM-D-U was 0.71. Composite scores for HAM-D-U at baseline and after a 2-week interval were also highly correlated with each other (Spearman correlation coefficient 0.83, p-value < 0.01) indicating good test-retest reliability. Composite scores for HAM-D-U and BDI-U were positively correlated with each other (Spearman correlation coefficient 0.85, p < 0.01) indicating good concurrent validity. Scores of two clinicians for HAM-D-U were also positively correlated (Spearman correlation coefficient 0.82, p-value < 0.01) indicated good inter-rater reliability. The HAM-D-U is a valid and reliable instrument for the assessment of Depression. It shows good inter-rater and test-retest reliability. The HAM-D-U can be a tool either for clinical management or research.
Measurement of Nonverbal IQ in Autism Spectrum Disorder: Scores in Young Adulthood compared to Early Childhood

PubMed Central

Bishop, Somer L.; Farmer, Cristan; Thurm, Audrey

2014-01-01

Nonverbal IQ (NVIQ) was examined in 84 individuals with ASD followed from age 2 to 19. Most adults who scored in the range of ID also received scores below 70 as children, and the majority of adults with scores in the average range had scored in this range by age 3. However, within the lower ranges of ability, actual scores declined from age 2 to 19, likely due in part to limitations of appropriate tests. Use of Vineland-II DLS scores in place of NVIQ did not statistically improve the correspondence between age 2 and age 19 scores. Clinicians and researchers should use caution when making comparisons based on exact scores or specific ability ranges within or across individuals with ASD of different ages. PMID:25239176
The impact of service-learning on cultural competence.

PubMed

Amerson, Roxanne

2010-01-01

Service-learning provides an excellent pedagogy for introducing students to clients of different cultural backgrounds, helping students become aware of the issues these clients face related to culture and health care, and teaching culturally appropriate care. The Transcultural Self-Efficacy Tool was used to evaluate self-perceived cultural competence in a convenience sample of 60 baccalaureate nursing students enrolled in a community health nursing course following the completion of service-learning projects with local and international communities. Pre- and posttests were analyzed based on total scores and subscale (cognitive, practical, and affective) scores. A paired-samples t test compared the mean pretest total score to the mean posttest total score, which demonstrated a significant increase. In addition, paired-samples t tests demonstrated a significant increase in each subscale.
Male-female differences in Scoliosis Research Society-30 scores in adolescent idiopathic scoliosis.

PubMed

Roberts, David W; Savage, Jason W; Schwartz, Daniel G; Carreon, Leah Y; Sucato, Daniel J; Sanders, James O; Richards, Benjamin Stephens; Lenke, Lawrence G; Emans, John B; Parent, Stefan; Sarwark, John F

2011-01-01

Longitudinal cohort study. To compare functional outcomes between male and female patients before and after surgery for adolescent idiopathic scoliosis (AIS). There is no clear consensus in the existing literature with respect to sex differences in functional outcomes in the surgical treatment of AIS. A prospective, consecutive, multicenter database of patients who underwent surgical correction for adolescent idiopathic scoliosis was analyzed retrospectively. All patients completed Scoliosis Research Society-30 (SRS-30) questionnaires before and 2 years after surgery. Patients with previous spine surgery were excluded. Data were collected for sex, age, Risser grade, previous bracing history, maximum preoperative Cobb angle, curve correction at 2 years, and SRS-30 domain scores. Paired sample t tests were used to compare preoperative and postoperative scores within each sex. Independent sample t tests were used to compare scores between sexes. A P value of <0.05 was considered statistically significant. Seven hundred forty-four patients (621 females and 123 males) were included. On average, males were 1 year older than females. There were no differences between sexes in Risser grade, bracing history, maximum curve magnitude, or correction after surgery. Both males and females had similar improvement in all SRS-30 domains after surgery. Self-image/appearance had the greatest relative improvement. Males had better self-image/appearance scores preoperatively, better pain scores at 2 years, and better mental health and total scores both preoperatively and at 2 years. Both males and females were similarly satisfied with surgery. Males treated with surgery for AIS report better preoperative self-image, less postoperative pain, and better mental health than females. These differences may be clinically significant. For both males and females, the most beneficial effect of surgery is improved self-image/appearance. Overall, the benefits of surgery for AIS are similar for both sexes.
Resilience linked to personality dimensions, alexithymia and affective symptoms in motor functional neurological disorders.

PubMed

Jalilianhasanpour, Rozita; Williams, Benjamin; Gilman, Isabelle; Burke, Matthew J; Glass, Sean; Fricchione, Gregory L; Keshavan, Matcheri S; LaFrance, W Curt; Perez, David L

2018-04-01

Reduced resilience, a construct associated with maladaptive stress coping and a predisposing vulnerability for Functional Neurological Disorders (FND), has been under-studied compared to other neuropsychiatric factors in FND. This prospective case-control study investigated self-reported resilience in patients with FND compared to controls and examined relationships between resilience and affective symptoms, personality traits, alexithymia, health status and adverse life event burden. 50 individuals with motor FND and 47 healthy controls participated. A univariate test followed by a logistic regression analysis investigated group-level differences in Connor-Davidson Resilience Scale (CD-RISC) scores. For within-group analyses performed separately in patients with FND and controls, univariate screening tests followed by multivariate linear regression analyses examined factors associated with self-reported resilience. Adjusting for age, gender, education status, ethnicity and lifetime adverse event burden, patients with FND reported reduced resilience compared to controls. Within-group analyses in patients with FND showed that individual-differences in mental health, extraversion, conscientiousness, and openness positively correlated with CD-RISC scores; post-traumatic stress disorder symptom severity, depression, anxiety, alexithymia and neuroticism scores negatively correlated with CD-RISC scores. Extraversion independently predicted resilience scores in patients with FND. In control subjects, univariate associations were appreciated between CD-RISC scores and gender, personality traits, anxiety, alexithymia and physical health; conscientiousness independently predicted resilience in controls. Patients with FND reported reduced resilience, and CD-RISC scores covaried with other important predisposing vulnerabilities for the development of FND. Future research should investigate if the CD-RISC is predictive of clinical outcomes in patients with FND. Copyright © 2018 Elsevier Inc. All rights reserved.
The effect of allergic rhinitis on the degree of stress, fatigue and quality of life in OSA patients.

PubMed

Park, Cheol Eon; Shin, Seung Youp; Lee, Kun Hee; Cho, Joong Saeng; Kim, Sung Wan

2012-09-01

Both allergic rhinitis (AR) and obstructive sleep apnea (OSA) are known to increase stress and fatigue, but the result of their coexistence has not been studied. The objective of this study was to evaluate the amount of stress and fatigue when AR is combined with OSA. One hundred and twelve patients diagnosed with OSA by polysomnography were enrolled. Among them, 37 patients were diagnosed with AR by a skin prick test and symptoms (OSA-AR group) and 75 patients were classified into the OSA group since they tested negative for allergies. We evaluated the Epworth sleepiness scale (ESS), stress score, fatigue score, ability to cope with stress, and rhinosinusitis quality of life questionnaire (RQLQ) with questionnaires and statistically compared the scores of both groups. There were no significant differences in BMI and sleep parameters such as LSAT, AHI, and RERA between the two groups. However, the OSA-AR group showed a significantly higher ESS score compared to the OSA group (13.7 ± 4.7 vs. 9.3 ± 4.8). Fatigue scores were also significantly higher in the OSA-AR group than in the OSA group (39.8 ± 11.0 vs. 30.6 ± 5.4). The OSA-AR group had a significantly higher stress score (60.4 ± 18.6 vs. 51.2 ± 10.4). The ability to cope with stress was higher in the OSA group, although this difference was not statistically significant. RQLQ scores were higher in the OSA-AR group (60.2 ± 16.7 compared to 25.1 ± 13.9). In conclusion, management of allergic rhinitis is very important in treating OSA patients in order to eliminate stress and fatigue and to minimize daytime sleepiness and quality of life.
Evaluation of ensemble forecast uncertainty using a new proper score: application to medium-range and seasonal forecasts

NASA Astrophysics Data System (ADS)

Christensen, Hannah; Moroz, Irene; Palmer, Tim

2015-04-01

Forecast verification is important across scientific disciplines as it provides a framework for evaluating the performance of a forecasting system. In the atmospheric sciences, probabilistic skill scores are often used for verification as they provide a way of unambiguously ranking the performance of different probabilistic forecasts. In order to be useful, a skill score must be proper -- it must encourage honesty in the forecaster, and reward forecasts which are reliable and which have good resolution. A new score, the Error-spread Score (ES), is proposed which is particularly suitable for evaluation of ensemble forecasts. It is formulated with respect to the moments of the forecast. The ES is confirmed to be a proper score, and is therefore sensitive to both resolution and reliability. The ES is tested on forecasts made using the Lorenz '96 system, and found to be useful for summarising the skill of the forecasts. The European Centre for Medium-Range Weather Forecasts (ECMWF) ensemble prediction system (EPS) is evaluated using the ES. Its performance is compared to a perfect statistical probabilistic forecast -- the ECMWF high resolution deterministic forecast dressed with the observed error distribution. This generates a forecast that is perfectly reliable if considered over all time, but which does not vary from day to day with the predictability of the atmospheric flow. The ES distinguishes between the dynamically reliable EPS forecasts and the statically reliable dressed deterministic forecasts. Other skill scores are tested and found to be comparatively insensitive to this desirable forecast quality. The ES is used to evaluate seasonal range ensemble forecasts made with the ECMWF System 4. The ensemble forecasts are found to be skilful when compared with climatological or persistence forecasts, though this skill is dependent on region and time of year.
A Causal-Comparative Study of the Affects of Benchmark Assessments on Middle Grades Science Achievement Scores

ERIC Educational Resources Information Center

Galloway, Melissa Ritchie

2016-01-01

The purpose of this causal comparative study was to test the theory of assessment that relates benchmark assessments to the Georgia middle grades science Criterion Referenced Competency Test (CRCT) percentages, controlling for schools who do not administer benchmark assessments versus schools who do administer benchmark assessments for all middle…
Development and validation of parenting measures for body image and eating patterns in childhood.

PubMed

Damiano, Stephanie R; Hart, Laura M; Paxton, Susan J

2015-01-01

Evidence-based parenting interventions are important in assisting parents to help their children develop healthy body image and eating patterns. To adequately assess the impact of parenting interventions, valid parent measures are required. The aim of this study was to develop and assess the validity and reliability of two new parent measures, the Parenting Intentions for Body image and Eating patterns in Childhood (Parenting Intentions BEC) and the Knowledge Test for Body image and Eating patterns in Childhood (Knowledge Test BEC). Participants were 27 professionals working in research or clinical treatment of body dissatisfaction or eating disorders, and 75 parents of children aged 2-6 years, who completed the measures via an online questionnaire. Seven scenarios were developed for the Parenting Intentions BEC to describe common experiences about the body and food that parents might need to respond to in front of their child. Parents ranked four behavioural intentions, derived from the current literature on parenting risk factors for body dissatisfaction and unhealthy eating patterns in children. Two subscales were created, one representing positive behavioural intentions, the other negative behavioural intentions. After piloting a larger pool of items, 13 statements were used to construct the Knowledge Test BEC. These were designed to be factual statements about the influence of parent language, media, family meals, healthy eating, and self-esteem on child eating and body image. The validity of both measures was tested by comparing parent and professional scores, and reliability was assessed by comparing parent scores over two testing occasions. Compared with parents, professionals reported significantly higher scores on the Positive Intentions subscale and significantly lower on the Negative Intentions subscale of the Parenting Intentions BEC; confirming the discriminant validity of six out of the seven scenarios. Test-retest reliability was also confirmed as parent scores on the two Parenting Intentions subscales did not differ over time. Eleven out of the 13 Knowledge Test items demonstrated sufficient discriminant validity and test-retest reliability. Overall, results indicated that the six-scenario Parenting Intentions BEC and the 11-item Knowledge Test BEC are valid and reliable measures for parents of young children.
Comparison of Static and Dynamic Balance in Female Collegiate Soccer, Basketball, and Gymnastics Athletes

PubMed Central

Bressel, Eadric; Yonker, Joshua C; Kras, John; Heath, Edward M

2007-01-01

Context: How athletes from different sports perform on balance tests is not well understood. When prescribing balance exercises to athletes in different sports, it may be important to recognize performance variations. Objective: To compare static and dynamic balance among collegiate athletes competing or training in soccer, basketball, and gymnastics. Design: A quasi-experimental, between-groups design. Independent variables included limb (dominant and nondominant) and sport played. Setting: A university athletic training facility. Patients or Other Participants: Thirty-four female volunteers who competed in National Collegiate Athletic Association Division I soccer (n = 11), basketball (n = 11), or gymnastics (n = 12). Intervention(s): To assess static balance, participants performed 3 stance variations (double leg, single leg, and tandem leg) on 2 surfaces (stiff and compliant). For assessment of dynamic balance, participants performed multidirectional maximal single-leg reaches from a unilateral base of support. Main Outcome Measure(s): Errors from the Balance Error Scoring System and normalized leg reach distances from the Star Excursion Balance Test were used to assess static and dynamic balance, respectively. Results: Balance Error Scoring System error scores for the gymnastics group were 55% lower than for the basketball group (P = .01), and Star Excursion Balance Test scores were 7% higher in the soccer group than the basketball group (P = .04). Conclusions: Gymnasts and soccer players did not differ in terms of static and dynamic balance. In contrast, basketball players displayed inferior static balance compared with gymnasts and inferior dynamic balance compared with soccer players. PMID:17597942
Generation of GHS Scores from TEST and online sources ...

EPA Pesticide Factsheets

Alternatives assessment frameworks such as DfE (Design for the Environment) evaluate chemical alternatives in terms of human health effects, ecotoxicity, and fate. T.E.S.T. (Toxicity Estimation Software Tool) can be utilized to evaluate human health in terms of acute oral rat toxicity, developmental toxicity, endocrine activity, and mutagenicity. It can be used to evaluate ecotoxicity (in terms of acute fathead minnow toxicity) and fate (in terms of bioconcentration factor). It also be used to estimate a variety of key physicochemical properties such as melting point, boiling point, vapor pressure, water solubility, and bioconcentration factor. A web-based version of T.E.S.T. is currently being developed to allow predictions to be made from other web tools. Online data sources such as from NCCT’s Chemistry Dashboard, REACH dossiers, or from ChemHat.org can also be utilized to obtain GHS (Global Harmonization System) scores for comparing alternatives. The purpose of this talk is to show how GHS (Global Harmonization Score) data can be obtained from literature sources and from T.E.S.T. (Toxicity Estimation Software Tool). This data will be used to compare chemical alternatives in the alternatives assessment dashboard (a 2018 CSS product).
A cross-national study of calculus

NASA Astrophysics Data System (ADS)

Chai, Jun; Friedler, Louis M.; Wolff, Edward F.; Li, Jun; Rhea, Karen

2015-05-01

The results from a cross-national study comparing calculus performance of students at East China Normal University (ECNU) in Shanghai and students at the University of Michigan before and after their first university calculus course are presented. Overall, ECNU significantly outperformed Michigan on both the pre- and post-tests, but the Michigan students showed a larger gain and normalized gain, and hence narrowed the gap. ECNU's superior performance was especially striking on the subset of problems requiring only a pre-calculus background. On those, Michigan's post-test scores were below ECNU's pre-test scores and, indeed, ECNU's higher performance on both the overall pre-test and overall post-test is attributable to its success on these problems.
Test-Retest Reliability and Minimal Detectable Change of Randomized Dichotic Digits in Learning-Disabled Children: Implications for Dichotic Listening Training.

PubMed

Mahdavi, Mohammad Ebrahim; Pourbakht, Akram; Parand, Akram; Jalaie, Shohreh

2018-03-01

Evaluation of dichotic listening to digits is a common part of many studies for diagnosis and managing auditory processing disorders in children. Previous researchers have verified test-retest relative reliability of dichotic digits results in normal children and adults. However, detecting intervention-related changes in the ear scores after dichotic listening training requires information regarding trial-to-trial typical variation of individual ear scores that is estimated using indices of absolute reliability. Previous studies have not addressed absolute reliability of dichotic listening results. To compare the results of the Persian randomized dichotic digits test (PRDDT) and its relative and absolute indices of reliability between typical achieving (TA) and learning-disabled (LD) children. A repeated measures observational study. Fifteen LD children were recruited from a previously performed study with age range of 7-12 yr. The control group consisted of 15 TA schoolchildren with age range of 8-11 yr. The Persian randomized dichotic digits test was administered on the children under free recall condition in two test sessions 7-12 days apart. We compared the average of the ear scores and ear advantage between TA and LD children. Relative indices of reliability included Pearson's correlation and intraclass correlation (ICC 2,1 ) coefficients and absolute reliability was evaluated by calculation of standard error of measurement (SEM) and minimal detectable change (MDC) using the raw ear scores. The Pearson correlation coefficient indicated that in both groups of children the ear scores of test and retest sessions were strongly and positively (greater than +0.8) correlated. The ear scores showed excellent ICC coefficient of consistency (0.78-0.82) and fair to excellent ICC coefficient of absolute agreement (0.62-0.74) in TA children and excellent ICC coefficients of consistency and absolute agreement in LD children (0.76-0.87). SEM and SEM% of the ear scores in TA children were 1.46 and 1.44% for the right ear and 4.68 and 5.47% for the left ear. SEM and SEM% of the ear scores in LD children were 4.55 and 5.88% for the right ear to 7.56 and 12.81% for the left ear. MDC and MDC% of the ear scores in TA children varied from 4.03 and 3.99% for the right ear to 12.93 and 15.13% for the left ear. MDC and MDC% of the ear scores in LD children varied from 12.57 and 16.25% for the right ear to 20.89 and 35.39% for the left ear. The LD children indicated test-retest relative reliability as high as TA children in the ear scores measured by PRDDT. However, within-subject variations of the ear scores calculated by indices of absolute reliability were considerably higher in LD children versus TA children. The results of the current study could have implications for detecting real training-related changes in the ear scores. American Academy of Audiology
The Four-Day School Week: An Examination of Long-Term Student Achievement at the Middle and Secondary Levels

ERIC Educational Resources Information Center

Fagergren, Peter J.

2003-01-01

Academic achievement under a four-day school week is compared to the traditional five-day school week. Test scores from the CAT [California Achievement Test], ITBS [Iowa Test of Basic Skills], TASK [Stanford Test of Academic Skills], SAT [Stanford Achievement Test], TAP [Tests of Academic Proficiency], and MAT [Metropolitan Achievement Test] were…
Pulmonary outcome in former preterm, very low birth weight children with bronchopulmonary dysplasia: a case-control follow-up at school age.

PubMed

Vom Hove, Maike; Prenzel, Freerk; Uhlig, Holm H; Robel-Tillig, Eva

2014-01-01

To assess and compare long-term pulmonary outcomes in former preterm-born, very low birth weight (VLBW) children with and without bronchopulmonary dysplasia (BPD) born in the surfactant era. Pulmonary function tests (ie, spirometry, body plethysmography, and gas transfer testing) were performed in children with a history of VLBW and BPD (n = 28) and compared with a matched preterm-born VLBW control group (n = 28). Medical history was evaluated by questionnaire. At time of follow-up (mean age, 9.5 years), respiratory symptoms (36% vs 8%) and receipt of asthma medication (21% vs 0%) were significantly more frequent in the preterm-born children with previous BPD than in those with no history of BPD. The children with a history of BPD had significantly lower values for forced expiratory volume in 1 second (z-score -1.27 vs -0.4; P = .008), forced vital capacity (z-score -1.39 vs -0.71 z-score; P = .022), and forced expiratory flow rate at 50% of forced vital capacity (z-score -2.21 vs -1.04; P = .048) compared with the preterm control group. Preterm-born children with a history of BPD are significantly more likely to have lung function abnormalities, such as airway obstruction and respiratory symptoms, at school age compared with preterm-born children without BPD. Copyright © 2014 Mosby, Inc. All rights reserved.
Comparison of U.S. Geological Survey and Ohio Environmental Protection Agency fish-collection methods using the index of biotic integrity and modified index of well-being, 1996-97

USGS Publications Warehouse

Covert, S. Alex

2001-01-01

The U.S. Geological Survey (USGS) and Ohio Environmental Protection Agency (OEPA) collected data on fish from 10 stream sites in 1996 and 3 stream sites in 1997 as part of a comparative study of fish community assessment methods. The sites sampled represent a wide range of basin sizes (ranging from 132?6,330 square kilometers) and surrounding land-use types (urban, agricultural, and mixed). Each agency used its own fish-sampling protocol. Using the Index of Biotic Integrity and Modified Index of Well-Being, differences between data sets were tested for significance by means of the Wilcoxon signed-ranks test (a = 0.05). Results showed that the median of Index of Biotic Integrity differences between data sets was not significantly different from zero (p = 0.2521); however, the same statistical test showed the median differences in the Modified Index of Well-Being scores to be significantly different from zero (p = 0.0158). The differences observed in the Index of Biotic Integrity scores are likely due to natural variability, increased variability at sites with degraded water quality, differences in sampling methods, and low-end adjustments in the Index of Biotic Integrity calculation when fewer than 50 fish were collected. The Modified Index of Well-Being scores calculated by OEPA were significantly higher than those calculated by the USGS. This finding was attributed to the comparatively large numbers and biomass of fish collected by the OEPA. By combining the two indices and viewing them in terms of the percentage attainment of Ohio Warmwater Habitat criteria, the two agencies? data seemed comparable, although the Index of Biotic Integrity scores were more similar than the Modified Index of Well-Being scores.

Functional Capacity and Quality of Life in Patients with Chronic Kidney Disease In Pre-Dialytic Treatment and on Hemodialysis--A Cross sectional study.

PubMed

Fassbinder, Tânia Regina Cavinatto; Winkelmann, Eliane Roseli; Schneider, Juliana; Wendland, Juliana; Oliveira, Olvânia Basso de

2015-01-01

Chronic kidney disease (CKD) infers directly in functional capacity, independence and therefore quality of life (QOL). To compare the physical fitness and quality of life of patients with chronic kidney disease submitted on hemodialysis (G1) and predialysis treatment (G2). A cross-sectional study, 54 patients with CKD, 27 of the G1 group (58.15 ± 10.84 years), 27 of G2 group (62.04 ± 16.56 years). There were cardiovascular risk factors, anthropometric measurements, respiratory muscle strength was measured by the inspiratory pressure (MIP) and expiratory (MEP) maximum measured in the manometer, six-minute walk (TC6'), cardiopulmonary exercise test, sit and stand one minute test (TSL1') and the Short-Form Questionary (SF-36) to assess QOL. The patients presented disease of stage between 2 and 5. It was applied the Kolmogorov-Smirnov normality test and used the t (Student) test or the U (Mann Whitney) test to compare the means of quantitative variables and the chi-square Pearson test and Fisher's exact test for qualitative variables. Pearson's or Spearman's test was used to identify correlations. No statistically significant difference was found between G1 and G2 in VO2peak (p = 0,259) in TC6' (p = 0,433) in the MIPmáx (p = 0,158) and found only in the MEPmáx (p = 0,024) to G1. The scores of the SF-36 in both groups showed a worse health status as evidenced by the low score in scores for QOL. Patients with CKD had reduced functional capacity and QOL, and hemodialysis, statistically, didn't have showed negative repercussions when compared with pre-dialysis patients.
Changing physician knowledge, attitudes, and beliefs about migraine: evaluation of a new educational intervention.

PubMed

Patwardhan, Meenal B; Samsa, Gregory P; Lipton, Richard B; Matchar, David B

2006-05-01

Use a presurvey of primary care providers (PCPs) enrolled in a continuing medical education (CME) program on headache management to ascertain their existing knowledge, attitudes, and beliefs regarding migraine and use a postsurvey to determine the extent to which the CME program has brought participant knowledge, attitudes, and skills closer to conformance with best evidence. Migraine is a common and debilitating condition, which PCPs may not always manage satisfactorily. In an effort to improve management, the American Headache Society has developed a CME program called BRAINSTORM that encourages PCPs to adopt the US Headache Consortium Guidelines for headache care. A 20-item questionnaire was developed that covered the essential elements of migraine care. The questionnaire was administered before and after a BRAINSTORM presentation to 254 consenting primary care clinicians attending a medical meeting at 1 of 6 sites. A control group of 112 comparable physicians who did not attend the presentation completed the same questionnaire. Prepresentation scores of attendees were compared to scores of nonattendees to assess the generalizability of results. Prepresentation scores on selected questions were used to assess participant baseline knowledge, attitudes, and beliefs. Pre- and postpresentation scores for attendees at all sites were compared using the Mantel-Haenszel statistic to assess the effectiveness of the BRAINSTORM CME. Pre- and postpresentation scores were compared by site using the Breslow-Day test to evaluate any differential impact based on CME location. Prepresentation scores of attendees and nonattendees were found to be similar. No significant difference in performance was noted across sites. A chi-square analysis revealed a statistically significant difference between pre- and postpresentation scores for 16 of the test's 20 questions. In the pretest, all participants scored <66% on 2 questions related to prevalence, impact, and pathophysiology of migraine, 2 questions pertaining to history taking/physical examination, and 3 migraine management questions. Attendee scores improved to >66% posttest on all except 2 questions related to prevalence, impact, and pathophysiology of migraine. Our results indicate that PCPs need to acquire greater understanding about the epidemiology and pathophysiology of migraine and may require guidance in history taking and physical examination of migraine patients. Improvement in scores posttest confirms that the BRAINSTORM program has a significant immediate impact on the knowledge, beliefs, and attitudes of participants. The program could be strengthened to improve emphasis in some areas where posttest scores showed no improvement.
Teaching peroral endoscopic myotomy (POEM) to surgeons in practice: an "into the fire" pre/post-test curriculum.

PubMed

Kishiki, Tomokazu; Lapin, Brittany; Wang, Chi; Jonson, Brandon; Patel, Lava; Zapf, Matthew; Gitelis, Matthew; Cassera, Maria A; Swanström, Lee L; Ujiki, Michael B

2018-03-01

With the increasing adoption of peroral endoscopic myotomy (POEM) as a first-line therapy for achalasia as well as a growing list of other indications, it is apparent that there is a need for effective training methods for both endoscopists in training and those already in practice. We present a hands-on-focused with pre- and post-testing methodology to teach these skills. Six POEM courses were taught by 11 experienced POEM endoscopists at two independent simulation laboratories. The training curriculum included a pre-training test, lectures and discussion, mentored hands-on instruction using live porcine and ex-plant models, and a post-training test. The scoring sheet for the pre- and post-tests assessed the POEM performance with a Likert-like scale measuring equipment setup, mucosotomy creation, endoscope navigation, visualization, myotomy, and closure. Participants were stratified by their experience with upper-GI endoscopy (Novices <100 cases vs. Experts ≥100 cases), and their data were analyzed and compared. Sixty-five participants with varying degrees of experience in upper-GI endoscopy and laparoscopic achalasia cases completed the training curriculum. Participants improved knowledge scores from 69.7 ± 17.1 (pre-test) to 87.7 ± 10.8 (post-test) (p < 0.01). POEM performance increased from 15.1 ± 5.1 to 25.0 ± 5.5 (out of 30) (p < 0.01) with the greatest gains in mucosotomy [1.7-4.4 (out of 5), p < 0.01] and equipment (3.4-4.7, p < 0.01). Novices had significantly lower pre-test scores compared with Experts in upper-GI endoscopy (overall pre-score: 11.9 ± 5.6 vs. 16.3 ± 4.6, p < 0.01). Both groups improved significantly after the course, and there were no differences in post-test scores (overall post-score: 23.9 ± 6.6 vs. 25.4 ± 5.1, p = 0.34) between Novices and Experts. A multimodal curriculum with procedural practice was an effective curricular design for teaching POEM to practitioners. The curriculum was specifically helpful for training surgeons with less upper-GI endoscopy experience.
The Association Between Unintended Births and Poor Child Development in India: Evidence from a Longitudinal Study.

PubMed

Singh, Abhishek; Upadhyay, Ashish Kumar; Singh, Ashish; Kumar, Kaushalendra

2017-03-01

Evidence on the association between unintended births and poor child development in developing countries is limited. We used data from three waves of the Young Lives study on childhood poverty conducted in Andhra Pradesh in 2002, 2006-07, and 2009 to examine the association between unintended births and poor child development in India. Multivariable linear regression models were used to examine the association between unintended births and four indicators of child development-height-for-age Z-score (HAZ), Peabody Picture Vocabulary Test (PPVT) score, Mathematics Achievement Test (MAT) score, and Early Grade Reading Assessment (EGRA) test score. The Propensity Score Matching (PSM) technique was also used to analyze data. Children who were reported as unintended at birth had significantly lower HAZ, PPVT, and EGRA scores compared with those who were reported as intended. PSM results support the findings from the multivariable linear regressions. Our findings provide evidence on the association between unintended births and poor child development in India. There may be a need to reposition family planning within India's reproductive and child health care programs. Future studies must take into account the unobserved heterogeneity that our study could not address fully. © 2017 The Population Council, Inc.
Demonstrating the validity of three general scores of PET in predicting higher education achievement in Israel.

PubMed

Oren, Carmel; Kennet-Cohen, Tamar; Turvall, Elliot; Allalouf, Avi

2014-01-01

The Psychometric Entrance Test (PET), used for admission to higher education in Israel together with the Matriculation (Bagrut), had in the past one general (total) score in which the weights for its domains: Verbal, Quantitative and English, were 2:2:1, respectively. In 2011, two additional total scores were introduced, with different weights for the Verbal and the Quantitative domains. This study compares the predictive validity of the three general scores of PET, and demonstrates validity in terms of utility. 100,863 freshmen students of all Israeli universities over the classes of 2005-2009. Regression weights and correlations of the predictors with FYGPA were computed. Simulations based on these results supplied the utility estimates. On average, PET is slightly more predictive than the Bagrut; using them both yields a better tool than either of them alone. Assigning differential weights to the components in the respective schools further improves the validity. The introduction of the new general scores of PET is validated by gathering and analyzing evidence based on relations of test scores to other variables. The utility of using the test can be demonstrated in ways different from correlations.
Inter-rater Agreement on Final Competency Testing Utilizing Standardized Patients.

PubMed

Bowman, Dixie H; Ferber, Kyle L; Sima, Adam P

2016-01-01

The purpose of this study was to determine whether licensed physical therapists (n=8) serving as standardized patients (SPs) for practical examinations evaluate physical therapy students (n=51) equivalently to the physical therapy course instructor (n=1). The SPs completed the same assessment based on the evaluation criteria as did the instructor. The scores for the practical examination, answers to three questions, and the documentation note were summarized separately for the SP and the instructor by means and standard deviations. A paired t-test and an intraclass correlation coefficient (ICC) for each aspect of the score were calculated. ICC(1,1) values were reported along with corresponding 95% confidence intervals. The instructor had significantly higher scores for the practical exam and the overall score compared to the ratings from the SPs. No differences were observed between the instructor and SP scores on the three answers to the questions and documentation note scores. Based on the ICC values identified in this study, a physical therapist serving as an SP may not be an adequate replacement for an instructor when it comes to grading physical therapy students on all aspects of their competency tests.
Cognitive performance and aphasia recovery.

PubMed

Fonseca, José; Raposo, Ana; Martins, Isabel Pavão

2018-03-01

Objectives This study assessed cognitive performance of subjects with aphasia during the acute stage of stroke and evaluated how such performance relates to recovery at 3 months. Materials & methods Patients with aphasia following a left hemisphere stroke were evaluated during the first (baseline) and the fourth-month post onset. Assessment comprised non-verbal tests of attention/processing speed (Symbol Search, Cancelation Task), executive functioning (Matrix Reasoning, Tower of Hanoi, Clock Drawing, Motor Initiative), semantic (Camel and Cactus Test), episodic and immediate memory (Memory for Faces Test, 5 Objects Memory Test, and Spatial Span. Recovery was measured by the Token Test score at 3 months. The impact of baseline performance on recovery was evaluated by logistic regression adjusting for age, education, severity of aphasia and the Alberta Stroke Program Early CT (ASPECT) score. Results Thirty-nine subjects (with a mean of 66.5 ± 10.6 years of age, 17 men) were included. Average baseline cognitive performance was within normal range in all tests except in memory tests (semantic, episodic and immediate memory) for which scores were ≤-1.5sd. Subjects with poor aphasia recovery (N = 27) were older and had fewer years of formal education but had identical ASPECT score compared to those with favorable recovery. Considering each test individually, the score obtained on the Matrix Reasoning test was the only one to predict aphasia recovery (Exp(B)=24.085 p = 0.038). Conclusions The Matrix Reasoning Test may contribute to predict aphasia recovery. Cognitive performance is a measure of network disruption but may also indicate the availability of recovery strategies.
Selecting Value-Added Models for Postsecondary Institutional Assessment

ERIC Educational Resources Information Center

Steedle, Jeffrey T.

2012-01-01

Value-added scores from tests of college learning indicate how score gains compare to those expected from students of similar entering academic ability. Unfortunately, the choice of value-added model can impact results, and this makes it difficult to determine which results to trust. The research presented here demonstrates how value-added models…
The Comparability of Three Wechsler Adult Intelligence Scales in a College Sample.

ERIC Educational Resources Information Center

Quereshi, M. Y.; Ostrowski, Michael J.

1985-01-01

Administered three Wechsler adult intelligence scales to 72 undergraduates and tested the quality of means, variances, and covariances, utilizing subtest scale scores and IQs. Results indicated that the three scales were not parallel. Generally, the subtest scaled scores exhibited less similarity across the three scales than the IQ estimates.…
ESEA Title I Linking Project. Final Report.

ERIC Educational Resources Information Center

Holmes, Susan E.

The Rasch model for test score equating was compared with three other equating procedures as methods for implementing the norm referenced method (RMC Model A) of evaluating ESEA Title I projects. The Rasch model and its theoretical limitations were described. The three other equating methods used were: linear observed score equating, linear true…
Dexterity and Bench Assembly Work Productivity in Adults with Mild Mental Retardation.

ERIC Educational Resources Information Center

Serr, Russell; And Others

1994-01-01

This study compared dexterity scores using the Vocational Transit Test System and bench assembly work productivity in 30 adults with mild mental retardation. Moderately high correlations were found between work output and motor coordination, manual dexterity, finger dexterity (with and without assembly), and total dexterity score. Finger dexterity…
The IGAP and the ITBS: A Comparative Study.

ERIC Educational Resources Information Center

Perlman, Carole L.; And Others

This study was designed to examine the extent to which Illinois Goal Assessment Program (IGAP) constructing meaning scores correlate with Iowa Tests of Basic Skills (ITBS) reading scores and with performance on ITBS items dealing with literal meaning, inferences, and generalizations. In addition, this study assessed the ability of the IGAP reading…
The Data Dilemma: Reporting in the Era of Federal Stimulus

ERIC Educational Resources Information Center

Weil, Marty

2009-01-01

Department of Education (DOE) secretary Arne Duncan has put school districts on notice: They must show how their students' scores on state tests compare with their scores in national evaluations. The Obama administration also wants states to track and disclose longitudinal data, according to a recent Wall Street Journal report. Of course,…
A Survey of Professional Licensure Examinations in Texas.

ERIC Educational Resources Information Center

Texas Coll. and Univ. System, Austin. Coordinating Board.

A determination was made of how graduates of Texas professional education programs perform on licensure examinations in comparison with their counterparts in other states. Test scores of Texas graduates were compared with national norms and averages in other states, when available, as well as with scores of graduates of out-of-state programs who…
Interactive laboratory classes enhance neurophysiological knowledge in Thai medical students.

PubMed

Wongjarupong, Nicha; Niyomnaitham, Danai; Vilaisaktipakorn, Pitchamol; Suksiriworaboot, Tanawin; Qureshi, Shaun Peter; Bongsebandhu-Phubhakdi, Saknan

2018-03-01

Interactive laboratory class (ILC) is a two-way communication teaching method that encourages students to correlate laboratory findings with materials from lectures. In Thai medical education, active learning methods are uncommon. This paper aims to establish 1) if ILCs would effectively promote physiology learning; 2) if effectiveness would be found in both previously academically high-performing and low-performing students; and 3) the acceptability of ILCs to Thai medical students as a novel learning method. Two hundred seventy-eight second-year medical students were recruited to this study. We conducted three ILC sessions, which followed corresponding lectures. We carried out multiple-choice pre- and post-ILC assessments of knowledge and compared by repeated-measures ANOVA and unpaired t-test. Subgroup analysis was performed to compare high-performance (HighP) and low-performance (LowP) students. After the ILCs, participants self-rated their knowledge and satisfaction. Post-ILC test scores increased significantly compared with pre-ILC test scores in all three sessions. Mean scores of each post-ILC test increased significantly from pre-ILC test in both LowP and HighP groups. More students self-reported a "very high" and "high" level of knowledge after ILCs. Most students agreed that ILCs provided more discussion opportunity, motivated their learning, and made lessons more enjoyable. As an adjunct to lectures, ILCs can enhance knowledge in medical students, regardless of previous academic performance. Students perceived ILC as useful and acceptable. This study supports the active learning methods in physiology education, regardless of cultural context.
Rescore protein-protein docked ensembles with an interface contact statistics.

PubMed

Mezei, Mihaly

2017-02-01

The recently developed statistical measure for the type of residue-residue contact at protein complex interfaces, based on a parameter-free definition of contact, has been used to define a contact score that is correlated with the likelihood of correctness of a proposed complex structure. Comparing the proposed contact scores on the native structure and on a set of model structures the proposed measure was shown to generally favor the native structure but in itself was not able to reliably score the native structure to be the best. Adjusting the scores of redocking experiments with the contact score showed that the adjusted score was able to move up the ranking of the native-like structure among the proposed complexes when the native-like was not ranked the best by the respective program. Tests on docking of unbound proteins compared the contact scores of the complexes with the contact score of the crystal structure again showing the tendency of the contact score to favor native-like conformations. The possibility of using the contact score to improve the determination of biological dimers in a crystal structure was also explored. Proteins 2017; 85:235-241. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Empirical Implications of Matching Children With Specific Language Impairment to Children With Typical Development on Nonverbal IQ.

PubMed

Earle, F Sayako; Gallinat, Erica L; Grela, Bernard G; Lehto, Alexa; Spaulding, Tammie J

This study determined the effect of matching children with specific language impairment (SLI) and their peers with typical development (TD) for nonverbal IQ on the IQ test scores of the resultant groups. Studies published between January 2000 and May 2012 reporting standard nonverbal IQ scores for SLI and age-matched TD controls were categorized into those that matched and did not match children with SLI and TD on nonverbal IQ. We then compared the nonverbal IQ scores across matching criterions within each diagnostic category. In studies that matched children on nonverbal IQ, children with SLI scored significantly higher on nonverbal IQ tests relative to children with SLI in studies that did not match on this criterion. Therefore, it appears that the nonverbal IQ performance of children with SLI is not comparable across studies that do and do not match samples on nonverbal IQ. This suggests that the practice of nonverbal IQ matching may have unintended consequences for the generalization of research findings to the broader SLI population.
Relationships between different nutritional anthropometric statuses and health-related fitness of South African primary school children.

PubMed

Armstrong, M E G; Lambert, M I; Lambert, E V

2017-05-01

A double burden of both under- and over-nutrition exists among South African children. To describe associations between nutritional statuses and health-related fitness test performances. Height and weight of 10 285 children (6-13 years; n = 5604 boys and 4681 girls) were measured and used to calculate body mass index (BMI) and prevalence of overweight and obesity, stunting, wasting and underweight. Physical fitness scores for standing long jump, shuttle run, sit-and-reach, sit-up (EUROFIT) and cricket ball throw were assessed. Age- and gender-specific z-scores were calculated for these variables. Physical fitness for each nutritional status group was compared to children of normal weight. Compared to normal weight children, overweight and obese children scored lower on all fitness tests (p < .001), except cricket ball throw (p = .235) and sit-and-reach (p = .015). Stunted and underweight children performed poorer than normal weight children on most fitness tests (p < .001), except sit-and-reach (stunted: p = .829; underweight: p = .538) and shuttle run (underweight: p = .017). Performance of wasted children was not as highly compromised as other under-nourished groups, but they performed poorer on the cricket ball throw (p < .001). When compared to normal weight children, both under- and over-nourished children performed poorer on some, but not all, health-related fitness tests.
Relationship Suicide, Cognitive Functions, and Depression in Patients with Schizophrenia

PubMed Central

KOCATÜRK, Bülent Kenan; EŞSİZOĞLU, Altan; AKSARAY, Gökay; AKARSU, Ferdane Özlem; MUSMUL, Ahmet

2015-01-01

Introduction The aim of this study was to compare schizophrenic patients with and without a suicide attempt history in terms of sociodemographic and clinical features and cognitive functions and to determine the predictive factors for suicide attempt history. Methods In this study, we assessed and compared 70 patients with schizophrenia, 27 patients with a suicide attempt history, and 43 patients without a suicide attempt history. The cognitive functions of patients were assessed by the Stroop test, Wisconsin Card Sorting Test (WCST), and Rey Auditory Verbal Learning Test. In order to evaluate clinical symptoms, the Positive and Negative Syndrome Scale (PANSS) and Calgary Depression Scale for Schizophrenia (CDSS) were used. Results In this study, the number of hospitalizations, PANSS general psychopathology subscale score, CDSS total score, suicide item score, and WCST total number of responses (WCST1) were significantly higher among the patients with a suicide attempt history. The WCST1 and CDSS total scores were predicted using the suicide attempt history. Conclusion Revealing the factors related to suicidal behavior in patients with schizophrenia contributes to the prevention of suicide. Studies with long-term follow-up and with a larger sample group are required for the investigation of relationship suicide, cognitive impairment, which is one of the core symptoms of schizophrenia, and depression. PMID:28360699
Enduring Advantages of Early Cochlear Implantation for Spoken Language Development

PubMed Central

Geers, Ann E.; Nicholas, Johanna G.

2013-01-01

Purpose To determine whether the precise age of implantation (AOI) remains an important predictor of spoken language outcomes in later childhood for those who received a cochlear implant (CI) between 12–38 months of age. Relative advantages of receiving a bilateral CI after age 4.5, better pre-CI aided hearing, and longer CI experience were also examined. Method Sixty children participated in a prospective longitudinal study of outcomes at 4.5 and 10.5 years of age. Twenty-nine children received a sequential second CI. Test scores were compared to normative samples of hearing age-mates and predictors of outcomes identified. Results Standard scores on language tests at 10.5 years of age remained significantly correlated with age of first cochlear implantation. Scores were not associated with receipt of a second, sequentially-acquired CI. Significantly higher scores were achieved for vocabulary as compared with overall language, a finding not evident when the children were tested at younger ages. Conclusion Age-appropriate spoken language skills continued to be more likely with younger AOI, even after an average of 8.6 years of additional CI use. Receipt of a second implant between ages 4–10 years and longer duration of device use did not provide significant added benefit. PMID:23275406

Some links on this page may take you to non-federal websites. Their policies may differ from this site.