test score performance: Topics by Science.gov

Sample records for test score performance

Neuropsychological test scores, academic performance, and developmental disorders in Spanish-speaking children.

PubMed

Rosselli, M; Ardila, A; Bateman, J R; Guzmán, M

2001-01-01

Limited information is currently available about performance of Spanish-speaking children on different neuropsychological tests. This study was designed to (a) analyze the effects of age and sex on different neuropsychological test scores of a randomly selected sample of Spanish-speaking children, (b) analyze the value of neuropsychological test scores for predicting school performance, and (c) describe the neuropsychological profile of Spanish-speaking children with learning disabilities (LD). Two hundred ninety (141 boys, 149 girls) 6- to 11-year-old children were selected from a school in Bogotá, Colombia. Three age groups were distinguished: 6- to 7-, 8- to 9-, and 10- to 11-year-olds. Performance was measured utilizing the following neuropsychological tests: Seashore Rhythm Test, Finger Tapping Test (FTT), Grooved Pegboard Test, Children's Category Test (CCT), California Verbal Learning Test-Children's Version (CVLT-C), Benton Visual Retention Test (BVRT), and Bateria Woodcock Psicoeducativa en Español (Woodcock, 1982). Normative scores were calculated. Age effect was significant for most of the test scores. A significant sex effect was observed for 3 test scores. Intercorrelations were performed between neuropsychological test scores and academic areas (science, mathematics, Spanish, social studies, and music). In a post hoc analysis, children presenting very low scores on the reading, writing, and arithmetic achievement scales of the Woodcock battery were identified in the sample, and their neuropsychological test scores were compared with a matched normal group. Finally, a comparison was made between Colombian and American norms.
Evaluation of target scores and benchmarks for the traversal task scenario of the Minimally Invasive Surgical Trainer-Virtual Reality (MIST-VR) laparoscopy simulator.

PubMed

Hackethal, A; Immenroth, M; Bürger, T

2006-04-01

The Minimally Invasive Surgical Trainer-Virtual Reality (MIST-VR) simulator is validated for laparoscopy training, but benchmarks and target scores for assessing single tasks are needed. Control data for the MIST-VR traversal task scenario were collected from 61 novices who performed the task 10 times over 3 days (1 h daily). Data were collected on the time taken, error score, economy of movement, and total score. Test differences were analyzed through percentage scores and t-tests for paired samples. Improvement was greatest over tests 1 to 5 (improvement: test(1.2), 38.07%; p = 0.000; test(4.5), 10.66%; p = 0.010): between tests 5 and 10, improvement slowed and scores stabilized. Variation in participants' performance fell steadily over the 10 tests. Trainees should perform at least 10 tests of the traversal task-five to get used to the equipment and task (automation phase; target total score, 95.16) and five to stabilize and consolidate performance (test 10 target total score, 74.11).
Test anxiety and academic performance in chiropractic students.

PubMed

Zhang, Niu; Henderson, Charles N R

2014-01-01

Objective : We assessed the level of students' test anxiety, and the relationship between test anxiety and academic performance. Methods : We recruited 166 third-quarter students. The Test Anxiety Inventory (TAI) was administered to all participants. Total scores from written examinations and objective structured clinical examinations (OSCEs) were used as response variables. Results : Multiple regression analysis shows that there was a modest, but statistically significant negative correlation between TAI scores and written exam scores, but not OSCE scores. Worry and emotionality were the best predictive models for written exam scores. Mean total anxiety and emotionality scores for females were significantly higher than those for males, but not worry scores. Conclusion : Moderate-to-high test anxiety was observed in 85% of the chiropractic students examined. However, total test anxiety, as measured by the TAI score, was a very weak predictive model for written exam performance. Multiple regression analysis demonstrated that replacing total anxiety (TAI) with worry and emotionality (TAI subscales) produces a much more effective predictive model of written exam performance. Sex, age, highest current academic degree, and ethnicity contributed little additional predictive power in either regression model. Moreover, TAI scores were not found to be statistically significant predictors of physical exam skill performance, as measured by OSCEs.
Students' Geographic Knowledge and Skills in Different Kinds of Tests: Multiple-Choice versus Performance Assessment.

ERIC Educational Resources Information Center

Kon, Jane Heckley; Martin-Kniep, Giselle O.

1992-01-01

Describes a case study to determine whether performance tests are a feasible alternative to multiple-choice tests. Examines the difficulties of administering and scoring performance assessments. Explains that the study employed three performance tests and one multiple-choice test. Concludes that performance test administration and scoring was no…
Cognitive skills, student achievement tests, and schools.

PubMed

Finn, Amy S; Kraft, Matthew A; West, Martin R; Leonard, Julia A; Bish, Crystal E; Martin, Rebecca E; Sheridan, Margaret A; Gabrieli, Christopher F O; Gabrieli, John D E

2014-03-01

Cognitive skills predict academic performance, so schools that improve academic performance might also improve cognitive skills. To investigate the impact schools have on both academic performance and cognitive skills, we related standardized achievement-test scores to measures of cognitive skills in a large sample (N = 1,367) of eighth-grade students attending traditional, exam, and charter public schools. Test scores and gains in test scores over time correlated with measures of cognitive skills. Despite wide variation in test scores across schools, differences in cognitive skills across schools were negligible after we controlled for fourth-grade test scores. Random offers of enrollment to oversubscribed charter schools resulted in positive impacts of such school attendance on math achievement but had no impact on cognitive skills. These findings suggest that schools that improve standardized achievement-test scores do so primarily through channels other than improving cognitive skills.
Increased correlation coefficient between the written test score and tutors' performance test scores after training of tutors for assessment of medical students during problem-based learning course in Malaysia.

PubMed

Jaiprakash, Heethal; Min, Aung Ko Ko; Ghosh, Sarmishtha

2016-03-01

This paper is aimed at finding if there was a change of correlation between the written test score and tutors' performance test scores in the assessment of medical students during a problem-based learning (PBL) course in Malaysia. This is a cross-sectional observational study, conducted among 264 medical students in two groups from November 2010 to November 2012. The first group's tutors did not receive tutor training; while the second group's tutors were trained in the PBL process. Each group was divided into high, middle and low achievers based on their end-of-semester exam scores. PBL scores were taken which included written test scores and tutors' performance test scores. Pearson correlation coefficient was calculated between the two kinds of scores in each group. The correlation coefficient between the written scores and tutors' scores in group 1 was 0.099 (p<0.001) and for group 2 was 0.305 (p<0.001). The higher correlation coefficient in the group where tutors received the PBL training reinforces the importance of tutor training before their participation in the PBL course.
MCAT Verbal Reasoning score: less predictive of medical school performance for English language learners.

PubMed

Winegarden, Babbi; Glaser, Dale; Schwartz, Alan; Kelly, Carolyn

2012-09-01

Medical College Admission Test (MCAT) scores are widely used as part of the decision-making process for selecting candidates for admission to medical school. Applicants who learned English as a second language may be at a disadvantage when taking tests in their non-native language. Preliminary research found significant differences between English language learners (ELLs), applicants who learned English after the age of 11 years, and non-ELL examinees on the Verbal Reasoning (VR) sub-test of the MCAT. The purpose of this study was to determine if relationships between VR sub-test scores and measures of medical school performance differed between ELL and non-ELL students. Scores on the MCAT VR sub-test and student performance outcomes (grades, examination scores, and markers of distinction and difficulty) were extracted from University of California San Diego School of Medicine admissions files and the Association of American Medical Colleges database for 924 students who matriculated in 1998-2005 (graduation years 2002-2009). Regression models were fitted to determine whether MCAT VR sub-test scores predicted medical school performance similarly for ELLs and non-ELLs. For several outcomes, including pre-clerkship grades, academic distinction, US Medical Licensing Examination Step 2 Clinical Knowledge scores and two clerkship shelf examinations, ELL status significantly affects the ability of the VR score to predict performance. Higher correlations between VR score and medical school performance emerged for non-ELL students than for ELL students for each of these outcomes. The MCAT VR score should be used with discretion when assessing ELL applicants for admission to medical school. © Blackwell Publishing Ltd 2012.
Clock face drawing test performance in children with ADHD.

PubMed

Ghanizadeh, Ahmad; Safavi, Salar; Berk, Michael

2013-01-01

The utility and discriminatory pattern of the clock face drawing test in ADHD is unclear. This study therefore compared Clock Face Drawing test performance in children with ADHD and controls. 95 school children with ADHD and 191 other children were matched for gender ratio and age. ADHD symptoms severities were assessed using DSM-IV ADHD checklist and their intellectual functioning was assessed. The participants completed three clock-drawing tasks, and the following four functions were assessed: Contour score, Numbers score, Hands setting score, and Center score. All the subscales scores of the three clock drawing tests of the ADHD group were lower than that of the control group. In ADHD children, inattention and hyperactivity/ impulsivity scores were not related to free drawn clock test scores. When pre-drawn contour test was performed, inattentiveness score was statistically associated with Number score while none of the other variables of age, gender, intellectual functioning, and hand use preference were associated with that kind of score. In pre-drawn clock, no association of ADHD symptoms with any CDT subscales found significant. In addition, more errors are observed with free drawn clock and Pre-drawn contour than pre-drawn clock. Putting Numbers and Hands setting are more sensitive measures to screen ADHD than Contour and Center drawing. Test performance, except Hands setting, may have already reached a developmental plateau. It is probable that Hand setting deficit in children with ADHD may not decrease from age 8 to 14 years. Performance of children with ADHD is associated with complexity of CDT.
The test-retest reliability of the latent construct of executive function depends on whether tasks are represented as formative or reflective indicators.

PubMed

Willoughby, Michael T; Kuhn, Laura J; Blair, Clancy B; Samek, Anya; List, John A

2017-10-01

This study investigates the test-retest reliability of a battery of executive function (EF) tasks with a specific interest in testing whether the method that is used to create a battery-wide score would result in differences in the apparent test-retest reliability of children's performance. A total of 188 4-year-olds completed a battery of computerized EF tasks twice across a period of approximately two weeks. Two different approaches were used to create a score that indexed children's overall performance on the battery-i.e., (1) the mean score of all completed tasks and (2) a factor score estimate which used confirmatory factor analysis (CFA). Pearson and intra-class correlations were used to investigate the test-retest reliability of individual EF tasks, as well as an overall battery score. Consistent with previous studies, the test-retest reliability of individual tasks was modest (rs ≈ .60). The test-retest reliability of the overall battery scores differed depending on the scoring approach (r mean = .72; r factor_ score = .99). It is concluded that the children's performance on individual EF tasks exhibit modest levels of test-retest reliability. This underscores the importance of administering multiple tasks and aggregating performance across these tasks in order to improve precision of measurement. However, the specific strategy that is used has a large impact on the apparent test-retest reliability of the overall score. These results replicate our earlier findings and provide additional cautionary evidence against the routine use of factor analytic approaches for representing individual performance across a battery of EF tasks.
Do candidate reactions relate to job performance or affect criterion-related validity? A multistudy investigation of relations among reactions, selection test scores, and job performance.

PubMed

McCarthy, Julie M; Van Iddekinge, Chad H; Lievens, Filip; Kung, Mei-Chuan; Sinar, Evan F; Campion, Michael A

2013-09-01

Considerable evidence suggests that how candidates react to selection procedures can affect their test performance and their attitudes toward the hiring organization (e.g., recommending the firm to others). However, very few studies of candidate reactions have examined one of the outcomes organizations care most about: job performance. We attempt to address this gap by developing and testing a conceptual framework that delineates whether and how candidate reactions might influence job performance. We accomplish this objective using data from 4 studies (total N = 6,480), 6 selection procedures (personality tests, job knowledge tests, cognitive ability tests, work samples, situational judgment tests, and a selection inventory), 5 key candidate reactions (anxiety, motivation, belief in tests, self-efficacy, and procedural justice), 2 contexts (industry and education), 3 continents (North America, South America, and Europe), 2 study designs (predictive and concurrent), and 4 occupational areas (medical, sales, customer service, and technological). Consistent with previous research, candidate reactions were related to test scores, and test scores were related to job performance. Further, there was some evidence that reactions affected performance indirectly through their influence on test scores. Finally, in no cases did candidate reactions affect the prediction of job performance by increasing or decreasing the criterion-related validity of test scores. Implications of these findings and avenues for future research are discussed. PsycINFO Database Record (c) 2013 APA, all rights reserved
The validity of ACT-PEP test scores for predicting academic performance of registered nurses in BSN programs.

PubMed

Yang, J C; Noble, J

1990-01-01

This study investigated the validity of three American College Testing-Proficiency Examination Program (ACT-PEP) tests (Maternal and Child Nursing, Psychiatric/Mental Health Nursing, Adult Nursing) for predicting the academic performance of registered nurses (RNs) enrolled in bachelor's degree BSN programs nationwide. This study also examined RN students' performance on the ACT-PEP tests by their demographic characteristics: student's age, sex, race, student status (full- or part-time), and employment status (full- or part-time). The total sample for the three tests comprised 2,600 students from eight institutions nationwide. The median correlation coefficients between the three ACT-PEP tests and the semester grade point averages ranged from .36 to .56. Median correlation coefficients increased over time, supporting the stability of ACT-PEP test scores for predicting academic performance over time. The relative importance of selected independent variables for predicting academic performance was also examined; the most important variable for predicting academic performance was typically the ACT-PEP test score. Across the institutions, student demographic characteristics did not contribute significantly to explaining academic performance, over and above ACT-PEP scores.
Predictors of medical school clerkship performance: a multispecialty longitudinal analysis of standardized examination scores and clinical assessments.

PubMed

Casey, Petra M; Palmer, Brian A; Thompson, Geoffrey B; Laack, Torrey A; Thomas, Matthew R; Hartz, Martha F; Jensen, Jani R; Sandefur, Benjamin J; Hammack, Julie E; Swanson, Jerry W; Sheeler, Robert D; Grande, Joseph P

2016-04-27

Evidence suggests that poor performance on standardized tests before and early in medical school is associated with poor performance on standardized tests later in medical school and beyond. This study aimed to explore relationships between standardized examination scores (before and during medical school) with test and clinical performance across all core clinical clerkships. We evaluated characteristics of 435 students at Mayo Medical School (MMS) who matriculated 2000-2009 and for whom undergraduate grade point average, medical college aptitude test (MCAT), medical school standardized tests (United States Medical Licensing Examination [USMLE] 1 and 2; National Board of Medical Examiners [NBME] subject examination), and faculty assessments were available. We assessed the correlation between scores and assessments and determined USMLE 1 cutoffs predictive of poor performance (≤10th percentile) on the NBME examinations. We also compared the mean faculty assessment scores of MMS students vs visiting students, and for the NBME, we determined the percentage of MMS students who scored at or below the tenth percentile of first-time national examinees. MCAT scores correlated robustly with USMLE 1 and 2, and USMLE 1 and 2 independently predicted NBME scores in all clerkships. USMLE 1 cutoffs corresponding to poor NBME performance ranged from 220 to 223. USMLE 1 scores were similar among MMS and visiting students. For most academic years and clerkships, NBME scores were similar for MMS students vs all first-time examinees. MCAT, USMLE 1 and 2, and subsequent clinical performance parameters were correlated with NBME scores across all core clerkships. Even more interestingly, faculty assessments correlated with NBME scores, affirming patient care as examination preparation. USMLE 1 scores identified students at risk of poor performance on NBME subject examinations, facilitating and supporting implementation of remediation before the clinical years. MMS students were representative of medical students across the nation.
Cognitive Skills, Student Achievement Tests, and Schools

PubMed Central

Finn, Amy S.; Kraft, Matthew A.; West, Martin R.; Leonard, Julia A.; Bish, Crystal E.; Martin, Rebecca E.; Sheridan, Margaret A.; Gabrieli, Christopher F. O.; Gabrieli, John D. E.

2014-01-01

Cognitive skills predict academic performance, so schools that improve academic performance might also improve cognitive skills. To investigate the impact schools have on both academic performance and cognitive skills, we related standardized achievement test scores to measures of cognitive skills in a large sample (N=1,367) of 8th-grade students attending traditional, exam, and charter public schools. Test scores and gains in test scores over time correlated with measures of cognitive skills. Despite wide variation in test scores across schools, differences in cognitive skills across schools were negligible after controlling for 4th-grade test scores. Random offers of enrollment to over-subscribed charter schools resulted in positive impacts of such school attendance on math achievement, but had no impact on cognitive skills. These findings suggest that schools that improve standardized achievement tests do so primarily through channels other than cognitive skills. PMID:24434238
Neurocognitive performance and symptom profiles of Spanish-speaking Hispanic athletes on the ImPACT test.

PubMed

Ott, Summer; Schatz, Philip; Solomon, Gary; Ryan, Joseph J

2014-03-01

This study documented baseline neurocognitive performance of 23,815 athletes on the Immediate Post-Concussion Assessment and Cognitive Testing (ImPACT) test. Specifically, 9,733 Hispanic, Spanish-speaking athletes who completed the ImPACT test in English and 2,087 Hispanic, Spanish-speaking athletes who completed the test in Spanish were compared with 11,955 English-speaking athletes who completed the test in English. Athletes were assigned to age groups (13-15, 16-18). Results revealed a significant effect of language group (p < .001; partial η(2) = 0.06) and age (p < .001; partial η(2) = 0.01) on test performance. Younger athletes performed more poorly than older athletes, and Spanish-speaking athletes completing the test in Spanish scored more poorly than Spanish-speaking and English-speaking athletes completing the test in English, on all Composite scores and Total Symptom scores. Spanish-speaking athletes completing the test in English also performed more poorly than English-speaking athletes completing the test in English on three Composite scores. These differences in performance and reported symptoms highlight the need for caution in interpreting ImPACT test data for Hispanic Americans.
Assessment of body-powered upper limb prostheses by able-bodied subjects, using the Box and Blocks Test and the Nine-Hole Peg Test.

PubMed

Haverkate, Liz; Smit, Gerwin; Plettenburg, Dick H

2016-02-01

The functional performance of currently available body-powered prostheses is unknown. The goal of this study was to objectively assess and compare the functional performance of three commonly used body-powered upper limb terminal devices. Experimental trial. A total of 21 able-bodied subjects (n = 21, age = 22 ± 2) tested three different terminal devices: TRS voluntary closing Hook Grip 2S, Otto Bock voluntary opening hand and Hosmer Model 5XA hook, using a prosthesis simulator. All subjects used each terminal device nine times in two functional tests: the Nine-Hole Peg Test and the Box and Blocks Test. Significant differences were found between the different terminal devices and their scores on the Nine-Hole Peg Test and the Box and Blocks Test. The Hosmer hook scored best in both tests. The TRS Hook Grip 2S scored second best. The Otto Bock hand showed the lowest scores. This study is a first step in the comparison of functional performances of body-powered prostheses. The data can be used as a reference value, to assess the performance of a terminal device or an amputee. The measured scores enable the comparison of the performance of a prosthesis user and his or her terminal device relative to standard scores. © The International Society for Prosthetics and Orthotics 2014.
Training improves laparoscopic tasks performance and decreases operator workload.

PubMed

Hu, Jesse S L; Lu, Jirong; Tan, Wee Boon; Lomanto, Davide

2016-05-01

It has been postulated that increased operator workload during task performance may increase fatigue and surgical errors. The National Aeronautics and Space Administration-Task Load Index (NASA-TLX) is a validated tool for self-assessment for workload. Our study aims to assess the relationship of workload and performance of novices in simulated laparoscopic tasks of different complexity levels before and after training. Forty-seven novices without prior laparoscopic experience were recruited in a trial to investigate whether training improves task performance as well as mental workload. The participants were tested on three standard tasks (ring transfer, precision cutting and intracorporeal suturing) in increasing complexity based on the Fundamentals of Laparoscopic Surgery (FLS) curriculum. Following a period of training and rest, participants were tested again. Test scores were computed from time taken and time penalties for precision errors. Test scores and NASA-TLX scores were recorded pre- and post-training and analysed using paired t tests. One-way repeated measures ANOVA was used to analyse differences in NASA-TLX scores between the three tasks. NASA-TLX score was lowest with ring transfer and highest with intracorporeal suturing. This was statistically significant in both pre-training (p < 0.001) and post-training (p < 0.001). NASA-TLX scores mirror the changes in test scores for the three tasks. Workload scores decreased significantly after training for all three tasks (ring transfer = 2.93, p < 0.001, precision cutting = 3.74, p < 0.001, intracorporeal suturing = 2.98, p < 0.001). NASA-TLX score is an accurate reflection of the complexity of simulated laparoscopic tasks in the FLS curriculum. This also correlates with the relationship of test scores between the three tasks. Simulation training improves both performance score and workload score across the tasks.
Transforming Biology Assessment with Machine Learning: Automated Scoring of Written Evolutionary Explanations

NASA Astrophysics Data System (ADS)

Nehm, Ross H.; Ha, Minsu; Mayfield, Elijah

2012-02-01

This study explored the use of machine learning to automatically evaluate the accuracy of students' written explanations of evolutionary change. Performance of the Summarization Integrated Development Environment (SIDE) program was compared to human expert scoring using a corpus of 2,260 evolutionary explanations written by 565 undergraduate students in response to two different evolution instruments (the EGALT-F and EGALT-P) that contained prompts that differed in various surface features (such as species and traits). We tested human-SIDE scoring correspondence under a series of different training and testing conditions, using Kappa inter-rater agreement values of greater than 0.80 as a performance benchmark. In addition, we examined the effects of response length on scoring success; that is, whether SIDE scoring models functioned with comparable success on short and long responses. We found that SIDE performance was most effective when scoring models were built and tested at the individual item level and that performance degraded when suites of items or entire instruments were used to build and test scoring models. Overall, SIDE was found to be a powerful and cost-effective tool for assessing student knowledge and performance in a complex science domain.
Validity and Reliability of Baseline Testing in a Standardized Environment.

PubMed

Higgins, Kathryn L; Caze, Todd; Maerlender, Arthur

2017-08-11

The Immediate Postconcussion Assessment and Cognitive Testing (ImPACT) is a computerized neuropsychological test battery commonly used to determine cognitive recovery from concussion based on comparing post-injury scores to baseline scores. This model is based on the premise that ImPACT baseline test scores are a valid and reliable measure of optimal cognitive function at baseline. Growing evidence suggests that this premise may not be accurate and a large contributor to invalid and unreliable baseline test scores may be the protocol and environment in which baseline tests are administered. This study examined the effects of a standardized environment and administration protocol on the reliability and performance validity of athletes' baseline test scores on ImPACT by comparing scores obtained in two different group-testing settings. Three hundred-sixty one Division 1 cohort-matched collegiate athletes' baseline data were assessed using a variety of indicators of potential performance invalidity; internal reliability was also examined. Thirty-one to thirty-nine percent of the baseline cases had at least one indicator of low performance validity, but there were no significant differences in validity indicators based on environment in which the testing was conducted. Internal consistency reliability scores were in the acceptable to good range, with no significant differences between administration conditions. These results suggest that athletes may be reliably performing at levels lower than their best effort would produce. © The Author 2017. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Collaborative Testing Improves Performance but Not Content Retention in a Large-Enrollment Introductory Biology Class

PubMed Central

Leight, Hayley; Saunders, Cheston; Calkins, Robin; Withers, Michelle

2012-01-01

Collaborative testing has been shown to improve performance but not always content retention. In this study, we investigated whether collaborative testing could improve both performance and content retention in a large, introductory biology course. Students were semirandomly divided into two groups based on their performances on exam 1. Each group contained equal numbers of students scoring in each grade category (“A”–“F”) on exam 1. All students completed each of the four exams of the semester as individuals. For exam 2, one group took the exam a second time in small groups immediately following the individually administered test. The other group followed this same format for exam 3. Individual and group exam scores were compared to determine differences in performance. All but exam 1 contained a subset of cumulative questions from the previous exam. Performances on the cumulative questions for exams 3 and 4 were compared for the two groups to determine whether there were significant differences in content retention. Even though group test scores were significantly higher than individual test scores, students who participated in collaborative testing performed no differently on cumulative questions than students who took the previous exam as individuals. PMID:23222835
Exploring the gender gap in the conceptual survey of electricity and magnetism

NASA Astrophysics Data System (ADS)

Henderson, Rachel; Stewart, Gay; Stewart, John; Michaluk, Lynnette; Traxler, Adrienne

2017-12-01

The "gender gap" on various physics conceptual evaluations has been extensively studied. Men's average pretest scores on the Force Concept Inventory and Force and Motion Conceptual Evaluation are 13% higher than women's, and post-test scores are on average 12% higher than women's. This study analyzed the gender differences within the Conceptual Survey of Electricity and Magnetism (CSEM) in which the gender gap has been less well studied and is less consistent. In the current study, data collected from 1407 students (77% men, 23% women) in a calculus-based physics course over ten semesters showed that male students outperformed female students on the CSEM pretest (5%) and post-test (6%). Separate analyses were conducted for qualitative and quantitative problems on lab quizzes and course exams and showed that male students outperformed female students by 3% on qualitative quiz and exam problems. Male and female students performed equally on the quantitative course exam problems. The gender gaps within CSEM post-test scores, qualitative lab quiz scores, and qualitative exam scores were insignificant for students with a CSEM pretest score of 25% or less but grew as pretest scores increased. Structural equation modeling demonstrated that a latent variable, called Conceptual Physics Performance/Non-Quantitative (CPP/NonQnt), orthogonal to quantitative test performance was useful in explaining the differences observed in qualitative performance; this variable was most strongly related to CSEM post-test scores. The CPP/NonQnt of male students was 0.44 standard deviations higher than female students. The CSEM pretest measured CPP/NonQnt much less accurately for women (R2=4 % ) than for men (R2=17 % ). The failure to detect a gender gap for students scoring 25% or less on the pretest suggests that the CSEM instrument itself is not gender biased. The failure to find a performance difference in quantitative test performance while detecting a gap in qualitative performance suggests the qualitative differences do not result from psychological factors such as science anxiety or stereotype threat.

The gender difference on the Mental Rotations test is not due to performance factors.

PubMed

Masters, M S

1998-05-01

Men score higher than women on the Mental Rotations test (MRT), and the magnitude of this gender difference is the largest of that on any spatial test. Goldstein, Haldane, and Mitchell (1990) reported finding that the gender difference on the MRT disappears when "performance factors" are controlled--specifically, when subjects are allowed sufficient time to attempt all items on the test or when a scoring procedure that controls for the number of items attempted is used. The present experiment also explored whether eliminating these performance factors results in a disappearance of the gender difference on the test. Male and female college students were allowed a short time period or unlimited time on the MRT. The tests were scored according to three different procedures. The results showed no evidence that the gender difference on the MRT was affected by the scoring method or the time limit. Regardless of the scoring procedure, men scored higher than women, and the magnitude of the gender difference persisted undiminished when subjects completed all items on the test. Thus there was no evidence that performance factors produced the gender difference on the MRT. These results are consistent with the results of other investigators who have attempted to replicate Goldstein et al.'s findings.
Does the MCAT predict medical school and PGY-1 performance?

PubMed

Saguil, Aaron; Dong, Ting; Gingerich, Robert J; Swygert, Kimberly; LaRochelle, Jeffrey S; Artino, Anthony R; Cruess, David F; Durning, Steven J

2015-04-01

The Medical College Admissions Test (MCAT) is a high-stakes test required for entry to most U. S. medical schools; admissions committees use this test to predict future accomplishment. Although there is evidence that the MCAT predicts success on multiple choice-based assessments, there is little information on whether the MCAT predicts clinical-based assessments of undergraduate and graduate medical education performance. This study looked at associations between the MCAT and medical school grade point average (GPA), Medical Licensing Examination (USMLE) scores, observed patient care encounters, and residency performance assessments. This study used data collected as part of the Long-Term Career Outcome Study to determine associations between MCAT scores, USMLE Step 1, Step 2 clinical knowledge and clinical skill, and Step 3 scores, Objective Structured Clinical Examination performance, medical school GPA, and PGY-1 program director (PD) assessment of physician performance for students graduating 2010 and 2011. MCAT data were available for all students, and the PGY PD evaluation response rate was 86.2% (N = 340). All permutations of MCAT scores (first, last, highest, average) were weakly associated with GPA, Step 2 clinical knowledge scores, and Step 3 scores. MCAT scores were weakly to moderately associated with Step 1 scores. MCAT scores were not significantly associated with Step 2 clinical skills Integrated Clinical Encounter and Communication and Interpersonal Skills subscores, Objective Structured Clinical Examination performance or PGY-1 PD evaluations. MCAT scores were weakly to moderately associated with assessments that rely on multiple choice testing. The association is somewhat stronger for assessments occurring earlier in medical school, such as USMLE Step 1. The MCAT was not able to predict assessments relying on direct clinical observation, nor was it able to predict PD assessment of PGY-1 performance. Reprint & Copyright © 2015 Association of Military Surgeons of the U.S.
Ocular dominance stability and reading skill: a controversial relationship.

PubMed

Zeri, Fabrizio; De Luca, Maria; Spinelli, Donatella; Zoccolotti, Pierluigi

2011-11-01

Evidence is mixed concerning the relationship between stability of ocular dominance and reading deficits. Contrasting results may be due to the use of different tests of dominance, different samples of readers, and different scoring methods. The aim of this study was to investigate the relationship among ocular dominance, general visual abilities, and reading performance, and to evaluate the consistency and reliability of different tests of ocular dominance and the effects of different types of eye dominance scoring. In a group of young adults, we measured: (a) main optometric parameters; (b) reading time and accuracy; and (c) ocular dominance in two sighting and four motor tests. Dominance was determined using different scoring methods (relative, absolute, and binary scores). All dominance tests showed good levels of internal reliability. Sighting tests were consistent regardless of the scoring method, and all participants had stable dominance. Three of four motor tests were moderately consistent when dominance was measured with relative scores but not when it was measured with absolute or binary scores. No relationship was found between stability of dominance and reading performance, regardless of the type of test or scoring method. No systematic pattern of correlation was found between binocular vision variables and dominance measures. Choosing the type of motor test to measure ocular dominance is crucial, because the level of consistency among tests is low to moderate. Furthermore, motor tests were not correlated with reading performances. Present results suggest caution when trying to link reading difficulties with specific profiles of ocular dominance.
Wire-bending test as a predictor of preclinical performance by dental students.

PubMed

Kao, E C; Ngan, P W; Wilson, S; Kunovich, R

1990-10-01

Traditional Dental Aptitude Test and academic grade point average have been shown to be poor predictors of clinical performance by dental students. To refine predictors of psychomotor skills, a wire-bending test was given to 105 freshmen at the beginning of their dental education. Grades from seven restorative preclinical courses in their freshman and sophomore years were compared to scores on wire bending and the three traditional predictors: GPA, academic aptitude, and perceptual aptitude scores. Wire-bending scores correlated significantly with six out of seven preclinical restorative courses. The predictive power for preclinical performance was doubled when wire bending was added to traditional predictors in stepwise multiple regression analysis. Wire-bending scores identified students of low performance. These preliminary results suggest that the wire-bending test shows some potential as a screening test for identifying students who may hae psychomotor difficulties, early in their dental education.
How Accurate Is a Test Score?

ERIC Educational Resources Information Center

Doppelt, Jerome E.

1956-01-01

The standard error of measurement as a means for estimating the margin of error that should be allowed for in test scores is discussed. The true score measures the performance that is characteristic of the person tested; the variations, plus and minus, around the true score describe a characteristic of the test. When the standard deviation is used…
Developing Test Score Reports that Work: The Process and Best Practices for Effective Communication

ERIC Educational Resources Information Center

Zenisky, April L.; Hambleton, Ronald K.

2012-01-01

Test scores matter these days. Test-takers want to understand how they performed, and test score reports, particularly those for individual examinees, are the vehicles by which most people get the bulk of this information. Historically, score reports have not always met the examinees' information or usability needs, but this is clearly changing…
The Michigan Context and Performance Report Card: Public Elementary & Middle Schools, 2013

ERIC Educational Resources Information Center

Spalding, Audrey

2013-01-01

The Michigan Context and Performance Report Card measures school performance by adjusting standardized test scores to account for student background. Comparing schools using unadjusted test scores ignores the significant relationship between academic performance and student socioeconomic background--a dynamic outside a school's control. The…
Predicting performance and injury resilience from movement quality and fitness scores in a basketball team over 2 years.

PubMed

McGill, Stuart M; Andersen, Jordan T; Horne, Arthur D

2012-07-01

The purpose of this study was to see if specific tests of fitness and movement quality could predict injury resilience and performance in a team of basketball players over 2 years (2 playing seasons). It was hypothesized that, in a basketball population, movement and fitness scores would predict performance scores and that movement and fitness scores would predict injury resilience. A basketball team from a major American university (N = 14) served as the test population in this longitudinal trial. Variables linked to fitness, movement ability, speed, strength, and agility were measured together with some National Basketball Association (NBA) combine tests. Dependent variables of performance indicators (such as games and minutes played, points scored, assists, rebounds, steal, and blocks) and injury reports were tracked for the subsequent 2 years. Results showed that better performance was linked with having a stiffer torso, more mobile hips, weaker left grip strength, and a longer standing long jump, to name a few. Of the 3 NBA combine tests administered here, only a faster lane agility time had significant links with performance. Some movement qualities and torso endurance were not linked. No patterns with injury emerged. These observations have implications for preseason testing and subsequent training programs in an attempt to reduce future injury and enhance playing performance.
The Impact of Conditional Scores on the Performance of DETECT.

ERIC Educational Resources Information Center

Zhang, Yanwei Oliver; Yu, Feng; Nandakumar, Ratna

DETECT is a nonparametric, conditional covariance-based procedure to identify dimensional structure and the degree of multidimensionality of test data. The ability composite or conditional score used to estimate conditional covariance plays a significant role in the performance of DETECT. The number correct score of all items in the test (T) and…
Rater Expertise in a Second Language Speaking Assessment: The Influence of Training and Experience

ERIC Educational Resources Information Center

Davis, Lawrence Edward

2012-01-01

Speaking performance tests typically employ raters to produce scores; accordingly, variability in raters' scoring decisions has important consequences for test reliability and validity. One such source of variability is the rater's level of expertise in scoring. Therefore, it is important to understand how raters' performance is influenced by…
What's in a Teacher Test? Assessing the Relationship between Teacher Test Scores and Student Secondary STEM Achievement. CEDR Working Paper. WP #2016-4

ERIC Educational Resources Information Center

Goldhaber, Dan; Gratz, Trevor; Theobald, Roddy

2016-01-01

We investigate the predictive validity of teacher credential test scores for student performance in secondary STEM classrooms in Washington state. After replicating earlier findings that teacher basic skills licensure test scores are a modest and statistically significant predictor of student math test score gains in elementary grades, we focus on…
Continuous multiword recognition performance of young and elderly listeners in ambient noise

NASA Astrophysics Data System (ADS)

Sato, Hiroshi

2005-09-01

Hearing threshold shift due to aging is known as a dominant factor to degrade speech recognition performance in noisy conditions. On the other hand, cognitive factors of aging-relating speech recognition performance in various speech-to-noise conditions are not well established. In this study, two kinds of speech test were performed to examine how working memory load relates to speech recognition performance. One is word recognition test with high-familiarity, four-syllable Japanese words (single-word test). In this test, each word was presented to listeners; the listeners were asked to write the word down on paper with enough time to answer. In the other test, five continuous word were presented to listeners and listeners were asked to write the word down after just five words were presented (multiword test). Both tests were done in various speech-to-noise ratios under 50-dBA Hoth spectrum noise with more than 50 young and elderly subjects. The results of two experiments suggest that (1) Hearing level is related to scores of both tests. (2) Scores of single-word test are well correlated with those of multiword test. (3) Scores of multiword test are not improved as speech-to-noise ratio improves in the condition where scores of single-word test reach their ceiling.
Surgical simulation tasks challenge visual working memory and visual-spatial ability differently.

PubMed

Schlickum, Marcus; Hedman, Leif; Enochsson, Lars; Henningsohn, Lars; Kjellin, Ann; Felländer-Tsai, Li

2011-04-01

New strategies for selection and training of physicians are emerging. Previous studies have demonstrated a correlation between visual-spatial ability and visual working memory with surgical simulator performance. The aim of this study was to perform a detailed analysis on how these abilities are associated with metrics in simulator performance with different task content. The hypothesis is that the importance of visual-spatial ability and visual working memory varies with different task contents. Twenty-five medical students participated in the study that involved testing visual-spatial ability using the MRT-A test and visual working memory using the RoboMemo computer program. Subjects were also trained and tested for performance in three different surgical simulators. The scores from the psychometric tests and the performance metrics were then correlated using multivariate analysis. MRT-A score correlated significantly with the performance metrics Efficiency of screening (p = 0.006) and Total time (p = 0.01) in the GI Mentor II task and Total score (p = 0.02) in the MIST-VR simulator task. In the Uro Mentor task, both the MRT-A score and the visual working memory 3-D cube test score as presented in the RoboMemo program (p = 0.02) correlated with Total score (p = 0.004). In this study we have shown that some differences exist regarding the impact of visual abilities and task content on simulator performance. When designing future cognitive training programs and testing regimes, one might have to consider that the design must be adjusted in accordance with the specific surgical task to be trained in mind.
Cognitive performance and aphasia recovery.

PubMed

Fonseca, José; Raposo, Ana; Martins, Isabel Pavão

2018-03-01

Objectives This study assessed cognitive performance of subjects with aphasia during the acute stage of stroke and evaluated how such performance relates to recovery at 3 months. Materials & methods Patients with aphasia following a left hemisphere stroke were evaluated during the first (baseline) and the fourth-month post onset. Assessment comprised non-verbal tests of attention/processing speed (Symbol Search, Cancelation Task), executive functioning (Matrix Reasoning, Tower of Hanoi, Clock Drawing, Motor Initiative), semantic (Camel and Cactus Test), episodic and immediate memory (Memory for Faces Test, 5 Objects Memory Test, and Spatial Span. Recovery was measured by the Token Test score at 3 months. The impact of baseline performance on recovery was evaluated by logistic regression adjusting for age, education, severity of aphasia and the Alberta Stroke Program Early CT (ASPECT) score. Results Thirty-nine subjects (with a mean of 66.5 ± 10.6 years of age, 17 men) were included. Average baseline cognitive performance was within normal range in all tests except in memory tests (semantic, episodic and immediate memory) for which scores were ≤-1.5sd. Subjects with poor aphasia recovery (N = 27) were older and had fewer years of formal education but had identical ASPECT score compared to those with favorable recovery. Considering each test individually, the score obtained on the Matrix Reasoning test was the only one to predict aphasia recovery (Exp(B)=24.085 p = 0.038). Conclusions The Matrix Reasoning Test may contribute to predict aphasia recovery. Cognitive performance is a measure of network disruption but may also indicate the availability of recovery strategies.
Validation of the Narrowing Beam Walking Test in Lower Limb Prosthesis Users.

PubMed

Sawers, Andrew; Hafner, Brian

2018-04-11

To evaluate the content, construct, and discriminant validity of the Narrowing Beam Walking Test (NBWT), a performance-based balance test for lower limb prosthesis users. Cross-sectional study. Research laboratory and prosthetics clinic. Unilateral transtibial and transfemoral prosthesis users (N=40). Not applicable. Content validity was examined by quantifying the percentage of participants receiving maximum or minimum scores (ie, ceiling and floor effects). Convergent construct validity was examined using correlations between participants' NBWT scores and scores or times on existing clinical balance tests regularly administered to lower limb prosthesis users. Known-groups construct validity was examined by comparing NBWT scores between groups of participants with different fall histories, amputation levels, amputation etiologies, and functional levels. Discriminant validity was evaluated by analyzing the area under each test's receiver operating characteristic (ROC) curve. No minimum or maximum scores were recorded on the NBWT. NBWT scores demonstrated strong correlations (ρ=.70‒.85) with scores/times on performance-based balance tests (timed Up and Go test, Four Square Step Test, and Berg Balance Scale) and a moderate correlation (ρ=.49) with the self-report Activities-specific Balance Confidence scale. NBWT performance was significantly lower among participants with a history of falls (P=.003), transfemoral amputation (P=.011), and a lower mobility level (P<.001). The NBWT also had the largest area under the ROC curve (.81) and was the only test to exhibit an area that was statistically significantly >.50 (ie, chance). The results provide strong evidence of content, construct, and discriminant validity for the NBWT as a performance-based test of balance ability. The evidence supports its use to assess balance impairments and fall risk in unilateral transtibial and transfemoral prosthesis users. Copyright © 2018 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.
What Do Test Score Really Mean? A Latent Class Analysis of Danish Test Score Performance

ERIC Educational Resources Information Center

McIntosh, James; Munk, Martin D.

2014-01-01

Latent class Poisson count models are used to analyse a sample of Danish test score results from a cohort of individuals born in 1954-1955, tested in 1968, and followed until 2011. The procedure takes account of unobservable effects as well as excessive zeros in the data. We show that the test scores measure manifest or measured ability as it has…
Pharmacy students' test-taking motivation-effort on a low-stakes standardized test.

PubMed

Waskiewicz, Rhonda A

2011-04-11

To measure third-year pharmacy students' level of motivation while completing the Pharmacy Curriculum Outcomes Assessment (PCOA) administered as a low-stakes test to better understand use of the PCOA as a measure of student content knowledge. Student motivation was manipulated through an incentive (ie, personal letter from the dean) and a process of statistical motivation filtering. Data were analyzed to determine any differences between the experimental and control groups in PCOA test performance, motivation to perform well, and test performance after filtering for low motivation-effort. Incentivizing students diminished the need for filtering PCOA scores for low effort. Where filtering was used, performance scores improved, providing a more realistic measure of aggregate student performance. To ensure that PCOA scores are an accurate reflection of student knowledge, incentivizing and/or filtering for low motivation-effort among pharmacy students should be considered fundamental best practice when the PCOA is administered as a low-stakes test.
Error Rates in Measuring Teacher and School Performance Based on Student Test Score Gains. NCEE 2010-4004

ERIC Educational Resources Information Center

Schochet, Peter Z.; Chiang, Hanley S.

2010-01-01

This paper addresses likely error rates for measuring teacher and school performance in the upper elementary grades using value-added models applied to student test score gain data. Using realistic performance measurement system schemes based on hypothesis testing, we develop error rate formulas based on OLS and Empirical Bayes estimators.…
Higher mental workload is associated with poorer laparoscopic performance as measured by the NASA-TLX tool.

PubMed

Yurko, Yuliya Y; Scerbo, Mark W; Prabhu, Ajita S; Acker, Christina E; Stefanidis, Dimitrios

2010-10-01

Increased workload during task performance may increase fatigue and facilitate errors. The National Aeronautics and Space Administration-Task Load Index (NASA-TLX) is a previously validated tool for workload self-assessment. We assessed the relationship of workload and performance during simulator training on a complex laparoscopic task. NASA-TLX workload data from three separate trials were analyzed. All participants were novices (n = 28), followed the same curriculum on the fundamentals of laparoscopic surgery suturing model, and were tested in the animal operating room (OR) on a Nissen fundoplication model after training. Performance and workload scores were recorded at baseline, after proficiency achievement, and during the test. Performance, NASA-TLX scores, and inadvertent injuries during the test were analyzed and compared. Workload scores declined during training and mirrored performance changes. NASA-TLX scores correlated significantly with performance scores (r = -0.5, P < 0.001). Participants with higher workload scores caused more inadvertent injuries to adjacent structures in the OR (r = 0.38, P < 0.05). Increased mental and physical workload scores at baseline correlated with higher workload scores in the OR (r = 0.52-0.82; P < 0.05) and more inadvertent injuries (r = 0.52, P < 0.01). Increased workload is associated with inferior task performance and higher likelihood of errors. The NASA-TLX questionnaire accurately reflects workload changes during simulator training and may identify individuals more likely to experience high workload and more prone to errors during skill transfer to the clinical environment.
Assessing English Language Learners' Oral Performance: A Comparison of Monologue, Interview, and Group Oral Test

ERIC Educational Resources Information Center

Ahmadi, Alireza; Sadeghi, Elham

2016-01-01

In the present study we investigated the effect of test format on oral performance in terms of test scores and discourse features (accuracy, fluency, and complexity). Moreover, we explored how the scores obtained on different test formats relate to such features. To this end, 23 Iranian EFL learners participated in three test formats of monologue,…

Reduce, Reuse, Recycle: The Longitudinal Value of Local Cut Scores Using State Test Data

ERIC Educational Resources Information Center

Nelson, Peter M.; Van Norman, Ethan R.; VanDerHeyden, Amanda

2017-01-01

We used existing reading (n = 1,498) and math (n = 2,260) data to evaluate state test scores for screening middle school students. In Phase 1, state test data were used to create a research-derived cut score that was optimal for predicting state test performance the following year. In Phase 2, those cut scores were applied with future cohorts.…
Can business and economics students perform elementary arithmetic?

PubMed

Standing, Lionel G; Sproule, Robert A; Leung, Ambrose

2006-04-01

Business and economics majors (N=146) were tested on the D'Amore Test of Elementary Arithmetic, which employs third-grade test items from 1932. Only 40% of the subjects passed the test by answering 10 out of 10 items correctly. Self-predicted scores were a good predictor of actual scores, but performance was not associated with demographic variables, grades in calculus courses, liking for science or computers, or mathematics anxiety. Scores decreased over the subjects' initial years on campus. The hardest test item, with an error rate of 23%, required the subject to evaluate (36 x 7) + (33 x 7). The results are similar to those of Standing in 2006, despite methodological changes intended to maximize performance.
A descriptive study of the U.S. Marine Corps fitness tests (2000-2012).

PubMed

Bartlett, Jamie L; Phillips, Jennifer; Galarneau, Michael R

2015-05-01

This article describes the performance of active duty U.S. Marines on the Physical Fitness Test (PFT) and Combat Fitness Test (CFT) during calendar years 2000 through 2012. Our study sample included PFT composite scores (n = 543,185), PFT and CFT composite scores (n = 160,936), and PFT and CFT event scores (n = 135,926 and n = 201,953, respectively). In general, all Marines performed very well on each fitness test, with overall annual improvements. Interestingly, the majority of female Marines passed the minimum male standard on the CFT. Further studies will evaluate the relationship of fitness test performance and injury. Reprint & Copyright © 2015 Association of Military Surgeons of the U.S.
Investigating Score Dependability in English/Chinese Interpreter Certification Performance Testing: A Generalizability Theory Approach

ERIC Educational Resources Information Center

Han, Chao

2016-01-01

As a property of test scores, reliability/dependability constitutes an important psychometric consideration, and it underpins the validity of measurement results. A review of interpreter certification performance tests (ICPTs) reveals that (a) although reliability/dependability checking has been recognized as an important concern, its theoretical…
Teacher Greetings Increase College Students' Test Scores

ERIC Educational Resources Information Center

Weinstein, Lawrence; Laverghetta, Antonio; Alexander, Ralph; Stewart, Megan

2009-01-01

The current study is an extension of a previous investigation dealing with teacher greetings to students. The present investigation used teacher greetings with college students and academic performance (test scores). We report data using university students and in-class test performance. Students in introductory psychology who received teachers'…
Relationships of Declining Test Scores and Grade Inflation.

ERIC Educational Resources Information Center

Bellott, Fred K.

The relationship between declining scores on national standardized tests and grade inflation is explored. Grade inflation refers to the indicated measure of evaluation of student performance having higher placement than is usual based on the performances. Data for this study were taken from the American College Testing (ACT) Program Class Profile…
A Direct Comparison of Real-World and Virtual Navigation Performance in Chronic Stroke Patients.

PubMed

Claessen, Michiel H G; Visser-Meily, Johanna M A; de Rooij, Nicolien K; Postma, Albert; van der Ham, Ineke J M

2016-04-01

An increasing number of studies have presented evidence that various patient groups with acquired brain injury suffer from navigation problems in daily life. This skill is, however, scarcely addressed in current clinical neuropsychological practice and suitable diagnostic instruments are lacking. Real-world navigation tests are limited by geographical location and associated with practical constraints. It was, therefore, investigated whether virtual navigation might serve as a useful alternative. To investigate the convergent validity of virtual navigation testing, performance on the Virtual Tubingen test was compared to that on an analogous real-world navigation test in 68 chronic stroke patients. The same eight subtasks, addressing route and survey knowledge aspects, were assessed in both tests. In addition, navigation performance of stroke patients was compared to that of 44 healthy controls. A correlation analysis showed moderate overlap (r = .535) between composite scores of overall real-world and virtual navigation performance in stroke patients. Route knowledge composite scores correlated somewhat stronger (r = .523) than survey knowledge composite scores (r = .442). When comparing group performances, patients obtained lower scores than controls on seven subtasks. Whereas the real-world test was found to be easier than its virtual counterpart, no significant interaction-effects were found between group and environment. Given moderate overlap of the total scores between the two navigation tests, we conclude that virtual testing of navigation ability is a valid alternative to navigation tests that rely on real-world route exposure.
Impact of a weekly reading program on orthopedic surgery residents' in-training examination.

PubMed

Weglein, Daniel G; Gugala, Zbigniew; Simpson, Suzanne; Lindsey, Ronald W

2015-05-01

In response to a decline in individual residents' performance and overall program performance on the Orthopaedic In-Training Examination (OITE), the authors' department initiated a daily literature reading program coupled with weekly tests on the assigned material. The goal of this study was to assess the effect of the reading program on individual residents' scores and the training program's OITE scores. The reading program consisted of daily review articles from the Journal of the American Academy of Orthopaedic Surgeons, followed by a weekly written examination consisting of multiple-choice or fill-in-the-blank questions. All articles were selected and all questions were written by the departmental chair. A questionnaire was given to assess residents' perceptions of the weekly tests. As a result of implementing the reading program for a 10-month period, residents' subsequent performance on the OITE significantly improved (mean score increase, 4, P<.0001; percentile score increase, 11, P=.0007). The difference in mean score was significant for residents in postgraduate years 3, 4, and 5. A statistically significant correlation was found between weekly test scores and performance on the OITE, with a significant correlation between weekly test scores and OITE percentile ranking. The study results also showed a positive correlation between reading test attendance and weekly test scores. Residents' anonymous questionnaire responses also demonstrated the reading program to be a valuable addition to the residency training curriculum. In conclusion, the study strongly supports the benefits of a weekly reading and examination program in enhancing the core knowledge of orthopedic surgery residents. Copyright 2015, SLACK Incorporated.
Predictive validity of pre-admission assessments on medical student performance.

PubMed

Dabaliz, Al-Awwab; Kaadan, Samy; Dabbagh, M Marwan; Barakat, Abdulaziz; Shareef, Mohammad Abrar; Al-Tannir, Mohamad; Obeidat, Akef; Mohamed, Ayman

2017-11-24

To examine the predictive validity of pre-admission variables on students' performance in a medical school in Saudi Arabia. In this retrospective study, we collected admission and college performance data for 737 students in preclinical and clinical years. Data included high school scores and other standardized test scores, such as those of the National Achievement Test and the General Aptitude Test. Additionally, we included the scores of the Test of English as a Foreign Language (TOEFL) and the International English Language Testing System (IELTS) exams. Those datasets were then compared with college performance indicators, namely the cumulative Grade Point Average (cGPA) and progress test, using multivariate linear regression analysis. In preclinical years, both the National Achievement Test (p=0.04, B=0.08) and TOEFL (p=0.017, B=0.01) scores were positive predictors of cGPA, whereas the General Aptitude Test (p=0.048, B=-0.05) negatively predicted cGPA. Moreover, none of the pre-admission variables were predictive of progress test performance in the same group. On the other hand, none of the pre-admission variables were predictive of cGPA in clinical years. Overall, cGPA strongly predict-ed students' progress test performance (p<0.001 and B=19.02). Only the National Achievement Test and TOEFL significantly predicted performance in preclinical years. However, these variables do not predict progress test performance, meaning that they do not predict the functional knowledge reflected in the progress test. We report various strengths and deficiencies in the current medical college admission criteria, and call for employing more sensitive and valid ones that predict student performance and functional knowledge, especially in the clinical years.
Predictive validity of pre-admission assessments on medical student performance

PubMed Central

Dabaliz, Al-Awwab; Kaadan, Samy; Dabbagh, M. Marwan; Barakat, Abdulaziz; Shareef, Mohammad Abrar; Al-Tannir, Mohamad; Obeidat, Akef

2017-01-01

Objectives To examine the predictive validity of pre-admission variables on students’ performance in a medical school in Saudi Arabia. Methods In this retrospective study, we collected admission and college performance data for 737 students in preclinical and clinical years. Data included high school scores and other standardized test scores, such as those of the National Achievement Test and the General Aptitude Test. Additionally, we included the scores of the Test of English as a Foreign Language (TOEFL) and the International English Language Testing System (IELTS) exams. Those datasets were then compared with college performance indicators, namely the cumulative Grade Point Average (cGPA) and progress test, using multivariate linear regression analysis. Results In preclinical years, both the National Achievement Test (p=0.04, B=0.08) and TOEFL (p=0.017, B=0.01) scores were positive predictors of cGPA, whereas the General Aptitude Test (p=0.048, B=-0.05) negatively predicted cGPA. Moreover, none of the pre-admission variables were predictive of progress test performance in the same group. On the other hand, none of the pre-admission variables were predictive of cGPA in clinical years. Overall, cGPA strongly predict-ed students’ progress test performance (p<0.001 and B=19.02). Conclusions Only the National Achievement Test and TOEFL significantly predicted performance in preclinical years. However, these variables do not predict progress test performance, meaning that they do not predict the functional knowledge reflected in the progress test. We report various strengths and deficiencies in the current medical college admission criteria, and call for employing more sensitive and valid ones that predict student performance and functional knowledge, especially in the clinical years. PMID:29176032
Correlation of Simulation Examination to Written Test Scores for Advanced Cardiac Life Support Testing: Prospective Cohort Study.

PubMed

Strom, Suzanne L; Anderson, Craig L; Yang, Luanna; Canales, Cecilia; Amin, Alpesh; Lotfipour, Shahram; McCoy, C Eric; Osborn, Megan Boysen; Langdorf, Mark I

2015-11-01

Traditional Advanced Cardiac Life Support (ACLS) courses are evaluated using written multiple-choice tests. High-fidelity simulation is a widely used adjunct to didactic content, and has been used in many specialties as a training resource as well as an evaluative tool. There are no data to our knowledge that compare simulation examination scores with written test scores for ACLS courses. To compare and correlate a novel high-fidelity simulation-based evaluation with traditional written testing for senior medical students in an ACLS course. We performed a prospective cohort study to determine the correlation between simulation-based evaluation and traditional written testing in a medical school simulation center. Students were tested on a standard acute coronary syndrome/ventricular fibrillation cardiac arrest scenario. Our primary outcome measure was correlation of exam results for 19 volunteer fourth-year medical students after a 32-hour ACLS-based Resuscitation Boot Camp course. Our secondary outcome was comparison of simulation-based vs. written outcome scores. The composite average score on the written evaluation was substantially higher (93.6%) than the simulation performance score (81.3%, absolute difference 12.3%, 95% CI [10.6-14.0%], p<0.00005). We found a statistically significant moderate correlation between simulation scenario test performance and traditional written testing (Pearson r=0.48, p=0.04), validating the new evaluation method. Simulation-based ACLS evaluation methods correlate with traditional written testing and demonstrate resuscitation knowledge and skills. Simulation may be a more discriminating and challenging testing method, as students scored higher on written evaluation methods compared to simulation.
Impairment of executive function in Kenyan children exposed to severe falciparum malaria with neurological involvement.

PubMed

Kariuki, Symon M; Abubakar, Amina; Newton, Charles R J C; Kihara, Michael

2014-09-16

Persistent neurocognitive impairments occur in a fifth of children hospitalized with severe falciparum malaria. There is little data on the association between different neurological phenotypes of severe malaria (seizures, impaired consciousness and prostration) and impairments in executive function. Executive functioning of children exposed to severe malaria with different neurological phenotypes (N = 58) and in those unexposed (N = 56) was examined using neuropsychological tests such as vigilance test, test for everyday attention test for children (TEA-Ch), contingency naming test (CNT) and self-ordered pointing test (SOPT). Linear regression was used to determine the association between neurological phenotypes of severe malaria and executive function performance scores, accounting for potential confounders. Children with complex seizures in severe malaria performed more poorly than unexposed controls in the vigilance (median efficiency scores (interquartile range) = 4.84 (1.28-5.68) vs. 5.84 (4.71-6.42), P = 0.030) and SOPT (mean errors (standard deviation) = 29.50 (8.82) vs. 24.80 (6.50), P = 0.029) tests, but no differences were observed in TEA-Ch and CNT tests. Performance scores for other neurological phenotypes of severe malaria were similar with those of unexposed controls. After accounting for potential confounders, such as child's age, sex, schooling; maternal age, schooling and economic activity; perinatal factors and history of seizures, complex seizures remained associated with efficiency scores in the vigilance test (beta coefficient (β) (95% confidence interval (CI)) = -0.40 (-0.67, -0.13), P = 0.006) and everyday attention scores of the TEA-Ch test (β (95% CI) = -0.57 (-1.04, -0.10), P = 0.019); the association with SOPT error scores was weak (β (95% CI) = 4.57 (-0.73-9.89), P = 0.089). Combined neurological phenotypes were not significantly associated with executive function performance scores. Executive function impairment in children with severe malaria is associated with specific neurological phenotypes, particularly complex seizures. Effective prophylaxis and management of malaria-associated acute seizures may improve executive functioning performance scores of children.
The Impact of Protein Structure and Sequence Similarity on the Accuracy of Machine-Learning Scoring Functions for Binding Affinity Prediction

PubMed Central

Peng, Jiangjun; Leung, Yee; Leung, Kwong-Sak; Wong, Man-Hon; Lu, Gang; Ballester, Pedro J.

2018-01-01

It has recently been claimed that the outstanding performance of machine-learning scoring functions (SFs) is exclusively due to the presence of training complexes with highly similar proteins to those in the test set. Here, we revisit this question using 24 similarity-based training sets, a widely used test set, and four SFs. Three of these SFs employ machine learning instead of the classical linear regression approach of the fourth SF (X-Score which has the best test set performance out of 16 classical SFs). We have found that random forest (RF)-based RF-Score-v3 outperforms X-Score even when 68% of the most similar proteins are removed from the training set. In addition, unlike X-Score, RF-Score-v3 is able to keep learning with an increasing training set size, becoming substantially more predictive than X-Score when the full 1105 complexes are used for training. These results show that machine-learning SFs owe a substantial part of their performance to training on complexes with dissimilar proteins to those in the test set, against what has been previously concluded using the same data. Given that a growing amount of structural and interaction data will be available from academic and industrial sources, this performance gap between machine-learning SFs and classical SFs is expected to enlarge in the future. PMID:29538331
The Impact of Protein Structure and Sequence Similarity on the Accuracy of Machine-Learning Scoring Functions for Binding Affinity Prediction.

PubMed

Li, Hongjian; Peng, Jiangjun; Leung, Yee; Leung, Kwong-Sak; Wong, Man-Hon; Lu, Gang; Ballester, Pedro J

2018-03-14

It has recently been claimed that the outstanding performance of machine-learning scoring functions (SFs) is exclusively due to the presence of training complexes with highly similar proteins to those in the test set. Here, we revisit this question using 24 similarity-based training sets, a widely used test set, and four SFs. Three of these SFs employ machine learning instead of the classical linear regression approach of the fourth SF (X-Score which has the best test set performance out of 16 classical SFs). We have found that random forest (RF)-based RF-Score-v3 outperforms X-Score even when 68% of the most similar proteins are removed from the training set. In addition, unlike X-Score, RF-Score-v3 is able to keep learning with an increasing training set size, becoming substantially more predictive than X-Score when the full 1105 complexes are used for training. These results show that machine-learning SFs owe a substantial part of their performance to training on complexes with dissimilar proteins to those in the test set, against what has been previously concluded using the same data. Given that a growing amount of structural and interaction data will be available from academic and industrial sources, this performance gap between machine-learning SFs and classical SFs is expected to enlarge in the future.
Assessing Growth in Young Children: A Comparison of Raw, Age-Equivalent, and Standard Scores Using the Peabody Picture Vocabulary Test

ERIC Educational Resources Information Center

Sullivan, Jeremy R.; Winter, Suzanne M.; Sass, Daniel A.; Svenkerud, Nicole

2014-01-01

Many tests provide users with several different types of scores to facilitate interpretation and description of students' performance. Common examples include raw scores, age- and grade-equivalent scores, and standard scores. However, when used within the context of assessing growth among young children, these scores should not be interchangeable…
Monitoring the Performance of Human and Automated Scores for Spoken Responses

ERIC Educational Resources Information Center

Wang, Zhen; Zechner, Klaus; Sun, Yu

2018-01-01

As automated scoring systems for spoken responses are increasingly used in language assessments, testing organizations need to analyze their performance, as compared to human raters, across several dimensions, for example, on individual items or based on subgroups of test takers. In addition, there is a need in testing organizations to establish…
Hispanics' SAT Scores: The Influences of Level of Parental Education, Performance-Avoidance Goals, and Knowledge about Learning

ERIC Educational Resources Information Center

Hannon, Brenda

2015-01-01

This study uncovers which learning (epistemic belief of learning), socioeconomic background (level of parental education, family income) or social-personality factors (performance-avoidance goals, test anxiety) mitigate the ethnic gap in SAT (Scholastic Assessment Test) scores. Measures assessing achievement motivation, test anxiety, socioeconomic…
Self Adapted Testing as Formative Assessment: Effects of Feedback and Scoring on Engagement and Performance

ERIC Educational Resources Information Center

Arieli-Attali, Meirav

2016-01-01

This dissertation investigated the feasibility of self-adapted testing (SAT) as a formative assessment tool with the focus on learning. Under two different orientation goals--to excel on a test (performance goal) or to learn from the test (learning goal)--I examined the effect of different scoring rules provided as interactive feedback, on test…
Enhancing the Interpretability of the Overall Results of an International Test of English-Language Proficiency

ERIC Educational Resources Information Center

Papageorgiou, Spiros; Morgan, Rick; Becker, Valerie

2015-01-01

The purpose of this study was to enhance the meaning of the scores of an English-language test by developing performance levels and descriptors for reporting overall test performance. The levels and descriptors were intended to accompany the total scale scores of TOEFL Junior® Standard, an international test of English as a second/foreign…
Exploring the motivation jungle: predicting performance on a novel task by investigating constructs from different motivation perspectives in tandem.

PubMed

Van Nuland, Hanneke J C; Dusseldorp, Elise; Martens, Rob L; Boekaerts, Monique

2010-08-01

Different theoretical viewpoints on motivation make it hard to decide which model has the best potential to provide valid predictions on classroom performance. This study was designed to explore motivation constructs derived from different motivation perspectives that predict performance on a novel task best. Motivation constructs from self-determination theory, self-regulation theory, and achievement goal theory were investigated in tandem. Performance was measured by systematicity (i.e. how systematically students worked on a problem-solving task) and test score (i.e. score on a multiple-choice test). Hierarchical regression analyses on data from 259 secondary school students showed a quadratic relation between a performance avoidance orientation and both performance outcomes, indicating that extreme high and low performance avoidance resulted in the lowest performance. Furthermore, two three-way interaction effects were found. Intrinsic motivation seemed to play a key role in test score and systematicity performance, provided that effort regulation and metacognitive skills were both high. Results indicate that intrinsic motivation in itself is not enough to attain a good performance. Instead, a moderate score on performance avoidance, together with the ability to remain motivated and effectively regulate and control task behavior, is needed to attain a good performance. High time management skills also contributed to higher test score and systematicity performance and a low performance approach orientation contributed to higher systematicity performance. We concluded that self-regulatory skills should be trained in order to have intrinsically motivated students perform well on novel tasks in the classroom.

Performance Pay Path to Improvement

ERIC Educational Resources Information Center

Gratz, Donald B.

2011-01-01

The primary goal of performance pay for the past decade has been higher test scores, and the most prominent strategy has been to increase teacher performance through financial incentives. If teachers are rewarded for success, according to this logic, they will try harder. If they try harder, more children will achieve higher test scores. The…
The Role of Test Scores in Explaining Race and Gender Differences in Wages

ERIC Educational Resources Information Center

Blackburn, McKinley L.

2004-01-01

Previous research has suggested that skills reflected in test-score performance on tests such as the Armed Forces Qualification Test (AFQT) can account for some of the racial differences in average wages. I use a more complete set of test scores available with the National Longitudinal Survey of Youth 1979 Cohort to reconsider this evidence, and…
Effects of handcuffs on neuropsychological testing: Implications for criminal forensic evaluations.

PubMed

Biddle, Christine M; Fazio, Rachel L; Dyshniku, Fiona; Denney, Robert L

2018-01-01

Neuropsychological evaluations are increasingly performed in forensic contexts, including in criminal settings where security sometimes cannot be compromised to facilitate evaluation according to standardized procedures. Interpretation of nonstandardized assessment results poses significant challenges for the neuropsychologist. Research is limited in regard to the validation of neuropsychological test accommodation and modification practices that deviate from standard test administration; there is no published research regarding the effects of hand restraints upon neuropsychological evaluation results. This study provides preliminary results regarding the impact of restraints on motor functioning and common neuropsychological tests with a motor component. When restrained, performance on nearly all tests utilized was significantly impacted, including Trail Making Test A/B, a coding test, and several tests of motor functioning. Significant performance decline was observed in both raw scores and normative scores. Regression models are also provided in order to help forensic neuropsychologists adjust for the effect of hand restraints on raw scores of these tests, as the hand restraints also resulted in significant differences in normative scores; in the most striking case there was nearly a full standard deviation of discrepancy.
Heparin-Induced Thrombocytopenia Antibody Test

MedlinePlus

... HIT II is clinically suspected. There is a pre-test scoring system that is typically used to determine ... The HIT antibody test is performed when this pre-scoring test shows that a person has a moderate to ...
Structured didactic teaching sessions improve medical student neurology clerkship test scores: a pilot study.

PubMed

Menkes, Daniel L; Reed, Mary

2008-01-01

To determine the effectiveness of didactic case-based instruction methodology to improve medical student comprehension of common neurological illnesses and neurological emergencies. Neurology department, academic university. 415 third and fourth year medical students performing a required four week neurology clerkship. Raw test scores on a 1 hour, 50-item clinical vignette based examination and open-ended questions in a post-clerkship feedback session. There was a statistically significant improvement in overall test scores (p<0.001). Didactic teaching sessions have a significant positive impact on neurology student clerkship test score performance and perception of their educational experience. Confirmation of these results across multiple specialties in a multi-center trial is warranted.
The Bay Area Verbal Learning Test (BAVLT): Normative Data and the Effects of Repeated Testing, Simulated Malingering, and Traumatic Brain Injury

PubMed Central

Woods, David L.; Wyma, John M.; Herron, Timothy J.; Yund, E. William

2017-01-01

Verbal learning tests (VLTs) are widely used to evaluate memory deficits in neuropsychiatric and developmental disorders. However, their validity has been called into question by studies showing significant differences in VLT scores obtained by different examiners. Here we describe the computerized Bay Area Verbal Learning Test (BAVLT), which minimizes inter-examiner differences by incorporating digital list presentation and automated scoring. In the 10-min BAVLT, a 12-word list is presented on three acquisition trials, followed by a distractor list, immediate recall of the first list, and, after a 30-min delay, delayed recall and recognition. In Experiment 1, we analyzed the performance of 195 participants ranging in age from 18 to 82 years. Acquisition trials showed strong primacy and recency effects, with scores improving over repetitions, particularly for mid-list words. Inter-word intervals (IWIs) increased with successive words recalled. Omnibus scores (summed over all trials except recognition) were influenced by age, education, and sex (women outperformed men). In Experiment 2, we examined BAVLT test-retest reliability in 29 participants tested with different word lists at weekly intervals. High intraclass correlation coefficients were seen for omnibus and acquisition scores, IWIs, and a categorization index reflecting semantic reorganization. Experiment 3 examined the performance of Experiment 2 participants when feigning symptoms of traumatic brain injury. Although 37% of simulated malingerers showed abnormal (p < 0.05) omnibus z-scores, z-score cutoffs were ineffective in discriminating abnormal malingerers from control participants with abnormal scores. In contrast, four malingering indices (recognition scores, primacy/recency effects, learning rate across acquisition trials, and IWIs) discriminated the two groups with 80% sensitivity and 80% specificity. Experiment 4 examined the performance of a small group of patients with mild or severe TBI. Overall, both patient groups performed within the normal range, although significant performance deficits were seen in some patients. The BAVLT improves the speed and replicability of verbal learning assessments while providing comprehensive measures of retrieval timing, semantic organization, and primacy/recency effects that clarify the nature of performance. PMID:28127280
Score Reporting in Teacher Certification Testing: A Review, Design, and Interview/Focus Group Study

ERIC Educational Resources Information Center

Klesch, Heather S.

2010-01-01

The reporting of scores on educational tests is at times misunderstood, misinterpreted, and potentially confusing to examinees and other stakeholders who may need to interpret test scores. In reporting test results to examinees, there is a need for clarity in the message communicated. As pressure rises for students to demonstrate performance at a…
The effects of individual differences, prior experience and cognitive load on the transfer of dynamic decision-making performance.

PubMed

Nicholson, Brad; O'Hare, David

2014-01-01

Situational awareness is recognised as an important factor in the performance of individuals and teams in dynamic decision-making (DDM) environments (Salmon et al. 2014 ). The present study was designed to investigate whether the scores on the WOMBAT™ Situational Awareness and Stress Tolerance Test (Roscoe and North 1980 ) would predict the transfer of DDM performance from training under different levels of cognitive load to a novel situation. Participants practised a simulated firefighting task under either low or high conditions of cognitive load and then performed a (transfer) test in an alternative firefighting environment under an intermediate level of cognitive load. WOMBAT™ test scores were a better predictor of DDM performance than scores on the Raven Matrices. Participants with high WOMBAT™ scores performed better regardless of their training condition. Participants with recent gaming experience who practised under low cognitive load showed better practice phase performance but worse transfer performance than those who practised under high cognitive load. The relationship between task experience, situational awareness ability, cognitive load and the transfer of dynamic decision-making (DDM) performance was investigated. Results showed that the WOMBAT™ test predicted transfer of DDM performance regardless of task cognitive load. The effects of cognitive load on performance varied according to previous task-relevant experience.
Fundamentals of endoscopic surgery: creation and validation of the hands-on test.

PubMed

Vassiliou, Melina C; Dunkin, Brian J; Fried, Gerald M; Mellinger, John D; Trus, Thadeus; Kaneva, Pepa; Lyons, Calvin; Korndorffer, James R; Ujiki, Michael; Velanovich, Vic; Kochman, Michael L; Tsuda, Shawn; Martinez, Jose; Scott, Daniel J; Korus, Gary; Park, Adrian; Marks, Jeffrey M

2014-03-01

The Fundamentals of Endoscopic Surgery™ (FES) program consists of online materials and didactic and skills-based tests. All components were designed to measure the skills and knowledge required to perform safe flexible endoscopy. The purpose of this multicenter study was to evaluate the reliability and validity of the hands-on component of the FES examination, and to establish the pass score. Expert endoscopists identified the critical skill set required for flexible endoscopy. They were then modeled in a virtual reality simulator (GI Mentor™ II, Simbionix™ Ltd., Airport City, Israel) to create five tasks and metrics. Scores were designed to measure both speed and precision. Validity evidence was assessed by correlating performance with self-reported endoscopic experience (surgeons and gastroenterologists [GIs]). Internal consistency of each test task was assessed using Cronbach's alpha. Test-retest reliability was determined by having the same participant perform the test a second time and comparing their scores. Passing scores were determined by a contrasting groups methodology and use of receiver operating characteristic curves. A total of 160 participants (17 % GIs) performed the simulator test. Scores on the five tasks showed good internal consistency reliability and all had significant correlations with endoscopic experience. Total FES scores correlated 0.73, with participants' level of endoscopic experience providing evidence of their validity, and their internal consistency reliability (Cronbach's alpha) was 0.82. Test-retest reliability was assessed in 11 participants, and the intraclass correlation was 0.85. The passing score was determined and is estimated to have a sensitivity (true positive rate) of 0.81 and a 1-specificity (false positive rate) of 0.21. The FES hands-on skills test examines the basic procedural components required to perform safe flexible endoscopy. It meets rigorous standards of reliability and validity required for high-stakes examinations, and, together with the knowledge component, may help contribute to the definition and determination of competence in endoscopy.
The reliability and validity of the Complex Task Performance Assessment: A performance-based assessment of executive function.

PubMed

Wolf, Timothy J; Dahl, Abigail; Auen, Colleen; Doherty, Meghan

2017-07-01

The objective of this study was to evaluate the inter-rater reliability, test-retest reliability, concurrent validity, and discriminant validity of the Complex Task Performance Assessment (CTPA): an ecologically valid performance-based assessment of executive function. Community control participants (n = 20) and individuals with mild stroke (n = 14) participated in this study. All participants completed the CTPA and a battery of cognitive assessments at initial testing. The control participants completed the CTPA at two different times one week apart. The intra-class correlation coefficient (ICC) for inter-rater reliability for the total score on the CTPA was .991. The ICCs for all of the sub-scores of the CTPA were also high (.889-.977). The CTPA total score was significantly correlated to Condition 4 of the DKEFS Color-Word Interference Test (p = -.425), and the Wechsler Test of Adult Reading (p = -.493). Finally, there were significant differences between control subjects and individuals with mild stroke on the total score of the CTPA (p = .007) and all sub-scores except interpretation failures and total items incorrect. These results are also consistent with other current executive function performance-based assessments and indicate that the CTPA is a reliable and valid performance-based measure of executive function.
The Cognitive Change Index as a Measure of Self and Informant Perception of Cognitive Decline: Relation to Neuropsychological Tests.

PubMed

Rattanabannakit, Chatchawan; Risacher, Shannon L; Gao, Sujuan; Lane, Kathleen A; Brown, Steven A; McDonald, Brenna C; Unverzagt, Frederick W; Apostolova, Liana G; Saykin, Andrew J; Farlow, Martin R

2016-01-01

The perception of cognitive decline by individuals and those who know them well ("informants") has been inconsistently associated with objective cognitive performance, but strongly associated with depressive symptoms. We investigated associations of self-report, informant-report, and discrepancy between self- and informant-report of cognitive decline obtained from the Cognitive Change Index (CCI) with cognitive test performance and self-reported depressive symptoms. 267 participants with normal cognition, mild cognitive impairment (MCI), or mild dementia were included from a cohort study and memory clinic. Association of test performance and self-rated depression (Geriatric Depression Scale, GDS) with CCI scores obtained from subjects (CCI-S), their informants (CCI-I), and discrepancy scores between subjects and informants (CCI-D; CCI-S minus CCI-I) were analyzed using correlation and analysis of covariance (ANCOVA) models. CCI-S and CCI-I scores showed high internal consistency (Cronbach alpha 0.96 and 0.98, respectively). Higher scores on CCI-S and CCI-I, and lower scores on the CCI-D, were associated with lower performance on various cognitive tests in both univariate and in ANCOVA models adjusted for age, gender, and education. Adjustment for GDS slightly weakened the relationships between CCI and test performance but most remained significant. Self- and informant-report of cognitive decline, as measured by the CCI, show moderately strong relationships with objective test performance independent of age, gender, education, and depressive symptoms. The CCI appears to be a valid cross-sectional measure of self and informant perception of cognitive decline across the continuum of functioning. Studies are needed to address the relationship of CCI scores to longitudinal outcome.
The Influence of Training and Experience on Rater Performance in Scoring Spoken Language

ERIC Educational Resources Information Center

Davis, Larry

2016-01-01

Two factors were investigated that are thought to contribute to consistency in rater scoring judgments: rater training and experience in scoring. Also considered were the relative effects of scoring rubrics and exemplars on rater performance. Experienced teachers of English (N = 20) scored recorded responses from the TOEFL iBT speaking test prior…
Still under the microscope: can a surgical aptitude test predict otolaryngology resident performance?

PubMed

Moore, Eric J; Price, Daniel L; Van Abel, Kathryn M; Carlson, Matthew L

2015-02-01

Application to otolaryngology-head and neck surgery residency is highly competitive, and the interview process strives to select qualified applicants with a high aptitude for the specialty. Commonly employed criteria for applicant selection have failed to show correlation with proficiency during residency training. We evaluate the correlation between the results of a surgical aptitude test administered to otolaryngology resident applicants and their performance during residency. Retrospective study at an academic otolaryngology-head and neck surgery residency program. Between 2007 and 2013, 224 resident applicants participated in a previously described surgical aptitude test administered at a microvascular surgical station. The composite score and attitudinal scores for 24 consecutive residents who matched at our institution were recorded, and their residency performance was analyzed by faculty survey on a five-point scale. The composite and attitudinal scores were analyzed for correlation with residency performance score by regression analysis. Twenty-four residents were evaluated for overall quality as a clinician by eight faculty members who were blinded to the results of surgical aptitude testing. The results of these surveys showed good inter-rater reliability. Both the overall aptitude test scores and the subset attitudinal score showed reliability in predicting performance during residency training. The goal of the residency selection process is to evaluate the candidate's potential for success in residency and beyond. The results of this study suggest that a simple-to-administer clinical skills test may have predictive value for success in residency and clinician quality. 4. © 2014 The American Laryngological, Rhinological and Otological Society, Inc.
Fine-motor skills testing and prediction of endovascular performance.

PubMed

Bech, Bo; Lönn, Lars; Schroeder, Torben V; Ringsted, Charlotte

2013-12-01

Performing endovascular procedures requires good control of fine-motor digital movements and hand-eye coordination. Objective assessment of such skills is difficult. Trainees acquire control of catheter/wire movements at various paces. However, little is known to what extent talent plays for novice candidates at entry to practice. To study the association between performance in a novel aptitude test of fine-motor skills and performance in simulated procedures. The test was based on manual course-tracking using a proprietary hand-operated roller-bar device coupled to a personal computer with monitor view rotation. A total of 40 test repetitions were conducted separately with each hand. Test scores were correlated with simulator performance. Group A (n = 14), clinicians with various levels of endovascular experience, performed a simulated procedure of contralateral iliac artery stenting. Group B (n = 19), medical students, performed 10 repetitions of crossing a challenging aortic bifurcation in a simulator. The test score differed markedly between the individuals in both groups, in particular with the non-dominant hand. Group A: the test score with the non-dominant hand correlated significantly with simulator performance assessed with the global rating scale SAVE (R = -0.69, P = 0.007). There was no association observed from performances with the dominant hand. Group B: there was no significant association between the test score and endovascular skills acquisition neither with the dominant nor with the non-dominant hand. Clinicians with increasing levels of endovascular technical experience had developed good fine-motor control of the non-dominant hand, in particular, that was associated with good procedural performance in the simulator. The aptitude test did not predict endovascular skills acquisition among medical students, thus, cannot be suggested for selection of novice candidates. Procedural experience and practice probably supplant the influence of innate abilities (talent) over time.
Characteristics and clinical correlates of prospective memory performance in first-episode schizophrenia.

PubMed

Zhou, Fu-Chun; Xiang, Yu-Tao; Wang, Chuan-Yue; Dickerson, Faith; Au, Raymond W C; Zhou, Jing-Jing; Zhou, Yan; Shum, David H K; Chiu, Helen F K; Man, David; Lee, Edwin H M; Yu, Xin; Chan, Raymond C K; Ungvari, Gabor S

2012-03-01

The aim of this study was to examine prospective memory (PM) and its socio-demographic, clinical, and neurocognitive correlates in first episode schizophrenia (FES). Fifty-one FES patients and 42 healthy controls formed the study sample. Time- and event-based PM (TBPM and EBPM) performance were measured with the Chinese version of the Cambridge Prospective Memory Test (C-CAMPROMPT). A battery of neuropsychological tests was also administered. Patients' clinical symptoms were evaluated with the Positive and Negative Symptom Scale (PANSS). Patients performed significantly worse in both TBPM (8.7 ± 5.3 vs. 14.8 ± 3.5) and EBPM (11.3 ± 4.7 vs. 15.7 ± 2.7) than the controls. After controlling for age, gender, education level and neurocognitive test score, the difference in performance on the two types of PM tasks between patients and controls was no longer present. In multiple linear regression analyses, longer duration of untreated psychosis (DUP), lower scores of the Hopkins Verbal Learning Test-Revised (HVLT-R) and the categories completed of the Wisconsin Card Sorting Test (WCST-CC) and higher score of the Color Trails Test-2 (CTT-2) contributed to poorer TBPM performance, while lower score of HVLT-R, higher score of the perseverative errors of the Wisconsin Card Sorting Test (WCST-PE) and longer DUP contributed to worse performance on EBPM. Both subtypes of PM are impaired in first-episode schizophrenia suggesting that PM deficits are an integral part of the cognitive dysfunction in the disease process. Copyright © 2011 Elsevier B.V. All rights reserved.
Pharmacy Students' Test-Taking Motivation-Effort on a Low-Stakes Standardized Test

PubMed Central

2011-01-01

Objective To measure third-year pharmacy students' level of motivation while completing the Pharmacy Curriculum Outcomes Assessment (PCOA) administered as a low-stakes test to better understand use of the PCOA as a measure of student content knowledge. Methods Student motivation was manipulated through an incentive (ie, personal letter from the dean) and a process of statistical motivation filtering. Data were analyzed to determine any differences between the experimental and control groups in PCOA test performance, motivation to perform well, and test performance after filtering for low motivation-effort. Results Incentivizing students diminished the need for filtering PCOA scores for low effort. Where filtering was used, performance scores improved, providing a more realistic measure of aggregate student performance. Conclusions To ensure that PCOA scores are an accurate reflection of student knowledge, incentivizing and/or filtering for low motivation-effort among pharmacy students should be considered fundamental best practice when the PCOA is administered as a low-stakes test PMID:21655395
An explorative study of school performance and antipsychotic medication.

PubMed

van der Schans, J; Vardar, S; Çiçek, R; Bos, H J; Hoekstra, P J; de Vries, T W; Hak, E

2016-09-21

Antipsychotic therapy can reduce severe symptoms of psychiatric disorders, however, data on school performance among children on such treatment are lacking. The objective was to explore school performance among children using antipsychotic drugs at the end of primary education. A cross-sectional study was conducted using the University Groningen pharmacy database linked to academic achievement scores at the end of primary school (Dutch Cito-test) obtained from Statistics Netherlands. Mean Cito-test scores and standard deviations were obtained for children on antipsychotic therapy and reference children, and statistically compared using analyses of covariance. In addition, differences in subgroups as boys versus girls, ethnicity, household income, and late starters (start date within 12 months of the Cito-test) versus early starters (start date > 12 months before the Cito-test) were tested. In all, data from 7994 children could be linked to Cito-test scores. At the time of the Cito-test, 45 (0.6 %) were on treatment with antipsychotics. Children using antipsychotics scored on average 3.6 points lower than the reference peer group (534.5 ± 9.5). Scores were different across gender and levels of household income (p < 0.05). Scores of early starters were significantly higher than starters within 12 months (533.7 ± 1.7 vs. 524.1 ± 2.6). This first exploration showed that children on antipsychotic treatment have lower school performance compared to the reference peer group at the end of primary school. This was most noticeable for girls, but early starters were less affected than later starters. Due to the observational cross-sectional nature of this study, no causality can be inferred, but the results indicate that school performance should be closely monitored and causes of underperformance despite treatment warrants more research.
Effects of Didactic Instruction and Test-Enhanced Learning in a Nursing Review Course.

PubMed

Tu, Yu-Ching; Lin, Yi-Jung; Lee, Jonathan W; Fan, Lir-Wan

2017-11-01

Determining the most effective approach for students' successful academic performance and achievement on the national licensure examination for RNs is important to nursing education and practice. A quasi-experimental design was used to compare didactic instruction and test-enhanced learning among nursing students divided into two fundamental nursing review courses in their final semester. Students in each course were subdivided into low-, intermediate-, and high-score groups based on their first examination scores. Mixed model of repeated measure and two-way analysis of variance were applied to evaluate students' academic results and both teaching approaches. Intermediate-scoring students' performances improved more through didactic instruction, whereas low-scoring students' performances improved more through test-enhanced learning. Each method had differing effects on individual subgroups within the different performance level groups of their classes, which points to the importance of considering both the didactic and test-enhanced learning approaches. [J Nurs Educ. 2017;56(11):683-687.]. Copyright 2017, SLACK Incorporated.
Preoperative prediction of inpatient recovery of function after total hip arthroplasty using performance-based tests: a prospective cohort study.

PubMed

Oosting, Ellen; Hoogeboom, Thomas J; Appelman-de Vries, Suzan A; Swets, Adam; Dronkers, Jaap J; van Meeteren, Nico L U

2016-01-01

The aim of this study was to evaluate the value of conventional factors, the Risk Assessment and Predictor Tool (RAPT) and performance-based functional tests as predictors of delayed recovery after total hip arthroplasty (THA). A prospective cohort study in a regional hospital in the Netherlands with 315 patients was attending for THA in 2012. The dependent variable recovery of function was assessed with the Modified Iowa Levels of Assistance scale. Delayed recovery was defined as taking more than 3 days to walk independently. Independent variables were age, sex, BMI, Charnley score, RAPT score and scores for four performance-based tests [2-minute walk test, timed up and go test (TUG), 10-meter walking test (10 mW) and hand grip strength]. Regression analysis with all variables identified older age (>70 years), Charnley score C, slow walking speed (10 mW >10.0 s) and poor functional mobility (TUG >10.5 s) as the best predictors of delayed recovery of function. This model (AUC 0.85, 95% CI 0.79-0.91) performed better than a model with conventional factors and RAPT scores, and significantly better (p = 0.04) than a model with only conventional factors (AUC 0.81, 95% CI 0.74-0.87). The combination of performance-based tests and conventional factors predicted inpatient functional recovery after THA. Two simple functional performance-based tests have a significant added value to a more conventional screening with age and comorbidities to predict recovery of functioning immediately after total hip surgery. Patients over 70 years old, with comorbidities, with a TUG score >10.5 s and a walking speed >1.0 m/s are at risk for delayed recovery of functioning. Those high risk patients need an accurate discharge plan and could benefit from targeted pre- and postoperative therapeutic exercise programs.
School Performance: A Matter of Health or Socio-Economic Background? Findings from the PIAMA Birth Cohort Study

PubMed Central

Ruijsbroek, Annemarie; Wijga, Alet H.; Gehring, Ulrike; Kerkhof, Marjan; Droomers, Mariël

2015-01-01

Background Performance in primary school is a determinant of children’s educational attainment and their socio-economic position and health inequalities in adulthood. We examined the relationship between five common childhood health conditions (asthma symptoms, eczema, general health, frequent respiratory infections, and overweight), health related school absence and family socio-economic status on children’s school performance. Methods We used data from 1,865 children in the Dutch PIAMA birth cohort study. School performance was measured as the teacher’s assessment of a suitable secondary school level for the child, and the child’s score on a standardized achievement test (Cito Test). Both school performance indicators were standardised using Z-scores. Childhood health was indicated by eczema, asthma symptoms, general health, frequent respiratory infections, overweight, and health related school absence. Children’s health conditions were reported repeatedly between the age of one to eleven. School absenteeism was reported at age eleven. Highest attained educational level of the mother and father indicated family socio-economic status. We used linear regression models with heteroskedasticity-robust standard errors for our analyses with adjustment for sex of the child. Results The health indicators used in our study were not associated with children’s school performance, independently from parental educational level, with the exception of asthma symptoms (-0.03 z-score / -0.04 z-score with Cito Test score after adjusting for respectively maternal and paternal education) and missing more than 5 schooldays due to illness (-0.18 z-score with Cito Test score and -0.17 z-score with school level assessment after adjustment for paternal education). The effect estimates for these health indicators were much smaller though than the effect estimates for parental education, which was strongly associated with children’s school performance. Conclusion Children’s school performance was affected only slightly by a number of common childhood health problems, but was strongly associated with parental education. PMID:26247468

School Performance: A Matter of Health or Socio-Economic Background? Findings from the PIAMA Birth Cohort Study.

PubMed

Ruijsbroek, Annemarie; Wijga, Alet H; Gehring, Ulrike; Kerkhof, Marjan; Droomers, Mariël

2015-01-01

Performance in primary school is a determinant of children's educational attainment and their socio-economic position and health inequalities in adulthood. We examined the relationship between five common childhood health conditions (asthma symptoms, eczema, general health, frequent respiratory infections, and overweight), health related school absence and family socio-economic status on children's school performance. We used data from 1,865 children in the Dutch PIAMA birth cohort study. School performance was measured as the teacher's assessment of a suitable secondary school level for the child, and the child's score on a standardized achievement test (Cito Test). Both school performance indicators were standardised using Z-scores. Childhood health was indicated by eczema, asthma symptoms, general health, frequent respiratory infections, overweight, and health related school absence. Children's health conditions were reported repeatedly between the age of one to eleven. School absenteeism was reported at age eleven. Highest attained educational level of the mother and father indicated family socio-economic status. We used linear regression models with heteroskedasticity-robust standard errors for our analyses with adjustment for sex of the child. The health indicators used in our study were not associated with children's school performance, independently from parental educational level, with the exception of asthma symptoms (-0.03 z-score / -0.04 z-score with Cito Test score after adjusting for respectively maternal and paternal education) and missing more than 5 schooldays due to illness (-0.18 z-score with Cito Test score and -0.17 z-score with school level assessment after adjustment for paternal education). The effect estimates for these health indicators were much smaller though than the effect estimates for parental education, which was strongly associated with children's school performance. Children's school performance was affected only slightly by a number of common childhood health problems, but was strongly associated with parental education.
The first OSCE; does students' experience of performing in public affect their results?

PubMed

Chan, Michael; Bax, Nigel; Woodley, Caroline; Jennings, Michael; Nicolson, Rod; Chan, Philip

2015-03-26

Personal qualities have been shown to affect students' exam results. We studied the effect of experience, and level, of public performance in music, drama, dance, sport, and debate at the time of admission to medical school as a predictor of student achievement in their first objective structured clinical examination (OSCE). A single medical school cohort (n = 265) sitting their first clinical exam in 2011 as third year students were studied. Pre-admission statements made at the time of application were coded for their stated achievements in the level of public performance; participation in each activity was scored 0-3, where 0 was no record, 1 = leisure time activity, 2 = activity at school or local level, 3 = activity at district, regional or national level. These scores were correlated to OSCE results by linear regression and t-test. Comparison was made between the highest scoring students in each area, and students scoring zero by t-test. There was a bell shaped distribution in public performance score in this cohort. There was no significant linear regression relationship between OSCE results and overall performance score, or between any subgroups. There was a significant difference between students with high scores in theatre, debate and vocal music areas, grouped together as verbal performance, and students scoring zero in these areas. (p < 0.05, t-test) with an effect size of 0.4. We found modest effects from pre-admission experience of verbal performance on students' scores in the OSCE examination. As these data are taken from students' admission statements, we call into question the received wisdom that such statements are unreliable.
Undergraduate GPAs, MCAT scores, and academic performance the first 2 years in podiatric medical school at Des Moines University.

PubMed

Yoho, Robert M; Antonopoulos, Kosta; Vardaxis, Vassilios

2012-01-01

This study was performed to determine the relationship between undergraduate academic performance and total Medical College Admission Test score and academic performance in the podiatric medical program at Des Moines University. The allopathic and osteopathic medical professions have published educational research examining this relationship. To our knowledge, no such educational research has been published for podiatric medical education. The undergraduate cumulative and science grade point averages and total Medical College Admission Test scores of four podiatric medical classes (2007-2010, N = 169) were compared with their academic performance in the first 2 years of podiatric medical school using pairwise Pearson product moment correlations and multiple regression analysis. Significant low to moderate positive correlations were identified between undergraduate cumulative and science grade point averages and student academic performance in years 1 and 2 of podiatric medical school for each of the four classes (except one) and the pooled data. There was no significant correlation between Medical College Admission Test score and academic performance in years 1 and 2 (except one) and the pooled data. These results identify undergraduate cumulative grade point average as the strongest cognitive admissions variable in predicting academic performance in the podiatric medicine program at Des Moines University, followed by undergraduate science grade point average. These results also suggest limitations of the total Medical College Admission Test score in predicting academic performance. Information from this study can be used in the admissions process and to monitor student progress.
Linking the Smarter Balanced Assessments to NWEA MAP Assessments

ERIC Educational Resources Information Center

Northwest Evaluation Association, 2015

2015-01-01

Concordance tables have been used for decades to relate scores on different tests measuring similar but distinct constructs. These tables, typically derived from statistical linking procedures, provide a direct link between scores on different tests and serve various purposes. Aside from describing how a score on one test relates to performance on…
Insulin resistance and cognitive test performance in elderly adults: National health and nutrition examination survey (NHANES).

PubMed

Sherzai, Ayesha Z; Shaheen, Magda; Yu, Jeffrey J; Talbot, Konrad; Sherzai, Dean

2018-05-15

To examine the relationship between homeostatic model of insulin resistance (HOMA-IR) and cognitive test performance among population≥60years in a national database. Higher insulin resistance is associated with lower cognitive test performance score in the population≥60years. We analyzed data from the National Health and Nutrition Examination Survey (NHANES) 1999-2000 and 2001-2002. Cognitive test performance was measured by the Digit Symbol Substitution (DSS) exercise score. The main independent variable was the homeostasis model assessment of insulin resistance (HOMA-IR). We used bivariate analysis and generalized linear model adjusting for age, gender, race, education, body mass index, and systolic and diastolic blood pressures; total cholesterol, low density lipoprotein (LDL), high density lipoprotein (HDL) and triglyceride levels; and physical activity, diabetes mellitus, stroke, and congestive heart failure. STATA 14 was used to analyze the data taking into consideration the design, strata and weight. Of the 1028 participants, 44% were male and 85% were white. The mean age was 70.0±0.28 (SE) years. Their average HOMA-IR was 3.6±0.14 and they had a mean of 49.2±0.8 correct DSS score in the cognitive test. Adjusting for the confounding variables, HOMA-IR was associated with decline in DSS score (B=-0.30, 95% confidence interval=-0.54 and -0.05, p=0.01). The model explained 44% of the variability of the DSS score (R 2 =0.44). Significant predictors of decline in DSS score were age, gender, race, and education (p=0.01). Insulin resistance as measured by HOMA-IR was independently associated with lower cognitive test performance score among elderly participants aged ≥60years. Longitudinal studies are needed to test the mechanism and the causal relationship. Copyright © 2017. Published by Elsevier B.V.
Association of Health Sciences Reasoning Test scores with academic and experiential performance.

PubMed

Cox, Wendy C; McLaughlin, Jacqueline E

2014-05-15

To assess the association of scores on the Health Sciences Reasoning Test (HSRT) with academic and experiential performance in a doctor of pharmacy (PharmD) curriculum. The HSRT was administered to 329 first-year (P1) PharmD students. Performance on the HSRT and its subscales was compared with academic performance in 29 courses throughout the curriculum and with performance in advanced pharmacy practice experiences (APPEs). Significant positive correlations were found between course grades in 8 courses and HSRT overall scores. All significant correlations were accounted for by pharmaceutical care laboratory courses, therapeutics courses, and a law and ethics course. There was a lack of moderate to strong correlation between HSRT scores and academic and experiential performance. The usefulness of the HSRT as a tool for predicting student success may be limited.
Automated Essay Scoring versus Human Scoring: A Correlational Study

ERIC Educational Resources Information Center

Wang, Jinhao; Brown, Michelle Stallone

2008-01-01

The purpose of the current study was to analyze the relationship between automated essay scoring (AES) and human scoring in order to determine the validity and usefulness of AES for large-scale placement tests. Specifically, a correlational research design was used to examine the correlations between AES performance and human raters' performance.…
Factors Contributing to Single- and Dual-Task Timed "Up & Go" Test Performance in Middle-Aged and Older Adults Who Are Active and Dwell in the Community.

PubMed

Chen, Hui-Ya; Tang, Pei-Fang

2016-03-01

Dual-task Timed "Up & Go" (TUG) tests are likely to have applications different from those of a single-task TUG test and may have different contributing factors. The purpose of this study was to compare factors contributing to performance on single- and dual-task TUG tests. This investigation was a cross-sectional study. Sixty-four adults who were more than 50 years of age and dwelled in the community were recruited. Interviews and physical examinations were performed to identify potential contributors to TUG test performance. The time to complete the single-task TUG test (TUGsingle) or the dual-task TUG test, which consisted of completing the TUG test while performing a serial subtraction task (TUGcognitive) or while carrying water (TUGmanual), was measured. Age, hip extensor strength, walking speed, general mental function, and Stroop scores for word and color were significantly associated with performance on all TUG tests. Hierarchical multiple regression models, without the input of walking speed, revealed different independent factors contributing to TUGsingle performance (Mini-Mental Status Examination score, β=-0.32), TUGmanual performance (age, β=0.35), and TUGcognitive performance (Stroop word score, β=-0.40; Mini-Mental Status Examination score, β=-0.31). At least 40% of the variance in the performance on the 3 TUG tests was not explained by common clinical measures, even when the factor of walking speed was considered. However, this study successfully identified some important factors contributing to performance on different TUG tests, and other studies have reported similar findings for single-task TUG test and dual-task gait performance. Although the TUGsingle and the TUGcognitive shared general mental function as a common factor, the TUGmanual was uniquely influenced by age and the TUGcognitive was uniquely influenced by focused attention. These results suggest that both common and unique factors contribute to performance on single- and dual-task TUG tests and suggest important applications of the combined use of the 3 TUG tests. © 2016 American Physical Therapy Association.
Impairment of Concept Formation Ability in Children with ADHD: Comparisons between Lower Grades and Higher Grades

PubMed Central

Hong, Hye Jeong; Kim, Jin Sung; Seo, Wan Seok; Koo, Bon Hoon; Bai, Dai Seg; Jeong, Jin Young

2010-01-01

Objective We investigated executive functions (EFs), as evaluated by the Wisconsin Card Sorting Test (WCST), and other EF between lower grades (LG) and higher grades (HG) in elementary-school-age attention deficit hyperactivity disorder (ADHD) children. Methods We classified a sample of 112 ADHD children into 4 groups (composed of 28 each) based on age (LG vs. HG) and WCST performance [lower vs. higher performance on WCST, defined by the number of completed categories (CC)] Participants in each group were matched according to age, gender, ADHD subtype, and intelligence. We used the Wechsler intelligence Scale for Children 3rd edition to test intelligence and the Computerized Neurocognitive Function Test-IV, which included the WCST, to test EF. Results Comparisons of EFs scores in LG ADHD children showed statistically significant differences in performing digit spans backward, some verbal learning scores, including all memory scores, and Stroop test scores. However, comparisons of EF scores in HG ADHD children did not show any statistically significant differences. Correlation analyses of the CC and EF variables and stepwise multiple regression analysis in LG ADHD children showed a combination of the backward form of the Digit span test and Visual span test in lower-performance ADHD participants significantly predicted the number of CC (R2=0.273, p<0.001). Conclusion This study suggests that the design of any battery of neuropsychological tests for measuring EF in ADHD children should first consider age before interpreting developmental variations and neuropsychological test results. Researchers should consider the dynamics of relationships within EF, as measured by neuropsychological tests. PMID:20927306
Test-retest reliability and minimal detectable change scores for the timed "up & go" test, the six-minute walk test, and gait speed in people with Alzheimer disease.

PubMed

Ries, Julie D; Echternach, John L; Nof, Leah; Gagnon Blodgett, Michelle

2009-06-01

With the increasing incidence of Alzheimer disease (AD), determining the validity and reliability of outcome measures for people with this disease is necessary. The goals of this study were to assess test-retest reliability of data for the Timed "Up & Go" Test (TUG), the Six-Minute Walk Test (6MWT), and gait speed and to calculate minimal detectable change (MDC) scores for each outcome measure. Performance differences between groups with mild to moderate AD and moderately severe to severe AD (as determined by the Functional Assessment Staging [FAST] scale) were studied. This was a prospective, nonexperimental, descriptive methodological study. Background data collected for 51 people with AD included: use of an assistive device, Mini-Mental Status Examination scores, and FAST scale scores. Each participant engaged in 2 test sessions, separated by a 30- to 60-minute rest period, which included 2 TUG trials, 1 6MWT trial, and 2 gait speed trials using a computerized gait assessment system. A specific cuing protocol was followed to achieve optimal performance during test sessions. Test-retest reliability values for the TUG, the 6MWT, and gait speed were high for all participants together and for the mild to moderate AD and moderately severe to severe AD groups separately (intraclass correlation coefficients > or = .973); however, individual variability of performance also was high. Calculated MDC scores at the 90% confidence interval were: TUG=4.09 seconds, 6MWT=33.5 m (110 ft), and gait speed=9.4 cm/s. The 2 groups were significantly different in performance of clinical tests, with the participants who were more cognitively impaired being more physically and functionally impaired. A single researcher for data collection limited sample numbers and prohibited blinding to dementia level. The TUG, the 6MWT, and gait speed are reliable outcome measures for use with people with AD, recognizing that individual variability of performance is high. Minimal detectable change scores at the 90% confidence interval can be used to assess change in performance over time and the impact of treatment.
Performance on a virtual reality angled laparoscope task correlates with spatial ability of trainees.

PubMed

Rosenthal, Rachel; Hamel, Christian; Oertli, Daniel; Demartines, Nicolas; Gantert, Walter A

2010-08-01

The aim of the present study was to investigate whether trainees' performance on a virtual reality angled laparoscope navigation task correlates with scores obtained on a validated conventional test of spatial ability. 56 participants of a surgery workshop performed an angled laparoscope navigation task on the Xitact LS 500 virtual reality Simulator. Performance parameters were correlated with the score of a validated paper-and-pencil test of spatial ability. Performance at the conventional spatial ability test significantly correlated with performance at the virtual reality task for overall task score (p < 0.001), task completion time (p < 0.001) and economy of movement (p = 0.035), not for endoscope travel speed (p = 0.947). In conclusion, trainees' performance in a standardized virtual reality camera navigation task correlates with their innate spatial ability. This VR session holds potential to serve as an assessment tool for trainees.
Comparing the performance plateau in adult cochlear implant patients using HINT and AzBio.

PubMed

Massa, Sean T; Ruckenstein, Michael J

2014-04-01

This study aims to characterize the performance plateau in adult cochlear implant recipients after the initial postimplantation increase by using word recognition testing and an explicit definition of performance plateau. Retrospective review. Urban, tertiary referral center. One hundred twenty-five patients with 138 devices tested with AzBio were matched to 130 patients with 138 devices tested with HINT based on performed on CNC monosyllable tests. Patient's performance was measured overtime using AzBio and HINT tests to determine when and at what score their performance reached a plateau. Time from implantation to reach a performance plateau and plateau score with each test. Thirty-four devices reached a HINT plateau and 30 devices reached an AzBio plateau. Patients reached plateaus at similar times postoperatively using HINT and AzBio, 18.8 and 16.5 weeks, respectively (p = 0.476). Five patients tested with HINT plateaued at scores of 99% to 100%, whereas no patients plateaued above 92% with AzBio. Patients reached a plateau in performance at similar median times using AzBio and HINT, despite the ceiling effect of HINT in some patients. Most patients who reach a plateau did so within 4 months, but exactly when and if a patient's performance plateaus varies significantly among individuals. Further study is required to determine which test best reflects when a patient reaches his or her maximal performance in natural listening conditions.
Effects of Differential Item Functioning on Examinees' Test Performance and Reliability of Test

ERIC Educational Resources Information Center

Lee, Yi-Hsuan; Zhang, Jinming

2017-01-01

Simulations were conducted to examine the effect of differential item functioning (DIF) on measurement consequences such as total scores, item response theory (IRT) ability estimates, and test reliability in terms of the ratio of true-score variance to observed-score variance and the standard error of estimation for the IRT ability parameter. The…
Association of Cognitive Performance with Time at Altitude, Sleep Quality, and Acute Mountain Sickness Symptoms.

PubMed

Issa, Amine N; Herman, Nicole M; Wentz, Robert J; Taylor, Bryan J; Summerfield, Doug C; Johnson, Bruce D

2016-09-01

It is well documented that cognitive performance may be altered with ascent to altitude, but the association of various cognitive performance tests with symptoms of acute mountain sickness (AMS) is not well understood. Our objective was to assess and compare cognitive performance during a high-altitude expedition using several tests and to report the association of each test with AMS, headache, and quality of sleep. During an expedition to Mount Everest, 3 cognitive tests (Stroop, Trail Making, and the real-time cognitive assessment tool, an in-house developed motor accuracy test) were used along with a questionnaire to assess health and AMS. Eight team members were assessed pre-expedition, postexpedition, and at several time points during the expedition. There were no significant differences (P >.05) found among scores taken at 3 time points at base camp and the postexpedition scores for all 3 tests. Changes in the Stroop test scores were significantly associated with the odds of AMS (P <.05). The logistic regression results show that the percent change from baseline for Stroop score (β = -5.637; P = .032) and Stroop attempts (β = -5.269; P = .049) are significantly associated with the odds of meeting the criteria for AMS. No significant changes were found in overall cognitive performance at altitude, but a significant relationship was found between symptoms of AMS and performance in certain cognitive tests. This research shows the need for more investigation of objective physiologic assessments to associate with self-perceived metrics of AMS to gauge effect on cognitive performance. Crown Copyright © 2016. Published by Elsevier Inc. All rights reserved.
Embedded measures of performance validity using verbal fluency tests in a clinical sample.

PubMed

Sugarman, Michael A; Axelrod, Bradley N

2015-01-01

The objective of this study was to determine to what extent verbal fluency measures can be used as performance validity indicators during neuropsychological evaluation. Participants were clinically referred for neuropsychological evaluation in an urban-based Veteran's Affairs hospital. Participants were placed into 2 groups based on their objectively evaluated effort on performance validity tests (PVTs). Individuals who exhibited credible performance (n = 431) failed 0 PVTs, and those with poor effort (n = 192) failed 2 or more PVTs. All participants completed the Controlled Oral Word Association Test (COWAT) and Animals verbal fluency measures. We evaluated how well verbal fluency scores could discriminate between the 2 groups. Raw scores and T scores for Animals discriminated between the credible performance and poor-effort groups with 90% specificity and greater than 40% sensitivity. COWAT scores had lower sensitivity for detecting poor effort. A combination of FAS and Animals scores into logistic regression models yielded acceptable group classification, with 90% specificity and greater than 44% sensitivity. Verbal fluency measures can yield adequate detection of poor effort during neuropsychological evaluation. We provide suggested cut points and logistic regression models for predicting the probability of poor effort in our clinical setting and offer suggested cutoff scores to optimize sensitivity and specificity.
Long-term and within-day variability of working memory performance and EEG in individuals.

PubMed

Gevins, Alan; McEvoy, Linda K; Smith, Michael E; Chan, Cynthia S; Sam-Vargas, Lita; Baum, Cliff; Ilan, Aaron B

2012-07-01

Assess individual-subject long-term and within-day variability of a combined behavioral and EEG test of working memory. EEGs were recorded from 16 adults performing n-back working memory tasks, with 10 tested in morning and afternoon sessions over several years. Participants were also tested after ingesting non-prescription medications or recreational substances. Performance and EEG measures were analyzed to derive an Overall score and three constituent sub-scores characterizing changes in performance, cortical activation, and alertness from each individual's baseline. Long-term and within-day variability were determined for each score; medication effects were assessed by reference to each individual's normal day-to-day variability. Over the several year period, the mean Overall score and sub-scores were approximately zero with standard deviations less than one. Overall scores were lower and their variability higher in afternoon relative to morning sessions. At the group level, alcohol, diphenhydramine and marijuana produced significant effects, but there were large individual differences. Objective working memory measures incorporating performance and EEG are stable over time and sensitive at the level of individual subjects to interventions that affect neurocognitive function. With further research these measures may be suitable for use in individualized medical care by providing a sensitive assessment of incipient illness and response to treatment. Published by Elsevier Ireland Ltd.
The Effect of Higher Education Variables on Cadet Performance during 1987 Light Aircraft Training

DTIC Science & Technology

1989-05-01

Affecting Performance .............................. 50 Academic Majors ........................... 50 Scholastic Aptitude Test Scores ....... 51 Quality...undergo a project such as the LATR program would not be feasible or rational. Perhaps a pure sample of flying talent in reference to academic performance ... performed that requirement was logged for inclusion in the total. Part 3, Academic Scores. Part three of the total performance score was the summation
Relationships between the Kaufman Brief Intelligence Test and the Wechsler Adult Intelligence Scale-Third Edition.

PubMed

Walters, Steven O; Weaver, Kenneth A

2003-06-01

The Kaufman Brief Intelligence Test detects learning problems of young students and is a screen for whether a more comprehensive test of intelligence is needed. A study to assess whether this test was valid as an adult intelligence test was conducted with 20 undergraduate psychology majors. The correlations between the Kaufman Brief Intelligence Test's Composite, Vocabulary, and Matrices test scores and their corresponding Wechsler Adult Intelligence Scale-Third Edition test scores, the Full Scale (r=.88), Verbal (r=.77), and Performance scores (r=.87), indicated very strong relationships. In addition, no significant differences were obtained between the Composite, Vocabulary, and Matrices means of the Kaufman Brief Intelligence Test and the Full Scale, Verbal, and Performance means of the WAIS-III. The Kaufman Brief Intelligence Test appears to be a valid test of intelligence for adults.
Low aerobic fitness and obesity are associated with lower standardized test scores in children.

PubMed

Roberts, Christian K; Freed, Benjamin; McCarthy, William J

2010-05-01

To investigate whether aerobic fitness and obesity in school children are associated with standardized test performance. Ethnically diverse (n = 1989) 5th, 7th, and 9th graders attending California schools comprised the sample. Aerobic fitness was determined by a 1-mile run/walk test; body mass index (BMI) was obtained from state-mandated measurements. California standardized test scores were obtained from the school district. Students whose mile run/walk times exceeded California Fitnessgram standards or whose BMI exceeded Centers for Disease Control sex- and age-specific body weight standards scored lower on California standardized math, reading, and language tests than students with desirable BMI status or fitness level, even after controlling for parent education among other covariates. Ethnic differences in standardized test scores were consistent with ethnic differences in obesity status and aerobic fitness. BMI-for-age was no longer a significant multivariate predictor when covariates included fitness level. Low aerobic fitness is common among youth and varies among ethnic groups, and aerobic fitness level predicts performance on standardized tests across ethnic groups. More research is needed to uncover the physiological mechanisms by which aerobic fitness may contribute to performance on standardized academic tests.
A job-related fitness test for the Dutch police.

PubMed

Strating, M; Bakker, R H; Dijkstra, G J; Lemmink, K A P M; Groothoff, J W

2010-06-01

The variety of tasks that characterize police work highlights the importance of being in good physical condition. To take a first step at standardizing the administration of a job-related test to assess a person's ability to perform the physical demands of the core tasks of police work. The principal research questions were: are test scores related to gender, age and function and are test scores related to body mass index (BMI) and the number of hours of physical exercise? Data of 6999 police officers, geographically spread over all parts of The Netherlands, who completed a physical competence test over a 1 year period were analysed. Women performed the test significantly more slowly than men. The mean test score was also related to age; the older a person the longer it took to complete the test. A higher BMI was associated with less hours of body exercise a week and a slower test performance, both in women and men. The differences in individual test scores, based on gender and age, have implications for future strategy within the police force. From a viewpoint of 'same job, same standard' one has to accept that test-score differences may lead to the exclusion of certain staff. However, from a viewpoint of 'diversity as a business issue', one may have to accept that on average, both female and older police officers are physically less tailored to their jobs than their male and younger colleagues.

Does familiarity with computers affect computerized neuropsychological test performance?

PubMed

Iverson, Grant L; Brooks, Brian L; Ashton, V Lynn; Johnson, Lynda G; Gualtieri, C Thomas

2009-07-01

The purpose of this study was to determine whether self-reported computer familiarity is related to performance on computerized neurocognitive testing. Participants were 130 healthy adults who self-reported whether their computer use was "some" (n = 65) or "frequent" (n = 65). The two groups were individually matched on age, education, sex, and race. All completed the CNS Vital Signs (Gualtieri & Johnson, 2006b) computerized neurocognitive battery. There were significant differences on 6 of the 23 scores, including scores derived from the Symbol-Digit Coding Test, Stroop Test, and the Shifting Attention Test. The two groups were also significantly different on the Psychomotor Speed (Cohen's d = 0.37), Reaction Time (d = 0.68), Complex Attention (d = 0.40), and Cognitive Flexibility (d = 0.64) domain scores. People with "frequent" computer use performed better than people with "some" computer use on some tests requiring rapid visual scanning and keyboard work.
Reliability and validity of the Assessment of Daily Activity Performance (ADAP) in community-dwelling older women.

PubMed

de Vreede, Paul L; Samson, Monique M; van Meeteren, Nico L; Duursma, Sijmen A; Verhaar, Harald J

2006-08-01

The Assessment of Daily Activity Performance (ADAP) test was developed, and modeled after the Continuous-scale Physical Functional Performance (CS-PFP) test, to provide a quantitative assessment of older adults' physical functional performance. The aim of this study was to determine the intra-examiner reliability and construct validity of the ADAP in a community-living older population, and to identify the importance of tester experience. Forty-three community-dwelling, older women (mean age 75 yr +/-4.3) were randomized to the test-retest reliability study (n=19) or validation study (n=24). The intra-examiner reliability of an experienced (tester 1) and an inexperienced tester (tester 2) was assessed by comparing test and retest scores of 19 participants. Construct validity was assessed by comparing the ADAP scores of 24 participants with self-perceived function by the SF-36 Health Survey, muscle function tests, and the Timed Up and Go test (TUG). Tester 1 had good consistency and reliability scores (mean difference between test and retest scores (DIF), -1.05+/-1.99; 95% confidence interval (CI), -2.58 to 0.48; Cronbach's alpha (alpha) range, 0.83 to 0.98; intraclass correlation (ICC) range, 0.75 to 0.96; Limits of Agreement (LoA), -2.58 to 4.95). Tester 2 had lower reliability scores (DIF, -2.45+/-4.36; 95% CI, -5.56 to 0.67; alpha range, 0.53 to 0.94; ICC range, 0.36 to 0.90; LoA, -6.09 to 10.99), with a systematic difference between test and retest scores for the ADAP domain lower-body strength (-3.81; 95% CI, -6.09 to -1.54), ADAP correlated with SF-36 Physical Functioning scale (r=0.67), TUG test (r=-0.91) and with isometric knee extensor strength (r=0.80). The ADAP test is a reliable and valid instrument. Our results suggest that testers should practise using the test, to improve reliability, before applying it to clinical settings.
The influence of critical thinking skills on performance and progression in a pre-registration nursing program.

PubMed

Pitt, Victoria; Powis, David; Levett-Jones, Tracy; Hunter, Sharyn

2015-01-01

The importance of developing critical thinking skills in preregistration nursing students is recognized worldwide. Yet, there has been limited exploration of how students' critical thinking skill scores on entry to pre-registration nursing education influence their academic and clinical performance and progression. The aim of this study was to: i) describe entry and exit critical thinking scores of nursing students enrolled in a three year bachelor of nursing program in Australia in comparison to norm scores; ii) explore entry critical thinking scores in relation to demographic characteristics, students' performance and progression. This longitudinal correlational study used the Health Sciences Reasoning Test (HSRT) to measure critical thinking skills in a sample (n=134) of students, at entry and exit (three years later). A one sample t-test was used to determine if differences existed between matched student critical thinking scores between entry and exit points. Academic performance, clinical performance and progression data were collected and correlations with entry critical thinking scores were examined. There was a significant relationship between critical thinking scores, academic performance and students' risk of failing, especially in the first semester of study. Critical thinking scores were predictive of program completion within three years. The increase in critical thinking scores from entry to exit was significant for the 28 students measured. In comparison to norm scores, entry level critical thinking scores were significantly lower, but exit scores were comparable. Critical thinking scores had no significant relationship to clinical performance. Entry critical thinking scores significantly correlate to academic performance and predict students risk of course failure and ability to complete a nursing degree in three years. Students' critical thinking scores are an important determinant of their success and as such can inform curriculum development and selection strategies. Copyright © 2014 Elsevier Ltd. All rights reserved.
Levels of mania and cognitive performance two years after ECT in patients with bipolar I disorder - results from a follow-up study.

PubMed

Haghighi, Mohammad; Barikani, Reza; Jahangard, Leila; Ahmadpanah, Mohammad; Bajoghli, Hafez; Sadeghi Bahmani, Dena; Holsboer-Trachsler, Edith; Brand, Serge

2016-08-01

There is limited evidence on the long-term outcomes for patients with bipolar I disorder (BP-I-D) and treated with ECT. Therefore, we asked whether mania scores and cognitive performance at the end of ECT treatment (baseline/BL) predicted mania scores, cognitive performance, recurrence, treatment adherence, and mood (depression; hypomania) two years later (follow-up/FU). 38 patients with BP-I-D undergoing ECT at baseline were followed up two years later. A brief psychiatric and cognitive assessment (Mini Mental State Examination; short-term verbal memory test) was performed; patients completed questionnaires covering recurrence, treatment adherence, and mood (depression; hypomania). High cognitive performance at BL predicted high cognitive performance at FU; low mania scores at BL predicted low mania scores at FU. By FU, cognitive performance had increased and mania scores decreased. Mania scores and cognitive performance at BL did not predict recurrence, or adherence to medication, or mood (depression; hypomania). The pattern of results suggests that after two years of successful treatment of acute mania with ECT, cognitive impairment, measured by MMSE and a short-term verbal memory test, is not impaired and mood symptom recurrence seems to be improved. Mania scores and cognitive performance at the end of ECT treatment predicted neither mood (depression; hypomania), nor recurrence, or adherence to medication two years later. Copyright © 2016 Elsevier Inc. All rights reserved.
The culture of time in neuropsychological assessment: exploring the effects of culture-specific time attitudes on timed test performance in Russian and American samples.

PubMed

Agranovich, Anna V; Panter, A T; Puente, Antonio E; Touradji, Pegah

2011-07-01

Cultural differences in time attitudes and their effect on timed neuropsychological test performance were examined in matched non-clinical samples of 100 Russian and American adult volunteers using 8 tests that were previously reported to be relatively free of cultural bias: Color Trails Test (CTT); Ruff Figural Fluency Test (RFFT); Symbol Digit Modalities Test (SDMT); and Tower of London-Drexel Edition (ToL(Dx)). A measure of time attitudes, the Culture of Time Inventory (COTI-33) was used to assess time attitudes potentially affecting time-limited testing. Americans significantly outscored Russians on CTT, SDMT, and ToL(Dx) (p,.05) while differences in RFFT scores only approached statistical significance. Group differences also emerged in COTI-33 factor scores, which partially mediated differences in performance on CTT-1, SDMT, and ToL(Dx) initiation time, but did not account for the effect of culture on CTT-2. Significant effect of culture was revealed in ratings of familiarity with testing procedures that was negatively related to CTT, ToL(Dx), and SDMT scores. Current findings indicated that attitudes toward time may influence results of time limited testing and suggested that individuals who lack familiarity with timed testing procedures tend to obtain lower scores on timed tests.
Vision and academic performance of learning disabled children.

PubMed

Wharry, R E; Kirkpatrick, S W

1986-02-01

The purpose of this study was to assess difference in academic performance among myopic, hyperopic, and emmetropic children who were learning disabled. More specifically, myopic children were expected to perform better on mathematical and spatial tasks than would hyperopic ones and that hyperopic and emmetropic children would perform better on verbal measures than would myopic ones. For 439 learning disabled students visual anomalies were determined via a Generated Retinal Reflex Image Screening System. Test data were obtained from school files. Partial support for the hypothesis was obtained. Myopic learning disabled children outperformed hyperopic and emmetropic children on the Key Math test. Myopic children scored better than hyperopic children on the WRAT Reading subtest and on the Durrell Analysis of Reading Difficulty Oral Reading Comprehension, Oral Rate, Flashword, and Spelling subtests, and on the Key Math Measurement and Total Scores. Severity of refractive error significantly affected the Wechsler Intelligence Scale for Children--Revised Full Scale, Performance Scale, Verbal Scale, and Digit Span scores but did not affect any academic test scores. Several other findings were also reported. Those with nonametropic problems scored higher than those without problems on the Key Math Time subtest. Implications supportive of the theories of Benbow and Benbow and Geschwind and Behan were stated.
Universality, correlations, and rankings in the Brazilian universities national admission examinations

NASA Astrophysics Data System (ADS)

da Silva, Roberto; Lamb, Luis C.; Barbosa, Marcia C.

2016-09-01

We analyze the scores obtained by students who have taken the ENEM examination, The Brazilian High School National Examination which is used in the admission process at Brazilian universities. The average high schools scores from different disciplines are compared through the Pearson correlation coefficient. The results show a very large correlation between the performance in the different school subjects. Even though the students' scores in the ENEM form a Gaussian due to the standardization, we show that the high schools' scores form a bimodal distribution that cannot be used to evaluate and compare students performance over time. We also show that this high schools distribution reflects the correlation between school performance and the economic level (based on the average family income) of the students. The ENEM scores are compared with a Brazilian non standardized exam, the entrance examination from the Universidade Federal do Rio Grande do Sul. The analysis of the performance of the same individuals in both tests shows that the two tests not only select different abilities, but also lead to the admission of different sets of individuals. Our results indicate that standardized tests might be an interesting tool to compare performance of individuals over the years, but not of institutions.
42 CFR 493.17 - Test categorization.

Code of Federal Regulations, 2012 CFR

2012-10-01

..., analytic or postanalytic phases of the testing. (2) Training and experience—(i) Score 1. (A) Minimal training is required for preanalytic, analytic and postanalytic phases of the testing process; and (B... necessary for analytic test performance. (3) Reagents and materials preparation—(i) Score 1. (A) Reagents...
42 CFR 493.17 - Test categorization.

Code of Federal Regulations, 2013 CFR

2013-10-01

..., analytic or postanalytic phases of the testing. (2) Training and experience—(i) Score 1. (A) Minimal training is required for preanalytic, analytic and postanalytic phases of the testing process; and (B... necessary for analytic test performance. (3) Reagents and materials preparation—(i) Score 1. (A) Reagents...
Association Between Medication Use and Performance on Higher Education Entrance Tests in Individuals With Attention-Deficit/Hyperactivity Disorder.

PubMed

Lu, Yi; Sjölander, Arvid; Cederlöf, Martin; D'Onofrio, Brian M; Almqvist, Catarina; Larsson, Henrik; Lichtenstein, Paul

2017-08-01

Individuals with attention-deficit/hyperactivity disorder (ADHD) are at greater risk for academic problems. Pharmacologic treatment is effective in reducing the core symptoms of ADHD, but it is unclear whether it helps to improve academic outcomes. To investigate the association between the use of ADHD medication and performance on higher education entrance tests in individuals with ADHD. This cohort study observed 61 640 individuals with a diagnosis of ADHD from January 1, 2006, to December 31, 2013. Records of their pharmacologic treatment were extracted from Swedish national registers along with data from the Swedish Scholastic Aptitude Test. Using a within-patient design, test scores when patients were taking medication for ADHD were compared with scores when they were not taking such medication. Data analysis was performed from November 24, 2015, to November 4, 2016. Periods with and without ADHD medication use. Scores from the higher education entrance examination (score range, 1-200 points). Among 930 individuals (493 males and 437 females; mean [SD] age, 22.2 [3.2] years) who had taken multiple entrance tests (n = 2524) and used ADHD medications intermittently, the test scores were a mean of 4.80 points higher (95% CI, 2.26-7.34; P < .001) during periods they were taking medication vs nonmedicated periods, after adjusting for age and practice effects. Similar associations between ADHD medication use and test scores were detected in sensitivity analyses. Individuals with ADHD had higher scores on the higher education entrance tests during periods they were taking ADHD medication vs nonmedicated periods. These findings suggest that ADHD medications may help ameliorate educationally relevant outcomes in individuals with ADHD.
Tests of measurement invariance failed to support the application of the "then-test".

PubMed

Nolte, Sandra; Elsworth, Gerald R; Sinclair, Andrew J; Osborne, Richard H

2009-11-01

The use of then-test (retrospective pre-test) scores has frequently been proposed as a solution to potential confounding of change scores because of response shift, as it is assumed that then-test and post-test responses are provided from the same perspective. However, this assumption has not been formally tested using robust quantitative methods. The aim of this study was to compare the psychometric performance of then-test/post-test with traditional pre-test/post-test data and assessing whether the resulting data structures support the application of the then-test for evaluations of chronic disease self-management interventions. Pre-test, post-test, and then-test data were collected from 314 participants of self-management courses using the Health Education Impact Questionnaire (heiQ). The derived change scores (pre-test/post-test; then-test/post-test) were examined for their psychometric performance using tests of measurement invariance. Few questionnaire items were noninvariant across pre-test/post-test, with four items identified and requiring removal to enable an unbiased comparison of factor means. In contrast, 12 items were identified and required removal in then-test/post-test data to avoid biased change score estimates. Traditional pre-test/post-test data appear to be robust with little indication of response shift. In contrast, the weaker psychometric performance of then-test/post-test data suggests psychometric flaws that may be the result of implicit theory of change, social desirability, and recall bias.
Interpreting Linked Psychomotor Performance Scores

ERIC Educational Resources Information Center

Looney, Marilyn A.

2013-01-01

Given that equating/linking applications are now appearing in kinesiology literature, this article provides an overview of the different types of linked test scores: equated, concordant, and predicted. It also addresses the different types of evidence required to determine whether the scores from two different field tests (measuring the same…
Trait impulsivity predicts D-KEFS tower test performance in university students.

PubMed

Lyvers, Michael; Basch, Vanessa; Duff, Helen; Edwards, Mark S

2015-01-01

The present study examined a widely used self-report index of trait impulsiveness in relation to performance on a well-known neuropsychological executive function test in 70 university undergraduate students (50 women, 20 men) aged 18 to 24 years old. Participants completed the Barratt Impulsiveness Scale (BIS-11) and the Frontal Systems Behavior Scale (FrSBe), after which they performed the Tower Test of the Delis-Kaplan Executive Function System. Hierarchical linear regression showed that after controlling for gender, current alcohol consumption, age at onset of weekly alcohol use, and FrSBe scores, BIS-11 significantly predicted Tower Test Achievement scores, β = -.44, p < .01. The results indicate that self-reported impulsiveness is associated with poorer executive cognitive performance even in a sample likely to be characterized by relatively high general cognitive functioning (i.e., university students). The results also support the role of inhibition as a key aspect of executive task performance. Elevated scores on the BIS-11 and FrSBe are known to be linked to risky drinking in young adults as confirmed in this sample; however, only BIS-11 predicted Tower Test performance.
Relationships between the handball-specific complex test, non-specific field tests and the match performance score in elite professional handball players.

PubMed

Hermassi, Souhail; Chelly, Mohamed-Souhaiel; Wollny, Rainer; Hoffmeyer, Birgit; Fieseler, Georg; Schulze, Stephan; Irlenbusch, Lars; Delank, Karl-Stefan; Shephard, Roy J; Bartels, Thomas; Schwesig, René

2018-06-01

This study assessed the validity of the handball-specific complex test (HBCT) and two non-specific field tests in professional elite handball athletes, using the match performance score (MPS) as the gold standard of performance. Thirteen elite male handball players (age: 27.4±4.8 years; premier German league) performed the HBCT, the Yo-Yo Intermittent Recovery (YYIR) test and a repeated shuttle sprint ability (RSA) test at the beginning of pre-season training. The RSA results were evaluated in terms of best time, total time, and fatigue decrement. Heart rates (HR) were assessed at selected times throughout all tests; the recovery HR was measured immediately post-test and 10 minutes later. The match performance score was based on various handball specific parameters (e.g., field goals, assists, steals, blocks, and technical mistakes) as seen during all matches of the immediately subsequent season (2015/2016). The parameters of run 1, run 2, and HR recovery at minutes 6 and 10 of the RSA test all showed a variance of more than 10% (range: 11-15%). However, the variance of scores for the YYIR test was much smaller (range: 1-7%). The resting HR (r2=0.18), HR recovery at minute 10 (r2=0.10), lactate concentration at rest (r2=0.17), recovery of heart rate from 0 to 10 minutes (r2=0.15), and velocity of second throw at first trial (r2=0.37) were the most valid HBCT parameters. Much effort is necessary to assess MPS and to develop valid tests. Speed and the rate of functional recovery seem the best predictors of competitive performance for elite handball players.
Highlights of Conference on Using Student Test Scores to Measure Teacher Performance: The State of the Art in Research and Practice

ERIC Educational Resources Information Center

Guarino, Cassandra; Reckase, Mark D.; Wooldridge, Jeffrey M.

2013-01-01

The push for accountability in public schooling has extended to the measurement of teacher performance, accelerated by federal efforts through Race to the Top. Currently, a large number of states and districts across the country are computing measures of teacher performance based on the standardized test scores of their students and using them to…
School Budgeting and School Performance: The Impact of New York City's Performance Driven Budgeting Initiative.

ERIC Educational Resources Information Center

Stiefel, Leanna; Schwartz, Amy Ellen; Portas, Carole; Kim, Dae Yeop

2003-01-01

Analyzes the impact of Performance Driven Budgeting (PDB), a school-based budgeting initiative, on student test scores in the fourth and fifth grades and on spending patterns in selected New York City schools. Finds that PDB has a positive effect on some student test scores and leads to a change in the mix of spending, but not its level. (Contains…
Speech-discrimination scores modeled as a binomial variable.

PubMed

Thornton, A R; Raffin, M J

1978-09-01

Many studies have reported variability data for tests of speech discrimination, and the disparate results of these studies have not been given a simple explanation. Arguments over the relative merits of 25- vs 50-word tests have ignored the basic mathematical properties inherent in the use of percentage scores. The present study models performance on clinical tests of speech discrimination as a binomial variable. A binomial model was developed, and some of its characteristics were tested against data from 4120 scores obtained on the CID Auditory Test W-22. A table for determining significant deviations between scores was generated and compared to observed differences in half-list scores for the W-22 tests. Good agreement was found between predicted and observed values. Implications of the binomial characteristics of speech-discrimination scores are discussed.
Cross-cultural adaptation, reliability and validity of the Turkish version of the Hospital for Special Surgery (HSS) Knee Score.

PubMed

Narin, Selnur; Unver, Bayram; Bakırhan, Serkan; Bozan, Ozgür; Karatosun, Vasfi

2014-01-01

The purpose of this study was to adapt the English version of the Hospital for Special Surgery (HSS) knee score for use in a Turkish population and to evaluate its validity, reliability and cultural adaptation. Standard forward-back translation of the HSS knee score was performed and the Turkish version was applied in 73 patients. The Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC), Mini-Mental State Examination and sit-to-stand test were also performed and analyzed. Internal consistency reliability was tested using Cronbach's alpha. The intraclass correlation coefficient (ICC) was used to calculate the test-retest reliability at one-week intervals. Validity was assessed by calculating the Pearson correlation between the HSS, WOMAC and sit-to-stand test scores. The ICC ranged from 0.98 to 0.99 with high internal consistency (Cronbach's alpha: 0.87). The WOMAC score correlated with total HSS score (r: -0.80, p<0.001) and sit-to-stand score (r: 0.12, p: 0.312). The Turkish version of the HSS knee score is reliable and valid in evaluating the total knee arthroplasty in Turkish patients.
Performance of machine-learning scoring functions in structure-based virtual screening.

PubMed

Wójcikowski, Maciej; Ballester, Pedro J; Siedlecki, Pawel

2017-04-25

Classical scoring functions have reached a plateau in their performance in virtual screening and binding affinity prediction. Recently, machine-learning scoring functions trained on protein-ligand complexes have shown great promise in small tailored studies. They have also raised controversy, specifically concerning model overfitting and applicability to novel targets. Here we provide a new ready-to-use scoring function (RF-Score-VS) trained on 15 426 active and 893 897 inactive molecules docked to a set of 102 targets. We use the full DUD-E data sets along with three docking tools, five classical and three machine-learning scoring functions for model building and performance assessment. Our results show RF-Score-VS can substantially improve virtual screening performance: RF-Score-VS top 1% provides 55.6% hit rate, whereas that of Vina only 16.2% (for smaller percent the difference is even more encouraging: RF-Score-VS top 0.1% achieves 88.6% hit rate for 27.5% using Vina). In addition, RF-Score-VS provides much better prediction of measured binding affinity than Vina (Pearson correlation of 0.56 and -0.18, respectively). Lastly, we test RF-Score-VS on an independent test set from the DEKOIS benchmark and observed comparable results. We provide full data sets to facilitate further research in this area (http://github.com/oddt/rfscorevs) as well as ready-to-use RF-Score-VS (http://github.com/oddt/rfscorevs_binary).
Can Percentiles Replace Raw Scores in the Statistical Analysis of Test Data?

ERIC Educational Resources Information Center

Zimmerman, Donald W.; Zumbo, Bruno D.

2005-01-01

Educational and psychological testing textbooks typically warn of the inappropriateness of performing arithmetic operations and statistical analysis on percentiles instead of raw scores. This seems inconsistent with the well-established finding that transforming scores to ranks and using nonparametric methods often improves the validity and power…

Effect of online formative assessment on summative performance in integrated musculoskeletal system module.

PubMed

Mitra, Nilesh Kumar; Barua, Ankur

2015-03-03

The impact of web-based formative assessment practices on performance of undergraduate medical students in summative assessments is not widely studied. This study was conducted among third-year undergraduate medical students of a designated university in Malaysia to compare the effect, on performance in summative assessment, of repeated computer-based formative assessment with automated feedback with that of single paper-based formative assessment with face-to face feedback. This quasi-randomized trial was conducted among two groups of undergraduate medical students who were selected by stratified random technique from a cohort undertaking the Musculoskeletal module. The control group C (n = 102) was subjected to a paper-based formative MCQ test. The experimental group E (n = 65) was provided three online formative MCQ tests with automated feedback. The summative MCQ test scores for both these groups were collected after the completion of the module. In this study, no significant difference was observed between the mean summative scores of the two groups. However, Band 1 students from group E with higher entry qualification showed higher mean score in the summative assessment. A trivial, but significant and positive correlation (r(2) = +0.328) was observed between the online formative test scores and summative assessment scores of group E. The proportionate increase of performance in group E was found to be almost double than group C. The use of computer based formative test with automated feedback improved the performance of the students with better academic background in the summative assessment. Computer-based formative test can be explored as an optional addition to the curriculum of pre-clinical integrated medical program to improve the performance of the students with higher academic ability.
Time and Performance on the California Critical Thinking Skills Test.

ERIC Educational Resources Information Center

Frisby, Craig L.; Traffanstedt, Bobby K.

2003-01-01

Investigates the relationship between total scores on the California Critical Thinking Skills Test (CCTST) and the time taken to complete it. Finds that slower test takers obtained significantly higher scores. Discusses implications of these findings for college instruction. (SG)
Development of an Itemwise Efficiency Scoring Method: Concurrent, Convergent, Discriminant, and Neuroimaging-Based Predictive Validity Assessed in a Large Community Sample

PubMed Central

Moore, Tyler M.; Reise, Steven P.; Roalf, David R.; Satterthwaite, Theodore D.; Davatzikos, Christos; Bilker, Warren B.; Port, Allison M.; Jackson, Chad T.; Ruparel, Kosha; Savitt, Adam P.; Baron, Robert B.; Gur, Raquel E.; Gur, Ruben C.

2016-01-01

Traditional “paper-and-pencil” testing is imprecise in measuring speed and hence limited in assessing performance efficiency, but computerized testing permits precision in measuring itemwise response time. We present a method of scoring performance efficiency (combining information from accuracy and speed) at the item level. Using a community sample of 9,498 youths age 8-21, we calculated item-level efficiency scores on four neurocognitive tests, and compared the concurrent, convergent, discriminant, and predictive validity of these scores to simple averaging of standardized speed and accuracy-summed scores. Concurrent validity was measured by the scores' abilities to distinguish men from women and their correlations with age; convergent and discriminant validity were measured by correlations with other scores inside and outside of their neurocognitive domains; predictive validity was measured by correlations with brain volume in regions associated with the specific neurocognitive abilities. Results provide support for the ability of itemwise efficiency scoring to detect signals as strong as those detected by standard efficiency scoring methods. We find no evidence of superior validity of the itemwise scores over traditional scores, but point out several advantages of the former. The itemwise efficiency scoring method shows promise as an alternative to standard efficiency scoring methods, with overall moderate support from tests of four different types of validity. This method allows the use of existing item analysis methods and provides the convenient ability to adjust the overall emphasis of accuracy versus speed in the efficiency score, thus adjusting the scoring to the real-world demands the test is aiming to fulfill. PMID:26866796
Cortical Thickness Correlates of Specific Cognitive Performance Accounted for by the General Factor of Intelligence in Healthy Children Aged 6 to 18

PubMed Central

Karama, Sherif; Colom, Roberto; Johnson, Wendy; Deary, Ian J.; Haier, Richard; Waber, Deborah P.; Lepage, Claude; Ganjavi, Hooman; Jung, Rex; Evans, Alan C.

2011-01-01

Prevailing psychometric theories of intelligence posit that individual differences in cognitive performance are attributable to three main sources of variance: the general factor of intelligence (g), cognitive ability domains, and specific test requirements and idiosyncrasies. Cortical thickness has been previously associated with g. In the present study, we systematically analyzed associations between cortical thickness and cognitive performance with and without adjusting for the effects of g in a representative sample of children and adolescents (N = 207, Mean age = 11.8; SD = 3.5; Range = 6 to 18.3 years). Seven cognitive tests were included in a measurement model that identified three first-order factors (representing cognitive ability domains) and one second-order factor representing g. Residuals of the cognitive ability domain scores were computed to represent g-independent variance for the three domains and seven tests. Cognitive domain and individual test scores as well as residualized scores were regressed against cortical thickness, adjusting for age, gender and a proxy measure of brain volume. g and cognitive domain scores were positively correlated with cortical thickness in very similar areas across the brain. Adjusting for the effects of g eliminated associations of domain and test scores with cortical thickness. Within a psychometric framework, cortical thickness correlates of cognitive performance on complex tasks are well captured by g in this demographically representative sample. PMID:21241809
CrossFit athletes exhibit high symmetry of fundamental movement patterns. A cross-sectional study

PubMed Central

Tafuri, Silvio; Notarnicola, Angela; Monno, Antonello; Ferretti, Francesco; Moretti, Biagio

2016-01-01

Summary Background even if CrossFit training programs accounted actually more than 7500 gyms affiliated in the USA and more than 2000 in Europe and involved today more than 1 million of people, actually there were not several studies about the effect of the CrossFit on the health and sport performance. The aim of these research was to evaluate the performance in 7 fundamental movement patterns using a standardized methods, the Functional Movement Screen (FMS). Methods we enrolled three groups of athletes (age 17–40 years; >6 months of training programs): CrossFitters, body builders and professional weightlifters. FMS test was performed to all people enrolled. Scores of FMS test was examined comparing three groups. Results no differences in the three groups were showed in the mean score values of each test and in total score, except for shoulder mobility test (higher among CrossFitters) and trunk stability push-up test (higher among weightlifter). Agreement between the test performed on the two sides was higher in CrossFit groups for hurdle step (93.2%), in line lung (86%), rotary stability test (95.3%) and shoulder mobility (90.7%; p<0.001). Conclusions CrossFitters seem to have a high level of concordance in the scores achieved in bilateral test. CrossFit seems to produce marked symmetry in some fundamental movements compared to weightlifting and bodybuilding. PMID:27331045
CrossFit athletes exhibit high symmetry of fundamental movement patterns. A cross-sectional study.

PubMed

Tafuri, Silvio; Notarnicola, Angela; Monno, Antonello; Ferretti, Francesco; Moretti, Biagio

2016-01-01

even if CrossFit training programs accounted actually more than 7500 gyms affiliated in the USA and more than 2000 in Europe and involved today more than 1 million of people, actually there were not several studies about the effect of the CrossFit on the health and sport performance. The aim of these research was to evaluate the performance in 7 fundamental movement patterns using a standardized methods, the Functional Movement Screen (FMS). we enrolled three groups of athletes (age 17-40 years; >6 months of training programs): CrossFitters, body builders and professional weightlifters. FMS test was performed to all people enrolled. Scores of FMS test was examined comparing three groups. no differences in the three groups were showed in the mean score values of each test and in total score, except for shoulder mobility test (higher among CrossFitters) and trunk stability push-up test (higher among weightlifter). Agreement between the test performed on the two sides was higher in CrossFit groups for hurdle step (93.2%), in line lung (86%), rotary stability test (95.3%) and shoulder mobility (90.7%; p<0.001). CrossFitters seem to have a high level of concordance in the scores achieved in bilateral test. CrossFit seems to produce marked symmetry in some fundamental movements compared to weightlifting and bodybuilding.
Perception and Practice: The Impact of Teachers' Scoring Experience on Performance-Based Instruction and Classroom Assessment.

ERIC Educational Resources Information Center

Goldberg, Gail Lynn; Roswell, Barbara Sherr

Teachers' reactions to the administration and scoring of the Maryland School Performance Assessment Program tests (MSPAP) were studied, focusing on their direct and indirect exposure to tasks and evaluative criteria through the experience of scoring the MSPAP. Since its inception in 1991, the MSPAP has been scored in-state by certified teachers…
Cognitive predictors of skilled performance with an advanced upper limb multifunction prosthesis: a preliminary analysis.

PubMed

Hancock, Laura; Correia, Stephen; Ahern, David; Barredo, Jennifer; Resnik, Linda

2017-07-01

Purpose The objectives were to 1) identify major cognitive domains involved in learning to use the DEKA Arm; 2) specify cognitive domain-specific skills associated with basic versus advanced users; and 3) examine whether baseline memory and executive function predicted learning. Method Sample included 35 persons with upper limb amputation. Subjects were administered a brief neuropsychological test battery prior to start of DEKA Arm training, as well as physical performance measures at the onset of, and following training. Multiple regression models controlling for age and including neuropsychological tests were developed to predict physical performance scores. Prosthetic performance scores were divided into quartiles and independent samples t-tests compared neuropsychological test scores of advanced scorers and basic scorers. Baseline neuropsychological test scores were used to predict change in scores on physical performance measures across time. Results Cognitive domains of attention and processing speed were statistically significantly related to proficiency of DEKA Arm use and predicted level of proficiency. Conclusions Results support use of neuropsychological tests to predict learning and use of a multifunctional prosthesis. Assessment of cognitive status at the outset of training may help set expectations for the duration and outcomes of treatment. Implications for Rehabilitation Cognitive domains of attention and processing speed were significantly related to level of proficiencyof an advanced multifunctional prosthesis (the DEKA Arm) after training. Results provide initial support for the use of neuropsychological tests to predict advanced learningand use of a multifunctional prosthesis in upper-limb amputees. Results suggest that assessment of patients' cognitive status at the outset of upper limb prosthetictraining may, in the future, help patients, their families and therapists set expectations for theduration and intensity of training and may help set reasonable proficiency goals.
Physical performance testing in mucopolysaccharidosis I: a pilot study.

PubMed

Dumas, Helene M; Fragala, Maria A; Haley, Stephen M; Skrinar, Alison M; Wraith, James E; Cox, Gerald F

2004-01-01

To develop and field-test a physical performance measure (MPS-PPM) for individuals with Mucopolysaccharidosis I (MPS I), a rare genetic disorder. Motor performance and endurance items were developed based on literature review, clinician feedback, feasibility, and equipment and training needs. A standardized testing protocol and scoring rules were created. The MPS-PPM includes: Arm Function (7 items), Leg Function (5 items), and Endurance (2 items). Pilot data were collected for 10 subjects (ages 5-29 years). We calculated Spearman's rho correlations between age, severity and summary z-scores on the MPS-PPM. Subjects had variable presentations, as correlations among the three sub-test scores were not significant. Increasing age was related to greater severity in physical performance (r = 0.72, p<0.05) and lower scores on the Leg Function (r = -0.67, p<0.05) and Endurance (r = -0.65, p<0.05) sub-tests. The MPS-PPM was sensitive to detecting physical performance deficits, as six subjects could not complete the full battery of Arm Function items and eight subjects were unable to complete all Leg Function items. Subjects walked more slowly and expended more energy than typically developing peers. Individuals with MPS I have difficulty with arm and leg function and reduced endurance. The MPS-PPM is a clinically feasible measure that detects limitations in physical performance and may have potential to quantify changes in function following intervention. Copyright 2004 Taylor and Francis Ltd.
Clinical competency evaluation of Brazilian chiropractic interns

PubMed Central

Facchinato, Ana Paula A.; Benedicto, Camila C.; Mora, Aline G.; Cabral, Dayane M.C.; Fagundes, Djalma J.

2015-01-01

Objective This study compares the results of an objective structured clinical examination (OSCE) between 2 groups of students before an internship and after 6 months of clinical practice in an internship. Methods Seventy-two students participated, with 36 students in each cohort. The OSCEs were performed in the simulation laboratory before the participants' clinical practice internship and after 6 months of the internship. Students were tested in 9 stations for clinical skills and knowledge. The same procedures were repeated for both cohorts. The t test was used for unpaired parametric samples and Fisher's exact test was used for comparison of proportions. Results There was no difference in the mean final score between the 2 groups (p = .34 for test 1; p = .08 for test 2). The performance of the students in group 1 was not significantly different when performed before and after 6 months of clinical practice, but in group 2 there was a significant decrease in the average score after 6 months of clinical practice. Conclusions There was no difference in the cumulative average score for the 2 groups before and after 6 months of clinical practice in the internship. There were differences within the cohorts, however, with a significant decrease in the average score in group 2. Issues pertaining to test standardization and student motivation for test 2 may have influenced the scores. PMID:25588200
Concussion Baseline Testing: Preexisting Factors, Symptoms, and Neurocognitive Performance.

PubMed

Cottle, Jordan E; Hall, Eric E; Patel, Kirtida; Barnes, Kenneth P; Ketcham, Caroline J

2017-02-01

Neurocognitive test scores are often considered an important aspect of concussion management. To best use these data, clinicians must understand potential factors that may influence baseline performance on these tests. To determine preexisting factors that may influence performance on the Immediate Post-Concussion Assessment and Cognitive Test (ImPACT). Cross-sectional study. Research laboratory. A total of 486 National Collegiate Athletic Association Division I collegiate student-athletes. To determine neurocognitive functioning and total symptom score at baseline, ImPACT was administered. Outcomes were verbal memory, visual memory, visual motor speed, reaction time, and total symptom score. A self-report demographic section at the beginning of ImPACT was used to gather information concerning previous treatment for headaches, migraines, and psychiatric conditions; diagnosis of attention-deficit/hyperactivity disorder; and exposure to previous strenuous exercise. We conducted multivariate analyses of variance to determine if the ImPACT composite and total symptom scores differed according to preexisting factors (P < .0083). Sex showed an effect on verbal memory (P = .001), visual motor speed (P < .001), and reaction time (P = .006), with women performing better than men. A previous diagnosis of attention-deficit/hyperactivity disorder affected visual motor speed (P = .008). Previous treatment for headaches (P < .001), migraines (P = .001), a psychiatric condition (P < .001), or a diagnosis of attention-deficit/hyperactivity disorder (P < .001) all showed effects on the total symptom score. Strenuous exercise did not affect neurocogntive performance or total symptom score. Based on our findings and the previous literature, we suggest that many preexisting factors influence baseline neurocognitive data. Baseline testing is an important aspect of concussion management. Sports medicine professionals should be cognizant of these factors when developing concussion-management protocols.
Concussion Baseline Testing: Preexisting Factors, Symptoms, and Neurocognitive Performance

PubMed Central

Cottle, Jordan E.; Hall, Eric E.; Patel, Kirtida; Barnes, Kenneth P.; Ketcham, Caroline J.

2017-01-01

Context: Neurocognitive test scores are often considered an important aspect of concussion management. To best use these data, clinicians must understand potential factors that may influence baseline performance on these tests. Objective: To determine preexisting factors that may influence performance on the Immediate Post-Concussion Assessment and Cognitive Test (ImPACT). Design: Cross-sectional study. Setting: Research laboratory. Patients or Other Participants: A total of 486 National Collegiate Athletic Association Division I collegiate student-athletes. Main Outcome Measure(s): To determine neurocognitive functioning and total symptom score at baseline, ImPACT was administered. Outcomes were verbal memory, visual memory, visual motor speed, reaction time, and total symptom score. A self-report demographic section at the beginning of ImPACT was used to gather information concerning previous treatment for headaches, migraines, and psychiatric conditions; diagnosis of attention-deficit/hyperactivity disorder; and exposure to previous strenuous exercise. We conducted multivariate analyses of variance to determine if the ImPACT composite and total symptom scores differed according to preexisting factors (P < .0083). Results: Sex showed an effect on verbal memory (P = .001), visual motor speed (P < .001), and reaction time (P = .006), with women performing better than men. A previous diagnosis of attention-deficit/hyperactivity disorder affected visual motor speed (P = .008). Previous treatment for headaches (P < .001), migraines (P = .001), a psychiatric condition (P < .001), or a diagnosis of attention-deficit/hyperactivity disorder (P < .001) all showed effects on the total symptom score. Strenuous exercise did not affect neurocogntive performance or total symptom score. Conclusions: Based on our findings and the previous literature, we suggest that many preexisting factors influence baseline neurocognitive data. Baseline testing is an important aspect of concussion management. Sports medicine professionals should be cognizant of these factors when developing concussion-management protocols. PMID:28071936
Validity of GRE General Test scores and TOEFL scores for graduate admission to a technical university in Western Europe

NASA Astrophysics Data System (ADS)

Zimmermann, Judith; von Davier, Alina A.; Buhmann, Joachim M.; Heinimann, Hans R.

2018-01-01

Graduate admission has become a critical process in tertiary education, whereby selecting valid admissions instruments is key. This study assessed the validity of Graduate Record Examination (GRE) General Test scores for admission to Master's programmes at a technical university in Europe. We investigated the indicative value of GRE scores for the Master's programme grade point average (GGPA) with and without the addition of the undergraduate GPA (UGPA) and the TOEFL score, and of GRE scores for study completion and Master's thesis performance. GRE scores explained 20% of the variation in the GGPA, while additional 7% were explained by the TOEFL score and 3% by the UGPA. Contrary to common belief, the GRE quantitative reasoning score showed only little explanatory power. GRE scores were also weakly related to study progress but not to thesis performance. Nevertheless, GRE and TOEFL scores were found to be sensible admissions instruments. Rigorous methodology was used to obtain highly reliable results.
Academic self-handicapping: relationships with learning specific and general self-perceptions and academic performance over time.

PubMed

Gadbois, Shannon A; Sturgeon, Ryan D

2011-06-01

Academic self-handicapping (ASH) tendencies, strategies students employ that increase their chances of failure on assessments while protecting self-esteem, are correlated with classroom goal structures and to learners' general self-perceptions and learning strategies. In particular, greater ASH is related to poorer academic performance but has yet to be examined with respect to learners' performance across a series of tests. This research was designed to examine the relationship between students' ASH tendencies and their self-concept clarity, learning strategies, and performance on a series of tests in a university course. A total of 209 (153 female; 56 male) Canadian university psychology students participated in this study. Participants' ASH tendencies, self-concept clarity, approaches to learning, and self-regulatory learning strategies were assessed along with expected grades and hours of study in the course from which they were recruited. Finally, students' grades were obtained for the three tests for the course from which they were recruited. Students reporting greater self-handicapping tendencies reported lower self-concept clarity, lower academic self-efficacy, greater test anxiety, more superficial learning strategies, and scored lower on all tests in the course. The relationships of ASH scores and learner variables with performance varied across the three performance indices. In particular, ASH scores were more strongly related to second and third tests, and prior performances were accounted for. ASH scores accounted for a relatively small but significant proportion of variance for all three tests. These results showed that ASH is a unique contributing factor in student performance outcomes, and may be particularly important after students complete the initial assessment in a course. ©2010 The British Psychological Society.
Cardiovascular risk factors associated with lower baseline cognitive performance in HIV-positive persons.

PubMed

Wright, E J; Grund, B; Robertson, K; Brew, B J; Roediger, M; Bain, M P; Drummond, F; Vjecha, M J; Hoy, J; Miller, C; Penalva de Oliveira, A C; Pumpradit, W; Shlay, J C; El-Sadr, W; Price, R W

2010-09-07

To determine factors associated with baseline neurocognitive performance in HIV-infected participants enrolled in the Strategies for Management of Antiretroviral Therapy (SMART) neurology substudy. Participants from Australia, North America, Brazil, and Thailand were administered a 5-test neurocognitive battery. Z scores and the neurocognitive performance outcome measure, the quantitative neurocognitive performance z score (QNPZ-5), were calculated using US norms. Neurocognitive impairment was defined as z scores <-2 in two or more cognitive domains. Associations of test scores, the QNPZ-5, and impairment with baseline factors including demographics and risk factors for HIV-associated dementia (HAD) and cardiovascular disease (CVD) were determined in multiple regression. The 292 participants had a median CD4 cell count of 536 cells/mm(3), 88% had an HIV viral load < or =400 copies/mL, and 92% were taking antiretrovirals. Demographics, HIV, and clinical factors differed between locations. The mean QNPZ-5 score was -0.72; 14% of participants had neurocognitive impairment. For most tests, scores and z scores differed significantly between locations, with and without adjustment for age, sex, education, and race. Prior CVD was associated with neurocognitive impairment. Prior CVD, hypercholesterolemia, and hypertension were associated with poorer neurocognitive performance but conventional HAD risk factors and the CNS penetration effectiveness rank of antiretroviral regimens were not. In this HIV-positive population with high CD4 cell counts, neurocognitive impairment was associated with prior CVD. Lower neurocognitive performance was associated with prior CVD, hypertension, and hypercholesterolemia, but not conventional HAD risk factors. The contribution of CVD and cardiovascular risk factors to the neurocognition of HIV-positive populations warrants further investigation.
The Relationship between Academic Averages of Primary School Science and Technology Class and Test Sub-Test Scores of Placement Test of Science

ERIC Educational Resources Information Center

Guzeller, Cem Oktay

2012-01-01

In this research, the relationship between written exam scores of science and technology class of 6th, 7th, and 8th grades, project, participation in class activities and performance work, year-end academic success point averages and sub-test raw scores of LDT science of 6th, 7th and 8th grades. Academic success point averages were used as…
Timed activity performance in persons with upper limb amputation: A preliminary study.

PubMed

Resnik, Linda; Borgia, Mathew; Acluche, Frantzy

55 subjects with upper limb amputation were administered the T-MAP twice within one week. To develop a timed measure of activity performance for persons with upper limb amputation (T-MAP); examine the measure's internal consistency, test-retest reliability and validity; and compare scores by prosthesis use. Measures of activity performance for persons with upper limb amputation are needed The time required to perform daily activities is a meaningful metric that implication for participation in life roles. Internal consistency and test-retest reliability were evaluated. Construct validity was examined by comparing scores by amputation level. Exploratory analyses compared sub-group scores, and examined correlations with other measures. Scale alpha was 0.77, ICC was 0.93. Timed scores differed by amputation level. Subjects using a prosthesis took longer to perform all tasks. T-MAP was not correlated with other measures of dexterity or activity, but was correlated with pain for non-prosthesis users. The timed scale had adequate internal consistency and excellent test-retest reliability. Analyses support reliability and construct validity of the T-MAP. 2c "outcomes" research. Published by Elsevier Inc.
Test Scores, Dropout Rates, and Transfer Rates as Alternative Indicators of High School Performance

ERIC Educational Resources Information Center

Rumberger, Russell W.; Palardy, Gregory J.

2005-01-01

This study investigated the relationships among several different indicators of high school performance: test scores, dropout rates, transfer rates, and attrition rates. Hierarchical linear models were used to analyze panel data from a sample of 14,199 students who took part in the National Education Longitudinal Survey of 1988. The results…
Developing Local Oral Reading Fluency Cut Scores for Predicting High-Stakes Test Performance

ERIC Educational Resources Information Center

Grapin, Sally L.; Kranzler, John H.; Waldron, Nancy; Joyce-Beaulieu, Diana; Algina, James

2017-01-01

This study evaluated the classification accuracy of a second grade oral reading fluency curriculum-based measure (R-CBM) in predicting third grade state test performance. It also compared the long-term classification accuracy of local and publisher-recommended R-CBM cut scores. Participants were 266 students who were divided into a calibration…
Race to the Paycheck: Merit Pay and Theories of Teacher Motivation

ERIC Educational Resources Information Center

Horne, Jason; Foley, Virginia P.; Flora, Bethany H.

2014-01-01

Recent reforms in teacher evaluation tie these evaluations to student performance as measured by test scores and merit pay has been offered as a way to reward high test scores and improve teacher performance. Thus, the federal Race to the Top program has led several states toward teacher evaluation instruments that incorporate outcome data in the…

The Validity of ITBS Reading Comprehension Test Scores for Learning Disabled and Non Learning Disabled Students under Extended-Time Conditions.

ERIC Educational Resources Information Center

Huesman, Ronald L., Jr.; Frisbie, David A.

This study investigated the effect of extended-time limits in terms of performance levels and score comparability for reading comprehension scores on the Iowa Tests of Basic Skills (ITBS). The first part of the study compared the average reading comprehension scores on the ITBS of 61 sixth-graders with learning disabilities and 397 non learning…
Why do Women Perform Better With Women Than With Men?

ERIC Educational Resources Information Center

Page, Richard H.; Orton, Julie

In a 1973 study Morgan and Mausner administered the second half of the Hidden Figures Test to pairs of high school students who had scored in either the upper or lower quartile on the first half of the test. High-scoring females showed a significant tendency to lower their scores when working with a low-scoring male partner. This tendency was not…
Simulation-Based Educational Module Improves Intern and Medical Student Performance of Closed Reduction and Percutaneous Pinning of Pediatric Supracondylar Humeral Fractures.

PubMed

Butler, Bennet A; Lawton, Cort D; Burgess, Jamie; Balderama, Earvin S; Barsness, Katherine A; Sarwark, John F

2017-12-06

Simulation-based education has been integrated into many orthopaedic residency programs to augment traditional teaching models. Here we describe the development and implementation of a combined didactic and simulation-based course for teaching medical students and interns how to properly perform a closed reduction and percutaneous pinning of a pediatric supracondylar humeral fracture. Subjects included in the study were either orthopaedic surgery interns or subinterns at our institution. Subjects all completed a combined didactic and simulation-based course on pediatric supracondylar humeral fractures. The first part of this course was an electronic (e)-learning module that the subjects could complete at home in approximately 40 minutes. The second part of the course was a 20-minute simulation-based skills learning session completed in the simulation center. Subject knowledge of closed reduction and percutaneous pinning of supracondylar humeral fractures was tested using a 30-question, multiple-choice, written test. Surgical skills were tested in the operating room or in a simulated operating room. Subject pre-intervention and post-intervention scores were compared to determine if and how much they had improved. A total of 21 subjects were tested. These subjects significantly improved their scores on both the written, multiple-choice test and skills test after completing the combined didactic and simulation module. Prior to the module, intern and subintern multiple-choice test scores were significantly worse than postgraduate year (PGY)-2 to PGY-5 resident scores (p < 0.01); after completion of the module, there was no significant difference in the multiple-choice test scores. After completing the module, there was no significant difference in skills test scores between interns and PGY-2 to PGY-5 residents. Both tests were validated using the scores obtained from PGY-2 to PGY-5 residents. Our combined didactic and simulation course significantly improved intern and subintern understanding of supracondylar humeral fractures and their ability to perform a closed reduction and percutaneous pinning of these fractures.
Predicting dementia using socio-demographic characteristics and the Free and Cued Selective Reminding Test in the general population.

PubMed

Mura, Thibault; Baramova, Marieta; Gabelle, Audrey; Artero, Sylvaine; Dartigues, Jean-François; Amieva, Hélène; Berr, Claudine

2017-03-23

Our study aimed to determine whether the consideration of socio-demographic features improves the prediction of Alzheimer's dementia (AD) at 5 years when using the Free and Cued Selective Reminding Test (FCSRT) in the general older population. Our analyses focused on 2558 subjects from the prospective Three-City Study, a cohort of community-dwelling individuals aged 65 years and over, with FCSRT scores. Four "residual scores" and "risk scores" were built that included the FCSRT scores and socio-demographic variables. The predictive performance of crude, residual and risk scores was analyzed by comparing the areas under the ROC curve (AUC). In total, 1750 subjects were seen 5 years after completing the FCSRT. AD was diagnosed in 116 of them. Compared with the crude free-recall score, the predictive performances of the residual score and of the risk score were not significantly improved (AUC: 0.83 vs 0.82 and 0.88 vs 0.89 respectively). Using socio-demographic features in addition to the FCSRT does not improve its predictive performance for dementia or AD.
KATTS: a framework for maximizing NCLEX-RN performance.

PubMed

McDowell, Betsy M

2008-04-01

A key indicator of the quality of a nursing education program is the performance of its graduates as first-time takers of the NCLEX-RN. As a result, nursing schools are open to strategies that strengthen the performance of their graduates on the examination. The Knowledge base, Anxiety control, Test-Taking Skills (KATTS) framework focuses on the three components of achieving a maximum score on an examination. In KATTS, all three components must be present and in proper balance to maximize a test taker's score. By strengthening not just one but all of these components, graduates can improve their overall test scores significantly. Suggested strategies for strengthening each component of KATTS are provided. This framework has been used successfully in designing remedial tutoring programs and in assisting first-time NCLEX test takers in preparing for the licensing examination.
Estimating verbal fluency and naming ability from the test of premorbid functioning and demographic variables: Regression equations derived from a regional UK sample.

PubMed

Jenkinson, Toni-Marie; Muncer, Steven; Wheeler, Miranda; Brechin, Don; Evans, Stephen

2018-06-01

Neuropsychological assessment requires accurate estimation of an individual's premorbid cognitive abilities. Oral word reading tests, such as the test of premorbid functioning (TOPF), and demographic variables, such as age, sex, and level of education, provide a reasonable indication of premorbid intelligence, but their ability to predict other related cognitive abilities is less well understood. This study aimed to develop regression equations, based on the TOPF and demographic variables, to predict scores on tests of verbal fluency and naming ability. A sample of 119 healthy adults provided demographic information and were tested using the TOPF, FAS, animal naming test (ANT), and graded naming test (GNT). Multiple regression analyses, using the TOPF and demographics as predictor variables, were used to estimate verbal fluency and naming ability test scores. Change scores and cases of significant impairment were calculated for two clinical samples with diagnosed neurological conditions (TBI and meningioma) using the method in Knight, McMahon, Green, and Skeaff (). Demographic variables provided a significant contribution to the prediction of all verbal fluency and naming ability test scores; however, adding TOPF score to the equation considerably improved prediction beyond that afforded by demographic variables alone. The percentage of variance accounted for by demographic variables and/or TOPF score varied from 19 per cent (FAS), 28 per cent (ANT), and 41 per cent (GNT). Change scores revealed significant differences in performance in the clinical groups, particularity the TBI group. Demographic variables, particularly education level, and scores on the TOPF should be taken into consideration when interpreting performance on tests of verbal fluency and naming ability. © 2017 The British Psychological Society.
Has the UK Clinical Aptitude Test improved medical student selection?

PubMed

Wright, Sarah R; Bradley, Philip M

2010-11-01

In 2006, the United Kingdom Clinical Aptitude Test (UKCAT) was introduced as a new medical school admissions tool. The aim of this cohort study was to determine whether the UKCAT has made any improvements to the way medical students are selected. Regression analysis was performed in order to study the ability of previous school type and gender to predict UKCAT, personal statement or interview scores in two cohorts of accepted students. The ability of admissions scores and demographic data to predict performance on knowledge and skills examinations was also studied. Previous school type was not a significant predictor of either interview or UKCAT scores amongst students who had been accepted onto the programme (n = 307). However, it was a significant predictor of personal statement score, with students from independent and grammar schools performing better than students from state-maintained schools. Previous school type, personal statements and interviews were not significant predictors of knowledge examination performance. UKCAT scores were significant predictors of knowledge examination performance for all but one examination administered in the first 2 years of medical school. Admissions data explained very little about performance on skills (objective structured clinical examinations [OSCEs]) assessments. The use of personal statements as a basis for selection results in a bias towards students from independent and grammar schools. However, no evidence was found to suggest that students accepted from these schools perform any better than students from maintained schools on Year 1 and 2 medical school examinations. Previous school type did not predict interview or UKCAT scores of accepted students. UKCAT scores are predictive of Year 1 and 2 examination performance at this medical school, whereas interview scores are not. The results of this study challenge claims made by other authors that aptitude tests do not have a place in medical school selection in the UK. © Blackwell Publishing Ltd 2010.
Comparison of the performance of first-grade and mentally retarded students on the Peabody Mathematics Readiness Test.

PubMed

Richardson, L I; Thurman, R L; Bassler, O C

1978-07-01

The Peabody Mathematics Readiness Test was developed to assess mathematics readiness and identify children who would encounter difficulty in first-grade mathematics. In the present study, we compared performances of mentally retarded subjects and first-grade subjects on this test. Retarded subjects' mean scores were significantly lower than those of the nonretarded subjects on the drawing test; however, there were no significant differences between the mean scores of the groups on the other five subscales.
Ohio District Tests Performance Pay--for Students

ERIC Educational Resources Information Center

Viadero, Debra

2007-01-01

Coshocton district in Ohio takes part in an unusual experiment that pays students who pass or scores high in the state exams. Pupils here in grades 3 through 6 earn $15 for every "proficient" score and $20 for "accelerated" or "advanced" scores. With annual tests given in five subjects, students can earn up to $100 if…
Motor performance in children with Noonan syndrome.

PubMed

Croonen, Ellen A; Essink, Marlou; van der Burgt, Ineke; Draaisma, Jos M; Noordam, Cees; Nijhuis-van der Sanden, Maria W G

2017-09-01

Although problems with motor performance in daily life are frequently mentioned in Noonan syndrome, the motor performance profile has never been systematically investigated. The aim of this study was to examine whether a specific profile in motor performance in children with Noonan syndrome was seen using valid norm-referenced tests. The study assessed motor performance in 19 children with Noonan syndrome (12 females, mean age 9 years 4 months, range 6 years 1 month to 11 years and 11 months, SDS 1 year and 11 months). More than 60% of the parents of the children reported pain, decreased muscle strength, reduced endurance, and/or clumsiness in daily functioning. The mean standard scores on the Visual Motor Integration (VMI) test and Movement Assessment Battery for Children 2, Dutch version (MABC-2-NL) items differed significantly from the reference scores. Grip strength, muscle force, and 6 min Walking Test (6 MWT) walking distance were significantly lower, and the presence of generalized hypermobility was significantly higher. All MABC-2-NL scores (except manual dexterity) correlated significantly with almost all muscle strength tests, VMI total score, and VMI visual perception score. The 6 MWT was only significantly correlated to grip strength. This is the first study that confirms that motor performance, strength, and endurance are significantly impaired in children with Noonan syndrome. Decreased functional motor performance seems to be related to decreased visual perception and reduced muscle strength. Research on causal relationships and the effectiveness of interventions is needed. Physical and/or occupational therapy guidance should be considered to enhance participation in daily life. © 2017 Wiley Periodicals, Inc.
A Comparison of the Performance of Graduate and Undergraduate School Applicants on the Test of Written English. TOEFL Research Reports Report 50.

ERIC Educational Resources Information Center

Zwick, Rebecca; Thayer, Dorothy T.

The performance of graduate and undergraduate school applicants on the Test of Written English (TWE) was compared for each of 66 data sets, dating from 1988 to 1993. The analyses compared the average TWE score for graduates and undergraduates after matching examinees on the total score on the Test of English as a Foreign Language (TOEFL). The main…
Factors contributing to speech perception scores in long-term pediatric cochlear implant users.

PubMed

Davidson, Lisa S; Geers, Ann E; Blamey, Peter J; Tobey, Emily A; Brenner, Christine A

2011-02-01

The objectives of this report are to (1) describe the speech perception abilities of long-term pediatric cochlear implant (CI) recipients by comparing scores obtained at elementary school (CI-E, 8 to 9 yrs) with scores obtained at high school (CI-HS, 15 to 18 yrs); (2) evaluate speech perception abilities in demanding listening conditions (i.e., noise and lower intensity levels) at adolescence; and (3) examine the relation of speech perception scores to speech and language development over this longitudinal timeframe. All 112 teenagers were part of a previous nationwide study of 8- and 9-yr-olds (N = 181) who received a CI between 2 and 5 yrs of age. The test battery included (1) the Lexical Neighborhood Test (LNT; hard and easy word lists); (2) the Bamford Kowal Bench sentence test; (3) the Children's Auditory-Visual Enhancement Test; (4) the Test of Auditory Comprehension of Language at CI-E; (5) the Peabody Picture Vocabulary Test at CI-HS; and (6) the McGarr sentences (consonants correct) at CI-E and CI-HS. CI-HS speech perception was measured in both optimal and demanding listening conditions (i.e., background noise and low-intensity level). Speech perception scores were compared based on age at test, lexical difficulty of stimuli, listening environment (optimal and demanding), input mode (visual and auditory-visual), and language age. All group mean scores significantly increased with age across the two test sessions. Scores of adolescents significantly decreased in demanding listening conditions. The effect of lexical difficulty on the LNT scores, as evidenced by the difference in performance between easy versus hard lists, increased with age and decreased for adolescents in challenging listening conditions. Calculated curves for percent correct speech perception scores (LNT and Bamford Kowal Bench) and consonants correct on the McGarr sentences plotted against age-equivalent language scores on the Test of Auditory Comprehension of Language and Peabody Picture Vocabulary Test achieved asymptote at similar ages, around 10 to 11 yrs. On average, children receiving CIs between 2 and 5 yrs of age exhibited significant improvement on tests of speech perception, lipreading, speech production, and language skills measured between primary grades and adolescence. Evidence suggests that improvement in speech perception scores with age reflects increased spoken language level up to a language age of about 10 yrs. Speech perception performance significantly decreased with softer stimulus intensity level and with introduction of background noise. Upgrades to newer speech processing strategies and greater use of frequency-modulated systems may be beneficial for ameliorating performance under these demanding listening conditions.
An Evaluation of the IntelliMetric[SM] Essay Scoring System

ERIC Educational Resources Information Center

Rudner, Lawrence M.; Garcia, Veronica; Welch, Catherine

2006-01-01

This report provides a two-part evaluation of the IntelliMetric[SM] automated essay scoring system based on its performance scoring essays from the Analytic Writing Assessment of the Graduate Management Admission Test[TM] (GMAT[TM]). The IntelliMetric system performance is first compared to that of individual human raters, a Bayesian system…
Score Distributions of the Balance Outcome Measure for Elder Rehabilitation (BOOMER) in Community-Dwelling Older Adults With Vertebral Fracture.

PubMed

Brown, Zachary M; Gibbs, Jenna C; Adachi, Jonathan D; Ashe, Maureen C; Hill, Keith D; Kendler, David L; Khan, Aliya; Papaioannou, Alexandra; Prasad, Sadhana; Wark, John D; Giangregorio, Lora M

2017-11-28

We sought to evaluate the Balance Outcome Measure for Elder Rehabilitation (BOOMER) in community-dwelling women 65 years and older with vertebral fracture and to describe score distributions and potential ceiling and floor effects. This was a secondary data analysis of baseline data from the Build Better Bones with Exercise randomized controlled trial using the BOOMER. A total of 141 women with osteoporosis and radiographically confirmed vertebral fracture were included. Concurrent validity and internal consistency were assessed in comparison to the Short Physical Performance Battery (SPPB). Normality and ceiling/floor effects of total BOOMER scores and component test items were also assessed. Exploratory analyses of assistive aid use and falls history were performed. Tests for concurrent validity demonstrated moderate correlation between total BOOMER and SPPB scores. The BOOMER component tests showed modest internal consistency. Substantial ceiling effect and nonnormal score distributions were present among overall sample and those not using assistive aids for total BOOMER scores, although scores were normally distributed for those using assistive aids. The static standing with eyes closed test demonstrated the greatest ceiling effects of the component tests, with 92% of participants achieving a maximal score. While the BOOMER compares well with the SPPB in community-dwelling women with vertebral fractures, researchers or clinicians considering using the BOOMER in similar or higher-functioning populations should be aware of the potential for ceiling effects.
Meal composition and shift work performance.

PubMed

Love, Heather L; Watters, Corilee A; Chang, Wei-Ching

2005-01-01

Research indicates that the ability to perform a task can be affected by the composition of the meal preceding the task. This study investigated the effect of shift workers' consumption of a medium-fat, medium-carbohydrate meal on alertness scores. Six subjects (four men, two women) aged 19 to 44 recorded food intake, sleep, and quality of sleep for two weeks, and measured their body temperature and performed cognitive tests during two night shifts at baseline and in test periods. The Stanford Sleepiness Scale (SSS) was used to quantify sleepiness, and a Paced Auditory Serial Addition Test (PASAT) was used to measure cognitive performance. In comparison with the score at baseline, when subjects had a low-fat, high-carbohydrate dietary intake (1,335 kcal/5,588 kJ, 56% carbohydrate, 28% fat), the 1.6-second PASAT score improved significantly (p=0.042) during night shifts when subjects consumed a test meal (987 kcal/4,131 kJ, 46% carbohydrate, 42% fat). No statistically significant difference in SSS was found between baseline and test periods. The reduced body temperature between 2400 hours and 0530 hours was similar for both baseline and test periods. Meal composition and size during night shifts may affect cognitive performance.
Performance of machine-learning scoring functions in structure-based virtual screening

PubMed Central

Wójcikowski, Maciej; Ballester, Pedro J.; Siedlecki, Pawel

2017-01-01

Classical scoring functions have reached a plateau in their performance in virtual screening and binding affinity prediction. Recently, machine-learning scoring functions trained on protein-ligand complexes have shown great promise in small tailored studies. They have also raised controversy, specifically concerning model overfitting and applicability to novel targets. Here we provide a new ready-to-use scoring function (RF-Score-VS) trained on 15 426 active and 893 897 inactive molecules docked to a set of 102 targets. We use the full DUD-E data sets along with three docking tools, five classical and three machine-learning scoring functions for model building and performance assessment. Our results show RF-Score-VS can substantially improve virtual screening performance: RF-Score-VS top 1% provides 55.6% hit rate, whereas that of Vina only 16.2% (for smaller percent the difference is even more encouraging: RF-Score-VS top 0.1% achieves 88.6% hit rate for 27.5% using Vina). In addition, RF-Score-VS provides much better prediction of measured binding affinity than Vina (Pearson correlation of 0.56 and −0.18, respectively). Lastly, we test RF-Score-VS on an independent test set from the DEKOIS benchmark and observed comparable results. We provide full data sets to facilitate further research in this area (http://github.com/oddt/rfscorevs) as well as ready-to-use RF-Score-VS (http://github.com/oddt/rfscorevs_binary). PMID:28440302
Gross Olfaction Before and After Laparoscopic Gastric Bypass.

PubMed

Zerrweck, Carlos; Gallardo, Vannia Castañeda; Calleja, Carmen; Sepúlveda, Elisa; Guilber, Lizbeth

2017-11-01

Obesity leads to olfaction alterations, and this can further impact food choices, appetite, and nutritional status. Bariatric procedures induce weight loss and change in taste and smell perception, but more information is needed, especially using objective olfaction tests. A prospective study was conducted during 6 months, with candidates to laparoscopic gastric bypass at a single institution. A preoperative nasofibroscopy and gross smell identification test (The Pocket Smell Test ®) were performed in those meeting the inclusion criteria. After 6 months, a new test was performed, and the primary objective was to determine if there was an improvement in the olfaction score. Weight loss and comorbidities improvement were also analyzed. From the 30 patients with morbid obesity enrolled, 21 met the inclusion criteria and ENT evaluation. At baseline, 42.8% of patients scored 3 points, 53.3% scored 2 points, and 4.7% scored 1 point. After 6 months, there was a -81.1% of change. Seventeen patients scored 3 points (p = 0.002 vs initial) and two scored 2 points (p = 0.006 vs initial). There were no patients with less than 2 points. Weight and comorbidities had a significant improvement as well. Laparoscopic gastric bypass improves the olfaction scores of the Pocket Smell Test in morbidly obese patients 6 months after their procedure. More complex tests can be used in candidates to bariatric surgery if low scores are detected initially. Other causes of olfaction dysfunctions should be determined if there is no improvement after weight loss.
The Impact of Settable Test Item Exposure Control Interface Format on Postsecondary Business Student Test Performance

ERIC Educational Resources Information Center

Truell, Allen D.; Zhao, Jensen J.; Alexander, Melody W.

2005-01-01

The purposes of this study were to determine if there is a significant difference in postsecondary business student scores and test completion time based on settable test item exposure control interface format, and to determine if there is a significant difference in student scores and test completion time based on settable test item exposure…
Effect of ice massage on lower extremity functional performance and weight discrimination ability in collegiate footballers.

PubMed

Sharma, Geeta; Noohu, Majumi Mohamad

2014-09-01

Cryotherapy, in the form of ice massge is used to reduce inflammation after acute musculoskeletal injury or trauma. The potential negative effects of ice massage on proprioception are unknown, despite equivocal evidence supporting its effectiveness. The purpose of the study was to test the influence of cooling on weight discrimination ability and hence the performance in footballers. The study was of same subject experimental design (pretest-posttest design). Thirty male collegiate football players, whose mean age was 21.07 years, participated in the study. The participants were assessed for two functional performance tests, single leg hop test and crossed over hop test and weight discrimination ability before and after ice massage for 5 minutes on hamstrings muscle tendon. Pre cooling scores of Single Leg Hop Test of the dominant leg in the subjects was 166.65 (± 10.16) cm and post cooling scores of the dominant leg was 167.25 (± 11.77) cm. Pre cooling scores of Crossed Over Hop Test of the dominant leg in the subjects was 174.14 (± 8.60) cm and post cooling scores of the dominant leg was 174.45 (± 9.28) cm. Pre cooling scores of Weight Discrimination Differential Threshold of the dominant leg in the subjects was 1.625 ± 1.179 kg compared with post cooling scores of the dominant leg 1.85 (± 1.91) kg. Pre cooling scores of single leg hop and crossed over hop test of the dominant leg in the subjects compared with post cooling scores of the dominant leg showed no significant differences and it was also noted that the weight discrimination ability (weight discrimination differential threshold) didn't show any significant difference. All the values are reported as mean ± SD. This study provides additional evidence that proprioceptive acuity in the hamstring muscles (biceps femoris) remains largely unaffected after ice application to the hamstrings tendon (biceps femoris).
Improved neurocognitive test performance in both arms of the SMART study: impact of practice effect

PubMed Central

Grund, Birgit; Wright, Edwina J.; Brew, Bruce J.; Price, Richard W.; Roediger, Mollie P.; Bain, Margaret P.; Hoy, Jennifer F.; Shlay, Judith C.; Vjecha, Michael J.; Robertson, Kevin R.

2013-01-01

We evaluated factors associated with improvement in neurocognitive performance in 258 HIV-infected adults with baseline CD4 lymphocyte counts above 350 cells/mm3 randomized to intermittent, CD4-guided antiretroviral therapy (ART) (128 participants) versus continuous therapy (130) in the Neurology substudy of the Strategies for Management of Antiretroviral Therapy trial. Participants were enrolled in Australia, North America, Brazil, and Thailand, and neurocognitive performance was assessed by a five-test battery at baseline and month 6. The primary outcome was change in the quantitative neurocognitive performance z score (QNPZ-5), the average of the z scores of the five tests. Associations of the 6-month change in test scores with ART use, CD4 cell counts, HIV RNA levels, and other factors were determined using multiple regression models. At baseline, median age was 40 years, median CD4 cell count was 513 cells/mm3, 88 % had plasma HIV RNA ≤400 copies/mL, and mean QNPZ-5 was −0.68. Neurocognitive performance improved in both treatment groups by 6 months; QNPZ-5 scores increased by 0.20 and 0.13 in the intermittent and continuous ART groups, respectively (both P<0.001 for increase and P=0.26 for difference). ART was used on average for 3.6 and 5.9 out of the 6 months in the intermittent and continuous ART groups, respectively, but the increase in neurocognitive test scores could not be explained by ART use, changes in CD4, or plasma HIV RNA, which suggests a practice effect. The impact of a practice effect after 6 months emphasizes the need for a control group in HIV studies that measure intervention effects using neurocognitive tests similar to ours. PMID:23943468

Measures of Partial Knowledge and Unexpected Responses in Multiple-Choice Tests

ERIC Educational Resources Information Center

Chang, Shao-Hua; Lin, Pei-Chun; Lin, Zih-Chuan

2007-01-01

This study investigates differences in the partial scoring performance of examinees in elimination testing and conventional dichotomous scoring of multiple-choice tests implemented on a computer-based system. Elimination testing that uses the same set of multiple-choice items rewards examinees with partial knowledge over those who are simply…
42 CFR 493.859 - Standard; ABO group and D (Rho) typing.

Code of Federal Regulations, 2013 CFR

2013-10-01

... attain a score of at least 100 percent of acceptable responses for each analyte or test in each testing event is unsatisfactory analyte performance for the testing event. (b) Failure to attain an overall.... (2) For any unacceptable analyte or unsatisfactory testing event score, remedial action must be taken...
42 CFR 493.859 - Standard; ABO group and D (Rho) typing.

Code of Federal Regulations, 2012 CFR

2012-10-01

... attain a score of at least 100 percent of acceptable responses for each analyte or test in each testing event is unsatisfactory analyte performance for the testing event. (b) Failure to attain an overall.... (2) For any unacceptable analyte or unsatisfactory testing event score, remedial action must be taken...
42 CFR 493.859 - Standard; ABO group and D (Rho) typing.

Code of Federal Regulations, 2014 CFR

2014-10-01

... attain a score of at least 100 percent of acceptable responses for each analyte or test in each testing event is unsatisfactory analyte performance for the testing event. (b) Failure to attain an overall.... (2) For any unacceptable analyte or unsatisfactory testing event score, remedial action must be taken...
A prognostic scoring system for arm exercise stress testing.

PubMed

Xie, Yan; Xian, Hong; Chandiramani, Pooja; Bainter, Emily; Wan, Leping; Martin, Wade H

2016-01-01

Arm exercise stress testing may be an equivalent or better predictor of mortality outcome than pharmacological stress imaging for the ≥50% for patients unable to perform leg exercise. Thus, our objective was to develop an arm exercise ECG stress test scoring system, analogous to the Duke Treadmill Score, for predicting outcome in these individuals. In this retrospective observational cohort study, arm exercise ECG stress tests were performed in 443 consecutive veterans aged 64.1 (11.1) years. (mean (SD)) between 1997 and 2002. From multivariate Cox models, arm exercise scores were developed for prediction of 5-year and 12-year all-cause and cardiovascular mortality and 5-year cardiovascular mortality or myocardial infarction (MI). Arm exercise capacity in resting metabolic equivalents (METs), 1 min heart rate recovery (HRR) and ST segment depression ≥1 mm were the stress test variables independently associated with all-cause and cardiovascular mortality by step-wise Cox analysis (all p<0.01). A score based on the relation HRR (bpm)+7.3×METs-10.5×ST depression (0=no; 1=yes) prognosticated 5-year cardiovascular mortality with a C-statistic of 0.81 before and 0.88 after adjustment for significant demographic and clinical covariates. Arm exercise scores for the other outcome end points yielded C-statistic values of 0.77-0.79 before and 0.82-0.86 after adjustment for significant covariates versus 0.64-0.72 for best fit pharmacological myocardial perfusion imaging models in a cohort of 1730 veterans who were evaluated over the same time period. Arm exercise scores, analogous to the Duke Treadmill Score, have good power for prediction of mortality or MI in patients who cannot perform leg exercise.
The Perceptions of Standardized Tests, Academic Self-Efficacy, and Academic Performance of African American Graduate Students: a Correlational and Comparative Analysis

ERIC Educational Resources Information Center

Marrah, Arleezah K.

2012-01-01

The academic performance of African American students continues to be a concern for educators, researchers, and most importantly their community. This issue is particularly prevalent in the standardized test scores of African American students where they score on average one or more standard deviations below their Caucasian and Asian American…
Assessing the Effect of School Days and Absences on Test Score Performance. CEP Discussion Paper No. 1302

ERIC Educational Resources Information Center

Aucejo, Esteban M.; Romano, Teresa Foy

2014-01-01

While instructional time is viewed as crucial to learning, little is known about the effectiveness of reducing absences relative to increasing the number of school days. In this regard, this paper jointly estimates the effect of absences and length of the school calendar on test score performance. Using administrative data from North Carolina…
Using screen-based simulation to improve performance during pediatric resuscitation.

PubMed

Biese, Kevin J; Moro-Sutherland, Donna; Furberg, Robert D; Downing, Brian; Glickman, Larry; Murphy, Alison; Jackson, Cheryl L; Snyder, Graham; Hobgood, Cherri

2009-12-01

To assess the ability of a screen-based simulation-training program to improve emergency medicine and pediatric resident performance in critical pediatric resuscitation knowledge, confidence, and skills. A pre-post, interventional design was used. Three measures of performance were created and assessed before and after intervention: a written pre-course knowledge examination, a self-efficacy confidence score, and a skills-based high-fidelity simulation code scenario. For the high-fidelity skills assessment, independent physician raters recorded and reviewed subject performance. The intervention consisted of eight screen-based pediatric resuscitation scenarios that subjects had 4 weeks to complete. Upon completion of the scenarios, all three measures were repeated. For the confidence assessment, summary pre- and post-test summary confidence scores were compared using a t-test, and for the skills assessment, pre-scores were compared with post-test measures for each individual using McNemar's chi-square test for paired samples. Twenty-six of 35 (71.3%) enrolled subjects completed the institutional review board-approved study. Increases were observed in written test scores, confidence, and some critical interventions in high-fidelity simulation. The mean improvement in cumulative confidence scores for all residents was 10.1 (SD +/-4.9; range 0-19; p < 0.001), with no resident feeling less confident after the intervention. Although overall performance in simulated codes did not change significantly, with average scores of 6.65 (+/-1.76) to 7.04 (+/-1.37) out of 9 possible points (p = 0.58), improvement was seen in the administering of appropriate amounts of IV fluids (59-89%, p = 0.03). In this study, improvements in resident knowledge, confidence, and performance of certain skills in simulated pediatric cardiac arrest scenarios suggest that screen-based simulations may be an effective way to enhance resuscitation skills of pediatric providers. These results should be confirmed using a randomized design with an appropriate control group. (c) 2009 by the Society for Academic Emergency Medicine.
Automated smartphone audiometry: Validation of a word recognition test app.

PubMed

Dewyer, Nicholas A; Jiradejvong, Patpong; Henderson Sabes, Jennifer; Limb, Charles J

2018-03-01

Develop and validate an automated smartphone word recognition test. Cross-sectional case-control diagnostic test comparison. An automated word recognition test was developed as an app for a smartphone with earphones. English-speaking adults with recent audiograms and various levels of hearing loss were recruited from an audiology clinic and were administered the smartphone word recognition test. Word recognition scores determined by the smartphone app and the gold standard speech audiometry test performed by an audiologist were compared. Test scores for 37 ears were analyzed. Word recognition scores determined by the smartphone app and audiologist testing were in agreement, with 86% of the data points within a clinically acceptable margin of error and a linear correlation value between test scores of 0.89. The WordRec automated smartphone app accurately determines word recognition scores. 3b. Laryngoscope, 128:707-712, 2018. © 2017 The American Laryngological, Rhinological and Otological Society, Inc.
The method used to set the pass mark in an objective structured clinical examination defines the performance of candidates for certification as rheumatologists.

PubMed

Pascual-Ramos, Virginia; Guilaisne Bernard-Medina, Ana; Flores-Alvarado, Diana Elsa; Portela-Hernández, Margarita; Maldonado-Velázquez, María Del Rocío; Jara-Quezada, Luis Javier; Amezcua-Guerra, Luis Manuel; Rubio-Judith López-Zepeda, Nadina E; Álvarez-Hernandez, Everardo; Saavedra, Miguel Ángel; Arce-Salinas, César Alejandro

The Mexican Accreditation Council for Rheumatology certifies trainees (TR) on an annual basis using both a multiple-choice question (MCQ) test and an objective structured clinical examination (OSCE). For 2013 and 2014, the OSCE pass mark (PM) was set by criterion referencing as ≥6 (CPM), whereas overall rating of borderline performance method (BPM) was added for 2015 and 2016 accreditations. We compared OSCE TR performance according to CPM and BPM, and examined whether correlations between MCQ and OSCE were affected by PM. Forty-three (2015) and 37 (2016) candidates underwent both tests. Altogether, OSCE were integrated by 15 validated stations; one evaluator per station scored TR performance according to a station-tailored check-list and a Likert scale (fail, borderline, above range) of overall performance. A composite OSCE score was derived for each candidate. Appropriate statistics were used. Mean (±standard derivation [SD]) MCQ test scores were 6.6±0.6 (2015) and 6.4±0.6 (2016) with 5 candidates receiving a failing score each year. Mean (±SD) OSCE scores were 7.4±0.6 (2015) and 7.3±0.6 (2016); no candidate received a failing CPM score in either 2015 or 2016 OSCE, although 21 (49%) and 19 (51%) TR, respectively, received a failing BPM score (calculated as 7.3 and 7.4, respectively). Stations for BPM ranged from 4.5 to 9.5; overall, candidates showed better performance in CPM. In all, MCQ correlated with composite OSCE, r=0.67 (2015) and r=0.53 (2016); P≤.001. Trainees with a passing BPM score in OSCE had higher MCQ scores than those with a failing score. Overall, OSCE-PM selection impacted candidates' performance but had a limited affect on correlation between clinical and practical examinations. Copyright © 2016 Elsevier España, S.L.U. and Sociedad Española de Reumatología y Colegio Mexicano de Reumatología. All rights reserved.
Validation and clinical utility of the executive function performance test in persons with traumatic brain injury.

PubMed

Baum, C M; Wolf, T J; Wong, A W K; Chen, C H; Walker, K; Young, A C; Carlozzi, N E; Tulsky, D S; Heaton, R K; Heinemann, A W

2017-07-01

This study examined the relationships between the Executive Function Performance Test (EFPT), the NIH Toolbox Cognitive Function tests, and neuropsychological executive function measures in 182 persons with traumatic brain injury (TBI) and 46 controls to evaluate construct, discriminant, and predictive validity. Construct validity: There were moderate correlations between the EFPT and the NIH Toolbox Crystallized (r = -.479), Fluid Tests (r = -.420), and Total Composite Scores (r = -.496). Discriminant validity: Significant differences were found in the EFPT total and sequence scores across control, complicated mild/moderate, and severe TBI groups. We found differences in the organisation score between control and severe, and between mild and severe TBI groups. Both TBI groups had significantly lower scores in safety and judgement than controls. Compared to the controls, the severe TBI group demonstrated significantly lower performance on all instrumental activities of daily living (IADL) tasks. Compared to the mild TBI group, the controls performed better on the medication task, the severe TBI group performed worse in the cooking and telephone tasks. Predictive validity: The EFPT predicted the self-perception of independence measured by the TBI-QOL (beta = -0.49, p < .001) for the severe TBI group. Overall, these data support the validity of the EFPT for use in individuals with TBI.
Prediction of true test scores from observed item scores and ancillary data.

PubMed

Haberman, Shelby J; Yao, Lili; Sinharay, Sandip

2015-05-01

In many educational tests which involve constructed responses, a traditional test score is obtained by adding together item scores obtained through holistic scoring by trained human raters. For example, this practice was used until 2008 in the case of GRE(®) General Analytical Writing and until 2009 in the case of TOEFL(®) iBT Writing. With use of natural language processing, it is possible to obtain additional information concerning item responses from computer programs such as e-rater(®). In addition, available information relevant to examinee performance may include scores on related tests. We suggest application of standard results from classical test theory to the available data to obtain best linear predictors of true traditional test scores. In performing such analysis, we require estimation of variances and covariances of measurement errors, a task which can be quite difficult in the case of tests with limited numbers of items and with multiple measurements per item. As a consequence, a new estimation method is suggested based on samples of examinees who have taken an assessment more than once. Such samples are typically not random samples of the general population of examinees, so that we apply statistical adjustment methods to obtain the needed estimated variances and covariances of measurement errors. To examine practical implications of the suggested methods of analysis, applications are made to GRE General Analytical Writing and TOEFL iBT Writing. Results obtained indicate that substantial improvements are possible both in terms of reliability of scoring and in terms of assessment reliability. © 2015 The British Psychological Society.
Pharmacy Student Self-Testing as a Predictor of Examination Performance

PubMed Central

Panus, Peter; Hagemeier, Nicholas; Thigpen, Jim; Brooks, Lauren

2014-01-01

Objectives. To determine if student self-testing improves performance during a doctor of pharmacy course. Methods. Students were given access to online quizzes with a large pool of randomly selected questions specific to upcoming examination content. Quizzes were electronically scored immediately upon completion and students were provided corrective feedback. Results. Examination scores following implementation of the practice quizzes were significantly higher in all but the last testing period. The upper fiftieth percentile of students scored higher on both the practice quizzes and subsequent examinations in all but the fourth testing period. Conclusions. Providing pharmacy students with self-testing opportunities could increase their retention of course material and provide feedback to both students and educators regarding learning, as well as provide students with a measure of their metacognition. PMID:24672065
Scoring Systems to Estimate Intracerebral Control and Survival Rates of Patients Irradiated for Brain Metastases;Brain metastases; Radiation therapy; Local control; Survival; Prognostic scores

DOE Office of Scientific and Technical Information (OSTI.GOV)

Rades, Dirk, E-mail: Rades.Dirk@gmx.net; Dziggel, Liesa; Haatanen, Tiina

2011-07-15

Purpose: To create and validate scoring systems for intracerebral control (IC) and overall survival (OS) of patients irradiated for brain metastases. Methods and Materials: In this study, 1,797 patients were randomly assigned to the test (n = 1,198) or the validation group (n = 599). Two scoring systems were developed, one for IC and another for OS. The scores included prognostic factors found significant on multivariate analyses. Age, performance status, extracerebral metastases, interval tumor diagnosis to RT, and number of brain metastases were associated with OS. Tumor type, performance status, interval, and number of brain metastases were associated with IC.more » The score for each factor was determined by dividing the 6-month IC or OS rate (given in percent) by 10. The total score represented the sum of the scores for each factor. The score groups of the test group were compared with the corresponding score groups of the validation group. Results: In the test group, 6-month IC rates were 17% for 14-18 points, 49% for 19-23 points, and 77% for 24-27 points (p < 0.0001). IC rates in the validation group were 19%, 52%, and 77%, respectively (p < 0.0001). In the test group, 6-month OS rates were 9% for 15-19 points, 41% for 20-25 points, and 78% for 26-30 points (p < 0.0001). OS rates in the validation group were 7%, 39%, and 79%, respectively (p < 0.0001). Conclusions: Patients irradiated for brain metastases can be given scores to estimate OS and IC. IC and OS rates of the validation group were similar to the test group demonstrating the validity and reproducibility of both scores.« less
Reasoning and Comprehension Processes of Linguistic Minority Persons Learning from Text

DTIC Science & Technology

1989-08-25

scores for the ESL speakers are typical for this population. Performance on the Test of English as a Foreign Language ( TOEFL ) is the language proficiency...fluctuated around 500 for the past several years. An additional 7 ESL students reported scores on the Test of English as a Foreign Language ( TOEFL ) and 2...students reported both SAT and TOEFL scores. The mean TOEFL was 564.7, with scores ranging from 510 to 630. 0 The mean TOEFL score is representative of
The Short Physical Performance Battery is a discriminative tool for identifying patients with COPD at risk of disability.

PubMed

Bernabeu-Mora, Roberto; Medina-Mirapeix, Françesc; Llamazares-Herrán, Eduardo; García-Guillamón, Gloria; Giménez-Giménez, Luz María; Sánchez-Nieto, Juan Miguel

2015-01-01

Limited mobility is a risk factor for developing chronic obstructive pulmonary disease (COPD)-related disabilities. Little is known about the validity of the Short Physical Performance Battery (SPPB) for identifying mobility limitations in patients with COPD. To determine the clinical validity of the SPPB summary score and its three components (standing balance, 4-meter gait speed, and five-repetition sit-to-stand) for identifying mobility limitations in patients with COPD. This cross-sectional study included 137 patients with COPD, recruited from a hospital in Spain. Muscle strength tests and SPPB were measured; then, patients were surveyed for self-reported mobility limitations. The validity of SPPB scores was analyzed by developing receiver operating characteristic curves to analyze the sensitivity and specificity for identifying patients with mobility limitations; by examining group differences in SPPB scores across categories of mobility activities; and by correlating SPPB scores to strength tests. Only the SPPB summary score and the five-repetition sit-to-stand components showed good discriminative capabilities; both showed areas under the receiver operating characteristic curves greater than 0.7. Patients with limitations had significantly lower SPPB scores than patients without limitations in nine different mobility activities. SPPB scores were moderately correlated with the quadriceps test (r>0.40), and less correlated with the handgrip test (r<0.30), which reinforced convergent and divergent validities. A SPPB summary score cutoff of 10 provided the best accuracy for identifying mobility limitations. This study provided evidence for the validity of the SPPB summary score and the five-repetition sit-to-stand test for assessing mobility in patients with COPD. These tests also showed potential as a screening test for identifying patients with COPD that have mobility limitations.
The Short Physical Performance Battery is a discriminative tool for identifying patients with COPD at risk of disability

PubMed Central

Bernabeu-Mora, Roberto; Medina-Mirapeix, Françesc; Llamazares-Herrán, Eduardo; García-Guillamón, Gloria; Giménez-Giménez, Luz María; Sánchez-Nieto, Juan Miguel

2015-01-01

Background Limited mobility is a risk factor for developing chronic obstructive pulmonary disease (COPD)-related disabilities. Little is known about the validity of the Short Physical Performance Battery (SPPB) for identifying mobility limitations in patients with COPD. Objective To determine the clinical validity of the SPPB summary score and its three components (standing balance, 4-meter gait speed, and five-repetition sit-to-stand) for identifying mobility limitations in patients with COPD. Methods This cross-sectional study included 137 patients with COPD, recruited from a hospital in Spain. Muscle strength tests and SPPB were measured; then, patients were surveyed for self-reported mobility limitations. The validity of SPPB scores was analyzed by developing receiver operating characteristic curves to analyze the sensitivity and specificity for identifying patients with mobility limitations; by examining group differences in SPPB scores across categories of mobility activities; and by correlating SPPB scores to strength tests. Results Only the SPPB summary score and the five-repetition sit-to-stand components showed good discriminative capabilities; both showed areas under the receiver operating characteristic curves greater than 0.7. Patients with limitations had significantly lower SPPB scores than patients without limitations in nine different mobility activities. SPPB scores were moderately correlated with the quadriceps test (r>0.40), and less correlated with the handgrip test (r<0.30), which reinforced convergent and divergent validities. A SPPB summary score cutoff of 10 provided the best accuracy for identifying mobility limitations. Conclusion This study provided evidence for the validity of the SPPB summary score and the five-repetition sit-to-stand test for assessing mobility in patients with COPD. These tests also showed potential as a screening test for identifying patients with COPD that have mobility limitations. PMID:26664110
See It, Be It, Write It: Using Performing Arts to Improve Writing Skills and Test Scores

ERIC Educational Resources Information Center

Blecher-Sass, Hope Sara; Moffitt, Maryellen

2010-01-01

Improve students' writing skills and boost their assessment scores while adding arts education, creativity, and fun to your writing curriculum. With this vibrant resource, improving writing skills goes hand-in-hand with improving test scores. Students learn how to use acting and visualization as prewriting activities to help them connect writing…
Academic performance in adolescence after inguinal hernia repair in infancy: a nationwide cohort study.

PubMed

Hansen, Tom G; Pedersen, Jacob K; Henneberg, Steen W; Pedersen, Dorthe A; Murray, Jeffrey C; Morton, Neil S; Christensen, Kaare

2011-05-01

Although animal studies have indicated that general anesthetics may result in widespread apoptotic neurodegeneration and neurocognitive impairment in the developing brain, results from human studies are scarce. We investigated the association between exposure to surgery and anesthesia for inguinal hernia repair in infancy and subsequent academic performance. Using Danish birth cohorts from 1986-1990, we compared the academic performance of all children who had undergone inguinal hernia repair in infancy to a randomly selected, age-matched 5% population sample. Primary analysis compared average test scores at ninth grade adjusting for sex, birth weight, and paternal and maternal age and education. Secondary analysis compared the proportions of children not attaining test scores between the two groups. From 1986-1990 in Denmark, 2,689 children underwent inguinal hernia repair in infancy. A randomly selected, age-matched 5% population sample consists of 14,575 individuals. Although the exposure group performed worse than the control group (average score 0.26 lower; 95% CI, 0.21-0.31), after adjusting for known confounders, no statistically significant difference (-0.04; 95% CI, -0.09 to 0.01) between the exposure and control groups could be demonstrated. However, the odds ratio for test score nonattainment associated with inguinal hernia repair was 1.18 (95% CI, 1.04-1.35). Excluding from analyses children with other congenital malformations, the difference in mean test scores remained nearly unchanged (0.05; 95% CI, 0.00-0.11). In addition, the increased proportion of test score nonattainment within the exposure group was attenuated (odds ratio = 1.13; 95% CI, 0.98-1.31). In the ethnically and socioeconomically homogeneous Danish population, we found no evidence that a single, relatively brief anesthetic exposure in connection with hernia repair in infancy reduced academic performance at age 15 or 16 yr after adjusting for known confounding factors. However, the higher test score nonattainment rate among the hernia group could suggest that a subgroup of these children are developmentally disadvantaged compared with the background population.
Twelve-Week Exercise Influences Memory Complaint but not Memory Performance in Older Adults: A Randomized Controlled Study.

PubMed

Iuliano, Enzo; Fiorilli, Giovanni; Aquino, Giovanna; Di Costanzo, Alfonso; Calcagno, Giuseppe; di Cagno, Alessandra

2017-10-01

This study aimed to evaluate the effects of different types of exercise on memory performance and memory complaint after a 12-week intervention. Eighty community-dwelling volunteers, aged 66.96 ± 11.73 years, were randomly divided into four groups: resistance, cardiovascular, postural, and control groups (20 participants for each group). All participants were tested for their cognitive functions before and after their respective 12-week intervention using Rey memory words test, Prose memory test, and Memory Complaint Questionnaire (MAC-Q). Statistical analysis showed that the three experimental groups significantly improved MAC-Q scores in comparison with the control group (p < .05). The variation of MAC-Q scores and the variations of Rey and Prose memory tests scores were not correlated. These results indicate that the 12-week interventions exclusively influenced memory complaint but not memory performance. Further investigations are needed to understand the relation between memory complaint and memory performance, and the factors that can influence this relationship.

Nursing students' attitudes toward statistics: Effect of a biostatistics course and association with examination performance.

PubMed

Kiekkas, Panagiotis; Panagiotarou, Aliki; Malja, Alvaro; Tahirai, Daniela; Zykai, Rountina; Bakalis, Nick; Stefanopoulos, Nikolaos

2015-12-01

Although statistical knowledge and skills are necessary for promoting evidence-based practice, health sciences students have expressed anxiety about statistics courses, which may hinder their learning of statistical concepts. To evaluate the effects of a biostatistics course on nursing students' attitudes toward statistics and to explore the association between these attitudes and their performance in the course examination. One-group quasi-experimental pre-test/post-test design. Undergraduate nursing students of the fifth or higher semester of studies, who attended a biostatistics course. Participants were asked to complete the pre-test and post-test forms of The Survey of Attitudes Toward Statistics (SATS)-36 scale at the beginning and end of the course respectively. Pre-test and post-test scale scores were compared, while correlations between post-test scores and participants' examination performance were estimated. Among 156 participants, post-test scores of the overall SATS-36 scale and of the Affect, Cognitive Competence, Interest and Effort components were significantly higher than pre-test ones, indicating that the course was followed by more positive attitudes toward statistics. Among 104 students who participated in the examination, higher post-test scores of the overall SATS-36 scale and of the Affect, Difficulty, Interest and Effort components were significantly but weakly correlated with higher examination performance. Students' attitudes toward statistics can be improved through appropriate biostatistics courses, while positive attitudes contribute to higher course achievements and possibly to improved statistical skills in later professional life. Copyright © 2015 Elsevier Ltd. All rights reserved.
Are overreferrals on developmental screening tests really a problem?

PubMed

Glascoe, F P

2001-01-01

Developmental screening tests, even those meeting standards for screening test accuracy, produce numerous false-positive results for 15% to 30% of children. This is thought to produce unnecessary referrals for diagnostic testing or special services and increase the cost of screening programs. To explore whether children who pass screening tests differ in important ways from those who do not and to determine whether children overreferred for testing benefit from the scrutiny of diagnostic testing and treatment planning. Subjects were a national sample of 512 parents and their children (age range of the children, 7 months to 8 years) who participated in validation studies of various screening tests. Psychological examiners adhering to standardized directions obtained informed consent and administered at least 2 developmental screening measures (the Brigance Screens, the Battelle Developmental Inventory Screening Test, the Denver-II, and the Parents' Evaluations of Developmental Status) and a concurrent battery of diagnostic measures, including tests of intelligence, language, and academic achievement (for children aged 2(1/2) years and older). The performance on diagnostic measures of children who failed screening but were not found to have a disability (false positives) was compared with that of children who passed screening and did not have a disability on diagnostic testing (true negatives). Children with false-positive scores performed significantly (P<.001) lower on diagnostic measures than did children with true-negative scores. The false-positive group had scores in adaptive behavior, language, intelligence, and academic achievement that were 9 to 14 points lower than the scores of those in the true-negative group. When viewing the likelihood of scoring below the 25th percentile on diagnostic measures, children with false-positive scores had a relative risk of 2.6 in adaptive behavior (95% confidence interval [CI], 1.67-4.21), 3.1 in language skills (95% CI, 1.90-5.20), 6.7 on intelligence tests (95% CI, 3.28-13.50), and 4.9 on academic measures (95% CI, 2.61-9.28). Overall, 151 (70%) of the children with false-positive results scored below the 25th percentile on 1 or more diagnostic measures (the point at which most children have difficulty benefiting from typical classroom instruction) in contrast with 64 (29%) of the children with true-negative scores (odds ratio, 5.6; 95% CI, 3.73-8.49). Children with false-positive scores were also more likely to be nonwhite and to have parents who had not graduated from high school. Performance differences between children with true-negative scores and children with false-positive scores continued to be significant (P<.001) even after adjusting for sociodemographic differences between groups. Children overreferred for diagnostic testing by developmental screens perform substantially lower than children with true-negative scores on measures of intelligence, language, and academic achievement-the 3 best predictors of school success. These children also carry more psychosocial risk factors, such as limited parental education and minority status. Thus, children with false-positive screening results are an at-risk group for whom diagnostic testing may not be an unnecessary expense but rather a beneficial and needed service that can help focus intervention efforts. Although such testing will not indicate a need for special education placement, it can be useful in identifying children's needs for other programs known to improve language, cognitive, and academic skills, such as Head Start, Title I services, tutoring, private speech-language therapy, and quality day care.
Effect of a Lower Extremity Preventive Training Program on Physical Performance Scores in Military Recruits.

PubMed

Peck, Karen Y; DiStefano, Lindsay J; Marshall, Stephen W; Padua, Darin A; Beutler, Anthony I; de la Motte, Sarah J; Frank, Barnett S; Martinez, Jessica C; Cameron, Kenneth L

2017-11-01

Peck, KY, DiStefano, LJ, Marshall, SW, Padua, DA, Beutler, AI, de la Motte, SJ, Frank, BS, Martinez, JC, and Cameron, KL. Effect of a lower extremity preventive training program on physical performance scores in military recruits. J Strength Cond Res 31(11): 3146-3157, 2017-Exercise-based preventive training programs are designed to improve movement patterns associated with lower extremity injury risk; however, the impact of these programs on general physical fitness has not been evaluated. The purpose of this study was to compare fitness scores between participants in a preventive training program and a control group. One thousand sixty-eight freshmen from a U.S. Service Academy were cluster-randomized into either the intervention or control group during 6 weeks of summer training. The intervention group performed a preventive training program, specifically the Dynamic Integrated Movement Enhancement (DIME), which is designed to improve lower extremity movement patterns. The control group performed the Army Preparation Drill (PD), a warm-up designed to prepare soldiers for training. Main outcome measures were the Army Physical Fitness Test (APFT) raw and scaled (for age and sex) scores. Independent t tests were used to assess between-group differences. Multivariable logistic regression models were used to control for the influence of confounding variables. Dynamic Integrated Movement Enhancement group participants completed the APFT 2-mile run 20 seconds faster compared with the PD group (p < 0.001), which corresponded with significantly higher scaled scores (p < 0.001). Army Physical Fitness Test push-up scores were significantly higher in the DIME group (p = 0.041), but there were no significant differences in APFT sit-up scores. The DIME group had significantly higher total APFT scores compared with the PD group (p < 0.001). Similar results were observed in multivariable models after controlling for sex and body mass index (BMI). Committing time to the implementation of a preventive training program does not appear to negatively affect fitness test scores.
A cross-national study of calculus

NASA Astrophysics Data System (ADS)

Chai, Jun; Friedler, Louis M.; Wolff, Edward F.; Li, Jun; Rhea, Karen

2015-05-01

The results from a cross-national study comparing calculus performance of students at East China Normal University (ECNU) in Shanghai and students at the University of Michigan before and after their first university calculus course are presented. Overall, ECNU significantly outperformed Michigan on both the pre- and post-tests, but the Michigan students showed a larger gain and normalized gain, and hence narrowed the gap. ECNU's superior performance was especially striking on the subset of problems requiring only a pre-calculus background. On those, Michigan's post-test scores were below ECNU's pre-test scores and, indeed, ECNU's higher performance on both the overall pre-test and overall post-test is attributable to its success on these problems.
Comparative neurobehavioral study of a polybrominated biphenyl-exposed population in Michigan and a nonexposed group in Wisconsin.

PubMed Central

Valciukas, J A; Lilis, R; Wolff, M S; Anderson, H A

1978-01-01

An analysis of findings regarding the prevalence and time course of symptoms and the results of neurobehavioral testing among Michigan and Wisconsin dairy farmers, is reported. Reviewed are: (1) differences in the prevalence of neurological symptoms at the time of examination; (2) differences in the incidence and time course of symptoms for the period 1972--1976; (3) differences among populations and subgroups (sex and age) regarding performance test scores; (4) correlations between performance test scores and neurological symptoms; and (5) correlations between serum PBB levels as indicators of exposure and performance tests and neurological symptoms. PMID:209977
The Efficacy of Mammography Boot Camp to Improve the Performance of Radiologists

PubMed Central

Lee, Eun Hye; Jung, Seung Eun; Kim, You Me; Choi, Nami

2014-01-01

Objective To evaluate the efficacy of a mammography boot camp (MBC) to improve radiologists' performance in interpreting mammograms in the National Cancer Screening Program (NCSP) in Korea. Materials and Methods Between January and July of 2013, 141 radiologists were invited to a 3-day educational program composed of lectures and group practice readings using 250 digital mammography cases. The radiologists' performance in interpreting mammograms were evaluated using a pre- and post-camp test set of 25 cases validated prior to the camp by experienced breast radiologists. Factors affecting the radiologists' performance, including age, type of attending institution, and type of test set cases, were analyzed. Results The average scores of the pre- and post-camp tests were 56.0 ± 12.2 and 78.3 ± 9.2, respectively (p < 0.001). The post-camp test scores were higher than the pre-camp test scores for all age groups and all types of attending institutions (p < 0.001). The rate of incorrect answers in the post-camp test decreased compared to the pre-camp test for all suspicious cases, but not for negative cases (p > 0.05). Conclusion The MBC improves radiologists' performance in interpreting mammograms irrespective of age and type of attending institution. Improved interpretation is observed for suspicious cases, but not for negative cases. PMID:25246818
Associations of Adiposity and Aerobic Fitness with Executive Function and Math Performance in Danish Adolescents.

PubMed

Huang, Tao; Tarp, Jakob; Domazet, Sidsel Louise; Thorsen, Anne Kær; Froberg, Karsten; Andersen, Lars Bo; Bugge, Anna

2015-10-01

To examine the associations of adiposity and aerobic fitness with executive function and math performance in Danish adolescents. Cross-sectional analyses were conducted with data on 525 adolescents attending sixth and seventh grades from 14 schools in the 5 main regions of Denmark. A modified Eriksen flanker task was used to assess inhibitory control, a key aspect of executive function. Academic performance was assessed by a customized math test. Aerobic fitness was assessed by an intermittent shuttle-run test (Andersen test). Body mass index (BMI) was negatively associated with accuracy on incongruent trials during the flanker task (P = .005). A higher BMI was associated with a larger accuracy interference score (P = .01). Similarly, waist circumference (WC) was negatively associated with accuracy on incongruent trials (P = .008). A higher WC was associated with a larger reaction time (RT) interference score (P = .02) and accuracy interference score (P = .009). Higher aerobic fitness was associated with a faster RT on congruent trials (P = .009) and incongruent trials (P = .003). Higher aerobic fitness was associated with a smaller RT interference score (P = .04). Aerobic fitness was positively associated with math score (P < .001). BMI and WC were not associated with math score (P > .05). These results suggest that aerobic fitness is positively associated with both inhibitory control and math performance in adolescents. Adiposity is negatively associated with inhibitory control in adolescents. Adiposity is not associated with math performance. Copyright © 2015 Elsevier Inc. All rights reserved.
Further evidence for the increased power of LOD scores compared with nonparametric methods.

PubMed

Durner, M; Vieland, V J; Greenberg, D A

1999-01-01

In genetic analysis of diseases in which the underlying model is unknown, "model free" methods-such as affected sib pair (ASP) tests-are often preferred over LOD-score methods, although LOD-score methods under the correct or even approximately correct model are more powerful than ASP tests. However, there might be circumstances in which nonparametric methods will outperform LOD-score methods. Recently, Dizier et al. reported that, in some complex two-locus (2L) models, LOD-score methods with segregation analysis-derived parameters had less power to detect linkage than ASP tests. We investigated whether these particular models, in fact, represent a situation that ASP tests are more powerful than LOD scores. We simulated data according to the parameters specified by Dizier et al. and analyzed the data by using a (a) single locus (SL) LOD-score analysis performed twice, under a simple dominant and a recessive mode of inheritance (MOI), (b) ASP methods, and (c) nonparametric linkage (NPL) analysis. We show that SL analysis performed twice and corrected for the type I-error increase due to multiple testing yields almost as much linkage information as does an analysis under the correct 2L model and is more powerful than either the ASP method or the NPL method. We demonstrate that, even for complex genetic models, the most important condition for linkage analysis is that the assumed MOI at the disease locus being tested is approximately correct, not that the inheritance of the disease per se is correctly specified. In the analysis by Dizier et al., segregation analysis led to estimates of dominance parameters that were grossly misspecified for the locus tested in those models in which ASP tests appeared to be more powerful than LOD-score analyses.
Cross-validation of the Dot Counting Test in a large sample of credible and non-credible patients referred for neuropsychological testing.

PubMed

McCaul, Courtney; Boone, Kyle B; Ermshar, Annette; Cottingham, Maria; Victor, Tara L; Ziegler, Elizabeth; Zeller, Michelle A; Wright, Matthew

2018-01-18

To cross-validate the Dot Counting Test in a large neuropsychological sample. Dot Counting Test scores were compared in credible (n = 142) and non-credible (n = 335) neuropsychology referrals. Non-credible patients scored significantly higher than credible patients on all Dot Counting Test scores. While the original E-score cut-off of ≥17 achieved excellent specificity (96.5%), it was associated with mediocre sensitivity (52.8%). However, the cut-off could be substantially lowered to ≥13.80, while still maintaining adequate specificity (≥90%), and raising sensitivity to 70.0%. Examination of non-credible subgroups revealed that Dot Counting Test sensitivity in feigned mild traumatic brain injury (mTBI) was 55.8%, whereas sensitivity was 90.6% in patients with non-credible cognitive dysfunction in the context of claimed psychosis, and 81.0% in patients with non-credible cognitive performance in depression or severe TBI. Thus, the Dot Counting Test may have a particular role in detection of non-credible cognitive symptoms in claimed psychiatric disorders. Alternative to use of the E-score, failure on ≥1 cut-offs applied to individual Dot Counting Test scores (≥6.0″ for mean grouped dot counting time, ≥10.0″ for mean ungrouped dot counting time, and ≥4 errors), occurred in 11.3% of the credible sample, while nearly two-thirds (63.6%) of the non-credible sample failed one of more of these cut-offs. An E-score cut-off of 13.80, or failure on ≥1 individual score cut-offs, resulted in few false positive identifications in credible patients, and achieved high sensitivity (64.0-70.0%), and therefore appear appropriate for use in identifying neurocognitive performance invalidity.
Predicting student performance in sonographic scanning using spatial ability as an ability determinent of skill acquisition

NASA Astrophysics Data System (ADS)

Clem, Douglas Wayne

Spatial ability refers to an individual's capacity to visualize and mentally manipulate three dimensional objects. Since sonographers manually manipulate 2D and 3D sonographic images to generate multi-viewed, logical, sequential renderings of an anatomical structure, it can be assumed that spatial ability is central to the perception and interpretation of these medical images. Using Ackerman's theory of ability determinants of skilled performance as a conceptual framework, this study explored the relationship of spatial ability and learning sonographic scanning. Beginning first year sonography students from four different educational institutions were administered a spatial abilities test prior to their initial scanning lab coursework. The students' spatial test scores were compared with their scanning competency performance scores. A significant relationship between the students' spatial ability scores and their scanning performance scores was found. This result suggests that the use of spatial ability tests for admission to sonography programs may improve candidate selection, as well as assist programs in adjusting instruction and curriculum for students who demonstrate low spatial ability.
The relationship between selected standardized test scores and performance in advanced placement math and science exams: Analyzing the differential effectiveness of scores for course identification and placement

NASA Astrophysics Data System (ADS)

Urbina, Josue N.

There is a national need to increase the STEM-related workforce. Among factors leading towards STEM careers include the number of advanced high school mathematics and science courses students complete. Florida's enrollment patterns in STEM-related Advanced Placement (AP) courses, however, reveal that only a small percentage of students enroll into these classes. Therefore, screening tools are needed to find more students for these courses, who are academically ready, yet have not been identified. The purpose of this study was to investigate the extent to which scores from a national standardized test, Preliminary Scholastic Assessment Test/ National Merit Qualifying Test (PSAT/NMSQT), in conjunction with and compared to a state-mandated standardized test, Florida Comprehensive Assessment Test (FCAT), are related to selected AP exam performance in Seminole County Public Schools. An ex post facto correlational study was conducted using 6,189 student records from the 2010 - 2012 academic years. Multiple regression analyses using simultaneous Full Model testing showed differential moderate to strong relationships between scores in eight of the nine AP courses (i.e., Biology, Environmental Science, Chemistry, Physics B, Physics C Electrical, Physics C Mechanical, Statistics, Calculus AB and BC) examined. For example, the significant unique contribution to overall variance in AP scores was a linear combination of PSAT Math (M), Critical Reading (CR) and FCAT Reading (R) for Biology and Environmental Science. Moderate relationships for Chemistry included a linear combination of PSAT M, W (Writing) and FCAT M; a combination of FCAT M and PSAT M was most significantly associated with Calculus AB performance. These findings have implications for both research and practice. FCAT scores, in conjunction with PSAT scores, can potentially be used for specific STEM-related AP courses, as part of a systematic approach towards AP course identification and placement. For courses with moderate to strong relationships, validation studies and development of expectancy tables, which estimate the probability of successful performance on these AP exams, are recommended. Also, findings established a need to examine other related research issues including, but not limited to, extensive longitudinal studies and analyses of other available or prospective standardized test scores.
Evaluating Pekin duck walking ability using a treadmill performance test.

PubMed

Byrd, C J; Main, R P; Makagon, M M

2016-10-01

Gait scoring is the most popular method for assessing the walking ability of poultry species. Although inexpensive and easy to implement, gait scoring systems are often criticized for being subjective. Using a treadmill performance test we assessed whether observable differences in Pekin duck walking ability identified using a gait scoring system translated to differences in walking performance. One hundred and eighty ducks were selected using a three-category gait scoring system (GS0 = smooth gait, n = 55; GS0.5 = labored walk without easily identifiable impediment, n = 56; GS1 = obvious impediment, n = 59) and the amount of time each duck was able to sustain walking on a treadmill at a speed of 0.31 m/s was evaluated. The walking test ended when each duck met one of three elimination criteria: (1) The duck walked for a maximum time of ten minutes, (2) the duck required support from the observer's hand for more than three seconds in order to continue walking on the treadmill, or (3) the duck sat down on the treadmill and made no attempt to stand despite receiving assistance from the observer. Data were analyzed in SAS 9.4 using PROC GLM. Tukey's multiple comparison test was used to compare differences in time spent walking between gait scores. Significant differences were found between all gait scores (P < 0.05). Behavioral correlates of walking performance were investigated. Video recorded during the treadmill test was analyzed for counts of sitting, standing, and leaning behaviors. Data were analyzed in SAS 9.4 using a negative binomial model for count data. No differences were found between gait scores for counts of sitting, standing, and leaning behaviors (P > 0.05). In conclusion, the amount of time spent walking on the treadmill corresponded to gait score and was an effective measurement for quantifying Pekin duck walking ability. The test could be a valuable tool for assessing the development of walking issues or the effectiveness of treatments aimed at promoting leg health. © 2016 Poultry Science Association Inc.
Role of test motivation in intelligence testing.

PubMed

Duckworth, Angela Lee; Quinn, Patrick D; Lynam, Donald R; Loeber, Rolf; Stouthamer-Loeber, Magda

2011-05-10

Intelligence tests are widely assumed to measure maximal intellectual performance, and predictive associations between intelligence quotient (IQ) scores and later-life outcomes are typically interpreted as unbiased estimates of the effect of intellectual ability on academic, professional, and social life outcomes. The current investigation critically examines these assumptions and finds evidence against both. First, we examined whether motivation is less than maximal on intelligence tests administered in the context of low-stakes research situations. Specifically, we completed a meta-analysis of random-assignment experiments testing the effects of material incentives on intelligence-test performance on a collective 2,008 participants. Incentives increased IQ scores by an average of 0.64 SD, with larger effects for individuals with lower baseline IQ scores. Second, we tested whether individual differences in motivation during IQ testing can spuriously inflate the predictive validity of intelligence for life outcomes. Trained observers rated test motivation among 251 adolescent boys completing intelligence tests using a 15-min "thin-slice" video sample. IQ score predicted life outcomes, including academic performance in adolescence and criminal convictions, employment, and years of education in early adulthood. After adjusting for the influence of test motivation, however, the predictive validity of intelligence for life outcomes was significantly diminished, particularly for nonacademic outcomes. Collectively, our findings suggest that, under low-stakes research conditions, some individuals try harder than others, and, in this context, test motivation can act as a third-variable confound that inflates estimates of the predictive validity of intelligence for life outcomes.
Role of test motivation in intelligence testing

PubMed Central

Duckworth, Angela Lee; Quinn, Patrick D.; Lynam, Donald R.; Loeber, Rolf; Stouthamer-Loeber, Magda

2011-01-01

Intelligence tests are widely assumed to measure maximal intellectual performance, and predictive associations between intelligence quotient (IQ) scores and later-life outcomes are typically interpreted as unbiased estimates of the effect of intellectual ability on academic, professional, and social life outcomes. The current investigation critically examines these assumptions and finds evidence against both. First, we examined whether motivation is less than maximal on intelligence tests administered in the context of low-stakes research situations. Specifically, we completed a meta-analysis of random-assignment experiments testing the effects of material incentives on intelligence-test performance on a collective 2,008 participants. Incentives increased IQ scores by an average of 0.64 SD, with larger effects for individuals with lower baseline IQ scores. Second, we tested whether individual differences in motivation during IQ testing can spuriously inflate the predictive validity of intelligence for life outcomes. Trained observers rated test motivation among 251 adolescent boys completing intelligence tests using a 15-min “thin-slice” video sample. IQ score predicted life outcomes, including academic performance in adolescence and criminal convictions, employment, and years of education in early adulthood. After adjusting for the influence of test motivation, however, the predictive validity of intelligence for life outcomes was significantly diminished, particularly for nonacademic outcomes. Collectively, our findings suggest that, under low-stakes research conditions, some individuals try harder than others, and, in this context, test motivation can act as a third-variable confound that inflates estimates of the predictive validity of intelligence for life outcomes. PMID:21518867
An approach to analyzing a single subject's scores obtained in a standardized test with application to the Aachen Aphasia Test (AAT).

PubMed

Willmes, K

1985-08-01

Methods for the analysis of a single subject's test profile(s) proposed by Huber (1973) are applied to the Aachen Aphasia Test (AAT). The procedures are based on the classical test theory model (Lord & Novick, 1968) and are suited for any (achievement) test with standard norms from a large standardization sample and satisfactory reliability estimates. Two test profiles of a Wernicke's aphasic, obtained before and after a 3-month period of speech therapy, are analyzed using inferential comparisons between (groups of) subtest scores on one test application and between two test administrations for single (groups of) subtests. For each of these comparisons, the two aspects of (i) significant (reliable) differences in performance beyond measurement error and (ii) the diagnostic validity of that difference in the reference population of aphasic patients are assessed. Significant differences between standardized subtest scores and a remarkably better preserved reading and writing ability could be found for both test administrations using the multiple test procedure of Holm (1979). Comparison of both profiles revealed an overall increase in performance for each subtest as well as changes in level of performance relations between pairs of subtests.
Performance on the Functional Movement Screen™ is related to hop performance, but not to hip and knee strength in collegiate football players

PubMed Central

Willigenburg, Nienke; Hewett, Timothy E.

2016-01-01

Objective To define the relationship between FMS™ scores and hop performance, hip strength, and knee strength in collegiate football players. Design Cross-sectional cohort. Participants Freshmen of a division I collegiate American football team (n=59). Main Outcome Measures The athletes performed the FMS™, as well as a variety of hop tests, isokinetic knee strength and isometric hip strength tasks. We recorded total FMS™ score, peak strength and hop performance, and we calculated asymmetries between legs on the different tasks. Spearman’s correlation coefficients quantified the relationships these measures, and chi-square analyses compared the number of athletes with asymmetries on the different tasks. Results We observed significant correlations (r=0.38–0.56, p≤0.02) between FMS™ scores and hop distance, but not between FMS™ scores and hip or knee strength (all p≥0.21). The amount of asymmetry on the FMS™ test was significantly correlated to the amount of asymmetry on the timed 6m hop (r=0.44, p<0.01), but not to hip or knee strength asymmetries between limbs (all p≥0.34). Conclusions FMS™ score was positively correlated to hop distance, and limb asymmetry in FMS™ tasks was correlated to limb asymmetry in 6m hop time in football players. No significant correlations were observed between FMS™ score and hip and knee strength, or between FMS™ asymmetry and asymmetries in hip and knee strength between limbs. These results indicate that a simple hop for distance test may be a time and cost efficient alternative to FMS™ testing in athletes and that functional asymmetries between limbs do not coincide with strength asymmetries. PMID:26886801
Performance on the Functional Movement Screen Is Related to Hop Performance But Not to Hip and Knee Strength in Collegiate Football Players.

PubMed

Willigenburg, Nienke; Hewett, Timothy E

2017-03-01

To define the relationship between Functional Movement Screen (FMS) scores and hop performance, hip strength, and knee strength in collegiate football players. Cross-sectional cohort. Freshmen of a Division I collegiate American football team (n = 59). The athletes performed the FMS, and also a variety of hop tests, isokinetic knee strength, and isometric hip strength tasks. We recorded total FMS score, peak strength, and hop performance, and we calculated asymmetries between legs on the different tasks. Spearman correlation coefficients quantified the relationships between these measures, and χ analyses compared the number of athletes with asymmetries on the different tasks. We observed significant correlations (r = 0.38-0.56, P ≤ 0.02) between FMS scores and hop distance but not between FMS scores and hip or knee strength (all P ≥ 0.21). The amount of asymmetry on the FMS test was significantly correlated to the amount of asymmetry on the timed 6-m hop (r = 0.44, P < 0.01) but not to hip or knee strength asymmetries between limbs (all P ≥ 0.34). Functional Movement Screen score was positively correlated to hop distance, and limb asymmetry in FMS tasks was correlated to limb asymmetry in 6-m hop time in football players. No significant correlations were observed between FMS score and hip and knee strength or between FMS asymmetry and asymmetries in hip and knee strength between limbs. These results indicate that a simple hop for distance test may be a time-efficient and cost-efficient alternative to FMS testing in athletes and that functional asymmetries between limbs do not coincide with strength asymmetries.
The Eighth Grade CRCT as a Predictive Measure of Student Success on the Ninth Grade EOCT

ERIC Educational Resources Information Center

Body, Matthew

2013-01-01

Student performance on high stakes testing in secondary education has contributed to the need for students' testing potential to be identified before entering high school. There is evidence to suggest that a greater understanding of how earlier test scores predict later test scores will help educators and school officials increase student…
Commentary: Student Cognition, the Situated Learning Context, and Test Score Interpretation

ERIC Educational Resources Information Center

La Marca, Paul M.

2006-01-01

Although it is assumed that student cognition contributes to student performance on achievement tests, it may be that current testing models lack the degree of specification necessary to warrant such inferences. With test score interpretations as the referent, the authors in this special issue address the role of student cognition in learning and…
Language of administration and neuropsychological test performance in neurologically intact Hispanic American bilingual adults.

PubMed

Gasquoine, Philip Gerard; Croyle, Kristin L; Cavazos-Gonzalez, Cynthia; Sandoval, Omar

2007-11-01

This study compared the performance of Hispanic American bilingual adults on Spanish and English language versions of a neuropsychological test battery. Language achievement test scores were used to divide 36 bilingual, neurologically intact, Hispanic Americans from south Texas into Spanish-dominant, balanced, and English-dominant bilingual groups. They were administered the eight subtests of the Bateria Neuropsicologica and the Matrix Reasoning subtest of the WAIS-III in Spanish and English. Half the participants were tested in Spanish first. Balanced bilinguals showed no significant differences in test scores between Spanish and English language administrations. Spanish and/or English dominant bilinguals showed significant effects of language of administration on tests with higher language compared to visual perceptual weighting (Woodcock-Munoz Language Survey-Revised, Letter Fluency, Story Memory, and Stroop Color and Word Test). Scores on tests with higher visual-perceptual weighting (Matrix Reasoning, Figure Memory, Wisconsin Card Sorting Test, and Spatial Span), were not significantly affected by language of administration, nor were scores on the Spanish/California Verbal Learning Test, and Digit Span. A problem was encountered in comparing false positive rates in each language, as Spanish norms fell below English norms, resulting in a much higher false positive rate in English across all bilingual groupings. Use of a comparison standard (picture vocabulary score) reduced false positive rates in both languages, but the higher false positive rate in English persisted.

The application of soccer performance testing protocols to the non-elite player.

PubMed

Siegler, J; Robergs, R; Weingart, H

2006-03-01

The application of performance testing for the evaluation of non-elite soccer players has received little attention. The purpose of this investigation was to use tests developed for elite soccer players to evaluate performance in non-elite soccer players and compare performance test results between elite (literature) and non-elite (data) players. Thirteen male soccer players volunteered to participate. The tests included a treadmill VO2max test, 20 m sprint, vertical jump (VJ), 30 s Wingate cycle ergometer test, the Loughborough Intermittent Shuttle Test (LIST), and 2 20-m multi-stage shuttle runs to exhaustion (fatigue test). Actual VO2max (absolute and relative) scores were correlated with the estimated VO2max scores (fatigue test), 20 m sprint, VJ, and 30 s Wingate using a Pearson's product-moment correlation. A paired t-test was conducted on the fatigue test trials. Non-significant relationships were observed between actual VO2max scores and estimated VO2max from the fatigue test (absolute and relative terms). Non-significant relationships were also observed between peak and average power output (Wingate), 20 m sprint, and VJ. Mean heart rates (HRs) throughout the LIST was 165+/-7 bpm, which represented 88% of HRmax. The results of this study demonstrate that to elicit physiological differences between elite and non-elite players, assessment must include both an aerobic and anaerobic component.
Performance based on sEMG activity is related to psychosocial components: differences between back and abdominal endurance tests.

PubMed

Van Damme, Benedicte; Stevens, Veerle; Van Tiggelen, Damien; Perneel, Christiaan; Crombez, Geert; Danneels, Lieven

2014-10-01

The influence of psychosocial components on back and abdominal endurance tests in patients with persistent non-specific low back pain should be investigated to ensure the correct interpretation of these measures. Three-hundred and thirty-two patients (291 men and 41 women) from 19 to 63years performed an abdominal and back muscle endurance test after completing some psychosocial questionnaires. During the endurance tests, surface electromyography signals of the internal obliques, the external obliques, the lumbar multifidus and the iliocostalis were recorded. Patients were dichotomized as underperformers and good performers, by comparing their real endurance time, to the expected time of endurance derived from the normalized median frequency slope. Independent t-tests were performed to examine the differences on the outcome of the questionnaires. In the back muscle endurance test, the underperformers had significantly lower (p<0.05) scores on some of the physical subscales of the SF-36. The underperformers group of the AE test scored significantly higher on the DRAM MZDI (p=0.018) and on the PCS scale (p=0.020) and showed also significantly lower scores on the SF-36 (p<0.05). Back muscle endurance tests are influenced by physical components, while abdominal endurance tests seem influenced by psychosocial components. Copyright © 2014 Elsevier Ltd. All rights reserved.
The Relationship of Performance on the Dental Admission Test and Performance on Part I of the National Board Dental Examinations.

ERIC Educational Resources Information Center

De Ball, Suzanne; Sullivan, Kathleen; Horine, Julie; Duncan, William K.; Replogle, William

2002-01-01

Comapred University of Mississippi dental student scores on the Dental Admission Test (DAT) and Part I of the National Board Dental Examinations (NBDE) and found that DAT reading comprehension was a statistically significant predictor of all four subtests of the NBDE. Also found that DAT biology and organic chemistry scores were predictors of NBDE…
Naval Aerospace Medical Research Laboratory. 1993 Command History.

DTIC Science & Technology

1994-04-01

selected student naval aviators score differentially on the test battery and are their scores correlated with flight school performance? 58...Ph.D., attended 3rd Meeting of Accelerated Research Initiative, Nenral Constraints on Cognitive Architecture, Learning Research and Development...Shamma, S.E. and Stanny, R.R,, "Models of Cognitive Performance Assessment Tests," Mathematical Modeling and Scientific Compuiing, Vol. 2, pp. 240-245
Achievement, attributions, self-efficacy, and goal setting by accounting undergraduates.

PubMed

Cheng, Pi-Yueh; Chiou, Wen-Bin

2010-02-01

Correlations were examined between two measures of accounting self-efficacy achievement goal setting, attributions, and scores on the Accounting Practice Achievement Test, obtained 1 yr. apart for 124 freshmen in junior college. Analysis indicated favorable attribution contributed to a higher mean score on accounting self-efficacy. Students with higher perceived self-efficacy performed better on the proficiency tests. Those with higher self-efficacy also set higher goals for subsequent achievement tests. Moreover, students who set higher achievement goals performed better. Goal setting mediated the relation of initial self-efficacy with subsequent test performance. However, the amount of variance accounted for by self-efficacy was small. An effective method for enhancing performance on an accounting achievement test might be to increase beneficial attributions, self-efficacy in accounting, and to encourage setting reasonable achievement goals.
Test Scores, Class Rank and College Performance: Lessons for Broadening Access and Promoting Success.

PubMed

Niu, Sunny X; Tienda, Marta

2012-04-01

Using administrative data for five Texas universities that differ in selectivity, this study evaluates the relative influence of two key indicators for college success-high school class rank and standardized tests. Empirical results show that class rank is the superior predictor of college performance and that test score advantages do not insulate lower ranked students from academic underperformance. Using the UT-Austin campus as a test case, we conduct a simulation to evaluate the consequences of capping students admitted automatically using both achievement metrics. We find that using class rank to cap the number of students eligible for automatic admission would have roughly uniform impacts across high schools, but imposing a minimum test score threshold on all students would have highly unequal consequences by greatly reduce the admission eligibility of the highest performing students who attend poor high schools while not jeopardizing admissibility of students who attend affluent high schools. We discuss the implications of the Texas admissions experiment for higher education in Europe.
Relationships between spatial activities and scores on the mental rotation test as a function of sex.

PubMed

Ginn, Sheryl R; Pickens, Stefanie J

2005-06-01

Previous results suggested that female college students' scores on the Mental Rotations Test might be related to their prior experience with spatial tasks. For example, women who played video games scored better on the test than their non-game-playing peers, whereas playing video games was not related to men's scores. The present study examined whether participation in different types of spatial activities would be related to women's performance on the Mental Rotations Test. 31 men and 59 women enrolled at a small, private church-affiliated university and majoring in art or music as well as students who participated in intercollegiate athletics completed the Mental Rotations Test. Women's scores on the Mental Rotations Test benefitted from experience with spatial activities; the more types of experience the women had, the better their scores. Thus women who were athletes, musicians, or artists scored better than those women who had no experience with these activities. The opposite results were found for the men. Efforts are currently underway to assess how length of experience and which types of experience are related to scores.
Effects of home and education environments on children's motor performance in China.

PubMed

Hua, Jing; Duan, Tao; Gu, Guixiong; Wo, Da; Zhu, Qinqin; Liu, Jiang-Qin; Liu, Ming; Wu, Zhuochun; Meng, Wei

2016-08-01

The aim of this study was to examine the effects of home and educational environments on children's motor performance in China. We conducted a cross-sectional study of 4001 preschool children selected from 160 classes. The children's motor performance was assessed using the Movement Assessment Battery for Children, 2nd edition (MABC-2). Home and educational environments were evaluated using validated checklists. The effects of home and educational environments on motor performance were analysed using mixed and multilevel logistic regression models. The results showed that one score increase in the outside space of the family home was positively associated with the increase in total test score (0.104) subtest score of aiming and catching (0.037), and balance (0.034) of the MABC-2, after adjusting for potential confounders (each p<0.05). Possession of motor toys at home and parental rearing behaviours were also related to total test score, manual dexterity, and balance (β=0.022-0.104, each p<0.05). Space and furnishings, activity, and interaction in the classroom had a significant positive association with total test score (β=0.069-0.201), and with subtest scores of manual dexterity, aiming and catching, and balance respectively (β=0.115-0.206). Space and furnishings of classrooms and possession of toys in the household were protective factors for 'at risk' or significant poor performance (odds ratio 0.942-0.973, each p<0.05). A permissive and accepting family and educational environment made a positive contribution to children's motor performance. Access to sufficient space and furnishings within the classroom, as well as toys in the family, were protective factors for poor motor performance. Future assistance is needed to support an advantageous environment in early childhood programmes in China. © 2016 Mac Keith Press.
Assessing mNIS+7Ionis and international neurologists' proficiency in a familial amyloidotic polyneuropathy trial.

PubMed

Dyck, Peter J; Kincaid, John C; Dyck, P James B; Chaudhry, Vinay; Goyal, Namita A; Alves, Christina; Salhi, Hayet; Wiesman, Janice F; Labeyrie, Celine; Robinson-Papp, Jessica; Cardoso, Márcio; Laura, Matilde; Ruzhansky, Katherine; Cortese, Andrea; Brannagan, Thomas H; Khoury, Julie; Khella, Sami; Waddington-Cruz, Márcia; Ferreira, João; Wang, Annabel K; Pinto, Marcus V; Ayache, Samar S; Benson, Merrill D; Berk, John L; Coelho, Teresa; Polydefkis, Michael; Gorevic, Peter; Adams, David H; Plante-Bordeneuve, Violaine; Whelan, Carol; Merlini, Giampaolo; Heitner, Stephen; Drachman, Brian M; Conceição, Isabel; Klein, Christopher J; Gertz, Morie A; Ackermann, Elizabeth J; Hughes, Steven G; Mauermann, Michelle L; Bergemann, Rito; Lodermeier, Karen A; Davies, Jenny L; Carter, Rickey E; Litchy, William J

2017-11-01

Polyneuropathy signs (Neuropathy Impairment Score, NIS), neurophysiologic tests (m+7 Ionis ), disability, and health scores were assessed in baseline evaluations of 100 patients entered into an oligonucleotide familial amyloidotic polyneuropathy (FAP) trial. We assessed: (1) Proficiency of grading neurologic signs and correlation with neurophysiologic tests, and (2) clinometric performance of modified NIS+7 neurophysiologic tests (mNIS+7 Ionis ) and its subscores and correlation with disability and health scores. The mNIS+7 Ionis sensitively detected, characterized, and broadly scaled diverse polyneuropathy impairments. Polyneuropathy signs (NIS and subscores) correlated with neurophysiology tests, disability, and health scores. Smart Somatotopic Quantitative Sensation Testing of heat as pain 5 provided a needed measure of small fiber involvement not adequately assessed by other tests. Specially trained neurologists accurately assessed neuropathy signs as compared to referenced neurophysiologic tests. The score, mNIS+7 Ionis , broadly detected, characterized, and scaled polyneuropathy abnormality in FAP, which correlated with disability and health scores. Muscle Nerve 56: 901-911, 2017. © 2017 Wiley Periodicals, Inc.
A Note on the Use of the Hiskey-Nebraska Test of Learning Aptitude with Deaf Children.

ERIC Educational Resources Information Center

Watson, Betty U.; Goldgar, David E.

1985-01-01

Comparing distribution of scores on the Hiskey-Nebraska Test of Learning Aptitude (H-NTLA) with those from the Wechsler Performance Scales for 71 hearing impaired Ss revealed a correlation of .85. However, the H-NTLA yielded more Ss with extreme scores. Findings stress the need for caution in interpreting extreme H-NTLA scores. (CL)
Development and validation of a composite scoring system for robot-assisted surgical training--the Robotic Skills Assessment Score.

PubMed

Chowriappa, Ashirwad J; Shi, Yi; Raza, Syed Johar; Ahmed, Kamran; Stegemann, Andrew; Wilding, Gregory; Kaouk, Jihad; Peabody, James O; Menon, Mani; Hassett, James M; Kesavadas, Thenkurussi; Guru, Khurshid A

2013-12-01

A standardized scoring system does not exist in virtual reality-based assessment metrics to describe safe and crucial surgical skills in robot-assisted surgery. This study aims to develop an assessment score along with its construct validation. All subjects performed key tasks on previously validated Fundamental Skills of Robotic Surgery curriculum, which were recorded, and metrics were stored. After an expert consensus for the purpose of content validation (Delphi), critical safety determining procedural steps were identified from the Fundamental Skills of Robotic Surgery curriculum and a hierarchical task decomposition of multiple parameters using a variety of metrics was used to develop Robotic Skills Assessment Score (RSA-Score). Robotic Skills Assessment mainly focuses on safety in operative field, critical error, economy, bimanual dexterity, and time. Following, the RSA-Score was further evaluated for construct validation and feasibility. Spearman correlation tests performed between tasks using the RSA-Scores indicate no cross correlation. Wilcoxon rank sum tests were performed between the two groups. The proposed RSA-Score was evaluated on non-robotic surgeons (n = 15) and on expert-robotic surgeons (n = 12). The expert group demonstrated significantly better performance on all four tasks in comparison to the novice group. Validation of the RSA-Score in this study was carried out on the Robotic Surgical Simulator. The RSA-Score is a valid scoring system that could be incorporated in any virtual reality-based surgical simulator to achieve standardized assessment of fundamental surgical tents during robot-assisted surgery. Copyright © 2013 Elsevier Inc. All rights reserved.
Performance of high school male athletes on the Functional Movement Screen™.

PubMed

Smith, Laura J; Creps, James R; Bean, Ryan; Rodda, Becky; Alsalaheen, Bara

2017-09-01

(1) Describe the performance of the Functional Movement Screen™ (FMS™) by reporting the proportion of adolescents with a score of ≤14 and the frequency of asymmetries in a cross-sectional sample; (2) explore associations between FMS™ to age and body mass, and explore the construct validity of the FMS™ against common postural stability measures; (3) examine the inter-rater and test-retest reliability of the FMS™ in adolescents. Cross-sectional. Field-setting. 94 male high-school athletes. The FMS™, Y-Balance Test (YBT) and Balance Error Scoring System (BESS). The median FMS™ composite score was 16 (9-21), 33% of participants scored below the suggested injury risk cutoff composite score of ≤14, and 62.8% had at least one asymmetry. No relationship was observed between the FMS™ to common static/dynamic balance tests. The inter-rater reliability of the FMS™ composite score suggested good reliability (ICC = 0.88, CI 95%:0.77, 0.94) and test-retest reliability for FMS™ composite scores was good with ICC = 0.83 (CI 95%:0.56, 0.95). FMS™ results should be interpreted cautiously with attention to the asymmetries identified during the screen, regardless of composite score. The lack of relationship between the FMS™ and other balance measures supports the notion that multiple screening tests should be used in order to provide a comprehensive picture of the adolescent athlete. Copyright © 2017 Elsevier Ltd. All rights reserved.
Character pathology and neuropsychological test performance in remitted opiate dependence

PubMed Central

Prosser, James M; Eisenberg, Daniel; Davey, Emily E; Steinfeld, Matthew; Cohen, Lisa J; London, Edythe D; Galynker, Igor I

2008-01-01

Background Cognitive deficits and personality pathology are prevalent in opiate dependence, even during periods of remission, and likely contribute to relapse. Understanding the relationship between the two in vulnerable, opiate-addicted patients may contribute to the design of better treatment and relapse prevention strategies. Methods The Millon Multiaxial Clinical Inventory (MCMI) and a series of neuropsychological tests were administered to three subject groups: 29 subjects receiving methadone maintenance treatment (MM), 27 subjects in protracted abstinence from methadone maintenance treatment (PA), and 29 healthy non-dependent comparison subjects. Relationships between MCMI scores, neuropsychological test results, and measures of substance use and treatment were examined using bivariate correlation and regression analysis. Results MCMI scores were greater in subjects with a history of opiate dependence than in comparison subjects. A significant negative correlation between MCMI scores and neuropsychological test performance was identified in all subjects. MCMI scores were stronger predictors of neuropsychological test performance than measures of drug use. Conclusion Formerly methadone-treated opiate dependent individuals in protracted opiate abstinence demonstrate a strong relationship between personality pathology and cognitive deficits. The cause of these deficits is unclear and most likely multi-factorial. This finding may be important in understanding and interpreting neuropsychological testing deficiencies in opiate-dependent subjects. PMID:19019247
Relationships between the Comprehensive Osteopathic Medical Achievement Test (COMAT) subject examinations and the COMLEX-USA Level 2-Cognitive Evaluation.

PubMed

Li, Feiming; Kalinowski, Kevin E; Song, Hao; Bates, Bruce P

2014-09-01

The relationship between the Comprehensive Osteopathic Medical Achievement Test (COMAT) series of subject examinations and the Comprehensive Osteopathic Medical Licensing Examination-USA Level 2-Cognitive Evaluation (COMLEX-USA Level 2-CE) has not been thoroughly examined. To investigate the factors associated with performance on COMAT subject examinations and how COMAT scores correlate with COMLEX-USA Level 2-CE scores. We examined scores of participants from 2 COMAT examination cycles in 2011 and 2012. According to surveys, most schools used COMAT scores in clerkship and clinical rotation evaluation, which were classified as being used for "high-stakes" purposes. We matched first-attempt COMAT scores with first-attempt COMLEX-USA Level 2-CE scores, and we conducted correlation analyses between the scores from the 7 COMAT subject examinations, as well as between COMAT and COMLEX-USA Level 2-CE scores. Multiple linear regression analyses were performed to investigate how much variance in COMLEX-USA Level 2-CE scores was explained by COMAT scores. In 2011 and 2012, respectively, 3751 and 3786 COMAT candidates had COMLEX-USA Level 2-CE scores (53.0% and 93.9%, respectively, had ⩾1 high-stakes COMAT score). Intercorrelations between COMAT scores were low to moderate (r=0.27-0.53), as hypothesized. Correlations between COMAT and Level 2-CE scores were moderate to high, with the highest correlations for internal medicine COMAT scores (r=0.63-0.65). All regressions showed internal medicine scores as the strongest predictor of Level 2-CE performance. Groups with high-stakes scores had larger adjusted coefficients of determination than those with low-stakes scores (eg, R(2)=0.63 vs 0.52, respectively, in 2011). For 2012 candidates with high-stakes scores, all predictors were statistically significant. The COMAT subject examination scores were moderately intercorrelated, as hypothesized, with higher correlations between COMAT and COMLEX-USA Level 2-CE scores. The COMAT performance was predictive of COMLEX-USA Level 2-CE performance. © 2014 The American Osteopathic Association.
Establishing pass/fail criteria for bronchoscopy performance.

PubMed

Konge, Lars; Clementsen, Paul; Larsen, Klaus Richter; Arendrup, Henrik; Buchwald, Christian; Ringsted, Charlotte

2012-01-01

Several tools have been created to assess competence in bronchoscopy. However, educational guidelines still use an arbitrary number of performed procedures to decide when basic competency is acquired. The purpose of this study was to define pass/fail scores for two bronchoscopy assessment tools, and investigate how these scores relate to physicians' experience regarding the number of bronchoscopy procedures performed. We studied two assessment tools and used two standard setting methods to create cut scores: the contrasting-groups method and the extended Angoff method. In the first we compared bronchoscopy performance scores of 14 novices with the scores of 14 experienced consultants to find the score that best discriminated between the two groups. In the second we asked an expert group of 7 experienced bronchoscopists to judge how a borderline trainee would perform on each item of the test. Using the contrasting-groups method we found a standard that would fail all novices and pass all consultants. A clear pass related to prior experience of 75 procedures. The consequences of using the extended Angoff method were also acceptable: all trainees who had performed less than 50 bronchoscopies failed the test and all consultants passed. A clear pass related to 80 procedures. Our proposed pass/fail scores for these two methods seem appropriate in terms of consequences. Prior experience with the performance of 75 and 80 bronchoscopies, respectively, seemed to ensure basic competency. In the future objective assessment tools could become an important aid in the certification of physicians performing bronchoscopies. Copyright © 2011 S. Karger AG, Basel.
37: COMPARISON OF TWO METHODS: TBL-BASED AND LECTURE-BASED LEARNING IN NURSING CARE OF PATIENTS WITH DIABETES IN NURSING STUDENTS

PubMed Central

Khodaveisi, Masoud; Qaderian, Khosro; Oshvandi, Khodayar; Soltanian, Ali Reza; Vardanjani, Mehdi molavi

2017-01-01

Background and aims learning plays an important role in developing nursing skills and right care-taking. The Present study aims to evaluate two learning methods based on team –based learning and lecture-based learning in learning care-taking of patients with diabetes in nursing students. Method In this quasi-experimental study, 64 students in term 4 in nursing college of Bukan and Miandoab were included in the study based on knowledge and performance questionnaire including 15 questions based on knowledge and 5 questions based on performance on care-taking in patients with diabetes were used as data collection tool whose reliability was confirmed by cronbach alpha (r=0.83) by the researcher. To compare the mean score of knowledge and performance in each group in pre-test step and post-test step, pair –t test and to compare mean of scores in two groups of control and intervention, the independent t- test was used. Results There was not significant statistical difference between two groups in pre terms of knowledge and performance score (p=0.784). There was significant difference between the mean of knowledge scores and diabetes performance in the post-test in the team-based learning group and lecture-based learning group (p=0.001). There was significant difference between the mean score of knowledge of diabetes care in pre-test and post-test in base learning groups (p=0.001). Conclusion In both methods team-based and lecture-based learning approaches resulted in improvement in learning in students, but the rate of learning in the team-based learning approach is greater compared to that of lecture-based learning and it is recommended that this method be used as a higher education method in the education of students.
Physique and Performance of Young Wheelchair Basketball Players in Relation with Classification

PubMed Central

Zancanaro, Carlo

2015-01-01

The relationships among physical characteristics, performance, and functional ability classification of younger wheelchair basketball players have been barely investigated to date. The purpose of this work was to assess anthropometry, body composition, and performance in sport-specific field tests in a national sample of Italian younger wheelchair basketball players as well as to evaluate the association of these variables with the players’ functional ability classification and game-related statistics. Several anthropometric measurements were obtained for 52 out of 91 eligible players nationwide. Performance was assessed in seven sport-specific field tests (5m sprint, 20m sprint with ball, suicide, maximal pass, pass for accuracy, spot shot and lay-ups) and game-related statistics (free-throw points scored per match, two- and three-point field-goals scored per match, and their sum). Association between variables, and predictivity was assessed by correlation and regression analysis, respectively. Players were grouped into four Classes of increasing functional ability (A-D). One-way ANOVA with Bonferroni’s correction for multiple comparisons was used to assess differences between Classes. Sitting height and functional ability Class especially correlated with performance outcomes, but wheelchair basketball experience and skinfolds did not. Game-related statistics and sport-specific field-test scores all showed significant correlation with each other. Upper arm circumference and/or maximal pass and lay-ups test scores were able to explain 42 to 59% of variance in game-related statistics (P<0.001). A clear difference in performance was only found for functional ability Class A and D. Conclusion: In younger wheelchair basketball players, sitting height positively contributes to performance. The maximal pass and lay-ups test should be carefully considered in younger wheelchair basketball training plans. Functional ability Class reflects to a limited extent the actual differences in performance. PMID:26606681
Relationship between COMLEX-USA scores and performance on the American Osteopathic Board of Emergency Medicine Part I certifying examination.

PubMed

Li, Feiming; Gimpel, John R; Arenson, Ethan; Song, Hao; Bates, Bruce P; Ludwin, Fredric

2014-04-01

Few studies have investigated how well scores from the Comprehensive Osteopathic Medical Licensing Examination-USA (COMLEX-USA) series predict resident outcomes, such as performance on board certification examinations. To determine how well COMLEX-USA predicts performance on the American Osteopathic Board of Emergency Medicine (AOBEM) Part I certification examination. The target study population was first-time examinees who took AOBEM Part I in 2011 and 2012 with matched performances on COMLEX-USA Level 1, Level 2-Cognitive Evaluation (CE), and Level 3. Pearson correlations were computed between AOBEM Part I first-attempt scores and COMLEX-USA performances to measure the association between these examinations. Stepwise linear regression analysis was conducted to predict AOBEM Part I scores by the 3 COMLEX-USA scores. An independent t test was conducted to compare mean COMLEX-USA performances between candidates who passed and who failed AOBEM Part I, and a stepwise logistic regression analysis was used to predict the log-odds of passing AOBEM Part I on the basis of COMLEX-USA scores. Scores from AOBEM Part I had the highest correlation with COMLEX-USA Level 3 scores (.57) and slightly lower correlation with COMLEX-USA Level 2-CE scores (.53). The lowest correlation was between AOBEM Part I and COMLEX-USA Level 1 scores (.47). According to the stepwise regression model, COMLEX-USA Level 1 and Level 2-CE scores, which residency programs often use as selection criteria, together explained 30% of variance in AOBEM Part I scores. Adding Level 3 scores explained 37% of variance. The independent t test indicated that the 397 examinees passing AOBEM Part I performed significantly better than the 54 examinees failing AOBEM Part I in all 3 COMLEX-USA levels (P<.001 for all 3 levels). The logistic regression model showed that COMLEX-USA Level 1 and Level 3 scores predicted the log-odds of passing AOBEM Part I (P=.03 and P<.001, respectively). The present study empirically supported the predictive and discriminant validities of the COMLEX-USA series in relation to the AOBEM Part I certification examination. Although residency programs may use COMLEX-USA Level 1 and Level 2-CE scores as partial criteria in selecting residents, Level 3 scores, though typically not available at the time of application, are actually the most statistically related to performances on AOBEM Part I.
The Effects of Item by Item Feedback Given during an Ability Test.

ERIC Educational Resources Information Center

Whetton, C.; Childs, R.

1981-01-01

Answer-until-correct (AUC) is a procedure for providing feedback during a multiple-choice test, giving an increased range of scores. The performance of secondary students on a verbal ability test using AUC procedures was compared with a group using conventional instructions. AUC scores considerably enhanced reliability but not validity.…
Automobile driver on-road performance test. Volume 3, Examiner's manual

DOT National Transportation Integrated Search

1981-09-30

This report provides procedures for administering and scoring the Automobile Driver On-Road Performance Test (ADOPT). The ADOPT checks 21 separate driving performances. Performances are checked at pre-determined locations along a 10-minute route and ...

Automobile driver on-road performance test. Volume 2, Administrator's manual

DOT National Transportation Integrated Search

1981-09-30

This report provides procedures for setting up, administering, and scoring the Automobile Driver On-Road Performance Test (ADOPT). The ADOPT checks 21 separate driving performances. Performances are checked at pre-determined locations along a 10-minu...
Pre-Service Identification of Talented Teachers through Non-Traditional Measures: A Study of the Role of Affective Variables as Predictors of Success in Student Teaching.

ERIC Educational Resources Information Center

Basom, Margaret; And Others

1994-01-01

Researchers examined relationships between the SRI Gallup Pre-Professional Teacher Interview and performance-based student teaching evaluations and between SRI Interview and California Student Achievement Test (CAT) scores. A relationship between SRI Interview scores and performance-based student teaching evaluations surfaces. CAT scores did not…
The Validity of Scores from the "GRE"® revised General Test for Forecasting Performance in Business Schools: Phase One. ETS GRE® Board Research Report. ETS GRE®-14-01. ETS Research Report. RR-14-17

ERIC Educational Resources Information Center

Young, John W.; Klieger, David; Bochenek, Jennifer; Li, Chen; Cline, Fred

2014-01-01

Scores from the "GRE"® revised General Test provide important information regarding the verbal and quantitative reasoning abilities and analytical writing skills of applicants to graduate programs. The validity and utility of these scores depend upon the degree to which the scores predict success in graduate and business school in…
Predicting preference-based SF-6D index scores from the SF-8 health survey.

PubMed

Wang, P; Fu, A Z; Wee, H L; Lee, J; Tai, E S; Thumboo, J; Luo, N

2013-09-01

To develop and test functions for predicting the preference-based SF-6D index scores from the SF-8 health survey. This study was a secondary analysis of data collected in a population health survey in which respondents (n = 7,529) completed both the SF-36 and the SF-8 questionnaires. We examined seven ordinary least-square estimators for their performance in predicting SF-6D scores from the SF-8 at both the individual and the group levels. In general, all functions performed similarly well in predicting SF-6D scores, and the predictions at the group level were better than predictions at the individual level. At the individual level, 42.5-51.5% of prediction errors were smaller than the minimally important difference (MID) of the SF-6D scores, depending on the function specifications, while almost all prediction errors of the tested functions were smaller than the MID of SF-6D at the group level. At both individual and group levels, the tested functions predicted lower than actual scores at the higher end of the SF-6D scale. Our study developed functions to generate preference-based SF-6D index scores from the SF-8 health survey, the first of its kind. Further research is needed to evaluate the performance and validity of the prediction functions.
The BioMedical Admissions Test for medical student selection: issues of fairness and bias.

PubMed

Emery, Joanne L; Bell, John F; Vidal Rodeiro, Carmen L

2011-01-01

The BioMedical Admissions Test (BMAT) forms part of the undergraduate medical admission process at the University of Cambridge. The fairness of admissions tests is an important issue. Aims were to investigate the relationships between applicants' background variables and BMAT scores, whether they were offered a place or rejected and, for those admitted, performance on the first year course examinations. Multilevel regression models were employed with data from three combined applicant cohorts. Admission rates for different groups were investigated with and without controlling for BMAT performance. The fairness of the BMAT was investigated by determining, for those admitted, whether scores predicted examination performance equitably. Despite some differences in applicants' BMAT performance (e.g. by school type and gender), BMAT scores predicted mean examination marks equitably for all background variables considered. The probability of achieving a 1st class examination result, however, was slightly under-predicted for those admitted from schools and colleges entering relatively few applicants. Not all differences in admission rates were accounted for by BMAT performance. However, the test constitutes only one part of a compensatory admission system in which other factors, such as interview performance, are important considerations. Results are in support of the equity of the BMAT.
Support vector regression scoring of receptor-ligand complexes for rank-ordering and virtual screening of chemical libraries.

PubMed

Li, Liwei; Wang, Bo; Meroueh, Samy O

2011-09-26

The community structure-activity resource (CSAR) data sets are used to develop and test a support vector machine-based scoring function in regression mode (SVR). Two scoring functions (SVR-KB and SVR-EP) are derived with the objective of reproducing the trend of the experimental binding affinities provided within the two CSAR data sets. The features used to train SVR-KB are knowledge-based pairwise potentials, while SVR-EP is based on physicochemical properties. SVR-KB and SVR-EP were compared to seven other widely used scoring functions, including Glide, X-score, GoldScore, ChemScore, Vina, Dock, and PMF. Results showed that SVR-KB trained with features obtained from three-dimensional complexes of the PDBbind data set outperformed all other scoring functions, including best performing X-score, by nearly 0.1 using three correlation coefficients, namely Pearson, Spearman, and Kendall. It was interesting that higher performance in rank ordering did not translate into greater enrichment in virtual screening assessed using the 40 targets of the Directory of Useful Decoys (DUD). To remedy this situation, a variant of SVR-KB (SVR-KBD) was developed by following a target-specific tailoring strategy that we had previously employed to derive SVM-SP. SVR-KBD showed a much higher enrichment, outperforming all other scoring functions tested, and was comparable in performance to our previously derived scoring function SVM-SP.
A study of time management: the correlation between video game usage and academic performance markers.

PubMed

Anand, Vivek

2007-08-01

This study analyzes the correlation between video game usage and academic performance. Scholastic Aptitude Test (SAT) and grade-point average (GPA) scores were used to gauge academic performance. The amount of time a student spends playing video games has a negative correlation with students' GPA and SAT scores. As video game usage increases, GPA and SAT scores decrease. A chi-squared analysis found a p value for video game usage and GPA was greater than a 95% confidence level (0.005 < p < 0.01). This finding suggests that dependence exists. SAT score and video game usage also returned a p value that was significant (0.01 < p < 0.05). Chi-squared results were not significant when comparing time spent studying and an individual's SAT score. This research suggests that video games may have a detrimental effect on an individual's GPA and possibly on SAT scores. Although these results show statistical dependence, proving cause and effect remains difficult, since SAT scores represent a single test on a given day. The effects of video games maybe be cumulative; however, drawing a conclusion is difficult because SAT scores represent a measure of general knowledge. GPA versus video games is more reliable because both involve a continuous measurement of engaged activity and performance. The connection remains difficult because of the complex nature of student life and academic performance. Also, video game usage may simply be a function of specific personality types and characteristics.
Preliminary Report on a National Cross-Validation of the Computerized Adaptive Screening Test (CAST).

ERIC Educational Resources Information Center

Knapp, Deirdre J.; Pliske, Rebecca M.

A study was conducted to validate the Army's Computerized Adaptive Screening Test (CAST), using data from 2,240 applicants from 60 army recruiting stations across the nation. CAST is a computer-assisted adaptive test used to predict performance on the Armed Forces Qualification Test (AFQT). AFQT scores are computed by adding four subtest scores of…
The effect of age-at-testing on verbal memory among children following severe traumatic brain injury.

PubMed

Silberg, Tamar; Ahonniska-Assa, Jaana; Levav, Miriam; Eliyahu, Roni; Peleg-Pilowsky, Tamar; Brezner, Amichai; Vakil, Eli

2016-01-01

Memory deficits are a common sequelae following childhood traumatic brain injury (TBI), which often have serious implications on age-related academic skills. The current study examined verbal memory performance using the Rey Auditory Verbal Learning Test (RAVLT) in a pediatric TBI sample. Verbal memory abilities as well as the effect of age at-testing on performance were examined. A sample of 67 children following severe TBI (age average = 12.3 ± 2.74) and 67 matched controls were evaluated using the RAVLT. Age effect at assessment was examined using two age groups: above and below 12 years of age during evaluation. Differences between groups were examined via the 9 RAVLT learning trials and the 7 composite scores conducted out of them. Children following TBI recalled significantly less words than controls on all RAVLT trials and had significantly lower scores on all composite scores. However, all of these scores fell within the low average range. Further analysis revealed significantly lower than average performance among the older children (above 12 years), while scores of the younger children following TBI fell within average limits. To conclude, verbal memory deficits among children following severe TBI demonstrate an age-at-testing effect with more prominent problems occurring above 12 years at the time of evaluation. Yet, age-appropriate performance among children below 12 years of age may not accurately describe memory abilities at younger ages following TBI. It is therefore recommended that clinicians address child's age at testing and avoid using a single test as an indicator of verbal memory functioning post TBI.
The impact of hearing loss on language performance in older adults with different stages of cognitive function

PubMed Central

Lodeiro-Fernández, Leire; Lorenzo-López, Laura; Maseda, Ana; Núñez-Naveira, Laura; Rodríguez-Villamil, José Luis; Millán-Calenti, José Carlos

2015-01-01

Purpose The possible relationship between audiometric hearing thresholds and cognitive performance on language tests was analyzed in a cross-sectional cohort of older adults aged ≥65 years (N=98) with different degrees of cognitive impairment. Materials and methods Participants were distributed into two groups according to Reisberg’s Global Deterioration Scale (GDS): a normal/predementia group (GDS scores 1–3) and a moderate/moderately severe dementia group (GDS scores 4 and 5). Hearing loss (pure-tone audiometry) and receptive and production-based language function (Verbal Fluency Test, Boston Naming Test, and Token Test) were assessed. Results Results showed that the dementia group achieved significantly lower scores than the predementia group in all language tests. A moderate negative correlation between hearing loss and verbal comprehension (r=−0.298; P<0.003) was observed in the predementia group (r=−0.363; P<0.007). However, no significant relationship between hearing loss and verbal fluency and naming scores was observed, regardless of cognitive impairment. Conclusion In the predementia group, reduced hearing level partially explains comprehension performance but not language production. In the dementia group, hearing loss cannot be considered as an explanatory factor of poor receptive and production-based language performance. These results are suggestive of cognitive rather than simply auditory problems to explain the language impairment in the elderly. PMID:25914528
Use of Prehire Minnesota Multiphasic Personality Inventory-2-Restructured Form (MMPI-2-RF) Police Candidate Scores to Predict Supervisor Ratings of Posthire Performance.

PubMed

Tarescavage, Anthony M; Brewster, JoAnne; Corey, David M; Ben-Porath, Yossef S

2015-08-01

We examined associations between prehire Minnesota Multiphasic Personality Inventory-2-Restructured Form (MMPI-2-RF) scores and posthire performance ratings for a sample of 131 male police officers. Substantive scale scores in this sample were meaningfully lower than those obtained by the test's normative sample and substantially range restricted, but scores were consistent with those produced by members of the police candidate comparison group (Corey & Ben-Porath). After applying a statistical correction for range restriction, we found several associations between MMPI-2-RF substantive scale scores and supervisor ratings of job-related performance. Findings for scales from the emotional dysfunction and interpersonal functioning domains of the test were particularly strong. For example, scales assessing low positive emotions and social avoidance were associated with several criteria that may be affected by lack of engagement with one's environment and other people, including problems with routine task performance, decision making, assertiveness, conscientiousness, and social competence. Implications of these findings for assessment science and practice are discussed. © The Author(s) 2014.
The association between neuropsychological scores and ethnicity, language, and acculturation variables in a large patient population.

PubMed

Boone, Kyle Brauer; Victor, Tara L; Wen, Johnny; Razani, Jill; Pontón, Marcel

2007-03-01

The relationship between ethnicity and cognitive test performance was examined in a sample of 161 patients referred for evaluation at a public hospital-affiliated neuropsychology clinic; 83 patients were Caucasian (non-Hispanic), 31 were African-American, 30 were Hispanic, and 17 were Asian. Significant group differences were present on some measures of language (Boston Naming Test), attention (Digit Span ACSS), constructional ability (Rey-Osterrieth [RO] copy), nonverbal processing speed (Trails A), and executive skills (Wisconsin Card Sorting Test [WCST]). Comparison of those who spoke English as a first language (or who learned English concurrently with a second language) versus those who spoke English as a second language (ESL) revealed significantly higher performance in the non-ESL group for Digit Span, Boston Naming Test, and FAS, and a higher score in the ESL group for RO copy. Boston Naming Test scores were significantly related to years educated in the United States; Boston Naming Test and Digit Span scores were significantly correlated with age at which conversational English was first learned and number of years in the United States; and finally, FAS scores were also significantly related to number of years in the United States. These findings are consistent with data from published literature on ethnic differences and the effects of acculturation on cognitive test performance in nonpatients, and also indicate that these observations are not attenuated by the presence of psychiatric or neurologic illness. The results further caution that normative data derived on Caucasian samples may not be appropriate for use with other ethnic groups.
Changing abilities vs. changing tasks: Examining validity degradation with test scores and college performance criteria both assessed longitudinally.

PubMed

Dahlke, Jeffrey A; Kostal, Jack W; Sackett, Paul R; Kuncel, Nathan R

2018-05-03

We explore potential explanations for validity degradation using a unique predictive validation data set containing up to four consecutive years of high school students' cognitive test scores and four complete years of those students' college grades. This data set permits analyses that disentangle the effects of predictor-score age and timing of criterion measurements on validity degradation. We investigate the extent to which validity degradation is explained by criterion dynamism versus the limited shelf-life of ability scores. We also explore whether validity degradation is attributable to fluctuations in criterion variability over time and/or GPA contamination from individual differences in course-taking patterns. Analyses of multiyear predictor data suggest that changes to the determinants of performance over time have much stronger effects on validity degradation than does the shelf-life of cognitive test scores. The age of predictor scores had only a modest relationship with criterion-related validity when the criterion measurement occasion was held constant. Practical implications and recommendations for future research are discussed. (PsycINFO Database Record (c) 2018 APA, all rights reserved).
42 CFR 493.845 - Standard; Toxicology.

Code of Federal Regulations, 2012 CFR

2012-10-01

... acceptable responses for each analyte in each testing event is unsatisfactory analyte performance for the... testing event. (e)(1) For any unsatisfactory analyte or test performance or testing event for reasons... any unacceptable analyte or testing event score, remedial action must be taken and documented, and the...
42 CFR 493.851 - Standard; Hematology.

Code of Federal Regulations, 2014 CFR

2014-10-01

... acceptable responses for each analyte in each testing event is unsatisfactory analyte performance for the... testing event. (e)(1) For any unsatisfactory analyte or test performance or testing event for reasons... any unacceptable analyte or testing event score, remedial action must be taken and documented, and the...
42 CFR 493.843 - Standard; Endocrinology.

Code of Federal Regulations, 2013 CFR

2013-10-01

... acceptable responses for each analyte in each testing event is unsatisfactory analyte performance for the... testing event. (e)(1) For any unsatisfactory analyte or test performance or testing event for reasons... any unacceptable analyte or testing event score, remedial action must be taken and documented, and the...
42 CFR 493.845 - Standard; Toxicology.

Code of Federal Regulations, 2014 CFR

2014-10-01

... acceptable responses for each analyte in each testing event is unsatisfactory analyte performance for the... testing event. (e)(1) For any unsatisfactory analyte or test performance or testing event for reasons... any unacceptable analyte or testing event score, remedial action must be taken and documented, and the...
42 CFR 493.845 - Standard; Toxicology.

Code of Federal Regulations, 2013 CFR

2013-10-01

... acceptable responses for each analyte in each testing event is unsatisfactory analyte performance for the... testing event. (e)(1) For any unsatisfactory analyte or test performance or testing event for reasons... any unacceptable analyte or testing event score, remedial action must be taken and documented, and the...
42 CFR 493.851 - Standard; Hematology.

Code of Federal Regulations, 2013 CFR

2013-10-01

... acceptable responses for each analyte in each testing event is unsatisfactory analyte performance for the... testing event. (e)(1) For any unsatisfactory analyte or test performance or testing event for reasons... any unacceptable analyte or testing event score, remedial action must be taken and documented, and the...
42 CFR 493.843 - Standard; Endocrinology.

Code of Federal Regulations, 2012 CFR

2012-10-01

... acceptable responses for each analyte in each testing event is unsatisfactory analyte performance for the... testing event. (e)(1) For any unsatisfactory analyte or test performance or testing event for reasons... any unacceptable analyte or testing event score, remedial action must be taken and documented, and the...

42 CFR 493.843 - Standard; Endocrinology.

Code of Federal Regulations, 2014 CFR

2014-10-01

... acceptable responses for each analyte in each testing event is unsatisfactory analyte performance for the... testing event. (e)(1) For any unsatisfactory analyte or test performance or testing event for reasons... any unacceptable analyte or testing event score, remedial action must be taken and documented, and the...
42 CFR 493.851 - Standard; Hematology.

Code of Federal Regulations, 2012 CFR

2012-10-01

... acceptable responses for each analyte in each testing event is unsatisfactory analyte performance for the... testing event. (e)(1) For any unsatisfactory analyte or test performance or testing event for reasons... any unacceptable analyte or testing event score, remedial action must be taken and documented, and the...
Full-scale transmission testing to evaluate advanced lubricants

NASA Technical Reports Server (NTRS)

Lewicki, David G.; Decker, Harry J.; Shimski, John T.

1992-01-01

Experimental tests were performed on the OH-58A helicopter main rotor transmission in the NASA Lewis 500 hp helicopter transmission test stand. The testing was part of a lubrication program. The objectives are to develop and show a separate lubricant for gearboxes with improved performance in life and load carrying capacity. The goal was to develop a testing procedure to fail certain transmission components using a MIL-L-23699 based reference oil and then to run identical tests with improved lubricants and show improved performance. The tests were directed at parts that failed due to marginal lubrication from Navy field experience. These failures included mast shaft bearing micropitting, sun gear and planet bearing fatigue, and spiral bevel gear scoring. A variety of tests were performed and over 900 hrs of total run time accumulated for these tests. Some success was achieved in developing a testing procedure to produce sun gear and planet bearing fatigue failures. Only marginal success was achieved in producing mast shaft bearing micropitting and spiral bevel gear scoring.
The effectiveness of and satisfaction with high-fidelity simulation to teach cardiac surgical resuscitation skills to nurses.

PubMed

McRae, Marion E; Chan, Alice; Hulett, Renee; Lee, Ai Jin; Coleman, Bernice

2017-06-01

There are few reports of the effectiveness or satisfaction with simulation to learn cardiac surgical resuscitation skills. To test the effect of simulation on the self-confidence of nurses to perform cardiac surgical resuscitation simulation and nurses' satisfaction with the simulation experience. A convenience sample of sixty nurses rated their self-confidence to perform cardiac surgical resuscitation skills before and after two simulations. Simulation performance was assessed. Subjects completed the Satisfaction with Simulation Experience scale and demographics. Self-confidence scores to perform all cardiac surgical skills as measured by paired t-tests were significantly increased after the simulation (d=-0.50 to 1.78). Self-confidence and cardiac surgical work experience were not correlated with time to performance. Total satisfaction scores were high (mean 80.2, SD 1.06) indicating satisfaction with the simulation. There was no correlation of the satisfaction scores with cardiac surgical work experience (τ=-0.05, ns). Self-confidence scores to perform cardiac surgical resuscitation procedures were higher after the simulation. Nurses were highly satisfied with the simulation experience. Copyright © 2016 Elsevier Ltd. All rights reserved.
Kernel Equating Under the Non-Equivalent Groups With Covariates Design

PubMed Central

Bränberg, Kenny

2015-01-01

When equating two tests, the traditional approach is to use common test takers and/or common items. Here, the idea is to use variables correlated with the test scores (e.g., school grades and other test scores) as a substitute for common items in a non-equivalent groups with covariates (NEC) design. This is performed in the framework of kernel equating and with an extension of the method developed for post-stratification equating in the non-equivalent groups with anchor test design. Real data from a college admissions test were used to illustrate the use of the design. The equated scores from the NEC design were compared with equated scores from the equivalent group (EG) design, that is, equating with no covariates as well as with equated scores when a constructed anchor test was used. The results indicate that the NEC design can produce lower standard errors compared with an EG design. When covariates were used together with an anchor test, the smallest standard errors were obtained over a large range of test scores. The results obtained, that an EG design equating can be improved by adjusting for differences in test score distributions caused by differences in the distribution of covariates, are useful in practice because not all standardized tests have anchor tests. PMID:29881012
Kernel Equating Under the Non-Equivalent Groups With Covariates Design.

PubMed

Wiberg, Marie; Bränberg, Kenny

2015-07-01

When equating two tests, the traditional approach is to use common test takers and/or common items. Here, the idea is to use variables correlated with the test scores (e.g., school grades and other test scores) as a substitute for common items in a non-equivalent groups with covariates (NEC) design. This is performed in the framework of kernel equating and with an extension of the method developed for post-stratification equating in the non-equivalent groups with anchor test design. Real data from a college admissions test were used to illustrate the use of the design. The equated scores from the NEC design were compared with equated scores from the equivalent group (EG) design, that is, equating with no covariates as well as with equated scores when a constructed anchor test was used. The results indicate that the NEC design can produce lower standard errors compared with an EG design. When covariates were used together with an anchor test, the smallest standard errors were obtained over a large range of test scores. The results obtained, that an EG design equating can be improved by adjusting for differences in test score distributions caused by differences in the distribution of covariates, are useful in practice because not all standardized tests have anchor tests.
Do racial and ethnic group differences in performance on the MCAT exam reflect test bias?

PubMed

Davis, Dwight; Dorsey, J Kevin; Franks, Ronald D; Sackett, Paul R; Searcy, Cynthia A; Zhao, Xiaohui

2013-05-01

The Medical College Admission Test (MCAT) is a standardized examination that assesses fundamental knowledge of scientific concepts, critical reasoning ability, and written communication skills. Medical school admission officers use MCAT scores, along with other measures of academic preparation and personal attributes, to select the applicants they consider the most likely to succeed in medical school. In 2008-2011, the committee charged with conducting a comprehensive review of the MCAT exam examined four issues: (1) whether racial and ethnic groups differ in mean MCAT scores, (2) whether any score differences are due to test bias, (3) how group differences may be explained, and (4) whether the MCAT exam is a barrier to medical school admission for black or Latino applicants. This analysis showed that black and Latino examinees' mean MCAT scores are lower than white examinees', mirroring differences on other standardized admission tests and in the average undergraduate grades of medical school applicants. However, there was no evidence that the MCAT exam is biased against black and Latino applicants as determined by their subsequent performance on selected medical school performance indicators. Among other factors which could contribute to mean differences in MCAT performance, whites, blacks, and Latinos interested in medicine differ with respect to parents' education and income. Admission data indicate that admission committees accept majority and minority applicants at similar rates, which suggests that medical students are selected on the basis of a combination of attributes and competencies rather than on MCAT scores alone.
Sub-classification of Advanced-Stage Hepatocellular Carcinoma: A Cohort Study Including 612 Patients Treated with Sorafenib.

PubMed

Yoo, Jeong-Ju; Chung, Goh Eun; Lee, Jeong-Hoon; Nam, Joon Yeul; Chang, Young; Lee, Jeong Min; Lee, Dong Ho; Kim, Hwi Young; Cho, Eun Ju; Yu, Su Jong; Kim, Yoon Jun; Yoon, Jung-Hwan

2018-04-01

Advanced hepatocellular carcinoma (HCC) is associated with various clinical conditions including major vessel invasion, metastasis, and poor performance status. The aim of this study was to establish a prognostic scoring system and to propose a sub-classification of the Barcelona-Clinic Liver Cancer (BCLC) stage C. This retrospective study included consecutive patientswho received sorafenib for BCLC stage C HCC at a single tertiary hospital in Korea. A Cox proportional hazard model was used to develop a scoring system, and internal validationwas performed by a 5-fold cross-validation. The performance of the model in predicting risk was assessed by the area under the curve and the Hosmer-Lemeshow test. A total of 612 BCLC stage C HCC patients were sub- classified into strata depending on their performance status. Five independent prognostic factors (Child-Pugh score, α-fetoprotein, tumor type, extrahepatic metastasis, and portal vein invasion) were identified and used in the prognostic scoring system. This scoring system showed good discrimination (area under the receiver operating characteristic curve, 0.734 to 0.818) and calibration functions (both p < 0.05 by the Hosmer-Lemeshow test at 1 month and 12 months, respectively). The differences in survival among the different risk groups classified by the total score were significant (p < 0.001 by the log-rank test in both the Eastern Cooperative Oncology Group 0 and 1 strata). The heterogeneity of patientswith BCLC stage C HCC requires sub-classification of advanced HCC. A prognostic scoring system with five independent factors is useful in predicting the survival of patients with BCLC stage C HCC.
Use of Automated Scoring in Spoken Language Assessments for Test Takers with Speech Impairments. Research Report. ETS RR-17-42

ERIC Educational Resources Information Center

Loukina, Anastassia; Buzick, Heather

2017-01-01

This study is an evaluation of the performance of automated speech scoring for speakers with documented or suspected speech impairments. Given that the use of automated scoring of open-ended spoken responses is relatively nascent and there is little research to date that includes test takers with disabilities, this small exploratory study focuses…
Detecting Intervention Effects in a Cluster-Randomized Design Using Multilevel Structural Equation Modeling for Binary Responses.

PubMed

Cho, Sun-Joo; Preacher, Kristopher J; Bottge, Brian A

2015-11-01

Multilevel modeling (MLM) is frequently used to detect group differences, such as an intervention effect in a pre-test-post-test cluster-randomized design. Group differences on the post-test scores are detected by controlling for pre-test scores as a proxy variable for unobserved factors that predict future attributes. The pre-test and post-test scores that are most often used in MLM are summed item responses (or total scores). In prior research, there have been concerns regarding measurement error in the use of total scores in using MLM. To correct for measurement error in the covariate and outcome, a theoretical justification for the use of multilevel structural equation modeling (MSEM) has been established. However, MSEM for binary responses has not been widely applied to detect intervention effects (group differences) in intervention studies. In this article, the use of MSEM for intervention studies is demonstrated and the performance of MSEM is evaluated via a simulation study. Furthermore, the consequences of using MLM instead of MSEM are shown in detecting group differences. Results of the simulation study showed that MSEM performed adequately as the number of clusters, cluster size, and intraclass correlation increased and outperformed MLM for the detection of group differences.
Relationship between college success and employer competency ratings for graduates of a baccalaureate nursing program.

PubMed

Bolin, S E; Hogle, E L

1984-01-01

This expost facto correlational study sought to determine which measures of academic success in one class of BSN graduates predicted their competence as employees one year after graduation, as judged by their employers. The relationship between pre-entrance test scores, clinical experience grades, GPA, State Board Test Pool examination scores, and employer competency ratings were also determined. In keeping with the literature in fields other than nursing, the findings suggest that there may be little relationship between academic performance in a nursing program and subsequent job performance as a nurse, even though verbal ability may be predictive of success in school. While significant positive correlations were found between pre-entrance test data and final grade point averages, as well as pre-entrance test scores and State Board Test Pool examination scores, there was little evidence that pre-entrance test scores were predictive of nursing abilities. Isolated correlations were found between the clinical components of some nursing courses and specific nursing abilities. Using multiple regression analysis, no clinical course grade was found to be a significant predictor of the mean employer competency rating. Significant predictors were found for only four of the individual nursing abilities, with the clinical component of Leadership in Nursing being the most frequent and best predictor.
Comparison of 3 Symptom Classification Methods to Standardize the History Component of the HEART Score.

PubMed

Marchick, Michael R; Setteducato, Michael L; Revenis, Jesse J; Robinson, Matthew A; Weeks, Emily C; Payton, Thomas F; Winchester, David E; Allen, Brandon R

2017-09-01

The History, Electrocardiography, Age, Risk factors, Troponin (HEART) score enables rapid risk stratification of emergency department patients presenting with chest pain. However, the subjectivity in scoring introduced by the history component has been criticized by some clinicians. We examined the association of 3 objective scoring models with the results of noninvasive cardiac testing. Medical records for all patients evaluated in the chest pain center of an academic medical center during a 1-year period were reviewed retrospectively. Each patient's history component score was calculated using 3 models developed by the authors. Differences in the distribution of HEART scores for each model, as well as their degree of agreement with one another, as well as the results of cardiac testing were analyzed. Seven hundred forty nine patients were studied, 58 of which had an abnormal stress test or computed tomography coronary angiography. The mean HEART scores for models 1, 2, and 3 were 2.97 (SD 1.17), 2.57 (SD 1.25), and 3.30 (SD 1.35), respectively, and were significantly different (P < 0.001). However, for each model, the likelihood of an abnormal cardiovascular test did not correlate with higher scores on the symptom component of the HEART score (P = 0.09, 0.41, and 0.86, respectively). While the objective scoring models produced different distributions of HEART scores, no model performed well with regards to identifying patients with abnormal advanced cardiac studies in this relatively low-risk cohort. Further studies in a broader cohort of patients, as well as comparison with the performance of subjective history scoring, is warranted before adoption of any of these objective models.
The Relationship between the Use of Study Strategies and Test Performance.

ERIC Educational Resources Information Center

Nist, Sherrie L.; And Others

1985-01-01

Investigates the relationship between the use of appropriate study strategies and test scores on three different content area exams. Finds a high correlation between the use of positive strategies and test performance. (RS)
The Score-Boosting Game.

ERIC Educational Resources Information Center

Popham, W. James

2000-01-01

Teachers everywhere are playing the score-boosting game to raise scores on mandated standardized achievement tests, although five nationally recognized assessments compare student performance instead of measuring classroom learning. Since curriculum standards are often vague and misaligned with assessments, teachers sprinkle instruction with…
Teaching Children to Relax.

ERIC Educational Resources Information Center

Proeger, Charlene; Myrick, Robert D.

1980-01-01

Many elementary school students perform below their ability levels due to excessive anxiety and stress. Research reveals negative correlations between general anxiety and test anxiety, and scores on intelligence tests. Studies have shown that changes in anxiety level are related to changes in intelligence quotient scores. Further, anxiety affects…
Brain Gym To Increase Academic Performance Of Children Aged 10-12 Years Old ( Experimental Study in Tembalang Elementary School and Pedalangan Elementary School Semarang)

NASA Astrophysics Data System (ADS)

Marpaung, M. G.; Sareharto, T. P.; Purwanti, A.; Hermawati, D.

2017-02-01

Academic performance becomes an important determinant of individual quality. it is determined by the function of affective, cognitive, psychomotor, and intelligence. Brain gym can improve learning processes and integrate all areas that related to the learning process. To prove the effect of brain gym towards academic performance of children aged 10-12 years. This study was a quasy experiment study with one group pre and post test design. Samples (n=18 male=7 and female=11) were taken from five and six grader and conducted in Tembalang and Pedalangan Elementary School, Semarang. Pretest were administered, followed by brain gym, and post test administered in the end of study. The measurement of Intelligence Quotient pre and post test using Culture Fair Intelligence Test Scale 2. Among the 18 subjects (male=7 and female=11) the average of academic performance and IQ score after brain gym showed improvement. The Improvement of IQ score with Culture Fair Test Scale 2 was analyzed by Dependent T test showed significant results (p=0,000). The improvement of Bahasa score was analyzed by Wilcoxon test showed significant results (p=0,001), an unsignificant result were shown in Mathematics p=0,079 and natural sciences p=0,306. Brain gym can increase academic performance of children aged 10-12 years old.
Participation in a coteaching classroom and students' end-of-course test scores

NASA Astrophysics Data System (ADS)

Debro, Ava

General education students consistently perform poorly on standardized science tests. Coteaching is an instructional strategy that improves the achievement of students with disabilities, but very little research exists that examines the effect of coteaching classrooms on the performance of general education students. The purpose of this study was to examine the effect of coteaching classrooms on the performance of general education students. The constructivist theoretical framework provided the foundation for this research. The research question examined the effect that coteaching classrooms had on the performance of general education biology students. In this experimental design utilizing a posttest-only control group, coteaching instructional strategy was the treatment, and student performance was measured using the scores obtained from the biology end-of-course test. Data for this study was analyzed using an independent t-test. The results of this study revealed that there was not a statistically significant difference in student performance on the biology end-of-course test between treatment and control groups. More than half of the general education biology students enrolled in coteaching classrooms failed the end-of-course test. Researchers may use this study as a catalyst to examine other instructional practices that may improve student performance in science courses. The results of this study may be used to persuade coteachers of the importance of attending frequent professional development opportunities that examine a variety of coteaching instructional strategies. Improving the performance of general education students in science may improve standardized test scores, afford more students the opportunity to attend college, and ensure that students are able to compete on a global level.
Effect of education and gender adjustment on the sensitivity and specificity of a cognitive screening battery for dementia: results from the MoVIES Project. Monongahela Valley Independent Elders Survey.

PubMed

Belle, S H; Seaberg, E C; Ganguli, M; Ratcliff, G; DeKosky, S; Kuller, L H

1996-01-01

The Monongahela Valley Independent Elders Survey (MoVIES) used a multiphase process to identify demented persons among 1,366 randomly selected noninstitutionalized individuals 65 years and older. Raw test scores from a cognitive screening battery were used to identify cognitively impaired individuals who were referred for a clinical evaluation. Subsequently, test scores were adjusted for education and gender within age strata. Adjusting test scores affected sensitivity for dementia only among the most educated, increasing sensitivity among younger subjects and decreasing among the older subjects. Specificity increased among the least educated and the oldest subjects. Overall, the adjusted criteria did not perform as well as the unadjusted criteria in this sample. Adjustment for education will not necessarily improve the ability of a screening battery for cognitive function to identify demented persons, particularly if unadjusted scores perform well.
Cross-cultural adaptation and validation of the sino-nasal outcome test (SNOT-22) for Spanish-speaking patients.

PubMed

de los Santos, Gonzalo; Reyes, Pablo; del Castillo, Raúl; Fragola, Claudio; Royuela, Ana

2015-11-01

Our objective was to perform translation, cross-cultural adaptation and validation of the sino-nasal outcome test 22 (SNOT-22) to Spanish language. SNOT-22 was translated, back translated, and a pretest trial was performed. The study included 119 individuals divided into 60 cases, who met diagnostic criteria for chronic rhinosinusitis according to the European Position Paper on Rhinosinusitis 2012; and 59 controls, who reported no sino-nasal disease. Internal consistency was evaluated with Cronbach's alpha test, reproducibility with Kappa coefficient, reliability with intraclass correlation coefficient (ICC), validity with Mann-Whitney U test and responsiveness with Wilcoxon test. In cases, Cronbach's alpha was 0.91 both before and after treatment, as for controls, it was 0.90 at their first test assessment and 0.88 at 3 weeks. Kappa coefficient was calculated for each item, with an average score of 0.69. ICC was also performed for each item, with a score of 0.87 in the overall score and an average among all items of 0.71. Median score for cases was 47, and 2 for controls, finding the difference to be highly significant (Mann-Whitney U test, p < 0.001). Clinical changes were observed among treated patients, with a median score of 47 and 13.5 before and after treatment, respectively (Wilcoxon test, p < 0.001). The effect size resulted in 0.14 in treated patients whose status at 3 weeks was unvarying; 1.03 in those who were better and 1.89 for much better group. All controls were unvarying with an effect size of 0.05. The Spanish version of the SNOT-22 has the internal consistency, reliability, reproducibility, validity and responsiveness necessary to be a valid instrument to be used in clinical practice.
Development and reliability of the rating of compensatory movements in upper limb prosthesis wearers during work-related tasks.

PubMed

van der Laan, Tallie M J; Postema, Sietke G; Reneman, Michiel F; Bongers, Raoul M; van der Sluis, Corry K

2018-02-10

Reliability study. Quantifying compensatory movements during work-related tasks may help to prevent musculoskeletal complaints in individuals with upper limb absence. (1) To develop a qualitative scoring system for rating compensatory shoulder and trunk movements in upper limb prosthesis wearers during the performance of functional capacity evaluation tests adjusted for use by 1-handed individuals (functional capacity evaluation-one handed [FCE-OH]); (2) to examine the interrater and intrarater reliability of the scoring system; and (3) to assess its feasibility. Movement patterns of 12 videotaped upper limb prosthesis wearers and 20 controls were analyzed. Compensatory movements were defined for each FCE-OH test, and a scoring system was developed, pilot tested, and adjusted. During reliability testing, 18 raters (12 FCE experts and 6 physiotherapists/gait analysts) scored videotapes of upper limb prosthesis wearers performing 4 FCE-OH tests 2 times (2 weeks apart). Agreement was expressed in % and kappa value. Feasibility (focus area's "acceptability", "demand," and "implementation") was determined by using a questionnaire. After 2 rounds of pilot testing and adjusting, reliability of a third version was tested. The interrater reliability for the first and second rating sessions were к = 0.54 (confidence interval [CI]: 0.52-0.57) and к = 0.64 (CI: 0.61-0.66), respectively. The intrarater reliability was к = 0.77 (CI: 0.72-0.82). The feasibility was good but could be improved by a training program. It seems possible to identify compensatory movements in upper limb prosthesis wearers during the performance of FCE-OH tests reliably by observation using the developed observational scoring system. Interrater reliability was satisfactory in most instances; intrarater reliability was good. Feasibility was established. Copyright © 2018 Hanley & Belfus. Published by Elsevier Inc. All rights reserved.

Assessing Practical Intelligence in Business School Admissions: A Supplement to the Graduate Management Admissions Test

ERIC Educational Resources Information Center

Hedlund, Jennifer; Wilt, Jeanne M.; Nebel, Kristina L.; Ashford, Susan J.; Sternberg, Robert J.

2006-01-01

The Graduate Management Admission Test (GMAT) is the most widely used measure of managerial potential in MBA admissions. GMAT scores, although predictive of grades in business school, leave much of the variance in graduate school performance unexplained. The GMAT also produces disparities in test scores between groups, generating the potential for…
From Test Scores to Language Use: Emergent Bilinguals Using English to Accomplish Academic Tasks

ERIC Educational Resources Information Center

Rodriguez-Mojica, Claudia

2018-01-01

Prominent discourses about emergent bilinguals' academic abilities tend to focus on performance as measured by test scores and perpetuate the message that emergent bilinguals trail far behind their peers. When we remove the constraints of formal testing situations, what can emergent bilinguals do in English as they engage in naturally occurring…
Predicting the language proficiency of Chinese student pilots within American airspace: Single-task versus dual-task English-language assessment

NASA Astrophysics Data System (ADS)

Noble, Clifford Elliott, II

2002-09-01

The problem. The purpose of this study was to investigate the ability of three single-task instruments---(a) the Test of English as a Foreign Language, (b) the Aviation Test of Spoken English, and (c) the Single Manual-Tracking Test---and three dual-task instruments---(a) the Concurrent Manual-Tracking and Communication Test, (b) the Certified Flight Instructor's Test, and (c) the Simulation-Based English Test---to predict the language performance of 10 Chinese student pilots speaking English as a second language when operating single-engine and multiengine aircraft within American airspace. Method. This research implemented a correlational design to investigate the ability of the six described instruments to predict the mean score of the criterion evaluation, which was the Examiner's Test. This test assessed the oral communication skill of student pilots on the flight portion of the terminal checkride in the Piper Cadet, Piper Seminole, and Beechcraft King Air airplanes. Results. Data from the Single Manual-Tracking Test, as well as the Concurrent Manual-Tracking and Communication Test, were discarded due to performance ceiling effects. Hypothesis 1, which stated that the average correlation between the mean scores of the dual-task evaluations and that of the Examiner's Test would predict the mean score of the criterion evaluation with a greater degree of accuracy than that of single-task evaluations, was not supported. Hypothesis 2, which stated that the correlation between the mean scores of the participants on the Simulation-Based English Test and the Examiner's Test would predict the mean score of the criterion evaluation with a greater degree of accuracy than that of all single- and dual-task evaluations, was also not supported. The findings suggest that single- and dual-task assessments administered after initial flight training are equivalent predictors of language performance when piloting single-engine and multiengine aircraft.
Genome-Wide Polygenic Scores Predict Reading Performance throughout the School Years

ERIC Educational Resources Information Center

Selzam, Saskia; Dale, Philip S.; Wagner, Richard K.; DeFries, John C.; Cederlöf, Martin; O'Reilly, Paul F.; Krapohl, Eva; Plomin, Robert

2017-01-01

It is now possible to create individual-specific genetic scores, called genome-wide polygenic scores (GPS). We used a GPS for years of education ("EduYears") to predict reading performance assessed at UK National Curriculum Key Stages 1 (age 7), 2 (age 12) and 3 (age 14) and on reading tests administered at ages 7 and 12 in a UK sample…
A multinational randomised study comparing didactic lectures with case scenario in a severe sepsis medical simulation course.

PubMed

Li, Chih-Huang; Kuan, Win-Sen; Mahadevan, Malcolm; Daniel-Underwood, Lynda; Chiu, Te-Fa; Nguyen, H Bryant

2012-07-01

Medical simulation has been used to teach critical illness in a variety of settings. This study examined the effect of didactic lectures compared with simulated case scenario in a medical simulation course on the early management of severe sepsis. A prospective multicentre randomised study was performed enrolling resident physicians in emergency medicine from four hospitals in Asia. Participants were randomly assigned to a course that included didactic lectures followed by a skills workshop and simulated case scenario (lecture-first) or to a course that included a skills workshop and simulated case scenario followed by didactic lectures (simulation-first). A pre-test was given to the participants at the beginning of the course, post-test 1 was given after the didactic lectures or simulated case scenario depending on the study group assignment, then a final post-test 2 was given at the end of the course. Performance on the simulated case scenario was evaluated with a performance task checklist. 98 participants were enrolled in the study. Post-test 2 scores were significantly higher than pre-test scores in all participants (80.8 ± 12.0% vs 65.4 ± 12.2%, p<0.01). There was no difference in pre-test scores between the two study groups. The lecture-first group had significantly higher post-test 1 scores than the simulation-first group (78.8 ± 10.6% vs 71.6 ± 12.6%, p<0.01). There was no difference in post-test 2 scores between the two groups. The simulated case scenario task performance completion was 90.8% (95% CI 86.6% to 95.0%) in the lecture-first group compared with 83.8% (95% CI 79.5% to 88.1%) in the simulation-first group (p=0.02). A medical simulation course can improve resident physician knowledge in the early management of severe sepsis. Such a course should include a comprehensive curriculum that includes didactic lectures followed by simulation experience.
Poorer clock draw test scores are associated with greater functional impairment in peripheral artery disease: the Walking and Leg Circulation Study II.

PubMed

Zimmermann, Laura J; Ferrucci, Luigi; Kiang Liu; Lu Tian; Guralnik, Jack M; Criqui, Michael H; Yihua Liao; McDermott, Mary M

2011-06-01

We hypothesized that, in the absence of clinically recognized dementia, cognitive dysfunction measured by the clock draw test (CDT) is associated with greater functional impairment in men and women with peripheral artery disease (PAD). Participants were men and women aged 60 years and older with Mini-Mental Status Examination scores ≥ 24 with PAD (n = 335) and without PAD (n = 234). We evaluated the 6-minute walk test, 4-meter walking velocity at usual and fastest pace, the Short Physical Performance Battery (SPPB), and accelerometer-measured physical activity. CDTs were scored using the Shulman system as follows: Category 1 (worst): CDT score 0-2; Category 2: CDT score 3; Category 3 (best): CDT score 4-5. Results were adjusted for age, sex, race, education, ankle-brachial index (ABI), and comorbidities. In individuals with PAD, lower CDT scores were associated with slower 4-meter usual-paced walking velocity (Category 1: 0.78 meters/second; Category 2: 0.83 meters/second; Category 3: 0.86 meters/second; p-trend = 0.025) and lower physical activity (Category 1: 420 activity units; Category 2: 677 activity units; Category 3: 701 activity units; p-trend = 0.045). Poorer CDT scores were also associated with worse functional performance in individuals without PAD (usual and fast-paced walking velocity and SPPB, p-trend = 0.022, 0.043, and 0.031, respectively). In conclusion, cognitive impairment identified with CDT is independently associated with greater functional impairment in older, dementia-free individuals with and without PAD. Longitudinal studies are necessary to explore whether baseline CDT scores and changes in CDT scores over time can predict long-term decline in functional performance in individuals with and without PAD.
Genome-wide scan of IQ finds significant linkage to a quantitative trait locus on 2q.

PubMed

Luciano, M; Wright, M J; Duffy, D L; Wainwright, M A; Zhu, G; Evans, D M; Geffen, G M; Montgomery, G W; Martin, N G

2006-01-01

A genome-wide linkage scan of 795 microsatellite markers (761 autosomal, 34 X chromosome) was performed on Multidimensional Aptitude Battery subtests and verbal, performance and full scale scores, the WAIS-R Digit Symbol subtest, and two word-recognition tests (Schonell Graded Word Reading Test, Cambridge Contextual Reading Test) highly predictive of IQ. The sample included 361 families comprising 2-5 siblings who ranged in age from 15.7 to 22.2 years; genotype, but not phenotype, data were available for 81% of parents. A variance components analysis which controlled for age and sex effects showed significant linkage for the Cambridge reading test and performance IQ to the same region on chromosome 2, with respective LOD scores of 4.15 and 3.68. Suggestive linkage (LOD score>2.2) for various measures was further supported on chromosomes 6, 7, 11, 14, 21 and 22. Where location of linkage peaks converged for IQ subtests within the same scale, the overall scale score provided increased evidence for linkage to that region over any individual subtest. Association studies of candidate genes, particularly those involved in neural transmission and development, will be directed to genes located under the linkage peaks identified in this study.
Computerized Maze Navigation and On-Road Performance by Drivers With Dementia

PubMed Central

Ott, Brian R.; Festa, Elena K.; Amick, Melissa M.; Grace, Janet; Davis, Jennifer D.; Heindel, William C.

2012-01-01

This study examined the ability of computerized maze test performance to predict the road test performance of cognitively impaired and normal older drivers. The authors examined 133 older drivers, including 65 with probable Alzheimer disease, 23 with possible Alzheimer disease, and 45 control subjects without cognitive impairment. Subjects completed 5 computerized maze tasks employing a touch screen and pointer as well as a battery of standard neuropsychological tests. Parameters measured for mazes included errors, planning time, drawing time, and total time. Within 2 weeks, subjects were examined by a professional driving instructor on a standardized road test modeled after the Washington University Road Test. Road test total score was significantly correlated with total time across the 5 mazes. This maze score was significant for both Alzheimer disease subjects and control subjects. One maze in particular, requiring less than 2 minutes to complete, was highly correlated with driving performance. For the standard neuropsychological tests, highest correlations were seen with Trail Making A (TrailsA) and the Hopkins Verbal Learning Tests Trial 1 (HVLT1). Multiple regression models for road test score using stepwise subtraction of maze and neuropsychological test variables revealed significant independent contributions for total maze time, HVLT1, and TrailsA for the entire group; total maze time and HVLT1 for Alzheimer disease subjects; and TrailsA for normal subjects. As a visual analog of driving, a brief computerized test of maze navigation time compares well to standard neuropsychological tests of psychomotor speed, scanning, attention, and working memory as a predictor of driving performance by persons with early Alzheimer disease and normal elders. Measurement of maze task performance appears to be useful in the assessment of older drivers at risk for hazardous driving. PMID:18287166
Predictive validity of the UKCAT for medical school undergraduate performance: a national prospective cohort study.

PubMed

Tiffin, Paul A; Mwandigha, Lazaro M; Paton, Lewis W; Hesselgreaves, H; McLachlan, John C; Finn, Gabrielle M; Kasim, Adetayo S

2016-09-26

The UK Clinical Aptitude Test (UKCAT) has been shown to have a modest but statistically significant ability to predict aspects of academic performance throughout medical school. Previously, this ability has been shown to be incremental to conventional measures of educational performance for the first year of medical school. This study evaluates whether this predictive ability extends throughout the whole of undergraduate medical study and explores the potential impact of using the test as a selection screening tool. This was an observational prospective study, linking UKCAT scores, prior educational attainment and sociodemographic variables with subsequent academic outcomes during the 5 years of UK medical undergraduate training. The participants were 6812 entrants to UK medical schools in 2007-8 using the UKCAT. The main outcome was academic performance at each year of medical school. A receiver operating characteristic (ROC) curve analysis was also conducted, treating the UKCAT as a screening test for a negative academic outcome (failing at least 1 year at first attempt). All four of the UKCAT scale scores significantly predicted performance in theory- and skills-based exams. After adjustment for prior educational achievement, the UKCAT scale scores remained significantly predictive for most years. Findings from the ROC analysis suggested that, if used as a sole screening test, with the mean applicant UKCAT score as the cut-off, the test could be used to reject candidates at high risk of failing at least 1 year at first attempt. However, the 'number needed to reject' value would be high (at 1.18), with roughly one candidate who would have been likely to pass all years at first sitting being rejected for every higher risk candidate potentially declined entry on this basis. The UKCAT scores demonstrate a statistically significant but modest degree of incremental predictive validity throughout undergraduate training. Whilst the UKCAT could be considered a fairly crude screening tool for future academic performance, it may offer added value when used in conjunction with other selection measures. Future work should focus on the optimum role of such tests within the selection process and the prediction of post-graduate performance.
Visual-Constructional Ability in Individuals with Severe Obesity: Rey Complex Figure Test Accuracy and the Q-Score.

PubMed

Sargénius, Hanna L; Bylsma, Frederick W; Lydersen, Stian; Hestad, Knut

2017-01-01

The aims of this study were to investigate visual-construction and organizational strategy among individuals with severe obesity, as measured by the Rey Complex Figure Test (RCFT), and to examine the validity of the Q-score as a measure for the quality of performance on the RCFT. Ninety-six non-demented morbidly obese (MO) patients and 100 healthy controls (HC) completed the RCFT. Their performance was calculated by applying the standard scoring criteria. The quality of the copying process was evaluated per the directions of the Q-score scoring system. Results revealed that the MO did not perform significantly lower than the HC on Copy accuracy (mean difference -0.302, CI -1.374 to 0.769, p = 0.579). In contrast, the groups did statistically differ from each other, with MO performing poorer than the HC on the Q-score (mean -1.784, CI -3.237 to -0.331, p = 0.016) and the Unit points (mean -1.409, CI -2.291 to -0.528, p = 0.002), but not on the Order points score (mean -0.351, CI -0.994 to 0.293, p = 0.284). Differences on the Unit score and the Q-score were slightly reduced when adjusting for gender, age, and education. This study presents evidence supporting the presence of inefficiency in visuospatial constructional ability among MO patients. We believe we have found an indication that the Q-score captures a wider range of cognitive processes that are not described by traditional scoring methods. Rather than considering accuracy and placement of the different elements only, the Q-score focuses more on how the subject has approached the task.
Mathematical literacy in undergraduates: role of gender, emotional intelligence and emotional self-efficacy

NASA Astrophysics Data System (ADS)

Tariq, Vicki N.; Qualter, Pamela; Roberts, Sian; Appleby, Yvon; Barnes, Lynne

2013-12-01

This empirical study explores the roles that Emotional Intelligence (EI) and Emotional Self-Efficacy (ESE) play in undergraduates' mathematical literacy, and the influence of EI and ESE on students' attitudes towards and beliefs about mathematics. A convenience sample of 93 female and 82 male first-year undergraduates completed a test of mathematical literacy, followed by an online survey designed to measure the students' EI, ESE and factors associated with mathematical literacy. Analysis of the data revealed significant gender differences. Males attained a higher mean test score than females and out-performed the females on most of the individual questions and the associated mathematical tasks. Overall, males expressed greater confidence in their mathematical skills, although both males' and females' confidence outweighed their actual mathematical proficiency. Correlation analyses revealed that males and females attaining higher mathematical literacy test scores were more confident and persistent, exhibited lower levels of mathematics anxiety and possessed higher mathematics qualifications. Correlation analyses also revealed that in male students, aspects of ESE were associated with beliefs concerning the learning of mathematics (i.e. that intelligence is malleable and that persistence can facilitate success), but not with confidence or actual performance. Both EI and ESE play a greater role with regard to test performance and attitudes/beliefs regarding mathematics amongst female undergraduates; higher EI and ESE scores were associated with higher test scores, while females exhibiting higher levels of ESE were also more confident and less anxious about mathematics, believed intelligence to be malleable, were more persistent and were learning goal oriented. Moderated regression analyses confirmed mathematics anxiety as a negative predictor of test performance in males and females, but also revealed that in females EI and ESE moderate the effects of anxiety on test performance, with the relationship between anxiety and test performance linked more to emotional management (EI) than to ESE.
Recovery in Level 7-10 Women's USA Artistic Gymnastics.

PubMed

Buckner, Stephen B; Bacon, Nicholas T; Bishop, Phillip A

2017-01-01

This study assessed physical performance in women's artistic gymnastics following three variable recovery periods. Participants included fifteen female gymnasts (mean age = 13.5 ± 1.1) who had competed at USA Gymnastics (USAG) levels 7 - 10 within at least one year prior to the study. Each testing session consisted of a warm-up followed by four muscular endurance tests and one explosive maximal test. Assessments included pull-ups, leg lifts, handstand push-ups, vertical jump, and push-ups. After the performance assessments, the participants completed a typical practice session. The performance measures were reassessed at the beginning of each of the recovery periods of 24, 48, and 72 hours in a counterbalanced design. Performance assessments were converted into Z-scores and then averaged for a composite session Z-score. The composite session Z-scores were compared to evaluate the recovery duration. Composite Z's were significantly lower (p=0.000), after the 24 (z=-1.10) and the 48 hour (z=-0.71) recovery periods compared to baseline (z=0.00). However, there was no difference in scores (p=1.00) between the baseline and 72 hours (z=0.004) recovery. Full recovery required 72 hours under the conditions of this study.
The Effect of School Poverty on Racial Gaps in Tests Scores: The Case of the Minnesota Basic Standards Tests

ERIC Educational Resources Information Center

Myers, Samuel L.; Kim, Hyeoneui; Mandala, Cheryl

2004-01-01

A data from 1996,1998 and 1999 Minnesota comprehensive statewide testing on eight graders is used to analyze whether African American students perform worse than the white students who attend the poverty schools. The analyses conclude that African American-White test score gap is attributed more to the racial discriminations and racial treatments…
Testing for Bias against Female Test Takers of the Graduate Management Admissions Test and Potential Impact on Admissions to Graduate Programs in Business.

ERIC Educational Resources Information Center

Wright, Robert E.; Bachrach, Daniel G.

2003-01-01

Graduate Management Admission Test (GMAT) scores and grade point average in graduate core courses were compared for 190 male and 144 female business administration students. No significant differences in course performance were found, but males had been admitted with significantly higher GMAT scores, suggesting a bias against women. (Contains 27…
Pretest Scores Uniquely Predict 1-Year-Delayed Performance in a Simulation-Based Mastery Course for Central Line Insertion.

PubMed

Diederich, Emily; Thomas, Laura; Mahnken, Jonathan; Lineberry, Matthew

2018-06-01

Within simulation-based mastery learning (SBML) courses, there is inconsistent inclusion of learner pretesting, which requires considerable resources and is contrary to popular instructional frameworks. However, it may have several benefits, including its direct benefit as a form of deliberate practice and its facilitation of more learner-specific subsequent deliberate practice. We consider an unexplored potential benefit of pretesting: its ability to predict variable long-term learner performance. Twenty-seven residents completed an SBML course in central line insertion. Residents were tested on simulated central line insertion precourse, immediately postcourse, and after between 64 and 82 weeks. We analyzed pretest scores' prediction of delayed test scores, above and beyond prediction by program year, line insertion experiences in the interim, and immediate posttest scores. Pretest scores related strongly to delayed test scores (r = 0.59, P = 0.01; disattenuated ρ = 0.75). The number of independent central lines inserted also related to year-delayed test scores (r = 0.44, P = 0.02); other predictors did not discernibly relate. In a regression model jointly predicting delayed test scores, pretest was a significant predictor (β = 0.487, P = 0.011); number of independent insertions was not (β = 0.234, P = 0.198). This study suggests that pretests can play a major role in predicting learner variance in learning gains from SBML courses, thus facilitating more targeted refresher training. It also exposes a risk in SBML courses that learners who meet immediate mastery standards may be incorrectly assumed to have equal long-term learning gains.
Childhood overweight and academic performance: national study of kindergartners and first-graders.

PubMed

Datar, Ashlesha; Sturm, Roland; Magnabosco, Jennifer L

2004-01-01

To examine the association between children's overweight status in kindergarten and their academic achievement in kindergarten and first grade. The data analyzed consisted of 11,192 first time kindergartners from the Early Childhood Longitudinal Study, a nationally representative sample of kindergartners in the U.S. in 1998. Multivariate regression techniques were used to estimate the independent association of overweight status with children's math and reading standardized test scores in kindergarten and grade 1. We controlled for socioeconomic status, parent-child interaction, birth weight, physical activity, and television watching. Overweight children had significantly lower math and reading test scores compared with non-overweight children in kindergarten. Both groups were gaining similarly on math and reading test scores, resulting in significantly lower test scores among overweight children at the end of grade 1. However, these differences, except for boys' math scores at baseline (difference = 1.22 points, p = 0.001), became insignificant after including socioeconomic and behavioral variables, indicating that overweight is a marker but not a causal factor. Race/ethnicity and mother's education were stronger predictors of test score gains or levels than overweight status. Significant differences in test scores by overweight status at the beginning of kindergarten and the end of grade 1 can be explained by other individual characteristics, including parental education and the home environment. However, overweight is more easily observable by other students compared with socioeconomic characteristics, and its significant (unadjusted) association with worse academic performance can contribute to the stigma of overweight as early as the first years of elementary school.
Stability of an ERP-based measure of brain network activation (BNA) in athletes: A new electrophysiological assessment tool for concussion.

PubMed

Eckner, James T; Rettmann, Ashley; Narisetty, Naveen; Greer, Jacob; Moore, Brandon; Brimacombe, Susan; He, Xuming; Broglio, Steven P

2016-01-01

To determine test-re-test reliabilities of novel Evoked Response Potential (ERP)-based Brain Network Activation (BNA) scores in healthy athletes. Observational, repeated-measures study. Forty-two healthy male and female high school and collegiate athletes completed auditory oddball and go/no-go ERP assessments at baseline, 1 week, 6 weeks and 1 year. The BNA algorithm was applied to the ERP data, considering electrode location, frequency band, peak latency and normalized amplitude to generate seven unique BNA scores for each testing session. Mean BNA scores, intra-class correlation coefficient (ICC) values and reliable change (RC) values were calculated for each of the seven BNA networks. BNA scores ranged from 46.3 ± 34.9 to 69.9 ± 22.8, ICC values ranged from 0.46-0.65 and 95% RC values ranged from 38.3-68.1 across the seven networks. The wide range of BNA scores observed in this population of healthy athletes suggests that a single BNA score or set of BNA scores from a single after-injury test session may be difficult to interpret in isolation without knowledge of the athlete's own baseline BNA score(s) and/or the results of serial tests performed at additional time points. The stability of each BNA network should be considered when interpreting test-re-test BNA score changes.
Procedures for Constructing and Using Criterion-Referenced Performance Tests.

ERIC Educational Resources Information Center

Campbell, Clifton P.; Allender, Bill R.

1988-01-01

Criterion-referenced performance tests (CRPT) provide a realistic method for objectively measuring task proficiency against predetermined attainment standards. This article explains the procedures of constructing, validating, and scoring CRPTs and includes a checklist for a welding test. (JOW)
Prediction of success in FAA air traffic control field training as a function of selection and screening test performance.

DOT National Transportation Integrated Search

1989-05-01

This study compared correlations between Office of Personnel Management (OPM) selection test scores for Air Traffic Control Specialists (ATCSs) and scores from the FAA Academy's second-stage screening program with measures of field training performan...
Assessing pediatrics residents' mathematical skills for prescribing medication: a need for improved training.

PubMed

Glover, Mark L; Sussmane, Jeffrey B

2002-10-01

To evaluate residents' skills in performing basic mathematical calculations used for prescribing medications to pediatric patients. In 2001, a test of ten questions on basic calculations was given to first-, second-, and third-year residents at Miami Children's Hospital in Florida. Four additional questions were included to obtain the residents' levels of training, specific pediatrics intensive care unit (PICU) experience, and whether or not they routinely double-checked doses and adjusted them for each patient's weight. The test was anonymous and calculators were permitted. The overall score and the score for each resident class were calculated. Twenty-one residents participated. The overall average test score and the mean test score of each resident class was less than 70%. Second-year residents had the highest mean test scores, although there was no significant difference between the classes of residents (p =.745) or relationship between the residents' PICU experiences and their exam scores (p =.766). There was no significant difference between residents' levels of training and whether they double-checked their calculations (p =.633) or considered each patient's weight relative to the dose prescribed (p =.869). Seven residents committed tenfold dosing errors, and one resident committed a 1,000-fold dosing error. Pediatrics residents need to receive additional education in performing the calculations needed to prescribe medications. In addition, residents should be required to demonstrate these necessary mathematical skills before they are allowed to prescribe medications.

To what extent does the Health Professions Admission Test-Ireland predict performance in early undergraduate tests of communication and clinical skills? An observational cohort study.

PubMed

Kelly, Maureen E; Regan, Daniel; Dunne, Fidelma; Henn, Patrick; Newell, John; O'Flynn, Siun

2013-05-10

Internationally, tests of general mental ability are used in the selection of medical students. Examples include the Medical College Admission Test, Undergraduate Medicine and Health Sciences Admission Test and the UK Clinical Aptitude Test. The most widely used measure of their efficacy is predictive validity.A new tool, the Health Professions Admission Test- Ireland (HPAT-Ireland), was introduced in 2009. Traditionally, selection to Irish undergraduate medical schools relied on academic achievement. Since 2009, Irish and EU applicants are selected on a combination of their secondary school academic record (measured predominately by the Leaving Certificate Examination) and HPAT-Ireland score. This is the first study to report on the predictive validity of the HPAT-Ireland for early undergraduate assessments of communication and clinical skills. Students enrolled at two Irish medical schools in 2009 were followed up for two years. Data collected were gender, HPAT-Ireland total and subsection scores; Leaving Certificate Examination plus HPAT-Ireland combined score, Year 1 Objective Structured Clinical Examination (OSCE) scores (Total score, communication and clinical subtest scores), Year 1 Multiple Choice Questions and Year 2 OSCE and subset scores. We report descriptive statistics, Pearson correlation coefficients and Multiple linear regression models. Data were available for 312 students. In Year 1 none of the selection criteria were significantly related to student OSCE performance. The Leaving Certificate Examination and Leaving Certificate plus HPAT-Ireland combined scores correlated with MCQ marks.In Year 2 a series of significant correlations emerged between the HPAT-Ireland and subsections thereof with OSCE Communication Z-scores; OSCE Clinical Z-scores; and Total OSCE Z-scores. However on multiple regression only the relationship between Total OSCE Score and the Total HPAT-Ireland score remained significant; albeit the predictive power was modest. We found that none of our selection criteria strongly predict clinical and communication skills. The HPAT- Ireland appears to measures ability in domains different to those assessed by the Leaving Certificate Examination. While some significant associations did emerge in Year 2 between HPAT Ireland and total OSCE scores further evaluation is required to establish if this pattern continues during the senior years of the medical course.
To what extent does the Health Professions Admission Test-Ireland predict performance in early undergraduate tests of communication and clinical skills? – An observational cohort study

PubMed Central

2013-01-01

Background Internationally, tests of general mental ability are used in the selection of medical students. Examples include the Medical College Admission Test, Undergraduate Medicine and Health Sciences Admission Test and the UK Clinical Aptitude Test. The most widely used measure of their efficacy is predictive validity. A new tool, the Health Professions Admission Test- Ireland (HPAT-Ireland), was introduced in 2009. Traditionally, selection to Irish undergraduate medical schools relied on academic achievement. Since 2009, Irish and EU applicants are selected on a combination of their secondary school academic record (measured predominately by the Leaving Certificate Examination) and HPAT-Ireland score. This is the first study to report on the predictive validity of the HPAT-Ireland for early undergraduate assessments of communication and clinical skills. Method Students enrolled at two Irish medical schools in 2009 were followed up for two years. Data collected were gender, HPAT-Ireland total and subsection scores; Leaving Certificate Examination plus HPAT-Ireland combined score, Year 1 Objective Structured Clinical Examination (OSCE) scores (Total score, communication and clinical subtest scores), Year 1 Multiple Choice Questions and Year 2 OSCE and subset scores. We report descriptive statistics, Pearson correlation coefficients and Multiple linear regression models. Results Data were available for 312 students. In Year 1 none of the selection criteria were significantly related to student OSCE performance. The Leaving Certificate Examination and Leaving Certificate plus HPAT-Ireland combined scores correlated with MCQ marks. In Year 2 a series of significant correlations emerged between the HPAT-Ireland and subsections thereof with OSCE Communication Z-scores; OSCE Clinical Z-scores; and Total OSCE Z-scores. However on multiple regression only the relationship between Total OSCE Score and the Total HPAT-Ireland score remained significant; albeit the predictive power was modest. Conclusion We found that none of our selection criteria strongly predict clinical and communication skills. The HPAT- Ireland appears to measures ability in domains different to those assessed by the Leaving Certificate Examination. While some significant associations did emerge in Year 2 between HPAT Ireland and total OSCE scores further evaluation is required to establish if this pattern continues during the senior years of the medical course. PMID:23663266
Is the NIHSS Certification Process Too Lenient?

PubMed Central

Hills, Nancy K.; Josephson, S. Andrew; Lyden, Patrick D.; Johnston, S. Claiborne

2009-01-01

Background and Purpose The National Institutes of Health Stroke Scale (NIHSS) is a widely used measure of neurological function in clinical trials and patient assessment; inter-rater scoring variability could impact communications and trial power. The manner in which the rater certification test is scored yields multiple correct answers that have changed over time. We examined the range of possible total NIHSS scores from answers given in certification tests by over 7,000 individual raters who were certified. Methods We analyzed the results of all raters who completed one of two standard multiple-patient videotaped certification examinations between 1998 and 2004. The range for the correct score, calculated using NIHSS ‘correct answers’, was determined for each patient. The distribution of scores derived from those who passed the certification test then was examined. Results A total of 6,268 raters scored 5 patients on Test 1; 1,240 scored 6 patients on Test 2. Using a National Stroke Association (NSA) answer key, we found that correct total scores ranged from 2 correct scores to as many as 12 different correct total scores. Among raters who achieved a passing score and were therefore qualified to administer the NIHSS, score distributions were even wider, with 1 certification patient receiving 18 different correct total scores. Conclusions Allowing multiple acceptable answers for questions on the NIHSS certification test introduces scoring variability. It seems reasonable to assume that the wider the range of acceptable answers in the certification test, the greater the variability in the performance of the test in trials and clinical practice by certified examiners. Greater consistency may be achieved by deriving a set of ‘best’ answers through expert consensus on all questions where this is possible, then teaching raters how to derive these answers using a required interactive training module. PMID:19295205
Supervision and computerized neurocognitive baseline test performance in high school athletes: an initial investigation.

PubMed

Kuhn, Andrew Warren; Solomon, Gary S

2014-01-01

Computerized neuropsychological testing batteries have provided a time-efficient and cost-efficient way to assess and manage the neurocognitive aspects of patients with sport-related concussion. These tests are straightforward and mostly self-guided, reducing the degree of clinician involvement required by traditional clinical neuropsychological paper-and-pencil tests. To determine if self-reported supervision status affected computerized neurocognitive baseline test performance in high school athletes. Retrospective cohort study. Supervised testing took place in high school computer libraries or sports medicine clinics. Unsupervised testing took place at the participant's home or another location with computer access. From 2007 to 2012, high school athletes across middle Tennessee (n = 3771) completed computerized neurocognitive baseline testing (Immediate Post-Concussion Assessment and Cognitive Testing [ImPACT]). They reported taking the test either supervised by a sports medicine professional or unsupervised. These athletes (n = 2140) were subjected to inclusion and exclusion criteria and then matched based on age, sex, and number of prior concussions. We extracted demographic and performance-based data from each de-identified baseline testing record. Paired t tests were performed between the self-reported supervised and unsupervised groups, comparing the following ImPACT baseline composite scores: verbal memory, visual memory, visual motor (processing) speed, reaction time, impulse control, and total symptom score. For differences that reached P < .05, the Cohen d was calculated to measure the effect size. Lastly, a χ(2) analysis was conducted to compare the rate of invalid baseline testing between the groups. All statistical tests were performed at the 95% confidence interval level. Self-reported supervised athletes demonstrated better visual motor (processing) speed (P = .004; 95% confidence interval [0.28, 1.52]; d = 0.12) and faster reaction time (P < .001; 95% confidence interval [-0.026, -0.014]; d = 0.21) composite scores than self-reported unsupervised athletes. Speed-based tasks were most affected by self-reported supervision status, although the effect sizes were relatively small. These data lend credence to the hypothesis that supervision status may be a factor in the evaluation of ImPACT baseline test scores.
Motion perception and driving: predicting performance through testing and shortening braking reaction times through training.

PubMed

Wilkins, Luke; Gray, Rob; Gaska, James; Winterbottom, Marc

2013-12-30

A driving simulator was used to examine the relationship between motion perception and driving performance. Although motion perception test scores have been shown to be related to driving safety, it is not clear which combination of tests are the best predictors and whether motion perception training can improve driving performance. In experiment 1, 60 younger drivers (22.4 ± 2.5 years) completed three motion perception tests (2-dimensional [2D] motion-defined letter [MDL] identification, 3D motion in depth sensitivity [MID], and dynamic visual acuity [DVA]) followed by two driving tests (emergency braking [EB] and hazard perception [HP]). In experiment 2, 20 drivers (21.6 ± 2.1 years) completed 6 weeks of motion perception training (using the MDL, MID, and DVA tests), while 20 control drivers (22.0 ± 2.7 years) completed an online driving safety course. The EB performance was measured before and after training. In experiment 1, MDL (r = 0.34) and MID (r = 0.46) significantly correlated with EB score. The change in DVA score as a function of target speed (i.e., "velocity susceptibility") was correlated most strongly with HP score (r = -0.61). In experiment 2, the motion perception training group had a significant decrease in brake reaction time on the EB test from pre- to posttreatment, while there was no significant change for the control group: t(38) = 2.24, P = 0.03. Tests of 3D motion perception are the best predictor of EB, while DVA velocity susceptibility is the best predictor of hazard perception. Motion perception training appears to result in faster braking responses.
Chance performance and floor effects: threats to the validity of the Wechsler Memory Scale--fourth edition designs subtest.

PubMed

Martin, Phillip K; Schroeder, Ryan W

2014-06-01

The Designs subtest allows for accumulation of raw score points by chance alone, creating the potential for artificially inflated performances, especially in older patients. A random number generator was used to simulate the random selection and placement of cards by 100 test naive participants, resulting in a mean raw score of 36.26 (SD = 3.86). This resulted in relatively high-scaled scores in the 45-54, 55-64, and 65-69 age groups on Designs II. In the latter age group, in particular, the mean simulated performance resulted in a scaled score of 7, with scores 1 SD below and above the performance mean translating to scaled scores of 5 and 8, respectively. The findings indicate that clinicians should use caution when interpreting Designs II performance in these age groups, as our simulations demonstrated that low average to average range scores occur frequently when patients are relying solely on chance performance. © The Author 2014. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Visuospatial Aptitude Testing Differentially Predicts Simulated Surgical Skill.

PubMed

Hinchcliff, Emily; Green, Isabel; Destephano, Christopher; Cox, Mary; Smink, Douglas; Kumar, Amanika; Hokenstad, Erik; Bengtson, Joan; Cohen, Sarah

2018-02-05

To determine if visuospatial perception (VSP) testing is correlated to simulated or intraoperative surgical performance as rated by the American College of Graduate Medical Education (ACGME) milestones. Classification II-2 SETTING: Two academic training institutions PARTICIPANTS: 41 residents, including 19 Brigham and Women's Hospital and 22 Mayo Clinic residents from three different specialties (OBGYN, general surgery, urology). Participants underwent three different tests: visuospatial perception testing (VSP), Fundamentals of Laparoscopic Surgery (FLS®) peg transfer, and DaVinci robotic simulation peg transfer. Surgical grading from the ACGME milestones tool was obtained for each participant. Demographic and subject background information was also collected including specialty, year of training, prior experience with simulated skills, and surgical interest. Standard statistical analysis using Student's t test were performed, and correlations were determined using adjusted linear regression models. In univariate analysis, BWH and Mayo training programs differed in both times and overall scores for both FLS® peg transfer and DaVinci robotic simulation peg transfer (p<0.05 for all). Additionally, type of residency training impacted time and overall score on robotic peg transfer. Familiarity with tasks correlated with higher score and faster task completion (p= 0.05 for all except VSP score). There was no difference in VSP scores by program, specialty, or year of training. In adjusted linear regression modeling, VSP testing was correlated only to robotic peg transfer skills (average time p=0.006, overall score p=0.001). Milestones did not correlate to either VSP or surgical simulation testing. VSP score was correlated with robotic simulation skills but not with FLS skills or ACGME milestones. This suggests that the ability of VSP score to predict competence differs between tasks. Therefore, further investigation is required into aptitude testing, especially prior to its integration as an entry examination into a surgical subspecialty. Copyright © 2018. Published by Elsevier Inc.
Evaluation of affective temperament and anxiety-depression levels of patients with polycystic ovary syndrome.

PubMed

Asik, Mehmet; Altinbas, Kursat; Eroglu, Mustafa; Karaahmet, Elif; Erbag, Gokhan; Ertekin, Hulya; Sen, Hacer

2015-10-01

Women with polycystic ovary syndrome (PCOS) are reported to experience depressive episodes at a higher rate than healthy controls (HC). Affective temperament features are psychiatric markers that may help to predict and identify vulnerability to depression in women with PCOS. Our aim was to evaluate the affective temperaments of women with PCOS and to investigate the association with depression and anxiety levels and laboratory variables in comparison with HC. The study included 71 women with PCOS and 50 HC. Hormonal evaluations were performed for women with PCOS. Physical examination, clinical history, Hospital Anxiety and Depression Scale (HADS) and TEMPS-A were performed for all subjects. Differences between groups were evaluated using Student's t-tests and Mann-Whitney U tests. Correlations and logistic regression tests were performed. All temperament subtype scores, except hyperthymic, and HADS anxiety, depression, and total scores were significantly higher in patients with PCOS compared to HC. A statistically significant positive correlation was found between BMI and irritable temperament, and insulin and HADS depression scores in patients with PCOS. Additionally, hirsutism score and menstrual irregularity were correlated with HADS depression, anxiety and total scores in PCOS patients. In logistic regression analysis, depression was not affected by PCOS, hirsutism score or menstrual irregularity. However, HADS anxiety score was associated with hirsutism score. Our study is the first to evaluate the affective temperament features of women with PCOS. Consequently, establishing affective temperament properties for women with PCOS may help clinicians predict those patients with PCOS who are at risk for depressive and anxiety disorders. Copyright © 2015 Elsevier B.V. All rights reserved.
Estimates of genetic parameters and environmental effects for measures of hunting performance in Finnish hounds.

PubMed

Liinamo, A E; Karjalainen, L; Ojala, M; Vilva, V

1997-03-01

Data from field trials of Finnish Hounds between 1988 and 1992 in Finland were used to estimate genetic parameters and environmental effects for measures of hunting performance using REML procedures and an animal model. The original data set included 28,791 field trial records from 5,666 dogs. Males and females had equal hunting performance, whereas experience acquired by age improved trial results compared with results for young dogs (P < .001). Results were mostly better on snow than on bare ground (P < .001), and testing areas, years, months, and their interactions affected results (P < .001). Estimates of heritabilities and repeatabilities were low for most of the 28 measures, mainly due to large residual variances. The highest heritabilities were for frequency of tonguing (h2 = .15), pursuit score (h2 = .13), tongue score (h2 = .13), ghost trailing score (h2 = .12), and merit and final score (both h2 = .11). Estimates of phenotypic and genetic correlations were positive and moderate or high for search scores, pursuit scores, and final scores but lower for other studied measures. The results suggest that, due to low heritabilities, evaluation of breeding values for Finnish Hounds with respect to their hunting ability should be based on animal model BLUP methods instead of mere performance testing. The evaluation system of field trials should also be revised for more reliability.
Maintenance of Wakefulness Test scores and driving performance in sleep disorder patients and controls.

PubMed

Philip, Pierre; Chaufton, Cyril; Taillard, Jacques; Sagaspe, Patricia; Léger, Damien; Raimondi, Monika; Vakulin, Andrew; Capelli, Aurore

2013-08-01

Sleepiness at the wheel is a risk factor for traffic accidents. Past studies have demonstrated the validity of the Maintenance of Wakefulness Test (MWT) scores as a predictor of driving impairment in untreated patients with obstructive sleep apnea syndrome (OSAS), but there is limited information on the validity of the maintenance of wakefulness test by MWT in predicting driving impairment in patients with hypersomnias of central origin (narcolepsy or idiopathic hypersomnia). The aim of this study was to compare the MWT scores with driving performance in sleep disorder patients and controls. 19 patients suffering from hypersomnias of central origin (9 narcoleptics and 10 idiopathic hypersomnia), 17 OSAS patients and 14 healthy controls performed a MWT (4×40-minute trials) and a 40-minute driving session on a real car driving simulator. Participants were divided into 4 groups defined by their MWT sleep latency scores. The groups were pathological (sleep latency 0-19 min), intermediate (20-33 min), alert (34-40 min) and control (>34 min). The main driving performance outcome was the number of inappropriate line crossings (ILCs) during the 40 minute drive test. Patients with pathological MWT sleep latency scores (0-19 min) displayed statistically significantly more ILC than patients from the intermediate, alert and control groups (F (3, 46)=7.47, p<0.001). Pathological sleep latencies on the MWT predicted driving impairment in patients suffering from hypersomnias of central origin as well as in OSAS patients. MWT is an objective measure of daytime sleepiness that appears to be useful in estimating the driving performance in sleepy patients. Copyright © 2013 Elsevier B.V. All rights reserved.
The Michigan Public High School Context and Performance Report Card

ERIC Educational Resources Information Center

Van Beek, Michael; Bowen, Daniel; Mills, Jonathan

2012-01-01

Assessing a high school's effectiveness is not straightforward. Comparing a school's standardized test scores to those of other schools is one approach to measuring effectiveness, but a major objection to this method is that students' test scores tend to be related to students' "socioeconomic" status--family household income, for…
Imperfect practice makes perfect: error management training improves transfer of learning.

PubMed

Dyre, Liv; Tabor, Ann; Ringsted, Charlotte; Tolsgaard, Martin G

2017-02-01

Traditionally, trainees are instructed to practise with as few errors as possible during simulation-based training. However, transfer of learning may improve if trainees are encouraged to commit errors. The aim of this study was to assess the effects of error management instructions compared with error avoidance instructions during simulation-based ultrasound training. Medical students (n = 60) with no prior ultrasound experience were randomised to error management training (EMT) (n = 32) or error avoidance training (EAT) (n = 28). The EMT group was instructed to deliberately make errors during training. The EAT group was instructed to follow the simulator instructions and to commit as few errors as possible. Training consisted of 3 hours of simulation-based ultrasound training focusing on fetal weight estimation. Simulation-based tests were administered before and after training. Transfer tests were performed on real patients 7-10 days after the completion of training. Primary outcomes were transfer test performance scores and diagnostic accuracy. Secondary outcomes included performance scores and diagnostic accuracy during the simulation-based pre- and post-tests. A total of 56 participants completed the study. On the transfer test, EMT group participants attained higher performance scores (mean score: 67.7%, 95% confidence interval [CI]: 62.4-72.9%) than EAT group members (mean score: 51.7%, 95% CI: 45.8-57.6%) (p < 0.001; Cohen's d = 1.1, 95% CI: 0.5-1.7). There was a moderate improvement in diagnostic accuracy in the EMT group compared with the EAT group (16.7%, 95% CI: 10.2-23.3% weight deviation versus 26.6%, 95% CI: 16.5-36.7% weight deviation [p = 0.082; Cohen's d = 0.46, 95% CI: -0.06 to 1.0]). No significant interaction effects between group and performance improvements between the pre- and post-tests were found in either performance scores (p = 0.25) or diagnostic accuracy (p = 0.09). The provision of error management instructions during simulation-based training improves the transfer of learning to the clinical setting compared with error avoidance instructions. Rather than teaching to avoid errors, the use of errors for learning should be explored further in medical education theory and practice. © 2016 John Wiley & Sons Ltd and The Association for the Study of Medical Education.
Lumbopelvic control and pitching performance of professional baseball pitchers.

PubMed

Chaudhari, Ajit M W; McKenzie, Christopher S; Borchers, James R; Best, Thomas M

2011-08-01

This study assessed the correlation between lumbopelvic control during a single-leg balancing task and in-game pitching performance in Minor-League baseball pitchers. Seventy-five healthy professional baseball pitchers performed a standing lumbopelvic control test during the last week of spring training for the 2008 and 2009 seasons while wearing a custom-designed testing apparatus, the "Level Belt." With the Level Belt secured to the waist, subjects attempted to transition from a 2-leg to a single-leg pitching stance and balance while maintaining a stable pelvic position. Subjects were graded on the maximum sagittal pelvic tilt from a neutral position during the motion. Pitching performance, number of innings pitched (IP), and injuries were compared for all subjects who pitched at least 50 innings during a season. The median Level Belt score for the study group was 7°. Two-sample t-tests with equal variances were used to determine if pitchers with a Level Belt score <7° or ≥7° were more likely to perform differently during the baseball season, and chi-square analysis was used to compare injuries between groups. Subjects scoring <7° on the Level Belt test had significantly fewer walks plus hits per inning than subjects scoring ≥7° (walks plus hits per inning pitched, 1.352 ± 0.251 vs. 1.584 ± 0.360, p = 0.013) and significantly more IP during the season (IP, 78.89 ± 38.67 vs. 53.38 ± 42.47, p = 0.043). There was no significant difference in the number of pitchers injured between groups. These data suggest that lumbopelvic control influences overall performance for baseball pitchers and that a simple test of lumbopelvic control can potentially identify individuals who have a better chance of pitching success.
Improved perceptual-motor performance measurement system

NASA Technical Reports Server (NTRS)

Parker, J. F., Jr.; Reilly, R. E.

1969-01-01

Battery of tests determines the primary dimensions of perceptual-motor performance. Eighteen basic measures range from simple tests to sophisticated electronic devices. Improved system has one unit for the subject containing test display and response elements, and one for the experimenter where test setups, programming, and scoring are accomplished.
The relation of functional visual acuity measurement methodology to tear functions and ocular surface status.

PubMed

Kaido, Minako; Ishida, Reiko; Dogru, Murat; Tsubota, Kazuo

2011-09-01

To investigate the relation of functional visual acuity (FVA) measurements with dry eye test parameters and to compare the testing methods with and without blink suppression and anesthetic instillation. A prospective comparative case series. Thirty right eyes of 30 dry eye patients and 25 right eyes of 25 normal subjects seen at Keio University School of Medicine, Department of Ophthalmology were studied. FVA testing was performed using a FVA measurement system with two different approaches, one in which measurements were made under natural blinking conditions without topical anesthesia (FVA-N) and the other in which the measurements were made under the blink suppression condition with topical anesthetic eye drops (FVA-BS). Tear function examinations, such as the Schirmer test, tear film break-up time, and fluorescein and Rose Bengal vital staining as ocular surface evaluation, were performed. The mean logMAR FVA-N scores and logMAR Landolt visual acuity scores were significantly lower in the dry eye subjects than in the healthy controls (p < 0.05), while there were no statistical differences between the logMAR FVA-BS scores of the dry eye subjects and those of the healthy controls. There was a significant correlation between the logMAR Landolt visual acuities and the logMAR FVA-N and logMAR FVA-BS scores. The FVA-N scores correlated significantly with tear quantities, tear stability and, especially, the ocular surface vital staining scores. FVA measurements performed under natural blinking significantly reflected the tear functions and ocular surface status of the eye and would appear to be a reliable method of FVA testing. FVA measurement is also an accurate predictor of dry eye status.
A comparison of the technique of the football quarterback pass between high school and university athletes.

PubMed

Toffan, Adam; Alexander, Marion J L; Peeler, Jason

2017-07-28

The purpose of the study was to compare the most effective joint movements, segment velocities and body positions to perform the fastest and most accurate pass of high school and university football quarterbacks. Secondary purposes were to develop a quarterback throwing test to assess skill level, to determine which kinematic variables were different between high school and university athletes as well as to determine which variables were significant predictors of quarterback throwing test performance. Ten high school and ten university athletes were filmed for the study, performing nine passes at a target and two passes for maximum distance. Thirty variables were measured using Dartfish Team Pro 4.5.2 video analysis system, and Microsoft Excel was used for statistical analysis. University athletes scored slightly higher than the high school athletes on the throwing test, however this result was not statistically significant. Correlation analysis and forward stepwise multiple regression analysis was performed on both the high school players and the university players in order to determine which variables were significant predictors of throwing test score. Ball velocity was determined to have the strongest predictive effect on throwing test score (r = 0.900) for the high school athletes, however, position of the back foot at release was also determined to be important (r = 0.661) for the university group. Several significant differences in throwing technique between groups were noted during the pass, however, body position at release showed the greatest differences between the two groups. High school players could benefit from more complete weight transfer and decreased throw time to increase throwing test score. University athletes could benefit from increased throw time and greater range of motion in external shoulder rotation and trunk rotation to increase their throwing test score. Coaches and practitioners will be able to use the findings of this research to help improve these and related throwing variables in their high school and university quarterbacks.
Adult Learners: Relationships of Reading, MCAT, and USMLE Step 1 Test Results for Medical Students.

ERIC Educational Resources Information Center

Haught, Patricia A.; Walls, Richard T.

This study examined the possible relationship between scores on the Nelson-Denny Reading Test (current forms G and H) and performance on the Medical College Admissions Test (MCAT) and the United States Medical Licensing Examination (USMLE) Step 1 examination scores. Participants were 730 medical students at a mid-Atlantic university, and for 572…
Do School-Based Tutoring Programs Significantly Improve Student Performance on Standardized Tests?

ERIC Educational Resources Information Center

Rothman, Terri; Henderson, Mary

2011-01-01

This study used a pre-post, nonequivalent control group design to examine the impact of an in-district, after-school tutoring program on eighth grade students' standardized test scores in language arts and mathematics. Students who had scored in the near-passing range on either the language arts or mathematics aspect of a standardized test at the…
Correlations between cerebral glucose metabolism and neuropsychological test performance in nonalcoholic cirrhotics.

PubMed

Lockwood, Alan H; Weissenborn, Karin; Bokemeyer, Martin; Tietge, U; Burchert, Wolfgang

2002-03-01

Many cirrhotics have abnormal neuropsychological test scores. To define the anatomical-physiological basis for encephalopathy in nonalcoholic cirrhotics, we performed resting-state fluorodeoxyglucose positron emission tomographic scans and administered a neuropsychological test battery to 18 patients and 10 controls. Statistical parametric mapping correlated changes in regional glucose metabolism with performance on the individual tests and a composite battery score. In patients without overt encephalopathy, poor performance correlated with reductions in metabolism in the anterior cingulate. In all patients, poor performance on the battery was positively correlated (p < 0.001) with glucose metabolism in bifrontal and biparietal regions of the cerebral cortex and negatively correlated with metabolism in hippocampal, lingual, and fusiform gyri and the posterior putamen. Similar patterns of abnormal metabolism were found when comparing the patients to 10 controls. Metabolic abnormalities in the anterior attention system and association cortices mediating executive and integrative function form the pathophysiological basis for mild hepatic encephalopathy.
The King-Devick (K-D) test of rapid eye movements: a bedside correlate of disability and quality of life in MS.

PubMed

Moster, Stephen; Wilson, James A; Galetta, Steven L; Balcer, Laura J

2014-08-15

We investigated the King-Devick (K-D) test of rapid number naming as a visual performance measure in a cohort of patients with multiple sclerosis (MS). In this cross-sectional study, 81 patients with MS and 20 disease-free controls from an ongoing study of visual outcomes underwent K-D testing. A test of rapid number naming, K-D requires saccadic eye movements as well as intact vision, attention and concentration. To perform the K-D test, participants are asked to read numbers aloud as quickly as possible from three test cards; the sum of the three test card times in seconds constitutes the summary score. High-contrast visual acuity (VA), low-contrast letter acuity (1.25% and 2.5% levels), retinal nerve fiber layer (RNFL) thickness by optical coherence tomography (OCT), MS Functional Composite (MSFC) and vision-specific quality of life (QOL) measures (25-Item NEI Visual Functioning Questionnaire [NEI-VFQ-25] and 10-Item Neuro-Ophthalmic Supplement) were also assessed. K-D time scores in the MS cohort (total time to read the three test cards) were significantly higher (worse) compared to those for disease-free controls (P=0.003, linear regression, accounting for age). Within the MS cohort, higher K-D scores were associated with worse scores for the NEI-VFQ-25 composite (P<0.001), 10-Item Neuro-Ophthalmic Supplement (P<0.001), binocular low-contrast acuity (2.5%, 1.25%, P<0.001, and high-contrast VA (P=0.003). Monocular low-contrast vision scores (P=0.001-0.009) and RNFL thickness (P=0.001) were also reduced in eyes of patients with worse K-D scores (GEE models accounting for age and within-patient, inter-eye correlations). Patients with a history of optic neuritis (ON) had increased (worse) K-D scores. Patients who classified their work disability status as disabled (receiving disability pension) did worse on K-D testing compared to those working full-time (P=0.001, accounting for age). The K-D test, a <2 minute bedside test of rapid number naming, is associated with visual dysfunction, neurologic impairment, and reduced vision-specific QOL in patients with MS. Scores reflect work disability as well as structural changes as measured by OCT imaging. History of ON and abnormal binocular acuities were associated with worse K-D scores, suggesting that abnormalities detected by K-D may go along with afferent dysfunction in MS patients. A brief test that requires saccadic eye movements, K-D should be considered for future MS trials as a rapid visual performance measure. Copyright © 2014 Elsevier B.V. All rights reserved.

Learning and Study Strategies Inventory subtests and factors as predictors of National Board of Chiropractic Examiners Part 1 examination performance.

PubMed

Schutz, Christine M; Dalton, Leanne; Tepe, Rodger E

2013-01-01

This study was designed to extend research on the relationship between chiropractic students' learning and study strategies and national board examination performance. Sixty-nine first trimester chiropractic students self-administered the Learning and Study Strategies Inventory (LASSI). Linear trends tests (for continuous variables) and Mantel-Haenszel trend tests (for categorical variables) were utilized to determine if the 10 LASSI subtests and 3 factors predicted low, medium and high levels of National Board of Chiropractic Examiners (NBCE) Part 1 scores. Multiple regression was performed to predict overall mean NBCE examination scores using the 3 LASSI factors as predictor variables. Four LASSI subtests (Anxiety, Concentration, Selecting Main Ideas, Test Strategies) and one factor (Goal Orientation) were significantly associated with NBCE examination levels. One factor (Goal Orientation) was a significant predictor of overall mean NBCE examination performance. Learning and study strategies are predictive of NBCE Part 1 examination performance in chiropractic students. The current study found LASSI subtests Anxiety, Concentration, Selecting Main Ideas, and Test Strategies, and the Goal-Orientation factor to be significant predictors of NBCE scores. The LASSI may be useful to educators in preparing students for academic success. Further research is warranted to explore the effects of learning and study strategies training on GPA and NBCE performance.
Detecting Intervention Effects in a Cluster-Randomized Design Using Multilevel Structural Equation Modeling for Binary Responses

PubMed Central

Cho, Sun-Joo; Preacher, Kristopher J.; Bottge, Brian A.

2015-01-01

Multilevel modeling (MLM) is frequently used to detect group differences, such as an intervention effect in a pre-test–post-test cluster-randomized design. Group differences on the post-test scores are detected by controlling for pre-test scores as a proxy variable for unobserved factors that predict future attributes. The pre-test and post-test scores that are most often used in MLM are summed item responses (or total scores). In prior research, there have been concerns regarding measurement error in the use of total scores in using MLM. To correct for measurement error in the covariate and outcome, a theoretical justification for the use of multilevel structural equation modeling (MSEM) has been established. However, MSEM for binary responses has not been widely applied to detect intervention effects (group differences) in intervention studies. In this article, the use of MSEM for intervention studies is demonstrated and the performance of MSEM is evaluated via a simulation study. Furthermore, the consequences of using MLM instead of MSEM are shown in detecting group differences. Results of the simulation study showed that MSEM performed adequately as the number of clusters, cluster size, and intraclass correlation increased and outperformed MLM for the detection of group differences. PMID:29881032
Verbal Serial List Learning in Mild Cognitive Impairment: A Profile Analysis of Interference, Forgetting, and Errors

PubMed Central

Libon, David J.; Bondi, Mark W.; Price, Catherine C.; Lamar, Melissa; Eppig, Joel; Wambach, Denene M.; Nieves, Christine; Delano-Wood, Lisa; Giovannetti, Tania; Lippa, Carol; Kabasakalian, Anahid; Cosentino, Stephanie; Swenson, Rod; Penney, Dana L.

2012-01-01

Using cluster analysis Libon et al. (2010) found three verbal serial list-learning profiles involving delay memory test performance in patients with mild cognitive impairment (MCI). Amnesic MCI (aMCI) patients presented with low scores on delay free recall and recognition tests; mixed MCI (mxMCI) patients scored higher on recognition compared to delay free recall tests; and dysexecutive MCI (dMCI) patients generated relatively intact scores on both delay test conditions. The aim of the current research was to further characterize memory impairment in MCI by examining forgetting/savings, interference from a competing word list, intrusion errors/perseverations, intrusion word frequency, and recognition foils in these three statistically determined MCI groups compared to normal control (NC) participants. The aMCI patients exhibited little savings, generated more highly prototypic intrusion errors, and displayed indiscriminate responding to delayed recognition foils. The mxMCI patients exhibited higher saving scores, fewer and less prototypic intrusion errors, and selectively endorsed recognition foils from the interference list. dMCI patients also selectively endorsed recognition foils from the interference list but performed similarly compared to NC participants. These data suggest the existence of distinct memory impairments in MCI and caution against the routine use of a single memory test score to operationally define MCI. PMID:21880171
An Investigation of Calculator Use on Employment Tests of Mathematical Ability: Effects on Reliability, Validity, Test Scores, and Speed of Completion

ERIC Educational Resources Information Center

Bing, Mark N.; Stewart, Susan M.; Davison, H. Kristl

2009-01-01

Handheld calculators have been used on the job for more than 30 years, yet the degree to which these devices can affect performance on employment tests of mathematical ability has not been thoroughly examined. This study used a within-subjects research design (N = 167) to investigate the effects of calculator use on test score reliability, test…
Face recognition performance of individuals with Asperger syndrome on the Cambridge Face Memory Test.

PubMed

Hedley, Darren; Brewer, Neil; Young, Robyn

2011-12-01

Although face recognition deficits in individuals with Autism Spectrum Disorder (ASD), including Asperger syndrome (AS), are widely acknowledged, the empirical evidence is mixed. This in part reflects the failure to use standardized and psychometrically sound tests. We contrasted standardized face recognition scores on the Cambridge Face Memory Test (CFMT) for 34 individuals with AS with those for 42, IQ-matched non-ASD individuals, and age-standardized scores from a large Australian cohort. We also examined the influence of IQ, autistic traits, and negative affect on face recognition performance. Overall, participants with AS performed significantly worse on the CFMT than the non-ASD participants and when evaluated against standardized test norms. However, while 24% of participants with AS presented with severe face recognition impairment (>2 SDs below the mean), many individuals performed at or above the typical level for their age: 53% scored within +/- 1 SD of the mean and 9% demonstrated superior performance (>1 SD above the mean). Regression analysis provided no evidence that IQ, autistic traits, or negative affect significantly influenced face recognition: diagnostic group membership was the only significant predictor of face recognition performance. In sum, face recognition performance in ASD is on a continuum, but with average levels significantly below non-ASD levels of performance. Copyright © 2011, International Society for Autism Research, Wiley-Liss, Inc.
Modified Balance Error Scoring System (M-BESS) test scores in athletes wearing protective equipment and cleats.

PubMed

Azad, Aftab Mohammad; Al Juma, Saad; Bhatti, Junaid Ahmad; Delaney, J Scott

2016-01-01

Balance testing is an important part of the initial concussion assessment. There is no research on the differences in Modified Balance Error Scoring System (M-BESS) scores when tested in real world as compared to control conditions. To assess the difference in M-BESS scores in athletes wearing their protective equipment and cleats on different surfaces as compared to control conditions. This cross-sectional study examined university North American football and soccer athletes. Three observers independently rated athletes performing the M-BESS test in three different conditions: (1) wearing shorts and T-shirt in bare feet on firm surface (control); (2) wearing athletic equipment with cleats on FieldTurf; and (3) wearing athletic equipment with cleats on firm surface. Mean M-BESS scores were compared between conditions. 60 participants were recruited: 39 from football (all males) and 21 from soccer (11 males and 10 females). Average age was 21.1 years (SD=1.8). Mean M-BESS scores were significantly lower (p<0.001) for cleats on FieldTurf (mean=26.3; SD=2.0) and for cleats on firm surface (mean=26.6; SD=2.1) as compared to the control condition (mean=28.4; SD=1.5). Females had lower scores than males for cleats on FieldTurf condition (24.9 (SD=1.9) vs 27.3 (SD=1.6), p=0.005). Players who had taping or bracing on their ankles/feet had lower scores when tested with cleats on firm surface condition (24.6 (SD=1.7) vs 26.9 (SD=2.0), p=0.002). Total M-BESS scores for athletes wearing protective equipment and cleats standing on FieldTurf or a firm surface are around two points lower than M-BESS scores performed on the same athletes under control conditions.
Modified Balance Error Scoring System (M-BESS) test scores in athletes wearing protective equipment and cleats

PubMed Central

Azad, Aftab Mohammad; Al Juma, Saad; Bhatti, Junaid Ahmad; Delaney, J Scott

2016-01-01

Background Balance testing is an important part of the initial concussion assessment. There is no research on the differences in Modified Balance Error Scoring System (M-BESS) scores when tested in real world as compared to control conditions. Objective To assess the difference in M-BESS scores in athletes wearing their protective equipment and cleats on different surfaces as compared to control conditions. Methods This cross-sectional study examined university North American football and soccer athletes. Three observers independently rated athletes performing the M-BESS test in three different conditions: (1) wearing shorts and T-shirt in bare feet on firm surface (control); (2) wearing athletic equipment with cleats on FieldTurf; and (3) wearing athletic equipment with cleats on firm surface. Mean M-BESS scores were compared between conditions. Results 60 participants were recruited: 39 from football (all males) and 21 from soccer (11 males and 10 females). Average age was 21.1 years (SD=1.8). Mean M-BESS scores were significantly lower (p<0.001) for cleats on FieldTurf (mean=26.3; SD=2.0) and for cleats on firm surface (mean=26.6; SD=2.1) as compared to the control condition (mean=28.4; SD=1.5). Females had lower scores than males for cleats on FieldTurf condition (24.9 (SD=1.9) vs 27.3 (SD=1.6), p=0.005). Players who had taping or bracing on their ankles/feet had lower scores when tested with cleats on firm surface condition (24.6 (SD=1.7) vs 26.9 (SD=2.0), p=0.002). Conclusions Total M-BESS scores for athletes wearing protective equipment and cleats standing on FieldTurf or a firm surface are around two points lower than M-BESS scores performed on the same athletes under control conditions. PMID:27900181
The Relationship Between Soldier Performance on the Two-Mile Run and the 20-m Shuttle Run Test.

PubMed

Canino, Maria C; Cohen, Bruce S; Redmond, Jan E; Sharp, Marilyn A; Zambraski, Edward J; Foulis, Stephen A

2018-05-01

The 20-m shuttle run test (MSRT) is a common field test used to measure aerobic fitness in controlled environments. The U.S. Army currently assesses aerobic fitness with the two-mile run (TMR), but external factors may impact test performance. The aim of this study is to examine the relationship between the Army Physical Fitness Test TMR performance and the MSRT in military personnel. A group of 531 (403 males and 128 females) active duty soldiers (age: 24.0 ± 4.1 years) performed the MSRT in an indoor facility. Heart rate was monitored for the duration of the test. Post-heart rate and age-predicted maximal heart rate were utilized to determine near-maximal performance on the MSRT. The soldiers provided their most recent Army Physical Fitness Test TMR time (min). A Pearson correlation and multiple linear regression analyses were performed to examine the relationship between TMR time (min) and MSRT score (total number of shuttles completed). The study was approved by the Human Use Review Committee at the U.S. Army Research Institute of Environmental Medicine, Natick, Massachusetts. A significant, negative correlation exists between TMR time and MSRT score (r = -0.75, p < 0.001). Sex and MSRT score significantly predicted TMR time (adjusted R2 = 0.65, standard error of estimate = 0.97, p < 0.001) with a 95% ratio limits of agreement of ±12.6%. The resulting equation is: TMR = 17.736-2.464 × (sex) - 0.050 × (MSRT) - 0.026 × (MSRT × sex) for predicted TMR time. Males equal zero, females equal one, and MSRT score is the total number of shuttles completed. The MSRT is a strong predictor of the TMR and should be considered as a diagnostic tool when assessing aerobic fitness in active duty soldiers.
On the Performance of the Marginal Homogeneity Test to Detect Rater Drift.

PubMed

Sgammato, Adrienne; Donoghue, John R

2018-06-01

When constructed response items are administered repeatedly, "trend scoring" can be used to test for rater drift. In trend scoring, raters rescore responses from the previous administration. Two simulation studies evaluated the utility of Stuart's Q measure of marginal homogeneity as a way of evaluating rater drift when monitoring trend scoring. In the first study, data were generated based on trend scoring tables obtained from an operational assessment. The second study tightly controlled table margins to disentangle certain features present in the empirical data. In addition to Q , the paired t test was included as a comparison, because of its widespread use in monitoring trend scoring. Sample size, number of score categories, interrater agreement, and symmetry/asymmetry of the margins were manipulated. For identical margins, both statistics had good Type I error control. For a unidirectional shift in margins, both statistics had good power. As expected, when shifts in the margins were balanced across categories, the t test had little power. Q demonstrated good power for all conditions and identified almost all items identified by the t test. Q shows substantial promise for monitoring of trend scoring.
Admissions Criteria as Predictors of Academic Performance in a Three-Year Pharmacy Program at a Historically Black Institution

PubMed Central

Parmar, Jayesh R.; Purnell, Miriam; Lang, Lynn A.

2016-01-01

Objective. To determine the ability of University of Maryland Eastern Shore School of Pharmacy’s admissions criteria to predict students’ academic performance in a 3-year pharmacy program and to analyze transferability to African-American students. Methods. Statistical analyses were conducted on retrospective data for 174 students. Didactic and experiential scores were used as measures of academic performance. Results. Pharmacy College Admission Test (PCAT), grade point average (GPA), interview, and observational scores combined with previous pharmacy experience and biochemistry coursework predicted the students' academic performance except second-year (P2) experiential performance. For African-American students, didactic performance positively correlated with PCAT writing subtests, while the experiential performance positively correlated with previous pharmacy experience and observational score. For nonAfrican-American students, didactic performance positively correlated with PCAT multiple-choice subtests, and experiential performance with interview score. The prerequisite GPA positively correlated with both of the student subgroups’ didactic performance. Conclusion. Both PCAT and GPA were predictors of didactic performance, especially in nonAfrican-Americans. Pharmacy experience and observational scores were predictors of experiential performance, especially in African-Americans. PMID:26941432
A healthy Nordic diet and physical performance in old age: findings from the longitudinal Helsinki Birth Cohort Study.

PubMed

Perälä, Mia-Maria; von Bonsdorff, Mikaela; Männistö, Satu; Salonen, Minna K; Simonen, Mika; Kanerva, Noora; Pohjolainen, Pertti; Kajantie, Eero; Rantanen, Taina; Eriksson, Johan G

2016-03-14

Epidemiological studies have shown that a number of nutrients are associated with better physical performance. However, little is still known about the role of the whole diet, particularly a healthy Nordic diet, in relation to physical performance. Therefore, we examined whether a healthy Nordic diet was associated with measures of physical performance 10 years later. We studied 1072 participants from the Helsinki Birth Cohort Study. Participants' diet was assessed using a validated 128-item FFQ at the mean age of 61 years, and a priori-defined Nordic diet score (NDS) was calculated. The score included Nordic fruits and berries, vegetables, cereals, PUFA:SFA and trans-fatty acids ratio, low-fat milk, fish, red and processed meat, total fat and alcohol. At the mean age of 71 years, participants' physical performance was measured using the Senior Fitness Test (SFT), and an overall SFT score was calculated. Women in the highest fourth of the NDS had on average 5 points higher SFT score compared with those in the lowest fourth (P for trend 0·005). No such association was observed in men. Women with the highest score had 17% better result in the 6-min walk test, 16% better arm curl and 20% better chair stand results compared with those with the lowest score (all P values<0·01). In conclusion, a healthy Nordic diet was associated with better overall physical performance among women and might help decrease the risk of disability in old age.
Race, Socioeconomic Status, and Implicit Bias: Implications for Closing the Achievement Gap

NASA Astrophysics Data System (ADS)

Schlosser, Elizabeth Auretta Cox

This study accessed the relationship between race, socioeconomic status, age and the race implicit bias held by middle and high school science teachers in Mobile and Baldwin County Public School Systems. Seventy-nine participants were administered the race Implicit Association Test (race IAT), created by Greenwald, A. G., Nosek, B. A., & Banaji, M. R., (2003) and a demographic survey. Quantitative analysis using analysis of variances, ANOVA and t-tests were used in this study. An ANOVA was performed comparing the race IAT scores of African American science teachers and their Caucasian counterparts. A statically significant difference was found (F = .4.56, p = .01). An ANOVA was also performed using the race IAT scores comparing the age of the participants; the analysis yielded no statistical difference based on age. A t-test was performed comparing the race IAT scores of African American teachers who taught at either Title I or non-Title I schools; no statistical difference was found between groups (t = -17.985, p < .001). A t-test was also performed comparing the race IAT scores of Caucasian teachers who taught at either Title I or non-Title I schools; a statistically significant difference was found between groups ( t = 2.44, p > .001). This research examines the implications of the achievement gap among African American and Caucasian students in science.
Baseline neurocognitive testing in sports-related concussions: the importance of a prior night's sleep.

PubMed

McClure, D Jake; Zuckerman, Scott L; Kutscher, Scott J; Gregory, Andrew J; Solomon, Gary S

2014-02-01

The management of sports-related concussions (SRCs) utilizes serial neurocognitive assessments and self-reported symptom inventories to assess recovery and safety for return to play (RTP). Because postconcussive RTP goals include symptom resolution and a return to neurocognitive baseline levels, clinical decisions rest in part on understanding modifiers of this baseline. Several studies have reported age and sex to influence baseline neurocognitive performance, but few have assessed the potential effect of sleep. We chose to investigate the effect of reported sleep duration on baseline Immediate Post-Concussion Assessment and Cognitive Testing (ImPACT) performance and the number of patient-reported symptoms. We hypothesized that athletes receiving less sleep before baseline testing would perform worse on neurocognitive metrics and report more symptoms. Cross-sectional study; Level of evidence, 3. We retrospectively reviewed 3686 nonconcussed athletes (2371 male, 1315 female; 3305 high school, 381 college) with baseline symptom and ImPACT neurocognitive scores. Patients were stratified into 3 groups based on self-reported sleep duration the night before testing: (1) short, <7 hours; (2) intermediate, 7-9 hours; and (3) long, ≥9 hours. A multivariate analysis of covariance (MANCOVA) with an α level of .05 was used to assess the influence of sleep duration on baseline ImPACT performance. A univariate ANCOVA was performed to investigate the influence of sleep on total self-reported symptoms. When controlling for age and sex as covariates, the MANCOVA revealed significant group differences on ImPACT reaction time, verbal memory, and visual memory scores but not visual-motor (processing) speed scores. An ANCOVA also revealed significant group differences in total reported symptoms. For baseline symptoms and ImPACT scores, subsequent pairwise comparisons revealed these associations to be most significant when comparing the short and intermediate sleep groups. Our results indicate that athletes sleeping fewer than 7 hours before baseline testing perform worse on 3 of 4 ImPACT scores and report more symptoms. Because SRC management and RTP decisions hinge on the comparison with a reliable baseline evaluation, clinicians should consider sleep duration before baseline neurocognitive testing as a potential factor in the assessment of athletes' recovery.
Does household access to improved water and sanitation in infancy and childhood predict better vocabulary test performance in Ethiopian, Indian, Peruvian and Vietnamese cohort studies?

PubMed Central

Dearden, Kirk A; Brennan, Alana T; Behrman, Jere R; Schott, Whitney; Crookston, Benjamin T; Humphries, Debbie L; Penny, Mary E; Fernald, Lia C H

2017-01-01

Objective Test associations between household water and sanitation (W&S) and children's concurrent and subsequent Peabody Picture Vocabulary Test (PPVT) scores. Design Prospective cohort study. Setting Ethiopia, India, Peru, Vietnam. Participants 7269 children. Primary outcome measures PPVT scores at 5 and 8 years. Key exposure variables were related to W&S, and collected at 1, 5 and 8 years, including ‘improved’ water (eg, piped, public tap or standpipe) and ‘improved’ toilets (eg, collection, storage, treatment and recycling of human excreta). Results Access to improved water at 1 year was associated with higher language scores at 5 years (3/4 unadjusted associations) and 8 years (4/4 unadjusted associations). Ethiopian children with access to improved water at 1 year had test scores that were 0.26 SD (95% CI 0.17 to 0.36) higher at 5 years than children without access. Access to improved water at 5 years was associated with higher concurrent PPVT scores (in 3/4 unadjusted associations), but not later scores (in 1/4 unadjusted associations). 5-year-old Peruvian children with access to improved water had better concurrent performance on the PPVT (0.44 SD, 95% CI 0.30 to 0.59) than children without access to improved water. Toilet access at 1 year was also associated with better PPVT scores at 5 years (3/4 unadjusted associations) and sometimes associated with test results at 8 years (2/4 unadjusted associations). Toilet access at 5 years was associated with concurrent PPVT scores (3/4 unadjusted associations). More than half of all associations in unadjusted models (water and toilets) persisted in adjusted models, particularly for toilets in India, Peru and Vietnam. Conclusions Access to ‘improved’ water and toilets had independent associations with children's PPVT scores that often persisted with adjustment for covariates. Our findings suggest that effects of W&S may go beyond subacute and acute infections and physical growth to include children's language performance, a critical component of cognitive development. PMID:28270388
Is Cognitive Test-Taking Anxiety Associated With Academic Performance Among Nursing Students?

PubMed

Duty, Susan M; Christian, Ladonna; Loftus, Jocelyn; Zappi, Victoria

2016-01-01

The cognitive component of test anxiety was correlated with academic performance among nursing students. Modest but statistically significant lower examination grade T scores were observed for students with high compared with low levels of cognitive test anxiety (CTA). High levels of CTA were associated with reduced academic performance.
Evaluation of BLAST-based edge-weighting metrics used for homology inference with the Markov Clustering algorithm.

PubMed

Gibbons, Theodore R; Mount, Stephen M; Cooper, Endymion D; Delwiche, Charles F

2015-07-10

Clustering protein sequences according to inferred homology is a fundamental step in the analysis of many large data sets. Since the publication of the Markov Clustering (MCL) algorithm in 2002, it has been the centerpiece of several popular applications. Each of these approaches generates an undirected graph that represents sequences as nodes connected to each other by edges weighted with a BLAST-based metric. MCL is then used to infer clusters of homologous proteins by analyzing these graphs. The various approaches differ only by how they weight the edges, yet there has been very little direct examination of the relative performance of alternative edge-weighting metrics. This study compares the performance of four BLAST-based edge-weighting metrics: the bit score, bit score ratio (BSR), bit score over anchored length (BAL), and negative common log of the expectation value (NLE). Performance is tested using the Extended CEGMA KOGs (ECK) database, which we introduce here. All metrics performed similarly when analyzing full-length sequences, but dramatic differences emerged as progressively larger fractions of the test sequences were split into fragments. The BSR and BAL successfully rescued subsets of clusters by strengthening certain types of alignments between fragmented sequences, but also shifted the largest correct scores down near the range of scores generated from spurious alignments. This penalty outweighed the benefits in most test cases, and was greatly exacerbated by increasing the MCL inflation parameter, making these metrics less robust than the bit score or the more popular NLE. Notably, the bit score performed as well or better than the other three metrics in all scenarios. The results provide a strong case for use of the bit score, which appears to offer equivalent or superior performance to the more popular NLE. The insight that MCL-based clustering methods can be improved using a more tractable edge-weighting metric will greatly simplify future implementations. We demonstrate this with our own minimalist Python implementation: Porthos, which uses only standard libraries and can process a graph with 25 m + edges connecting the 60 k + KOG sequences in half a minute using less than half a gigabyte of memory.
Forging the Basis for Developing Protein-Ligand Interaction Scoring Functions.

PubMed

Liu, Zhihai; Su, Minyi; Han, Li; Liu, Jie; Yang, Qifan; Li, Yan; Wang, Renxiao

2017-02-21

In structure-based drug design, scoring functions are widely used for fast evaluation of protein-ligand interactions. They are often applied in combination with molecular docking and de novo design methods. Since the early 1990s, a whole spectrum of protein-ligand interaction scoring functions have been developed. Regardless of their technical difference, scoring functions all need data sets combining protein-ligand complex structures and binding affinity data for parametrization and validation. However, data sets of this kind used to be rather limited in terms of size and quality. On the other hand, standard metrics for evaluating scoring function used to be ambiguous. Scoring functions are often tested in molecular docking or even virtual screening trials, which do not directly reflect the genuine quality of scoring functions. Collectively, these underlying obstacles have impeded the invention of more advanced scoring functions. In this Account, we describe our long-lasting efforts to overcome these obstacles, which involve two related projects. On the first project, we have created the PDBbind database. It is the first database that systematically annotates the protein-ligand complexes in the Protein Data Bank (PDB) with experimental binding data. This database has been updated annually since its first public release in 2004. The latest release (version 2016) provides binding data for 16 179 biomolecular complexes in PDB. Data sets provided by PDBbind have been applied to many computational and statistical studies on protein-ligand interaction and various subjects. In particular, it has become a major data resource for scoring function development. On the second project, we have established the Comparative Assessment of Scoring Functions (CASF) benchmark for scoring function evaluation. Our key idea is to decouple the "scoring" process from the "sampling" process, so scoring functions can be tested in a relatively pure context to reflect their quality. In our latest work on this track, i.e. CASF-2013, the performance of a scoring function was quantified in four aspects, including "scoring power", "ranking power", "docking power", and "screening power". All four performance tests were conducted on a test set containing 195 high-quality protein-ligand complexes selected from PDBbind. A panel of 20 standard scoring functions were tested as demonstration. Importantly, CASF is designed to be an open-access benchmark, with which scoring functions developed by different researchers can be compared on the same grounds. Indeed, it has become a popular choice for scoring function validation in recent years. Despite the considerable progress that has been made so far, the performance of today's scoring functions still does not meet people's expectations in many aspects. There is a constant demand for more advanced scoring functions. Our efforts have helped to overcome some obstacles underlying scoring function development so that the researchers in this field can move forward faster. We will continue to improve the PDBbind database and the CASF benchmark in the future to keep them as useful community resources.
The Effects of Material and Task Variations on a Brief Cognitive Learning Strategies Training Program

DTIC Science & Technology

1980-08-01

EFFECTS OF MATERIAL AND TASK VARIATIONS ON A BRIEF COGNITIVE LEARNING STRATEGIES TRAINING PROGRAM Introduction As scholastic achievement scores continue to...variance of the test scores revealed no significant dif- ferences among the three treatment conditions on any of the tests. Although these results...tentative because of the group performance patterns. On the first (easier) passage, group means indicated nearly perfect scores for all three of the
[Relationship between unipedal stance test score and center of pressure velocity in elderly].

PubMed

Rodrigo Antonio, Guzmán; Rony, Silvestre; Francisco Aniceto, Rodríguez; David Andrés, Arriagada; Pablo Andrés, Ortega

2011-01-01

Frequent falls are one of the most important health problems in the elderly population. The unipedal stance test (UPST), asses postural stability and is used in fall risk measures. Despite this, there is little information about its relationship with posturographic parameters (PP) that characterizes postural stability. Center of pressure velocity (CoPV) is one of the best PP that describes postural stability. The aim of this study was to analyze the relation between UST score and CoPV in elderly population. A sample of 38 healthy elderly subjects where divided in two groups according to their UPST score, low performance (LP, n=11) and high performance (HP, n=27). The correlation between UPST score and COP mean velocity (CoPmV), recorded from a posturographic test, was analyzed between both groups. An inverse correlation between UPST score and CoPmV was found in both groups. However, this was higher in the LP group (r=-0.69, P=.02) compared to the HP (r=-0.39, P=.04). Based on the results of this investigation, it may be concluded that the achievement on UPST has an inverse relationship with CoPmV, especially in subjects with low performance in the UPST. Copyright © 2010 SEGG. Published by Elsevier Espana. All rights reserved.
Sentence level auditory comprehension treatment program for aphasic adults.

PubMed

Naeser, M A; Haas, G; Mazurski, P; Laughlin, S

1986-06-01

The purpose of this study was to investigate whether a newly developed sentence level auditory comprehension (SLAC) treatment program could be used to improve language comprehension test scores in adults with chronic aphasia. Results indicate that the SLAC treatment program can be used with chronic patients; performance on a standardized test (the Token Test) was improved after treatment; and improved performance could not be predicted from either anatomic CT scan lesion sites or pretreatment test scores. One advantage to the SLAC treatment program is that the patient can practice listening independently with a tape recorder device (Language Master) and earphones either in the hospital or at home.

Development of a full-scale transmission testing procedure to evaluate advanced lubricants

NASA Technical Reports Server (NTRS)

Lewicki, David G.; Decker, Harry J.; Shimski, John T.

1992-01-01

Experimental tests were performed on the OH-58A helicopter main rotor transmission in the NASA Lewis 500-hp Helicopter Transmission Test Stand. The testing was part of a joint Navy/NASA/Army lubrication program. The objective of the program was to develop a separate lubricant for gearboxes and demonstrate an improved performance in life and load-carrying capacity. The goal of the experiments was to develop a testing procedure to fail certain transmission components using a MIL-L-23699 base reference oil, then run identical tests with improved lubricants and demonstrate performance. The tests were directed at failing components that the Navy has had problems with due to marginal lubrication. These failures included mast shaft bearing micropitting, sun gear and planet bearing fatigue, and spiral bevel gear scoring. A variety of tests were performed and over 900 hours of total run time accumulated for these tests. Some success was achieved in developing a testing procedure to produce sun gear and planet bearing fatigue failures. Only marginal success was achieved in producing mast shaft bearing micropitting and spiral bevel gear scoring.
Collaborative Test Reviews: Student Performance

ERIC Educational Resources Information Center

Bhatia, Anuradha; Makela, Carole J.

2010-01-01

A group study method proved helpful in improving senior-level students' performance on unit tests through collaborative learning. Students of a History of Textiles course voluntarily attended study sessions to review course content and prepare for unit tests. The students who attended the group reviews scored better on tests than those who did…
Predicting neuropsychological test performance on the basis of temporal orientation.

PubMed

Ryan, Joseph J; Glass, Laura A; Bartels, Jared M; Bergner, CariAnn M; Paolo, Anthony M

2009-05-01

Temporal orientation is often disrupted in the context of psychiatric or neurological disease; tests assessing this function are included in most mental status examinations. The present study examined the relationship between scores on the Temporal Orientation Scale (TOS) and performance on a battery of tests that assess memory, language, and cognitive functioning in a sample of patients with Alzheimer's disease (N = 55). Pearson-product moment correlations showed that, in all but two instances, the TOS was significantly correlated with each neuropsychological measure, p values < or = .05. Also, severely disoriented (i.e., TOS score < or = -8) patients were consistently 'impaired' on memory tests but not on tests of language and general cognitive functioning.
Investigating the Correlation Between Pharmacy Student Performance on the Health Science Reasoning Test and a Critical Thinking Assignment.

PubMed

Nornoo, Adwoa O; Jackson, Jonathan; Axtell, Samantha

2017-03-25

Objective. To determine whether there is a correlation between pharmacy students' scores on the Health Science Reasoning Test (HSRT) and their grade on a package insert assignment designed to assess critical thinking. Methods. The HSRT was administered to first-year pharmacy students during a critical-thinking course in the spring semester. In the same semester, a required package insert assignment was completed in a pharmacokinetics course. To determine whether there was a relationship between HSRT scores and grades on the assignment, a Spearman's rho correlation test was performed. Results. A very weak but significant positive correlation was found between students' grades on the assignment and their overall HSRT score (r=0.19, p <0.05), as well as deduction (a scale score of the HSRT; r=0.26, p <0.01). Conclusion. Based on a very weak but significant correlation to HSRT scores, this study demonstrated the potential of a package insert assignment to be used as one of the components to measure critical-thinking skills in pharmacy students.
Investigating the Correlation Between Pharmacy Student Performance on the Health Science Reasoning Test and a Critical Thinking Assignment

PubMed Central

Jackson, Jonathan; Axtell, Samantha

2017-01-01

Objective. To determine whether there is a correlation between pharmacy students’ scores on the Health Science Reasoning Test (HSRT) and their grade on a package insert assignment designed to assess critical thinking. Methods. The HSRT was administered to first-year pharmacy students during a critical-thinking course in the spring semester. In the same semester, a required package insert assignment was completed in a pharmacokinetics course. To determine whether there was a relationship between HSRT scores and grades on the assignment, a Spearman’s rho correlation test was performed. Results. A very weak but significant positive correlation was found between students’ grades on the assignment and their overall HSRT score (r=0.19, p<0.05), as well as deduction (a scale score of the HSRT; r=0.26, p<0.01). Conclusion. Based on a very weak but significant correlation to HSRT scores, this study demonstrated the potential of a package insert assignment to be used as one of the components to measure critical-thinking skills in pharmacy students. PMID:28381884
Links between global and local shape perception, coloured backgrounds, colour discrimination, and non-verbal IQ.

PubMed

Dore, Patricia; Dumani, Ardian; Wyatt, Geddes; Shepherd, Alex J

2018-03-16

This study explored associations between local and global shape perception on coloured backgrounds, colour discrimination, and non-verbal IQ (NVIQ). Five background colours were chosen for the local and global shape tasks that were tailored for the cone-opponent pathways early in the visual system (cardinal colour directions: L-M, loosely, reddish-greenish; and S-(L + M), or tritan colours, loosely, blueish-yellowish; where L, M and S refer to the long, middle and short wavelength sensitive cones). Participants also completed the Farnsworth-Munsell 100-hue test (FM100) to determine whether performance on the local and global shape tasks correlated with colour discrimination overall, or with performance on the L-M and tritan subsets of the FM100 test. Overall performance on the local and global shape tasks did correlate with scores on the FM100 tests, despite the colour of the background being irrelevant to the shape tasks. There were also significantly larger associations between scores for the L-M subset of the FM100 test, compared to the tritan subset, and accuracy on some of the shape tasks on the reddish, greenish and neutral backgrounds. Participants also completed the non-verbal components of the WAIS and the SPM+ version of Raven's progressive matrices, to determine whether performance on the FM100 test, and on the local and global shape tasks, correlated with NVIQ. FM100 scores correlated significantly with both WAIS and SPM+ scores. These results extend previous work that has indicated FM100 performance is not purely a measure of colour discrimination, but also involves aspects of each participant's NVIQ, such as the ability to attend to local and global aspects of the test, part-whole relationships, perceptual organisation and good visuomotor skills. Overall performance on the local and global shape tasks correlated only with the WAIS scores, not the SPM+. These results indicate that those aspects of NVIQ that engage spatial comprehension of local-global relationships and manual manipulation (WAIS), rather than more abstract reasoning (SPM+), are related to performance on the local and global shape tasks. Links are presented between various measures of NVIQ and performance on visual tasks, but they are currently seldom addressed in studies of either shape or colour perception. Further studies to explore these issues are recommended. Copyright © 2018 Elsevier Ltd. All rights reserved.
Structural and Sequence Similarity Makes a Significant Impact on Machine-Learning-Based Scoring Functions for Protein-Ligand Interactions.

PubMed

Li, Yang; Yang, Jianyi

2017-04-24

The prediction of protein-ligand binding affinity has recently been improved remarkably by machine-learning-based scoring functions. For example, using a set of simple descriptors representing the atomic distance counts, the RF-Score improves the Pearson correlation coefficient to about 0.8 on the core set of the PDBbind 2007 database, which is significantly higher than the performance of any conventional scoring function on the same benchmark. A few studies have been made to discuss the performance of machine-learning-based methods, but the reason for this improvement remains unclear. In this study, by systemically controlling the structural and sequence similarity between the training and test proteins of the PDBbind benchmark, we demonstrate that protein structural and sequence similarity makes a significant impact on machine-learning-based methods. After removal of training proteins that are highly similar to the test proteins identified by structure alignment and sequence alignment, machine-learning-based methods trained on the new training sets do not outperform the conventional scoring functions any more. On the contrary, the performance of conventional functions like X-Score is relatively stable no matter what training data are used to fit the weights of its energy terms.
Predicting Performance in Higher Education Using Proximal Predictors.

PubMed

Niessen, A Susan M; Meijer, Rob R; Tendeiro, Jorge N

2016-01-01

We studied the validity of two methods for predicting academic performance and student-program fit that were proximal to important study criteria. Applicants to an undergraduate psychology program participated in a selection procedure containing a trial-studying test based on a work sample approach, and specific skills tests in English and math. Test scores were used to predict academic achievement and progress after the first year, achievement in specific course types, enrollment, and dropout after the first year. All tests showed positive significant correlations with the criteria. The trial-studying test was consistently the best predictor in the admission procedure. We found no significant differences between the predictive validity of the trial-studying test and prior educational performance, and substantial shared explained variance between the two predictors. Only applicants with lower trial-studying scores were significantly less likely to enroll in the program. In conclusion, the trial-studying test yielded predictive validities similar to that of prior educational performance and possibly enabled self-selection. In admissions aimed at student-program fit, or in admissions in which past educational performance is difficult to use, a trial-studying test is a good instrument to predict academic performance.
Is there inter-procedural transfer of skills in intraocular surgery? A randomized controlled trial.

PubMed

Thomsen, Ann Sofia Skou; Kiilgaard, Jens Folke; la Cour, Morten; Brydges, Ryan; Konge, Lars

2017-12-01

To investigate how experience in simulated cataract surgery impacts and transfers to the learning curves for novices in vitreoretinal surgery. Twelve ophthalmology residents without previous experience in intraocular surgery were randomized to (1) intensive training in cataract surgery on a virtual-reality simulator until passing a test with predefined validity evidence (cataract trainees) or to (2) no cataract surgery training (novices). Possible skill transfer was assessed using a test consisting of all 11 vitreoretinal modules on the EyeSi virtual-reality simulator. All participants repeated the test of vitreoretinal surgical skills until their performance curve plateaued. Three experienced vitreoretinal surgeons also performed the test to establish validity evidence. Analysis with independent samples t-tests was performed. The vitreoretinal test on the EyeSi simulator demonstrated evidence of validity, given statistically significant differences in mean test scores for the first repetition; experienced surgeons scored higher than novices (p = 0.023) and cataract trainees (p = 0.003). Internal consistency for the 11 modules of the test was acceptable (Cronbach's α = 0.73). Our findings did not indicate a transfer effect with no significant differences found between cataract trainees and novices in their starting scores (mean ± SD 381 ± 129 points versus 455 ± 82 points, p = 0.262), time to reach maximum performance level (10.7 ± 3.0 hr versus 8.7 ± 2.8 hr, p = 0.265), or maximum scores (785 ± 162 points versus 805 ± 73 points, p = 0.791). Pretraining in cataract surgery did not demonstrate any measurable effect on vitreoretinal procedural performance. The results of this study indicate that we should not anticipate extensive transfer of surgical skills when planning training programmes in intraocular surgery. © 2017 Acta Ophthalmologica Scandinavica Foundation. Published by John Wiley & Sons Ltd.
Relationship between concussion history and neurocognitive test performance in National Football League draft picks.

PubMed

Solomon, Gary S; Kuhn, Andrew

2014-04-01

There are limited empirical data available regarding the relationship between concussion history and neurocognitive functioning in active National Football League (NFL) players in general and NFL draft picks in particular. Potential NFL draft picks undergo 2 neurocognitive tests at the National Invitational Camp (Scouting Combine) every year: the Wonderlic and, since 2011, the Immediate Post-concussion Assessment and Cognitive Testing (ImPACT). After conclusion of the combine and before the draft, NFL teams invite potential draft picks to their headquarters for individual visits where further assessment may occur. To examine the relationship between concussion history and neurocognitive performance (ImPACT and Wonderlic) in a sample of elite NFL draft picks. Cohort study; Level of evidence, 3. Over 7 years, 226 potential draft picks were invited to visit a specific NFL team's headquarters after the combine. The athletes were divided into 3 groups based on self-reported concussion history: no prior concussions, 1 prior concussion, and 2 or more prior concussions. Neurocognitive measures of interest included Wonderlic scores (provided by the NFL team) and ImPACT composite scores (administered either at the combine or during a visit to the team headquarters). The relationship between concussion history and neurocognitive scores was assessed, as were the relationships among the 2 neurocognitive tests. Concussion history had no relationship to neurocognitive performance on either the Wonderlic or ImPACT. Concussion history did not affect performance on either neurocognitive test, suggesting that for this cohort, a history of concussion may not have adverse effects on neurocognitive functioning as measured by these 2 tests. This study reveals no correlation between concussion history and neurocognitive test scores (ImPACT, Wonderlic) in soon-to-be active NFL athletes.
Why women perform better in college than admission scores would predict: Exploring the roles of conscientiousness and course-taking patterns.

PubMed

Keiser, Heidi N; Sackett, Paul R; Kuncel, Nathan R; Brothen, Thomas

2016-04-01

Women typically obtain higher subsequent college GPAs than men with the same admissions test score. A common reaction is to attribute this to a flaw in the admissions test. We explore the possibility that this underprediction of women's performance reflects gender differences in conscientiousness and college course-taking patterns. In Study 1, we focus on using the ACT to predict performance in a single, large course where performance is decomposed into cognitive (exam and quiz scores) and less cognitive, discretionary components (discussion and extra credit points). The ACT does not underpredict female's cognitive performance, but it does underpredict female performance on the less cognitive, discretionary components of academic performance, because it fails to measure and account for the personality trait of conscientiousness. In Study 2, we create 2 course-difficulty indices (Course Challenge and Mean Aptitude in Course) and add them to an HLM regression model to see if they reduce the degree to which SAT scores underpredict female performance. Including Course Challenge does result in a modest reduction of the gender coefficient; however, including Mean Aptitude in Course does not. Thus, differences in course-taking patterns is a partial (albeit small) explanation for the common finding of differential prediction by gender. (c) 2016 APA, all rights reserved).
Relationship between procrastination and academic performance among a group of undergraduate dental students in India.

PubMed

Lakshminarayan, Nagesh; Potdar, Shrudha; Reddy, Siddana Goud

2013-04-01

Procrastination, generally defined as a voluntary, irrational delay of behavior, is a prevalent phenomenon among college students throughout the world and occurs at alarmingly high rates. For this study, a survey was conducted of 209 second-, third-, and fourth-year undergraduate dental students of Bapuji Dental College and Hospital, Davangere, India, to identify the relationship between their level of procrastination and academic performance. A sixteen-item questionnaire was used to assess the level of procrastination among these students. Data related to their academic performance were also collected. Spearman's correlation coefficient test was used to assess the relationship between procrastination and academic performance. It showed a negative correlation of -0.63 with a significance level of p<0.01 (two-tailed test), indicating that students who showed high procrastination scores performed below average in their academics. In addition, analysis with the Mann-Whitney U test found a significant difference in procrastination scores between the two gender groups (p<0.05). Hence, among the Indian undergraduate dental students evaluated in this study, it appeared that individuals with above average and average academic performance had lower scores of procrastination and vice versa.
Effort, symptom validity testing, performance validity testing and traumatic brain injury.

PubMed

Bigler, Erin D

2014-01-01

To understand the neurocognitive effects of brain injury, valid neuropsychological test findings are paramount. This review examines the research on what has been referred to a symptom validity testing (SVT). Above a designated cut-score signifies a 'passing' SVT performance which is likely the best indicator of valid neuropsychological test findings. Likewise, substantially below cut-point performance that nears chance or is at chance signifies invalid test performance. Significantly below chance is the sine qua non neuropsychological indicator for malingering. However, the interpretative problems with SVT performance below the cut-point yet far above chance are substantial, as pointed out in this review. This intermediate, border-zone performance on SVT measures is where substantial interpretative challenges exist. Case studies are used to highlight the many areas where additional research is needed. Historical perspectives are reviewed along with the neurobiology of effort. Reasons why performance validity testing (PVT) may be better than the SVT term are reviewed. Advances in neuroimaging techniques may be key in better understanding the meaning of border zone SVT failure. The review demonstrates the problems with rigidity in interpretation with established cut-scores. A better understanding of how certain types of neurological, neuropsychiatric and/or even test conditions may affect SVT performance is needed.
CaPTHUS scoring model in primary hyperparathyroidism: can it eliminate the need for ioPTH testing?

PubMed

Elfenbein, Dawn M; Weber, Sara; Schneider, David F; Sippel, Rebecca S; Chen, Herbert

2015-04-01

The CaPTHUS model was reported to have a positive predictive value of 100 % to correctly predict single-gland disease in patients with primary hyperparathyroidism, thus obviating the need for intraoperative parathyroid hormone (ioPTH) testing. We sought to apply the CaPTHUS scoring model in our patient population and assess its utility in predicting long-term biochemical cure. We retrospective reviewed all parathyroidectomies for primary hyperparathyroidism performed at our university hospital from 2003 to 2012. We routinely perform ioPTH testing. Biochemical cure was defined as a normal calcium level at 6 months. A total of 1,421 patients met the inclusion criteria: 78 % of patients had a single adenoma at the time of surgery, 98 % had a normal serum calcium at 1 week postoperatively, and 96 % had a normal serum calcium level 6 months postoperatively. Using the CaPTHUS scoring model, 307 patients (22.5 %) had a score of ≥ 3, with a positive predictive value of 91 % for single adenoma. A CaPTHUS score of ≥ 3 had a positive predictive value of 98 % for biochemical cure at 1 week as well as at 6 months. In our population, where ioPTH testing is used routinely to guide use of bilateral exploration, patients with a preoperative CaPTHUS score of ≥ 3 had good long-term biochemical cure rates. However, the model only predicted adenoma in 91 % of cases. If minimally invasive parathyroidectomy without ioPTH testing had been done for these patients, the cure rate would have dropped from 98 % to an unacceptable 89 %. Even in these patients with high CaPTHUS scores, multigland disease is present in almost 10 %, and ioPTH testing is necessary.
Handbook for Development of Skill Qualification Tests

DTIC Science & Technology

1977-11-01

PERFORMANCE CERTIFICATION COMPONENT SQT 2, MOSC 16J10 SCORING INSTRUCTIONS TO SUPERVISORS A-l B. SQT NOTICE B-l...P-77-5 1 APPENDIX A HEADQUARTERS, DEPARTMENT OF THE ARMY WASHINGTON, DC, 20310 PERFORMANCE CERTIFICATION COMPONENT SQT 2, MOSC 16JI0 SCORING...soldier will not be penalized for a score of "N." 3. a. SQT 2, MOSC 16J10 consist of a written component and a per- formance certification component
The diagnostic performance of the Mass Restricted (MR) score in the identification of microbial invasion of the amniotic cavity or intra-amniotic inflammation is not superior to amniotic fluid interleukin-6

PubMed Central

Romero, Roberto; Kadar, Nicholas; Miranda, Jezid; Korzeniewski, Steven J.; Schwartz, Alyse G.; Chaemsaithong, Piya; Rogers, Wade; Soto, Eleazar; Gotsch, Francesca; Yeo, Lami; Hassan, Sonia S.; Chaiworapongsa, Tinnakorn

2018-01-01

Objective Intra-amniotic infection/inflammation are major causes of spontaneous preterm labor and delivery. However, diagnosis of intra-amniotic infection is challenging because most are subclinical and amniotic fluid (AF) cultures take several days before results are available. Several tests have been proposed for the rapid diagnosis of microbial invasion of the amniotic cavity (MIAC) or intra-amniotic inflammation. The aim of this study was to examine the diagnostic performance of the AF Mass Restricted (MR) score in comparison with interleukin-6 (IL-6) and matrix metalloproteinase-8 (MMP-8) for the identification of MIAC or inflammation. Methods AF samples were collected from patients with singleton gestations and symptoms of preterm labor (n = 100). Intra-amniotic inflammation was defined as >100 white blood cells/mm3 (WBCs) in AF; MIAC was defined as a positive AF culture. AF IL-6 and MMP-8 were determined using ELISA. The MR score was obtained using the Surface-Enhanced Laser Desorption Ionization Time of Flight (SELDI-TOF) mass spectrometry. Sensitivity and specificity were calculated and logistic regression models were fit to construct receiver-operating characteristic (ROC) curves for the identification of each outcome. The McNemar’s test and paired sample non-parametric statistical techniques were used to test for differences in diagnostic performance metrics. Results (1) The prevalence of MIAC and intra-amniotic inflammation was 34% (34/100) and 40% (40/100), respectively; (2) there were no significant differences in sensitivity of the three tests under study (MR score, IL-6 or MMP-8) in the identification of either MIAC or intra-amniotic inflammation (using the following cutoffs: MR score >2, IL-6 >11.4 ng/mL, and MMP-8 >23 ng/mL); (3) there was no significant difference in the sensitivity among the three tests for the same outcomes when the false positive rate was fixed at 15%; (4) the specificity for IL-6 was not significantly different from that of the MR score in identifying either MIAC or intra-amniotic inflammation when using previously reported thresholds; and (5) there were no significant differences in the area under the ROC curve when comparing the MR score, IL-6 or MMP-8 in the identification of these outcomes. Conclusions IL-6 and the MR score have equivalent diagnostic performance in the identification of MIAC or intra-amniotic inflammation. Selection from among these three tests (MR score, IL-6 and MMP-8) for diagnostic purposes should be based on factors such as availability, reproducibility, and cost. The MR score requires a protein chip and a SELDI-TOF instrument which are not widely available or considered “state of the art”. In contrast, immunoassays for IL-6 can be performed in the majority of clinical laboratories. PMID:24028673
The diagnostic performance of the Mass Restricted (MR) score in the identification of microbial invasion of the amniotic cavity or intra-amniotic inflammation is not superior to amniotic fluid interleukin-6.

PubMed

Romero, Roberto; Kadar, Nicholas; Miranda, Jezid; Korzeniewski, Steven J; Schwartz, Alyse G; Chaemsaithong, Piya; Rogers, Wade; Soto, Eleazar; Gotsch, Francesca; Yeo, Lami; Hassan, Sonia S; Chaiworapongsa, Tinnakorn

2014-05-01

Intra-amniotic infection/inflammation are major causes of spontaneous preterm labor and delivery. However, diagnosis of intra-amniotic infection is challenging because most are subclinical and amniotic fluid (AF) cultures take several days before results are available. Several tests have been proposed for the rapid diagnosis of microbial invasion of the amniotic cavity (MIAC) or intra-amniotic inflammation. The aim of this study was to examine the diagnostic performance of the AF Mass Restricted (MR) score in comparison with interleukin-6 (IL-6) and matrix metalloproteinase-8 (MMP-8) for the identification of MIAC or inflammation. AF samples were collected from patients with singleton gestations and symptoms of preterm labor (n = 100). Intra-amniotic inflammation was defined as >100 white blood cells/mm(3) (WBCs) in AF; MIAC was defined as a positive AF culture. AF IL-6 and MMP-8 were determined using ELISA. The MR score was obtained using the Surface-Enhanced Laser Desorption Ionization Time of Flight (SELDI-TOF) mass spectrometry. Sensitivity and specificity were calculated and logistic regression models were fit to construct receiver-operating characteristic (ROC) curves for the identification of each outcome. The McNemar's test and paired sample non-parametric statistical techniques were used to test for differences in diagnostic performance metrics. (1) The prevalence of MIAC and intra-amniotic inflammation was 34% (34/100) and 40% (40/100), respectively; (2) there were no significant differences in sensitivity of the three tests under study (MR score, IL-6 or MMP-8) in the identification of either MIAC or intra-amniotic inflammation (using the following cutoffs: MR score >2, IL-6 >11.4 ng/mL, and MMP-8 >23 ng/mL); (3) there was no significant difference in the sensitivity among the three tests for the same outcomes when the false positive rate was fixed at 15%; (4) the specificity for IL-6 was not significantly different from that of the MR score in identifying either MIAC or intra-amniotic inflammation when using previously reported thresholds; and (5) there were no significant differences in the area under the ROC curve when comparing the MR score, IL-6 or MMP-8 in the identification of these outcomes. IL-6 and the MR score have equivalent diagnostic performance in the identification of MIAC or intra-amniotic inflammation. Selection from among these three tests (MR score, IL-6 and MMP-8) for diagnostic purposes should be based on factors such as availability, reproducibility, and cost. The MR score requires a protein chip and a SELDI-TOF instrument which are not widely available or considered "state of the art". In contrast, immunoassays for IL-6 can be performed in the majority of clinical laboratories.
Hematoma Shape, Hematoma Size, Glasgow Coma Scale Score and ICH Score: Which Predicts the 30-Day Mortality Better for Intracerebral Hematoma?

PubMed Central

Wang, Chih-Wei; Liu, Yi-Jui; Lee, Yi-Hsiung; Hueng, Dueng-Yuan; Fan, Hueng-Chuen; Yang, Fu-Chi; Hsueh, Chun-Jen; Kao, Hung-Wen; Juan, Chun-Jung; Hsu, Hsian-He

2014-01-01

Purpose To investigate the performance of hematoma shape, hematoma size, Glasgow coma scale (GCS) score, and intracerebral hematoma (ICH) score in predicting the 30-day mortality for ICH patients. To examine the influence of the estimation error of hematoma size on the prediction of 30-day mortality. Materials and Methods This retrospective study, approved by a local institutional review board with written informed consent waived, recruited 106 patients diagnosed as ICH by non-enhanced computed tomography study. The hemorrhagic shape, hematoma size measured by computer-assisted volumetric analysis (CAVA) and estimated by ABC/2 formula, ICH score and GCS score was examined. The predicting performance of 30-day mortality of the aforementioned variables was evaluated. Statistical analysis was performed using Kolmogorov-Smirnov tests, paired t test, nonparametric test, linear regression analysis, and binary logistic regression. The receiver operating characteristics curves were plotted and areas under curve (AUC) were calculated for 30-day mortality. A P value less than 0.05 was considered as statistically significant. Results The overall 30-day mortality rate was 15.1% of ICH patients. The hematoma shape, hematoma size, ICH score, and GCS score all significantly predict the 30-day mortality for ICH patients, with an AUC of 0.692 (P = 0.0018), 0.715 (P = 0.0008) (by ABC/2) to 0.738 (P = 0.0002) (by CAVA), 0.877 (P<0.0001) (by ABC/2) to 0.882 (P<0.0001) (by CAVA), and 0.912 (P<0.0001), respectively. Conclusion Our study shows that hematoma shape, hematoma size, ICH scores and GCS score all significantly predict the 30-day mortality in an increasing order of AUC. The effect of overestimation of hematoma size by ABC/2 formula in predicting the 30-day mortality could be remedied by using ICH score. PMID:25029592
Advanced clinical interpretation of the Delis-Kaplan Executive Function System: multivariate base rates of low scores.

PubMed

Karr, Justin E; Garcia-Barrera, Mauricio A; Holdnack, James A; Iverson, Grant L

2018-01-01

Multivariate base rates allow for the simultaneous statistical interpretation of multiple test scores, quantifying the normal frequency of low scores on a test battery. This study provides multivariate base rates for the Delis-Kaplan Executive Function System (D-KEFS). The D-KEFS consists of 9 tests with 16 Total Achievement scores (i.e. primary indicators of executive function ability). Stratified by education and intelligence, multivariate base rates were derived for the full D-KEFS and an abbreviated four-test battery (i.e. Trail Making, Color-Word Interference, Verbal Fluency, and Tower Test) using the adult portion of the normative sample (ages 16-89). Multivariate base rates are provided for the full and four-test D-KEFS batteries, calculated using five low score cutoffs (i.e. ≤25th, 16th, 9th, 5th, and 2nd percentiles). Low scores occurred commonly among the D-KEFS normative sample, with 82.6 and 71.8% of participants obtaining at least one score ≤16th percentile for the full and four-test batteries, respectively. Intelligence and education were inversely related to low score frequency. The base rates provided herein allow clinicians to interpret multiple D-KEFS scores simultaneously for the full D-KEFS and an abbreviated battery of commonly administered tests. The use of these base rates will support clinicians when differentiating between normal variations in cognitive performance and true executive function deficits.
MANUSCRIPT IN PRESS: DEMENTIA & GERIATRIC COGNITIVE DISORDERS

PubMed Central

O’Bryant, Sid E.; Xiao, Guanghua; Barber, Robert; Cullum, C. Munro; Weiner, Myron; Hall, James; Edwards, Melissa; Grammas, Paula; Wilhelmsen, Kirk; Doody, Rachelle; Diaz-Arrastia, Ramon

2015-01-01

Background Prior work on the link between blood-based biomarkers and cognitive status has largely been based on dichotomous classifications rather than detailed neuropsychological functioning. The current project was designed to create serum-based biomarker algorithms that predict neuropsychological test performance. Methods A battery of neuropsychological measures was administered. Random forest analyses were utilized to create neuropsychological test-specific biomarker risk scores in a training set that were entered into linear regression models predicting the respective test scores in the test set. Serum multiplex biomarker data were analyzed on 108 proteins from 395 participants (197 AD cases and 198 controls) from the Texas Alzheimer’s Research and Care Consortium. Results The biomarker risk scores were significant predictors (p<0.05) of scores on all neuropsychological tests. With the exception of premorbid intellectual status (6.6%), the biomarker risk scores alone accounted for a minimum of 12.9% of the variance in neuropsychological scores. Biomarker algorithms (biomarker risk scores + demographics) accounted for substantially more variance in scores. Review of the variable importance plots indicated differential patterns of biomarker significance for each test, suggesting the possibility of domain-specific biomarker algorithms. Conclusions Our findings provide proof-of-concept for a novel area of scientific discovery, which we term “molecular neuropsychology.” PMID:24107792

Correcting Two-Sample "z" and "t" Tests for Correlation: An Alternative to One-Sample Tests on Difference Scores

ERIC Educational Resources Information Center

Zimmerman, Donald W.

2012-01-01

In order to circumvent the influence of correlation in paired-samples and repeated measures experimental designs, researchers typically perform a one-sample Student "t" test on difference scores. That procedure entails some loss of power, because it employs N - 1 degrees of freedom instead of the 2N - 2 degrees of freedom of the…
The effect of instructional methodology on high school students natural sciences standardized tests scores

NASA Astrophysics Data System (ADS)

Powell, P. E.

Educators have recently come to consider inquiry based instruction as a more effective method of instruction than didactic instruction. Experience based learning theory suggests that student performance is linked to teaching method. However, research is limited on inquiry teaching and its effectiveness on preparing students to perform well on standardized tests. The purpose of the study to investigate whether one of these two teaching methodologies was more effective in increasing student performance on standardized science tests. The quasi experimental quantitative study was comprised of two stages. Stage 1 used a survey to identify teaching methods of a convenience sample of 57 teacher participants and determined level of inquiry used in instruction to place participants into instructional groups (the independent variable). Stage 2 used analysis of covariance (ANCOVA) to compare posttest scores on a standardized exam by teaching method. Additional analyses were conducted to examine the differences in science achievement by ethnicity, gender, and socioeconomic status by teaching methodology. Results demonstrated a statistically significant gain in test scores when taught using inquiry based instruction. Subpopulation analyses indicated all groups showed improved mean standardized test scores except African American students. The findings benefit teachers and students by presenting data supporting a method of content delivery that increases teacher efficacy and produces students with a greater cognition of science content that meets the school's mission and goals.
Relationship Between Speech Intelligibility and Speech Comprehension in Babble Noise.

PubMed

Fontan, Lionel; Tardieu, Julien; Gaillard, Pascal; Woisard, Virginie; Ruiz, Robert

2015-06-01

The authors investigated the relationship between the intelligibility and comprehension of speech presented in babble noise. Forty participants listened to French imperative sentences (commands for moving objects) in a multitalker babble background for which intensity was experimentally controlled. Participants were instructed to transcribe what they heard and obey the commands in an interactive environment set up for this purpose. The former test provided intelligibility scores and the latter provided comprehension scores. Collected data revealed a globally weak correlation between intelligibility and comprehension scores (r = .35, p < .001). The discrepancy tended to grow as noise level increased. An analysis of standard deviations showed that variability in comprehension scores increased linearly with noise level, whereas higher variability in intelligibility scores was found for moderate noise level conditions. These results support the hypothesis that intelligibility scores are poor predictors of listeners' comprehension in real communication situations. Intelligibility and comprehension scores appear to provide different insights, the first measure being centered on speech signal transfer and the second on communicative performance. Both theoretical and practical implications for the use of speech intelligibility tests as indicators of speakers' performances are discussed.
Scaling: An Items Module

ERIC Educational Resources Information Center

Tong, Ye; Kolen, Michael J.

2010-01-01

"Scaling" is the process of constructing a score scale that associates numbers or other ordered indicators with the performance of examinees. Scaling typically is conducted to aid users in interpreting test results. This module describes different types of raw scores and scale scores, illustrates how to incorporate various sources of…
Performance of Blind Children on Digit-Span Tests.

ERIC Educational Resources Information Center

Hull, T.; Mason, H.

1995-01-01

This article reports the results of digit-span tests administered to 314 children who were visually impaired. Results found that gender, first language, and educational setting had no effect on the children's scores and that the congenitally totally blind children scored higher than did sighted children, whereas those who had had some sight did…
Experiential Awareness of the Effects of Test Score Reports.

ERIC Educational Resources Information Center

Bender, Robert C.

Because most counselors have experienced a significant amount of success, they often have difficulty understanding the impact of test scores on persons who do not perform well. Counselor educators must develop experiential awareness in an area normally outside the realm of their students. To provide such an experience, 25 counselor trainees took…
Academic Self-Perception and Its Relationship to Academic Performance

ERIC Educational Resources Information Center

Stringer, Ronald W.; Heath, Nancy

2008-01-01

One hundred and fifty-five students (average age, 10 years 7 months) were initially tested on reading, arithmetic, and academic self-perception. One year later they were tested again. Initial academic scores accounted for a large proportion of the variance in later academic scores. The children's self-perceptions of academic competence accounted…
The Effect of Stakes on Accountability Test Scores and Pass Rates

ERIC Educational Resources Information Center

Steedle, Jeffrey T.; Grochowalski, Joseph

2017-01-01

Students may not fully demonstrate their knowledge and skills on accountability tests if there are no stakes attached to individual performance. In that case, assessment results may not accurately reflect student achievement, so the validity of score interpretations and uses suffers. For this study, matched samples of students taking state…
Benefits of Coaching on Test Scores Seen as Negligible.

ERIC Educational Resources Information Center

Report on Education Research, 1983

1983-01-01

THE FOLLOWING IS THE FULL TEXT OF THIS DOCUMENT: A new study by a pair of Harvard University researchers discounts earlier findings that coaching can substantially improve student performance on the Scholastic Aptitude Test (SAT). "There is simply insufficient evidence that large score increases are a result of a coaching program," write…
Allele-sharing models: LOD scores and accurate linkage tests.

PubMed

Kong, A; Cox, N J

1997-11-01

Starting with a test statistic for linkage analysis based on allele sharing, we propose an associated one-parameter model. Under general missing-data patterns, this model allows exact calculation of likelihood ratios and LOD scores and has been implemented by a simple modification of existing software. Most important, accurate linkage tests can be performed. Using an example, we show that some previously suggested approaches to handling less than perfectly informative data can be unacceptably conservative. Situations in which this model may not perform well are discussed, and an alternative model that requires additional computations is suggested.
Allele-sharing models: LOD scores and accurate linkage tests.

PubMed Central

Kong, A; Cox, N J

1997-01-01

Starting with a test statistic for linkage analysis based on allele sharing, we propose an associated one-parameter model. Under general missing-data patterns, this model allows exact calculation of likelihood ratios and LOD scores and has been implemented by a simple modification of existing software. Most important, accurate linkage tests can be performed. Using an example, we show that some previously suggested approaches to handling less than perfectly informative data can be unacceptably conservative. Situations in which this model may not perform well are discussed, and an alternative model that requires additional computations is suggested. PMID:9345087
Quantification of the sit-to-stand movement for monitoring age-related motor deterioration using the Nintendo Wii Balance Board.

PubMed

Yamako, Go; Chosa, Etsuo; Totoribe, Koji; Fukao, Yuu; Deng, Gang

2017-01-01

Simple methods for quantitative evaluations of individual motor performance are crucial for the early detection of motor deterioration. Sit-to-stand movement from a chair is a mechanically demanding component of activities of daily living. Here, we developed a novel method using the ground reaction force and center of pressure measured from the Nintendo Wii Balance Board to quantify sit-to-stand movement (sit-to-stand score) and investigated the age-related change in the sit-to-stand score as a method to evaluate reduction in motor performance. The study enrolled 503 participants (mean age ± standard deviation, 51.0 ± 19.7 years; range, 20-88 years; male/female ratio, 226/277) without any known musculoskeletal conditions that limit sit-to-stand movement, which were divided into seven 10-year age groups. The participants were instructed to stand up as quickly as possible, and the sit-to-stand score was calculated as the combination of the speed and balance indices, which have a tradeoff relationship. We also performed the timed up and go test, a well-known clinical test used to evaluate an individual's mobility. There were significant differences in the sit-to-stand score and timed up and go time among age groups. The mean sit-to-stand score for 60s, 70s, and 80s were 77%, 68%, and 53% of that for the 20s, respectively. The timed up and go test confirmed the age-related decrease in mobility of the participants. In addition, the sit-to-stand score measured using the Wii Balance Board was compared with that from a laboratory-graded force plate using the Bland-Altman plot (bias = -3.1 [ms]-1, 95% limit of agreement: -11.0 to 3.9 [ms]-1). The sit-to-stand score has good inter-device reliability (intraclass correlation coefficient = 0.87). Furthermore, the test-retest reliability is substantial (intraclass correlation coefficient = 0.64). Thus, the proposed STS score will be useful to detect the early deterioration of motor performance.
Prediction of Osteopathic Medical School Performance on the basis of MCAT score, GPA, sex, undergraduate major, and undergraduate institution.

PubMed

Dixon, Donna

2012-04-01

The relationships of students' preadmission academic variables, sex, undergraduate major, and undergraduate institution to academic performance in medical school have not been thoroughly examined. To determine the ability of students' preadmission academic variables to predict osteopathic medical school performance and whether students' sex, undergraduate major, or undergraduate institution influence osteopathic medical school performance. The study followed students who graduated from New York College of Osteopathic Medicine of New York Institute of Technology in Old Westbury between 2003 and 2006. Student preadmission data were Medical College Admission Test (MCAT) scores, undergraduate grade point averages (GPAs), sex, undergraduate major, and undergraduate institutional selectivity. Medical school performance variables were GPAs, clinical performance (ie, clinical subject examinations and clerkship evaluations), and scores on the Comprehensive Osteopathic Medical Licensing Examination-USA (COMLEX-USA) Level 1 and Level 2-Clinical Evaluation (CE). Data were analyzed with Pearson product moment correlation coefficients and multivariate linear regression analyses. Differences between student groups were compared with the independent-samples, 2-tailed t test. A total of 737 students were included. All preadmission academic variables, except nonscience undergraduate GPA, were statistically significant predictors of performance on COMLEX-USA Level 1, and all preadmission academic variables were statistically significant predictors of performance on COMLEX-USA Level 2-CE. The MCAT score for biological sciences had the highest correlation among all variables with COMLEX-USA Level 1 performance (Pearson r=0.304; P<.001) and Level 2-CE performance (Pearson r=0.272; P<.001). All preadmission variables were moderately correlated with the mean clinical subject examination scores. The mean clerkship evaluation score was moderately correlated with mean clinical examination results (Pearson r=0.267; P<.001) and COMLEX-USA Level 2-CE performance (Pearson r=0.301; P<.001). Clinical subject examination scores were highly correlated with COMLEX-USA Level 2-CE scores (Pearson r=0.817; P<.001). No statistically significant difference in medical school performance was found between students with science and nonscience undergraduate majors, nor was undergraduate institutional selectivity a factor influencing performance. Students' preadmission academic variables were predictive of osteopathic medical school performance, including GPAs, clinical performance, and COMLEX-USA Level 1 and Level 2-CE results. Clinical performance was predictive of COMLEX-USA Level 2-CE performance.
Assessment of numeracy in sports and exercise science students at an Australian university

NASA Astrophysics Data System (ADS)

Green, Simon; McGlynn, Susan; Stuart, Deidre; Fahey, Paul; Pettigrew, Jim; Clothier, Peter

2018-05-01

The effect of high school study of mathematics on numeracy performance of sports and exercise science (SES) students is not clear. To investigate this further, we tested the numeracy skills of 401 students enrolled in a Bachelor of Health Sciences degree in SES using a multiple-choice survey consisting of four background questions and 39 numeracy test questions. Background questions (5-point scale) focused on highest level of mathematics studied at high school, self-perception of mathematics proficiency, perceived importance of mathematics to SES and likelihood of seeking help with mathematics. Numeracy questions focused on rational number, ratios and rates, basic algebra and graph interpretation. Numeracy performance was based on answers to these questions (1 mark each) and represented by the total score (maximum = 39). Students from first (n = 212), second (n = 78) and third (n = 111) years of the SES degree completed the test. The distribution of numeracy test scores for the entire cohort was negatively skewed with a median (IQR) score of 27(11). We observed statistically significant associations between test scores and the highest level of mathematics studied (P < 0.05), being lowest in students who studied Year 10 Mathematics (20 (9)), intermediate in students who studied Year 12 General Mathematics (26 (8)) and highest in two groups of students who studied higher-level Year 12 Mathematics (31 (9), 31 (6)). There were statistically significant associations between test scores and level of self-perception of mathematics proficiency and also likelihood of seeking help with mathematics (P < 0.05) but not with perceived importance of mathematics to SES. These findings reveal that the level of mathematics studied in high school is a critical factor determining the level of numeracy performance in SES students.
Prepharmacy predictors of success in pharmacy school: grade point averages, pharmacy college admissions test, communication abilities, and critical thinking skills.

PubMed

Allen, D D; Bond, C A

2001-07-01

Good admissions decisions are essential for identifying successful students and good practitioners. Various parameters have been shown to have predictive power for academic success. Previous academic performance, the Pharmacy College Admissions Test (PCAT), and specific prepharmacy courses have been suggested as academic performance indicators. However, critical thinking abilities have not been evaluated. We evaluated the connection between academic success and each of the following predictive parameters: the California Critical Thinking Skills Test (CCTST) score, PCAT score, interview score, overall academic performance prior to admission at a pharmacy school, and performance in specific prepharmacy courses. We confirmed previous reports but demonstrated intriguing results in predicting practice-based skills. Critical thinking skills predict practice-based course success. Also, the CCTST and PCAT scores (Pearson correlation [pc] = 0.448, p < 0.001) were closely related in our students. The strongest predictors of practice-related courses and clerkship success were PCAT (pc=0.237, p<0.001) and CCTST (pc = 0.201, p < 0.001). These findings and other analyses suggest that PCAT may predict critical thinking skills in pharmacy practice courses and clerkships. Further study is needed to confirm this finding and determine which PCAT components predict critical thinking abilities.
Color discrimination performance in patients with Alzheimer's disease.

PubMed

Salamone, Giovanna; Di Lorenzo, Concetta; Mosti, Serena; Lupo, Federica; Cravello, Luca; Palmer, Katie; Musicco, Massimo; Caltagirone, Carlo

2009-01-01

Visual deficits are frequent in Alzheimer's disease (AD), yet little is known about the nature of these disturbances. The aim of the present study was to investigate color discrimination in patients with AD to determine whether impairment of this visual function is a cognitive or perceptive/sensory disturbance. A cross-sectional clinical study was conducted in a specialized dementia unit on 20 patients with mild/moderate AD and 21 age-matched normal controls. Color discrimination was measured by the Farnsworth-Munsell 100 hue test. Cognitive functioning was measured with the Mini-Mental State Examination (MMSE) and a comprehensive battery of neuropsychological tests. The scores obtained on the color discrimination test were compared between AD patients and controls adjusting for global and domain-specific cognitive performance. Color discrimination performance was inversely related to MMSE score. AD patients had a higher number of errors in color discrimination than controls (mean +/- SD total error score: 442.4 +/- 84.5 vs. 304.1 +/- 45.9). This trend persisted even after adjustment for MMSE score and cognitive performance on specific cognitive domains. A specific reduction of color discrimination capacity is present in AD patients. This deficit does not solely depend upon cognitive impairment, and involvement of the primary visual cortex and/or retinal ganglionar cells may be contributory.
External Validation of European System for Cardiac Operative Risk Evaluation II (EuroSCORE II) for Risk Prioritization in an Iranian Population

PubMed Central

Atashi, Alireza; Amini, Shahram; Tashnizi, Mohammad Abbasi; Moeinipour, Ali Asghar; Aazami, Mathias Hossain; Tohidnezhad, Fariba; Ghasemi, Erfan; Eslami, Saeid

2018-01-01

Introduction The European System for Cardiac Operative Risk Evaluation II (EuroSCORE II) is a prediction model which maps 18 predictors to a 30-day post-operative risk of death concentrating on accurate stratification of candidate patients for cardiac surgery. Objective The objective of this study was to determine the performance of the EuroSCORE II risk-analysis predictions among patients who underwent heart surgeries in one area of Iran. Methods A retrospective cohort study was conducted to collect the required variables for all consecutive patients who underwent heart surgeries at Emam Reza hospital, Northeast Iran between 2014 and 2015. Univariate and multivariate analysis were performed to identify covariates which significantly contribute to higher EuroSCORE II in our population. External validation was performed by comparing the real and expected mortality using area under the receiver operating characteristic curve (AUC) for discrimination assessment. Also, Brier Score and Hosmer-Lemeshow goodness-of-fit test were used to show the overall performance and calibration level, respectively. Results Two thousand five hundred eight one (59.6% males) were included. The observed mortality rate was 3.3%, but EuroSCORE II had a prediction of 4.7%. Although the overall performance was acceptable (Brier score=0.047), the model showed poor discriminatory power by AUC=0.667 (sensitivity=61.90, and specificity=66.24) and calibration (Hosmer-Lemeshow test, P<0.01). Conclusion Our study showed that the EuroSCORE II discrimination power is less than optimal for outcome prediction and less accurate for resource allocation programs. It highlights the need for recalibration of this risk stratification tool aiming to improve post cardiac surgery outcome predictions in Iran. PMID:29617500
Construct Validity and Scoring Methods of the World Health Organization: Health and Work Performance Questionnaire Among Workers With Arthritis and Rheumatological Conditions.

PubMed

AlHeresh, Rawan; LaValley, Michael P; Coster, Wendy; Keysor, Julie J

2017-06-01

To evaluate construct validity and scoring methods of the world health organization-health and work performance questionnaire (HPQ) for people with arthritis. Construct validity was examined through hypothesis testing using the recommended guidelines of the consensus-based standards for the selection of health measurement instruments (COSMIN). The HPQ using the absolute scoring method showed moderate construct validity as four of the seven hypotheses were met. The HPQ using the relative scoring method had weak construct validity as only one of the seven hypotheses were met. The absolute scoring method for the HPQ is superior in construct validity to the relative scoring method in assessing work performance among people with arthritis and related rheumatic conditions; however, more research is needed to further explore other psychometric properties of the HPQ.
Investigating the mental abilities of rural Zulu primary school children in South Africa.

PubMed

Jinabhai, C C; Taylor, M; Rangongo, M F; Mkhize, N J; Anderson, S; Pillay, B J; Sullivan, K R

2004-02-01

Maximising the full potential of health and educational interventions in South African schools requires assessment of the current level of mental abilities of the school children as measured by cognitive and scholastic tests and the identification of any barriers to improved performance. This study reports on the application and interpretation of a selected battery of mental ability tests among Zulu school children and the methodological and analytical issues that need to be addressed. The test scores of 806 primary school children from a rural community are presented, based on four tests: Raven's Coloured Progressive Matrices (CPM), an Auditory Verbal Learning Test (AVLT), the Symbol Digit Modalities Test (SDMT) and Young's Group Mathematics Test (GMT). Significant gender differences were found in the test scores, and the mean scores of Zulu children in this study were lower than those reported in other studies. The results of this selected test battery provide data for the further development of appropriate test instruments for South African conditions. These results can contribute towards the development of a test battery for South African children that can be used to assess and improve their school performance.
PERFORMANCE OF TWO DIFFERENT CLINICAL SCORING SYSTEMS IN DIAGNOSING DISTAL SENSORY POLYNEUROPATHY IN PATIENTS WITH TYPE-2 DIABETES.

PubMed

Khan, Fehmeda Farrukh; Numan, Ahsan; Khawaja, Khadija Irfan; Atif, Ali; Fatima, Aziz; Masud, Faisal

2015-01-01

Early diagnosis of distal peripheral neuropathy (DSPN) the commonest diabetes complications, helps prevent significant morbidity. Clinical parameters are useful for detection, but subjectivity and lack of operator proficiency often results in inaccuracies. Comparative diagnostic accuracy of Diabetic Neuropathy Symptom (DNS) score and Diabetic Neuropathy Examination (DNE) score in detecting DSPN confirmed by nerve conduction studies (NCS) has not been evaluated. This study compares the performance of these scores in predicting the presence of electro physiologically proven DSPN. The objective of this, study was to compare the diagnostic accuracy of DNS and DNE scores in detecting NCS proven DSPN in type-2 diabetics, and to determine the frequency of sub-clinical DSPN among type-2 diabetics. In this cross-sectional study the DNS score and DNE score were determined in 110 diagnosed type-2 diabetic patients. NCS were carried out and amplitudes, velocities and latencies of sensory and motor nerves in lower limb were recorded. Comparison between the two clinical diagnostic modalities and NCS using Pearson's chi square test showed a significant association between NCS and DNE scores (p-value =.003, specificity 93%). The DNS score performed poorly in comparison (p-value = .068, specificity 77%). When the two scores were taken in combination the specificity in diagnosing DSPN was greater (p-value = .018, specificity 96%) than either alone. 33% of patients had subclinical neuropathy. DNE score alone and in combination with DNS score is reliable in predicting DSPN and is more specific than DNS score in evaluating DSPN. Both tests lack sensitivity. Patients without any evidence of clinical neuropathy manifest abnormalities on NCS.

Exploration of Analysis Methods for Diagnostic Imaging Tests: Problems with ROC AUC and Confidence Scores in CT Colonography

PubMed Central

Mallett, Susan; Halligan, Steve; Collins, Gary S.; Altman, Doug G.

2014-01-01

Background Different methods of evaluating diagnostic performance when comparing diagnostic tests may lead to different results. We compared two such approaches, sensitivity and specificity with area under the Receiver Operating Characteristic Curve (ROC AUC) for the evaluation of CT colonography for the detection of polyps, either with or without computer assisted detection. Methods In a multireader multicase study of 10 readers and 107 cases we compared sensitivity and specificity, using radiological reporting of the presence or absence of polyps, to ROC AUC calculated from confidence scores concerning the presence of polyps. Both methods were assessed against a reference standard. Here we focus on five readers, selected to illustrate issues in design and analysis. We compared diagnostic measures within readers, showing that differences in results are due to statistical methods. Results Reader performance varied widely depending on whether sensitivity and specificity or ROC AUC was used. There were problems using confidence scores; in assigning scores to all cases; in use of zero scores when no polyps were identified; the bimodal non-normal distribution of scores; fitting ROC curves due to extrapolation beyond the study data; and the undue influence of a few false positive results. Variation due to use of different ROC methods exceeded differences between test results for ROC AUC. Conclusions The confidence scores recorded in our study violated many assumptions of ROC AUC methods, rendering these methods inappropriate. The problems we identified will apply to other detection studies using confidence scores. We found sensitivity and specificity were a more reliable and clinically appropriate method to compare diagnostic tests. PMID:25353643
Exploration of analysis methods for diagnostic imaging tests: problems with ROC AUC and confidence scores in CT colonography.

PubMed

Mallett, Susan; Halligan, Steve; Collins, Gary S; Altman, Doug G

2014-01-01

Different methods of evaluating diagnostic performance when comparing diagnostic tests may lead to different results. We compared two such approaches, sensitivity and specificity with area under the Receiver Operating Characteristic Curve (ROC AUC) for the evaluation of CT colonography for the detection of polyps, either with or without computer assisted detection. In a multireader multicase study of 10 readers and 107 cases we compared sensitivity and specificity, using radiological reporting of the presence or absence of polyps, to ROC AUC calculated from confidence scores concerning the presence of polyps. Both methods were assessed against a reference standard. Here we focus on five readers, selected to illustrate issues in design and analysis. We compared diagnostic measures within readers, showing that differences in results are due to statistical methods. Reader performance varied widely depending on whether sensitivity and specificity or ROC AUC was used. There were problems using confidence scores; in assigning scores to all cases; in use of zero scores when no polyps were identified; the bimodal non-normal distribution of scores; fitting ROC curves due to extrapolation beyond the study data; and the undue influence of a few false positive results. Variation due to use of different ROC methods exceeded differences between test results for ROC AUC. The confidence scores recorded in our study violated many assumptions of ROC AUC methods, rendering these methods inappropriate. The problems we identified will apply to other detection studies using confidence scores. We found sensitivity and specificity were a more reliable and clinically appropriate method to compare diagnostic tests.
Intuitive Sense of Number Correlates With Math Scores on College-Entrance Examination

PubMed Central

Libertus, Melissa E.; Odic, Darko; Halberda, Justin

2012-01-01

Many educated adults possess exact mathematical abilities in addition to an approximate, intuitive sense of number, often referred to as the Approximate Number System (ANS). Here we investigate the link between ANS precision and mathematics performance in adults by testing participants on an ANS-precision test and collecting their scores on the Scholastic Aptitude Test (SAT), a standardized college-entrance exam in the USA. In two correlational studies, we found that ANS precision correlated with SAT-Quantitative (i.e., mathematics) scores. This relationship remained robust even when controlling for SAT-Verbal scores, suggesting a small but specific relationship between our primitive sense for number and formal mathematical abilities. PMID:23098904
PubMed

de Quadros, Ronice Müller; Cruz, Carina Rebello; Pizzio, Aline Lemos

2012-01-01

This study compared the performance in phonological memory tasks of bimodal bilíngual hearing children (children of deaf parents) and deaf children with cochlear implant (children of deaf parents and hearing parents), with different contexts of access to Brazilian Sign Language (Libras). We used two tests: Portuguêse Pseudowords (Santos and Bueno, 2003) and Libras Pseudosigns (developed by researchers from Development Bimodal Bilíngual Project). Moreover, we included two control groups, one of deaf children, growing up with Libras, with deaf parents, and the other of hearing adults Codas, bimodal bilínguals, with deaf parents. In the analysis of the results, initially, in regard to the performance among the groups tested, it was found that the bimodal bilíngual children had higher scores in both tests. However, when we analyzed the performance of the deaf child with cochlear implant, with deaf parents, with full access to sign language, compared to the other children with cochlear implant, with restricted access to Libras, we found that this child has a similar performance to the Coda children. The cochlear-implanted children with restricted access to Libras, therefore with more access to Portuguêse, had lower scores in both tests, being the worst score for the Portuguêse test. The results shown that children with cochlear implant can have benefits when they have access to Libras, having similar performances to hearing bimodal bilíngual children.
Test anxiety and performance-avoidance goals explain gender differences in SAT-V, SAT-M, and overall SAT scores.

PubMed

Hannon, Brenda

2012-11-01

This study uses analysis of co-variance in order to determine which cognitive/learning (working memory, knowledge integration, epistemic belief of learning) or social/personality factors (test anxiety, performance-avoidance goals) might account for gender differences in SAT-V, SAT-M, and overall SAT scores. The results revealed that none of the cognitive/learning factors accounted for gender differences in SAT performance. However, the social/personality factors of test anxiety and performance-avoidance goals each separately accounted for all of the significant gender differences in SAT-V, SAT-M, and overall SAT performance. Furthermore, when the influences of both of these factors were statistically removed simultaneously, all non-significant gender differences reduced further to become trivial by Cohen's (1988) standards. Taken as a whole, these results suggest that gender differences in SAT-V, SAT-M, and overall SAT performance are a consequence of social/learning factors.
Clock Drawing as a Screen for Impaired Driving in Aging and Dementia: Is It Worth the Time?

PubMed Central

Manning, Kevin J.; Davis, Jennifer D.; Papandonatos, George D.; Ott, Brian R.

2014-01-01

Clock drawing is recommended by medical and transportation authorities as a screening test for unsafe drivers. The objective of the present study was to assess the usefulness of different clock drawing systems as screening measures of driving performance in 122 healthy and cognitively impaired older drivers. Clock drawing was measured using four different scoring systems. Driving outcomes included global ratings of safety and the error rate on a standardized on-road test. Findings revealed that clock drawing was significantly correlated with the driving score on the road test for each of the scoring systems. However, receiver operator curve analyses showed limited clinical utility for clock drawing as a screening instrument for impaired on-road driving performance with the area under the curve ranging from 0.53 to 0.61. Results from this study indicate that clock drawing has limited utility as a solitary screening measure of on-road driving, even when considering a variety of scoring approaches. PMID:24296110
Clock drawing as a screen for impaired driving in aging and dementia: is it worth the time?

PubMed

Manning, Kevin J; Davis, Jennifer D; Papandonatos, George D; Ott, Brian R

2014-02-01

Clock drawing is recommended by medical and transportation authorities as a screening test for unsafe drivers. The objective of the present study was to assess the usefulness of different clock drawing systems as screening measures of driving performance in 122 healthy and cognitively impaired older drivers. Clock drawing was measured using four different scoring systems. Driving outcomes included global ratings of safety and the error rate on a standardized on-road test. Findings revealed that clock drawing was significantly correlated with the driving score on the road test for each of the scoring systems. However, receiver operator curve analyses showed limited clinical utility for clock drawing as a screening instrument for impaired on-road driving performance with the area under the curve ranging from 0.53 to 0.61. Results from this study indicate that clock drawing has limited utility as a solitary screening measure of on-road driving, even when considering a variety of scoring approaches.
Relationships between Continuous Performance Task Scores and Other Cognitive Measures: Causality or Commonality?

ERIC Educational Resources Information Center

Aylward, Glen P.; Gordon, Michael; Verhulst, Steven J.

1997-01-01

Relationships among continuous performance test (CPT), IQ, achievement, and memory/learning scores were explored for 1,280 children about 9 years old. Associations among the CPT measures and various cognitive/academic tasks suggest that all require attention and inhibition. The importance of assessing attention and disinhibition in psychological…
Assessing students' conceptual knowledge of electricity and magnetism

NASA Astrophysics Data System (ADS)

McColgan, Michele W.; Finn, Rose A.; Broder, Darren L.; Hassel, George E.

2017-12-01

We present the Electricity and Magnetism Conceptual Assessment (EMCA), a new assessment aligned with second-semester introductory physics courses. Topics covered include electrostatics, electric fields, circuits, magnetism, and induction. We have two motives for writing a new assessment. First, we find other assessments such as the Brief Electricity and Magnetism Assessment and the Conceptual Survey on Electricity and Magnetism not well aligned with the topics and content depth of our courses. We want to test introductory physics content at a level appropriate for our students. Second, we want the assessment to yield scores and gains comparable to the widely used Force Concept Inventory (FCI). After five testing and revision cycles, the assessment was finalized in early 2015 and is available online. We present performance results for a cohort of 225 students at Siena College who were enrolled in our algebra- and calculus-based physics courses during the spring 2015 and 2016 semesters. We provide pretest, post-test, and gain analyses, as well as individual question and whole test statistics to quantify difficulty and reliability. In addition, we compare EMCA and FCI scores and gains, and we find that students' FCI scores are strongly correlated with their performance on the EMCA. Finally, the assessment was piloted in an algebra-based physics course at George Washington University (GWU). We present performance results for a cohort of 130 GWU students and we find that their EMCA scores are comparable to the scores of students in our calculus-based physics course.
Comparison of virtual patient simulation with mannequin-based simulation for improving clinical performances in assessing and managing clinical deterioration: randomized controlled trial.

PubMed

Liaw, Sok Ying; Chan, Sally Wai-Chi; Chen, Fun-Gee; Hooi, Shing Chuan; Siau, Chiang

2014-09-17

Virtual patient simulation has grown substantially in health care education. A virtual patient simulation was developed as a refresher training course to reinforce nursing clinical performance in assessing and managing deteriorating patients. The objective of this study was to describe the development of the virtual patient simulation and evaluate its efficacy, by comparing with a conventional mannequin-based simulation, for improving the nursing students' performances in assessing and managing patients with clinical deterioration. A randomized controlled study was conducted with 57 third-year nursing students who were recruited through email. After a baseline evaluation of all participants' clinical performance in a simulated environment, the experimental group received a 2-hour fully automated virtual patient simulation while the control group received 2-hour facilitator-led mannequin-based simulation training. All participants were then re-tested one day (first posttest) and 2.5 months (second posttest) after the intervention. The participants from the experimental group completed a survey to evaluate their learning experiences with the newly developed virtual patient simulation. Compared to their baseline scores, both experimental and control groups demonstrated significant improvements (P<.001) in first and second post-test scores. While the experimental group had significantly lower (P<.05) second post-test scores compared with the first post-test scores, no significant difference (P=.94) was found between these two scores for the control group. The scores between groups did not differ significantly over time (P=.17). The virtual patient simulation was rated positively. A virtual patient simulation for a refreshing training course on assessing and managing clinical deterioration was developed. Although the randomized controlled study did not show that the virtual patient simulation was superior to mannequin-based simulation, both simulations have demonstrated to be effective refresher learning strategies for improving nursing students' clinical performance. Given the greater resource requirements of mannequin-based simulation, the virtual patient simulation provides a more promising alternative learning strategy to mitigate the decay of clinical performance over time.
Validity of the Test of Infant Motor Performance for prediction of 6-, 9- and 12-month scores on the Alberta Infant Motor Scale.

PubMed

Campbell, Suzann K; Kolobe, Thubi H A; Wright, Benjamin D; Linacre, John Michael

2002-04-01

The Test of Infant Motor Performance (TIMP) is a test of functional movement in infants from 32 weeks' post-conceptional age to 4 months postterm. The purpose of this study was to assess in 96 infants (44 females, 52 males) with varying risk, the relation between measures on the TIMP at 7, 30, 60, and 90 days after term age and percentile ranks (PR) on the Alberta Infant Motor Scale (AIMS). Correlation between scores on the TIMP and the AIMS was highest for TIMP tests at 90 days and AIMS testing at 6 months (r=0.67, p=0.0001), but all comparisons were statistically significant except those between the TIMP at 7 days and AIMS PR at 9 months. In a multiple regression analysis combining a perinatal risk score and 7-day TIMP measures to predict 12-month AIMS PR, risk, but not TIMP, predicted outcome (21% of variance explained). At older ages TIMP measures made increasing contributions to prediction of 12-month AIMS PR (30% of variance explained by 90-day TIMP). The best TIMP score to maximize specificity and correctly identify 84% of the infants above versus below the 10th PR at 6 months was a cut-off point of 1 SD below the mean. The same cut-off point correctly identified 88% of the infants at 12 months. A cut-off of -0.5 SD, however, maximized sensitivity at 92%. A negative test result, i.e. score above -0.5 SD at 3 months, carried only a 2% probability of a poor 12-month outcome. We conclude that TIMP scores significantly predict AIMS PR 6 to 12 months later, but the TIMP at 3 months of age has the greatest degree of validity for predicting motor performance on the AIMS at 12 months and can be used clinically to identify infants likely to benefit from intervention.
Influence of Movement Quality on Heart Rate While Performing the Dance-Specific Aerobic Fitness Test (DAFT) in Preprofessional Contemporary Dancers.

PubMed

Tiemens, Annemiek; van Rijn, Rogier M; Wyon, Matthew A; Redding, Emma; Stubbe, Janine H

2018-06-01

To explore whether movement quality has influence on heart rate (HR) frequency during the dance-specific aerobic fitness test (DAFT). Thirteen contemporary university dance students (age 19 ± 1.46 yrs) underwent two trials performing the DAFT while wearing a Polar HR monitor (Kempele, Finland). During the first trial, dancers were asked to perform the movements as if they were performing on stage, whereas during the second trial, standardized verbal instructions were given to reduce the quality of movement (e.g., no need to perform technically correct pliés). The variables measured at each trial were HR for all five stages of the DAFT and HR recovery (1 and 2 min after finishing the DAFT), movement quality (MQ) score, and rate of perceived exertion score (RPE). There were significant differences in HR between Trial 1 and Trial 2. For all stages and the resting period, HR was lower during Trial 2 (p<0.001). Also, the RPE score was significantly lower and the MQ score was significantly higher, indicating a poorer performance, during Trial 2 (both p<0.001). The results suggest that DAFT performance with lower movement quality elicits lower HR frequency and RPE during the DAFT. We recommend that specific instructions be given to participants about executing the movement sequence during the DAFT before testing commences. Also, movement quality must be taken into account when interpreting HR results from the DAFT in order to distinguish if a dancer's low HR results from good aerobic fitness or from poor performance of the movement sequence.
The impact of prematurity and maternal socioeconomic status and education level on achievement-test scores up to 8th grade.

PubMed

ElHassan, Nahed O; Bai, Shasha; Gibson, Neal; Holland, Greg; Robbins, James M; Kaiser, Jeffrey R

2018-01-01

The relative influence of prematurity vs. maternal social factors (socioeconomic status and education level) on academic performance has rarely been examined. To examine the impact of prematurity and maternal social factors on academic performance from 3rd through 8th grade. We conducted a retrospective cohort study of infants born in 1998 at the University of Arkansas for Medical Sciences. The study sample included 58 extremely low gestational age newborns (ELGANs, 23‒<28 weeks), 171 preterm (≥28‒<34 weeks), 228 late preterm (≥34‒<37 weeks), and 967 term ((≥37‒<42 weeks) infants. Neonatal and maternal variables were collected including maternal insurance status (proxy measure for socioeconomic status) and education level. The primary outcomes were literacy and mathematics achievement-test scores from 3rd through 8th grade. Linear mixed models were used to identify significant predictors of academic performance. All two-way interactions between grade level, gestational-age (GA) groups, and social factors were tested for statistical significance. Prematurity, social factors, gender, race, gravidity, and Apgar score at one minute were critical determinants of academic performance. Favorable social factors were associated with a significant increase in both literacy and mathematic scores, while prematurity was associated with a significant decrease in mathematic scores. Examination of GA categories and social factors interaction suggested that the impact of social factors on test scores was similar for all GA groups. Furthermore, the impact of social factors varied from grade to grade for literacy, while the influence of either GA groups or social factors was constant across grades for mathematics. For example, an ELGAN with favorable social factors had a predicted literacy score 104.1 (P <.001), 98.2 (P <.001), and 76.4 (P <.01) points higher than an otherwise similar disadvantaged term infant at grades 3, 5, and 8, respectively. The difference in their predicted mathematic scores was 33.4 points for all grades (P <.05). While there were significant deficits in academic performance for ELGANs compared to PT, LPT, and term infants, the deficit could be offset by higher SES and better-educated mothers. These favorable social factors were critical to a child's academic achievement. The role of socioeconomic factors should be incorporated in discussions on outcome with families of preterm infants.
Health Behaviors and Standardized Test Scores: The Impact of School Health Climate on Performance

ERIC Educational Resources Information Center

Gunter, Whitney D.; Daly, Kevin

2013-01-01

Research has found that many characteristics are related to performance on standardized tests. Many of these are not necessarily "academic" attributes. One area of this research is on the connection between physical health or lifestyles and test performance. The research that exists in this area is often disconnected with each other and…
Predicting better performance on a college preparedness test from narrative comprehension at the age of 6 years: An fMRI study.

PubMed

Horowitz-Kraus, Tzipi; Eaton, Kenneth; Farah, Rola; Hajinazarian, Ardag; Vannest, Jennifer; Holland, Scott K

2015-12-10

To investigate whether high performance on college preparedness tests at 18 years of age can be predicted from brain activation patterns during narrative comprehension at 5-7 years of age. In this longitudinal study, functional MRI data during an auditory narrative-comprehension task were acquired from 15 children (5-7 years of age) who also provided their American College Testing (ACT) scores at the age of 18 years. Active voxels during the narrative-comprehension task were correlated with both composite ACT scores and the reading-comprehension component of the exam. Higher composite ACT scores and behavioral scores for reading comprehension were positively correlated with greater activation in frontal and anterior brain regions during the narrative-comprehension task. Our results suggest that neural circuits supporting higher ACT performance are predictable from a narrative-comprehension task at the age of 5-7 years. This supports a critical role for the anterior cingulate cortex, which is a part of the cingulo-opercular cognitive-control network early in development, as a facilitator for better ACT scores. This study highlights that shared neural circuits that support overall ACT performance and neural circuits that support reading comprehension both rely on neural circuits related to narrative comprehension in childhood, suggesting that interventions involving narrative comprehension should be considered for individuals with reading and other academic difficulties. Copyright © 2015 Elsevier B.V. All rights reserved.
Effects of Concept Map Extraction and a Test-Based Diagnostic Environment on Learning Achievement and Learners' Perceptions

ERIC Educational Resources Information Center

Lin, Yu-Shih; Chang, Yi-Chun; Liew, Keng-Hou; Chu, Chih-Ping

2016-01-01

Computerised testing and diagnostics are critical challenges within an e-learning environment, where the learners can assess their learning performance through tests. However, a test result based on only a single score is insufficient information to provide a full picture of learning performance. In addition, because test results implicitly…
Predicting clinical concussion measures at baseline based on motivation and academic profile.

PubMed

Trinidad, Katrina J; Schmidt, Julianne D; Register-Mihalik, Johna K; Groff, Diane; Goto, Shiho; Guskiewicz, Kevin M

2013-11-01

The purpose of this study was to predict baseline neurocognitive and postural control performance using a measure of motivation, high school grade point average (hsGPA), and Scholastic Aptitude Test (SAT) score. Cross-sectional. Clinical research center. Eighty-eight National Collegiate Athletic Association Division I incoming student-athletes (freshman and transfers). Participants completed baseline clinical concussion measures, including a neurocognitive test battery (CNS Vital Signs), a balance assessment [Sensory Organization Test (SOT)], and motivation testing (Rey Dot Counting). Participants granted permission to access hsGPA and SAT total score. Standard scores for each CNS Vital Signs domain and SOT composite score. Baseline motivation, hsGPA, and SAT explained a small percentage of the variance of complex attention (11%), processing speed (12%), and composite SOT score (20%). Motivation, hsGPA, and total SAT score do not explain a significant amount of the variance in neurocognitive and postural control measures but may still be valuable to consider when interpreting neurocognitive and postural control measures.
Validation of the UCSD Performance-based Skills Assessment (UPSA) in Hispanics with and without schizophrenia.

PubMed

Mausbach, Brent T; Tiznado, Denisse; Cardenas, Veronica; Jeste, Dilip V; Patterson, Thomas L

2016-10-30

The UCSD Performance-based Skills Assessment (UPSA) is a widely used measure of functional capacity with strong reliability and validity. However there is a lack of psychometric data on Hispanics. The purpose of this study was to determine the impact of acculturation and education on UPSA performance among 62 Hispanic participants with schizophrenia or schizoaffective disorder and 46 healthy comparison subjects. Functional capacity was measured using the UPSA. Acculturation was measured using the Acculturation Rating Scale for Mexican Americans (ARSMA). Independent t-tests indicated that participants with schizophrenia had significantly lower UPSA total scores and scored lower on all UPSA sub-scales relative to the comparison group. Multiple regression also indicated that education and acculturation were significant predictors of UPSA total scores. These data provide a better understanding of UPSA scores in Hispanics with and without schizophrenia, and suggest that education and acculturation adjustments may be required to improve interpretation of test results. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Conceptual Scoring and Classification Accuracy of Vocabulary Testing in Bilingual Children

ERIC Educational Resources Information Center

Anaya, Jissel B.; Peña, Elizabeth D.; Bedore, Lisa M.

2018-01-01

Purpose: This study examined the effects of single-language and conceptual scoring on the vocabulary performance of bilingual children with and without specific language impairment. We assessed classification accuracy across 3 scoring methods. Method: Participants included Spanish-English bilingual children (N = 247) aged 5;1 (years;months) to…
Cognitive performance and psychosocial functioning in patients with bipolar disorder, unaffected siblings, and healthy controls.

PubMed

Vasconcelos-Moreno, Mirela P; Bücker, Joana; Bürke, Kelen P; Czepielewski, Leticia; Santos, Barbara T; Fijtman, Adam; Passos, Ives C; Kunz, Mauricio; Bonnín, Caterina Del Mar; Vieta, Eduard; Kapczinski, Flavio; Rosa, Adriane R; Kauer-Sant'Anna, Marcia

2016-01-01

To assess cognitive performance and psychosocial functioning in patients with bipolar disorder (BD), in unaffected siblings, and in healthy controls. Subjects were patients with BD (n=36), unaffected siblings (n=35), and healthy controls (n=44). Psychosocial functioning was accessed using the Functioning Assessment Short Test (FAST). A sub-group of patients with BD (n=21), unaffected siblings (n=14), and healthy controls (n=22) also underwent a battery of neuropsychological tests: California Verbal Learning Test (CVLT), Stroop Color and Word Test, and Wisconsin Card Sorting Test (WCST). Clinical and sociodemographic characteristics were analyzed using one-way analysis of variance or the chi-square test; multivariate analysis of covariance was used to examine differences in neuropsychological variables. Patients with BD showed higher FAST total scores (23.90±11.35) than healthy controls (5.86±5.47; p < 0.001) and siblings (12.60±11.83; p 0.001). Siblings and healthy controls also showed statistically significant differences in FAST total scores (p = 0.008). Patients performed worse than healthy controls on all CVLT sub-tests (p < 0.030) and in the number of correctly completed categories on WCST (p = 0.030). Siblings did not differ from healthy controls in cognitive tests. Unaffected siblings of patients with BD may show poorer functional performance compared to healthy controls. FAST scores may contribute to the development of markers of vulnerability and endophenotypic traits in at-risk populations.

Testing to the Top: Everything But the Kitchen Sink?

ERIC Educational Resources Information Center

Dietel, Ron

2011-01-01

Two tests intended to measure student achievement of the Common Core State Standards will face intense scrutiny, but the test makers say they will include performance assessments and other items that are not multiple-choice questions. Incorporating performance items on this tests will bring up issues over scoring, costs, and validity.
On-road driving impairments and associated cognitive deficits after stroke.

PubMed

Devos, Hannes; Tant, Mark; Akinwuntan, Abiodun E

2014-01-01

Little is known about the critical on-road driving skills that get affected after a stroke. The purpose of this study was to investigate the key on-road driving impairments and their associated cognitive deficits after a stroke. A second aim was to investigate if lateralization of stroke impacts results of the cognitive and on-road driving tests. In this cross-sectional study, 99 participants with a first-ever stroke who were actively driving prior to stroke underwent a cognitive battery and a standardized road test that evaluated 13 specific on-road driving skills. These on-road driving skills were mapped onto an existing, theoretical framework that categorized the on-road items into hierarchic clusters of operational, tactical, visuo-integrative, and mixed driving skills. The total score on the road test and the on-road decision, made by a certified fitness-to-drive expert, decided the main outcome. The critical on-road driving skills predicting the on-road decision were identified using logistic regression analysis. Linear regression analysis was employed to determine the cognitive impairments leading to poor total on-road scores. Analyses were repeated for right- and left-sided strokes. In all, 37 persons scored poorly on the road test. These participants performed worse in all hierarchic clusters of on-road driving. Performances on the operational cluster and the visuo-integrative cluster best predicted on-road decisions (R(2) = 0.60). 'Lane changing' and 'understanding, insight, and quality of traffic participation' were the critical skill deficits leading to poor performance on the road test (R(2) = 0.65). Divided attention was the main determinant of on-road scores in the total group (R(2) = 0.06). Participants with right-sided stroke performed worse on visual field, visual neglect, visual scanning, visuo-constructive skills, and divided attention compared with those with left-sided stroke. Divided attention was the main determinant of total on-road scores in the right-sided stroke group (R(2) = 0.10). A combination of visual scanning, speed of processing, and executive dysfunction yielded the best model to predict on-road scores in left-sided strokes (R(2) = 0.46). Poor performance in the road test after stroke is determined by critical operational and visuo-integrative driving impairments. Specific and different driving evaluation and training programs are needed for right- and left-sided strokes. © 2014 S. Karger AG, Basel.
Relationships between postural orientation and self reported function, hop performance and muscle power in subjects with anterior cruciate ligament injury.

PubMed

Trulsson, Anna; Roos, Ewa M; Ageberg, Eva; Garwicz, Martin

2010-07-01

Injury to the anterior cruciate ligament (ACL) is associated not only with knee instability and impaired neuromuscular control, but also with altered postural orientation manifested as observable "substitution patterns". However, tests currently used to evaluate knee function in subjects with ACL injury are not designed to assess postural orientation. Therefore, we are in the process of developing an observational test set that measures postural orientation in terms of the ability to stabilize body segments in relation to each other and to the environment. The aim of the present study was to characterise correlations between this novel test set, called the Test for Substitution Patterns (TSP) and commonly used tests of knee function. In a blinded set-up, 53 subjects (mean age 30 years, range 20-39, with 2-5 years since ACL injury) were assessed using the TSP, the Knee Injury and Osteoarthritis Outcome Score subscale sport/recreation (KOOS sport/rec), 3 hop tests and 3 muscle power tests. Correlations between the scores of the TSP and the other tests were determined. Moderate correlations were found between TSP scores and KOOS sport/rec (rs = -0.43; p = 0.001) and between TSP scores and hop test results (rs = -0.40 to -0.46; p < or = 0.003), indicating that altered postural orientation was associated with worse self-reported KOOS sport/rec function and worse hop performance. No significant correlations were found between TSP scores and muscle power results. Subjects had higher TSP scores on their injured side than on their uninjured side (median 4 and 1 points; interquartile range 2-6 and 0-1.5, respectively; p < 0.0001). We conclude that the Test for Substitution Patterns is of relevance to the patient and measures a specific aspect of neuromuscular control not quantified by the other tests investigated. We suggest that the TSP may be a valuable complement in the assessment of neuromuscular control in the rehabilitation of subjects with ACL injury.
Men and Women: Equal in Accounting?

ERIC Educational Resources Information Center

Park, L. Jane; And Others

1994-01-01

Data from 131 male and 177 female accounting students were derived from test scores, student estimation of performance, California Psychological Inventory, and State-Trait Anxiety Inventory. No significant differences between the sexes appeared in test performance or grade point average. Psychological tests showed male self-perceptions to be…
The King-Devick test and sports-related concussion: study of a rapid visual screening tool in a collegiate cohort.

PubMed

Galetta, Kristin M; Brandes, Lauren E; Maki, Karl; Dziemianowicz, Mark S; Laudano, Eric; Allen, Megan; Lawler, Kathy; Sennett, Brian; Wiebe, Douglas; Devick, Steve; Messner, Leonard V; Galetta, Steven L; Balcer, Laura J

2011-10-15

Concussion, defined as an impulse blow to the head or body resulting in transient neurologic signs or symptoms, has received increasing attention in sports at all levels. The King-Devick (K-D) test is based on the time to perform rapid number naming and captures eye movements and other correlates of suboptimal brain function. In a study of boxers and mixed martial arts (MMA) fighters, the K-D test was shown to have high degrees of test-retest and inter-rater reliability and to be an accurate method for rapidly identifying boxers and mixed martial arts fighters with concussion. We performed a study of the K-D test as a rapid sideline screening tool in collegiate athletes to determine the effect of concussion on K-D scores compared to a pre-season baseline. In this longitudinal study, athletes from the University of Pennsylvania varsity football, sprint football, and women's and men's soccer and basketball teams underwent baseline K-D testing prior to the start of the 2010-11 playing season. Post-season testing was also performed. For athletes who had concussions during the season, K-D testing was administered immediately on the sidelines and changes in score from baseline were determined. Among 219 athletes tested at baseline, post-season K-D scores were lower (better) than the best pre-season scores (35.1 vs. 37.9s, P=0.03, Wilcoxon signed-rank test), reflecting mild learning effects in the absence of concussion. For the 10 athletes who had concussions, K-D testing on the sidelines showed significant worsening from baseline (46.9 vs. 37.0s, P=0.009), with all except one athlete demonstrating worsening from baseline (median 5.9s). This study of collegiate athletes provides initial evidence in support of the K-D test as a strong candidate rapid sideline visual screening tool for concussion. Data show worsening of scores following concussion, and ongoing follow-up in this study with additional concussion events and different athlete populations will further examine the effectiveness of the K-D test. Copyright © 2011 Elsevier B.V. All rights reserved.
Accounting for estimated IQ in neuropsychological test performance with regression-based techniques.

PubMed

Testa, S Marc; Winicki, Jessica M; Pearlson, Godfrey D; Gordon, Barry; Schretlen, David J

2009-11-01

Regression-based normative techniques account for variability in test performance associated with multiple predictor variables and generate expected scores based on algebraic equations. Using this approach, we show that estimated IQ, based on oral word reading, accounts for 1-9% of the variability beyond that explained by individual differences in age, sex, race, and years of education for most cognitive measures. These results confirm that adding estimated "premorbid" IQ to demographic predictors in multiple regression models can incrementally improve the accuracy with which regression-based norms (RBNs) benchmark expected neuropsychological test performance in healthy adults. It remains to be seen whether the incremental variance in test performance explained by estimated "premorbid" IQ translates to improved diagnostic accuracy in patient samples. We describe these methods, and illustrate the step-by-step application of RBNs with two cases. We also discuss the rationale, assumptions, and caveats of this approach. More broadly, we note that adjusting test scores for age and other characteristics might actually decrease the accuracy with which test performance predicts absolute criteria, such as the ability to drive or live independently.
A Study on the Impact of Fatigue on Human Raters When Scoring Speaking Responses

ERIC Educational Resources Information Center

Ling, Guangming; Mollaun, Pamela; Xi, Xiaoming

2014-01-01

The scoring of constructed responses may introduce construct-irrelevant factors to a test score and affect its validity and fairness. Fatigue is one of the factors that could negatively affect human performance in general, yet little is known about its effects on a human rater's scoring quality on constructed responses. In this study, we compared…
Performance Comparison of Student-Athletes and General College Students on the Functional Movement Screen and the Y Balance Test.

PubMed

Engquist, Katherine D; Smith, Craig A; Chimera, Nicole J; Warren, Meghan

2015-08-01

Although various studies have assessed performance of athletes on the Functional Movement Screen (FMS) and the Y Balance Test (YBT), no study to date has directly evaluated a comparison of performance between athletes and members of the general population. Thus, to better understand the application of the FMS and the YBT to general college students, this study examined whether or not general college students performed similarly to student-athletes on the FMS (composite and movement pattern scores) and the YBT (composite and reach directions). This study evaluated 167 Division I student-athletes and 103 general college students from the same university on the FMS and the YBT. No difference was found in FMS composite scores between student-athletes and general college students. For FMS movement patterns, female student-athletes scored higher than general college students in the deep squat. No difference was found for men in any FMS movement pattern. Female student-athletes scored higher than female general college students in YBT composite scores; no difference was found for men in YBT composite scores. In analysis of YBT reach directions, female student-athletes scored higher than female general college students in all reach directions, whereas no difference was found in men. Existing research on the FMS composite score in athletic populations may apply to a general college population for the purposes of preparticipation screening, injury prediction, etc. Existing research on the YBT in male athletic populations is expected to apply equally to general college males for the purposes of preparticipation screening, injury prediction, etc.
Counting-backward test for executive function in idiopathic normal pressure hydrocephalus.

PubMed

Kanno, S; Saito, M; Hayashi, A; Uchiyama, M; Hiraoka, K; Nishio, Y; Hisanaga, K; Mori, E

2012-10-01

The aim of this study was to develop and validate a bedside test for executive function in patients with idiopathic normal pressure hydrocephalus (INPH). Twenty consecutive patients with INPH and 20 patients with Alzheimer's disease (AD) were enrolled in this study. We developed the counting-backward test for evaluating executive function in patients with INPH. Two indices that are considered to be reflective of the attention deficits and response suppression underlying executive dysfunction in INPH were calculated: the first-error score and the reverse-effect index. Performance on both the counting-backward test and standard neuropsychological tests for executive function was assessed in INPH and AD patients. The first-error score, reverse-effect index and the scores from the standard neuropsychological tests for executive function were significantly lower for individuals in the INPH group than in the AD group. The two indices for the counting-backward test in the INPH group were strongly correlated with the total scores for Frontal Assessment Battery and Phonemic Verbal Fluency. The first-error score was also significantly correlated with the error rate of the Stroop colour-word test and the score of the go/no-go test. In addition, we found that the first-error score highly distinguished patients with INPH from those with AD using these tests. The counting-backward test is useful for evaluating executive dysfunction in INPH and for differentiating between INPH and AD patients. In particular, the first-error score may reflect deficits in the response suppression related to executive dysfunction in INPH. © 2012 John Wiley & Sons A/S.
[Lack of correlation between performances in a simulator and in reality].

PubMed

Konge, Lars; Bitsch, Mikael

2010-12-13

Simulation-based training provides obvious benefits for patients and doctors in education. Frequently, virtual reality simulators are expensive and evidence for their efficacy is poor, particularly as a result of studies with poor methodology and few test participants. In medical simulated training- and evaluation programmes it is always a question of transfer to the real clinical world. To illustrate this problem a study comparing the test performance of persons on a bowling simulator with their performance in a real bowling alley was conducted. Twenty-five test subjects played two rounds of bowling on a Nintendo Wii and 25 days later on a real bowling alley. Correlations of the scores in the first and second round (test-retest-reliability) and of the scores on the simulator and in reality (criterion validation) were studied and there was tested for any difference between female and male performance. The intraclass correlation coefficient equalled 0.76, i.e. the simulator fairly accurately measured participant performance. In contrast to this there was absolutely no correlation between participants' real bowling abilities and their scores on the simulator (Pearson's r = 0.06). There was no significant difference between female and male abilities. Simulation-based testing and training must be based on evidence. More studies are needed to include an adequate number of subjects. Bowling competence should not be based on Nintendo Wii measurements. Simulated training- and evaluation programmes should be validated before introduction, to ensure consistency with the real world.
Qualitative and quantitative outcomes of audience response systems as an educational tool in a plastic surgery residency program.

PubMed

Arneja, Jugpal S; Narasimhan, Kailash; Bouwman, David; Bridge, Patrick D

2009-12-01

In-training evaluations in graduate medical education have typically been challenging. Although the majority of standardized examination delivery methods have become computer-based, in-training examinations generally remain pencil-paper-based, if they are performed at all. Audience response systems present a novel way to stimulate and evaluate the resident-learner. The purpose of this study was to assess the outcomes of audience response systems testing as compared with traditional testing in a plastic surgery residency program. A prospective 1-year pilot study of 10 plastic surgery residents was performed using audience response systems-delivered testing for the first half of the academic year and traditional pencil-paper testing for the second half. Examination content was based on monthly "Core Quest" curriculum conferences. Quantitative outcome measures included comparison of pretest and posttest and cumulative test scores of both formats. Qualitative outcomes from the individual participants were obtained by questionnaire. When using the audience response systems format, pretest and posttest mean scores were 67.5 and 82.5 percent, respectively; using traditional pencil-paper format, scores were 56.5 percent and 79.5 percent. A comparison of the cumulative mean audience response systems score (85.0 percent) and traditional pencil-paper score (75.0 percent) revealed statistically significantly higher scores with audience response systems (p = 0.01). Qualitative outcomes revealed increased conference enthusiasm, greater enjoyment of testing, and no user difficulties with the audience response systems technology. The audience response systems modality of in-training evaluation captures participant interest and reinforces material more effectively than traditional pencil-paper testing does. The advantages include a more interactive learning environment, stimulation of class participation, immediate feedback to residents, and immediate tabulation of results for the educator. Disadvantages include start-up costs and lead-time preparation.
Relationship Between Cognitive Assessment and Balance Measures in Adolescents Referred for Vestibular Physical Therapy After Concussion

PubMed Central

Alsalaheen, Bara A.; Whitney, Susan L.; Marchetti, Gregory F.; Furman, Joseph M.; Kontos, Anthony P.; Collins, Michael W.; Sparto, Patrick J.

2016-01-01

Objective To examine the relationship between cognitive and balance performance in adolescents with concussion. Design Retrospective case series. Setting Tertiary. Patients Sixty patients. Interventions Correlation analyses were performed to describe the relationship between symptoms, cognitive measure, and balance measure at the time of initiation of vestibular physical therapy. Main Outcome Measures Cognitive performance was assessed using the Immediate Post-concussion Assessment and Cognitive Testing (ImPACT). The dizziness and balance function measures included dizziness severity rating, Activities-specific Balance Confidence scale (ABC), Dizziness Handicap Inventory (DHI), Functional Gait Assessment, gait speed, Timed “UP and GO,” Five Times Sit to Stand, and Sensory Organization Test (SOT). To account for multiple comparisons, the False Discovery Rate method was used. Results Performance measures of balance were significantly correlated with cognitive measures. Greater total symptom scores were related to greater impairment in the ABC and DHI (r = 0.35-0.39, P ≤ 0.008) and worse performance in condition 2 of the SOT (r = −0.48, P = 0.004). Among the ImPACT composite scores, lower memory scores were correlated with impaired balance performance measures (r = 0.37-0.59, P ≤ 0.012). Lower visual memory was also correlated with worse ABC scores. Conclusions The significant relationships reported between the cognitive performance scores and balance measures may reflect that similar levels of functioning exist across domains in individuals with protracted recovery who receive vestibular physical therapy. PMID:25706663
What can eye movements tell us about Symbol Digit substitution by patients with schizophrenia?

PubMed

Elahipanah, Ava; Christensen, Bruce K; Reingold, Eyal M

2011-04-01

Substitution tests are sensitive to cognitive impairment and reliably discriminate patients with schizophrenia from healthy individuals better than most other neuropsychological instruments. However, due to their multifaceted nature, substitution test scores cannot pinpoint the specific cognitive deficits that lead to poor performance. The current study investigated eye movements during performance on a substitution test in order to better understand what aspect of substitution test performance underlies schizophrenia-related impairment. Twenty-five patients with schizophrenia and 25 healthy individuals performed a computerized version of the Symbol Digit Modalities Test while their eye movements were monitored. As expected, patients achieved lower overall performance scores. Moreover, analysis of participants' eye movements revealed that patients spent more time searching for the target symbol every time they visited the key area. Patients also made more visits to the key area for each response that they made. Regression analysis suggested that patients' impaired performance on substitution tasks is primarily related to a less efficient visual search and, secondarily, to impaired memory. Copyright © 2010 Elsevier B.V. All rights reserved.
Can dual processing theory explain physics students' performance on the Force Concept Inventory?

NASA Astrophysics Data System (ADS)

Wood, Anna K.; Galloway, Ross K.; Hardy, Judy

2016-12-01

According to dual processing theory there are two types, or modes, of thinking: system 1, which involves intuitive and nonreflective thinking, and system 2, which is more deliberate and requires conscious effort and thought. The Cognitive Reflection Test (CRT) is a widely used and robust three item instrument that measures the tendency to override system 1 thinking and to engage in reflective, system 2 thinking. Each item on the CRT has an intuitive (but wrong) answer that must be rejected in order to answer the item correctly. We therefore hypothesized that performance on the CRT may give useful insights into the cognitive processes involved in learning physics, where success involves rejecting the common, intuitive ideas about the world (often called misconceptions) and instead carefully applying physical concepts. This paper presents initial results from an ongoing study examining the relationship between students' CRT scores and their performance on the Force Concept Inventory (FCI), which tests students' understanding of Newtonian mechanics. We find that a higher CRT score predicts a higher FCI score for both precourse and postcourse tests. However, we also find that the FCI normalized gain is independent of CRT score. The implications of these results are discussed.
A comparative study of students' performance in preclinical physiology assessed by multiple choice and short essay questions.

PubMed

Oyebola, D D; Adewoye, O E; Iyaniwura, J O; Alada, A R; Fasanmade, A A; Raji, Y

2000-01-01

This study was designed to compare the performance of medical students in physiology when assessed by multiple choice questions (MCQs) and short essay questions (SEQs). The study also examined the influence of factors such as age, sex, O/level grades and JAMB scores on performance in the MCQs and SEQs. A structured questionnaire was administered to 264 medical students' four months before the Part I MBBS examination. Apart from personal data of each student, the questionnaire sought information on the JAMB scores and GCE O' Level grades of each student in English Language, Biology, Chemistry, Physics and Mathematics. The physiology syllabus was divided into five parts and the students were administered separate examinations (tests) on each part. Each test consisted of MCQs and SEQs. The performance in MCQs and SEQs were compared. Also, the effects of JAMB scores and GCE O/level grades on the performance in both the MCQs and SEQs were assessed. The results showed that the students performed better in all MCQ tests than in the SEQs. JAMB scores and O' level English Language grade had no significant effect on students' performance in MCQs and SEQs. However O' level grades in Biology, Chemistry, Physics and Mathematics had significant effects on performance in MCQs and SEQs. Inadequate knowledge of physiology and inability to present information in a logical sequence are believed to be major factors contributing to the poorer performance in the SEQs compared with MCQs. In view of the finding of significant association between performance in MCQs and SEQs and GCE O/level grades in science subjects and mathematics, it was recommended that both JAMB results and the GCE results in the four O/level subjects above may be considered when selecting candidates for admission into the medical schools.
Interactive laboratory classes enhance neurophysiological knowledge in Thai medical students.

PubMed

Wongjarupong, Nicha; Niyomnaitham, Danai; Vilaisaktipakorn, Pitchamol; Suksiriworaboot, Tanawin; Qureshi, Shaun Peter; Bongsebandhu-Phubhakdi, Saknan

2018-03-01

Interactive laboratory class (ILC) is a two-way communication teaching method that encourages students to correlate laboratory findings with materials from lectures. In Thai medical education, active learning methods are uncommon. This paper aims to establish 1) if ILCs would effectively promote physiology learning; 2) if effectiveness would be found in both previously academically high-performing and low-performing students; and 3) the acceptability of ILCs to Thai medical students as a novel learning method. Two hundred seventy-eight second-year medical students were recruited to this study. We conducted three ILC sessions, which followed corresponding lectures. We carried out multiple-choice pre- and post-ILC assessments of knowledge and compared by repeated-measures ANOVA and unpaired t-test. Subgroup analysis was performed to compare high-performance (HighP) and low-performance (LowP) students. After the ILCs, participants self-rated their knowledge and satisfaction. Post-ILC test scores increased significantly compared with pre-ILC test scores in all three sessions. Mean scores of each post-ILC test increased significantly from pre-ILC test in both LowP and HighP groups. More students self-reported a "very high" and "high" level of knowledge after ILCs. Most students agreed that ILCs provided more discussion opportunity, motivated their learning, and made lessons more enjoyable. As an adjunct to lectures, ILCs can enhance knowledge in medical students, regardless of previous academic performance. Students perceived ILC as useful and acceptable. This study supports the active learning methods in physiology education, regardless of cultural context.
Modulatory effects of psychopathy on Wisconsin Card Sorting Test performance in male offenders with Antisocial Personality Disorder.

PubMed

Pera-Guardiola, Vanessa; Batalla, Iolanda; Bosque, Javier; Kosson, David; Pifarré, Josep; Hernández-Ribas, Rosa; Goldberg, Ximena; Contreras-Rodríguez, Oren; Menchón, José M; Soriano-Mas, Carles; Cardoner, Narcís

2016-01-30

Neuropsychological deficits in executive functions (EF) have been linked to antisocial behavior and considered to be cardinal to the onset and persistence of severe antisocial and aggressive behavior. However, when psychopathy is present, prior evidence suggests that the dorsolateral prefrontal cortex is unaffected leading to intact EF. Ninety-one male offenders with Antisocial Personality Disorder (ASPD) and 24 controls completed the Wisconsin Card Sorting Test (WCST). ASPD individuals were grouped in three categories according to Psychopathy Checklist-Revised (PCL-R) scores (low, medium and high). We hypothesized that ASPD offenders with high PCL-R scores will not differ from healthy controls in EF and will show better EF performance in comparison with subjects with low PCL-R scores. Results showed that ASPD offenders with low PCL-R scores committed more perseverative errors and responses than controls and offenders with high PCL-R scores, which did not differ from healthy controls. Moreover, scores on Factor 1 and the interpersonal facet of the PCL-R were predictors of better WCST performance. Our results suggest a modulatory role of psychopathy in the cognitive performance of ASPD offenders, and provide further evidence supporting that offenders with ASPD and psychopathy are characterized by a cognitive profile different from those with ASPD without psychopathy. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Effect of teaching mathematics using GeoGebra on students' with dissimilar spatial visualisation

NASA Astrophysics Data System (ADS)

Bakar, Kamariah Abu; Ayub, Ahmad Fauzi Mohd; Tarmizi, Rohani Ahmad; Luan, Wong Su

2015-10-01

This study examined the effects of GeoGebra on mathematics performance of students with different spatial visualization. A qusai-experimental, pretest-posttest control group design was conducted. A total of 71 students from two intact groups were involved in the study. They were in two groups and each group was randonly assigned to the experimental group (36 students) and control group (35 students). A spatial visual test to identify students with high or low visualization, and a mathematics performance pre-test were administered at the initial stage of this study. A post-test was administered after 12 weeks of treatment using GeoGebra. Analyses of Covarion (ANCOVA) was used to adjust for the pre-test score. Findings showed that the group with access to GeoGebra achieved significantly better test scores in the posttest as compared to the group which followed the traditional teaching method. A two-way ANCOVA used to analyse the effect of students' spatial visualization on post-test performance showed that there was no effect. The results from this study suggested that using GeoGebra had helped the students to score better in the posttest. However, there is no significance difference on mathematics performances on students with difference types of spatial visualisastion. This study indicates that GeoGebra is useful in enhancing the teaching and learning of mathematics.
[Validity criteria of a short test to assess speech and language competence in 4-year-olds].

PubMed

Euler, H A; Holler-Zittlau, I; Minnen, S; Sick, U; Dux, W; Zaretsky, Y; Neumann, K

2010-11-01

A psychometrically constructed short test as a prerequisite for screening was developed on the basis of a revision of the Marburger Speech Screening to assess speech/language competence among children in Hessen (Germany). A total of 257 children (age 4.0 to 4.5 years) performed the test battery for speech/language competence; 214 children repeated the test 1 year later. Test scores correlated highly with scores of two competing language screenings (SSV, HASE) and with a combined score from four diagnostic tests of individual speech/language competences (Reynell III, patholinguistic diagnostics in impaired language development, PLAKSS, AWST-R). Validity was demonstrated by three comparisons: (1) Children with German family language had higher scores than children with another language. (2) The 3-month-older children achieved higher scores than younger children. (3) The difference between the children with German family language and those with another language was higher for the 3-month-older than for the younger children. The short test assesses the speech/language competence of 4-year-olds quickly, validly, and comprehensively.
Associations of medical student personality and health/wellness characteristics with their medical school performance across the curriculum.

PubMed

Haight, Scott J; Chibnall, John T; Schindler, Debra L; Slavin, Stuart J

2012-04-01

To assess the relationships of cognitive and noncognitive performance predictors to medical student preclinical and clinical performance indicators across medical school years 1 to 3 and to evaluate the association of psychological health/wellness factors with performance. In 2010, the authors conducted a cross-sectional, correlational, retrospective study of all 175 students at the Saint Louis University School of Medicine who had just completed their third (first clinical) year. Students were asked to complete assessments of personality, stress, anxiety, depression, social support, and community cohesion. Performance measures included total Medical College Admission Test (MCAT) score, preclinical academic grades, National Board of Medical Examiners subject exam scores, United States Medical Licensing Examination Step 1 score, clinical evaluations, and Humanism in Medicine Honor Society nominations. A total of 152 students (87%) participated. MCAT scores predicted cognitive performance indicators (academic tests), whereas personality variables (conscientiousness, extraversion, empathy) predicted noncognitive indicators (clinical evaluations, humanism nominations). Conscientiousness predicted all clinical skills, extraversion predicted clinical skills reflecting interpersonal behavior, and empathy predicted motivation. Health/wellness variables had limited associations with performance. In multivariate analyses that included control for shelf exam scores, conscientiousness predicted clinical evaluations, and extraversion and empathy predicted humanism nominations. This study identified two sets of skills (cognitive, noncognitive) used during medical school, with minimal overlap across the types of performance (e.g., exam performance versus clinical interpersonal skills) they predict. Medical school admission and evaluation efforts may need to be modified to reflect the importance of personality and other noncognitive factors.

Critical thinking skills in nursing students: comparison of simulation-based performance with metrics.

PubMed

Fero, Laura J; O'Donnell, John M; Zullo, Thomas G; Dabbs, Annette DeVito; Kitutu, Julius; Samosky, Joseph T; Hoffman, Leslie A

2010-10-01

This paper is a report of an examination of the relationship between metrics of critical thinking skills and performance in simulated clinical scenarios. Paper and pencil assessments are commonly used to assess critical thinking but may not reflect simulated performance. In 2007, a convenience sample of 36 nursing students participated in measurement of critical thinking skills and simulation-based performance using videotaped vignettes, high-fidelity human simulation, the California Critical Thinking Disposition Inventory and California Critical Thinking Skills Test. Simulation-based performance was rated as 'meeting' or 'not meeting' overall expectations. Test scores were categorized as strong, average, or weak. Most (75.0%) students did not meet overall performance expectations using videotaped vignettes or high-fidelity human simulation; most difficulty related to problem recognition and reporting findings to the physician. There was no difference between overall performance based on method of assessment (P = 0.277). More students met subcategory expectations for initiating nursing interventions (P ≤ 0.001) using high-fidelity human simulation. The relationship between videotaped vignette performance and critical thinking disposition or skills scores was not statistically significant, except for problem recognition and overall critical thinking skills scores (Cramer's V = 0.444, P = 0.029). There was a statistically significant relationship between overall high-fidelity human simulation performance and overall critical thinking disposition scores (Cramer's V = 0.413, P = 0.047). Students' performance reflected difficulty meeting expectations in simulated clinical scenarios. High-fidelity human simulation performance appeared to approximate scores on metrics of critical thinking best. Further research is needed to determine if simulation-based performance correlates with critical thinking skills in the clinical setting. © 2010 The Authors. Journal of Advanced Nursing © 2010 Blackwell Publishing Ltd.
Critical thinking skills in nursing students: comparison of simulation-based performance with metrics

PubMed Central

Fero, Laura J.; O’Donnell, John M.; Zullo, Thomas G.; Dabbs, Annette DeVito; Kitutu, Julius; Samosky, Joseph T.; Hoffman, Leslie A.

2018-01-01

Aim This paper is a report of an examination of the relationship between metrics of critical thinking skills and performance in simulated clinical scenarios. Background Paper and pencil assessments are commonly used to assess critical thinking but may not reflect simulated performance. Methods In 2007, a convenience sample of 36 nursing students participated in measurement of critical thinking skills and simulation-based performance using videotaped vignettes, high-fidelity human simulation, the California Critical Thinking Disposition Inventory and California Critical Thinking Skills Test. Simulation- based performance was rated as ‘meeting’ or ‘not meeting’ overall expectations. Test scores were categorized as strong, average, or weak. Results Most (75·0%) students did not meet overall performance expectations using videotaped vignettes or high-fidelity human simulation; most difficulty related to problem recognition and reporting findings to the physician. There was no difference between overall performance based on method of assessment (P = 0·277). More students met subcategory expectations for initiating nursing interventions (P ≤ 0·001) using high-fidelity human simulation. The relationship between video-taped vignette performance and critical thinking disposition or skills scores was not statistically significant, except for problem recognition and overall critical thinking skills scores (Cramer’s V = 0·444, P = 0·029). There was a statistically significant relationship between overall high-fidelity human simulation performance and overall critical thinking disposition scores (Cramer’s V = 0·413, P = 0·047). Conclusion Students’ performance reflected difficulty meeting expectations in simulated clinical scenarios. High-fidelity human simulation performance appeared to approximate scores on metrics of critical thinking best. Further research is needed to determine if simulation-based performance correlates with critical thinking skills in the clinical setting. PMID:20636471
Two baselines are better than one: Improving the reliability of computerized testing in sports neuropsychology.

PubMed

Bruce, Jared; Echemendia, Ruben; Tangeman, Lindy; Meeuwisse, Willem; Comper, Paul; Hutchison, Michael; Aubry, Mark

2016-01-01

Computerized neuropsychological tests are frequently used to assist in return-to-play decisions following sports concussion. However, due to concerns about test reliability, the Centers for Disease Control and Prevention recommends yearly baseline testing. The standard practice that has developed in baseline/postinjury comparisons is to examine the difference between the most recent baseline test and postconcussion performance. Drawing from classical test theory, the present study investigated whether temporal stability could be improved by taking an alternate approach that uses the aggregate of 2 baselines to more accurately estimate baseline cognitive ability. One hundred fifteen English-speaking professional hockey players with 3 consecutive Immediate Postconcussion Assessment and Testing (ImPACT) baseline tests were extracted from a clinical program evaluation database overseen by the National Hockey League and National Hockey League Players' Association. The temporal stability of ImPACT composite scores was significantly increased by aggregating test performance during Sessions 1 and 2 to predict performance during Session 3. Using this approach, the 2-factor Memory (r = .72) and Speed (r = .79) composites of ImPACT showed acceptable long-term reliability. Using the aggregate of 2 baseline scores significantly improves temporal stability and allows for more accurate predictions of cognitive change following concussion. Clinicians are encouraged to estimate baseline abilities by taking into account all of an athlete's previous baseline scores.
Risk score for first-screening of prevalent undiagnosed chronic kidney disease in Peru: the CRONICAS-CKD risk score.

PubMed

Carrillo-Larco, Rodrigo M; Miranda, J Jaime; Gilman, Robert H; Medina-Lezama, Josefina; Chirinos-Pacheco, Julio A; Muñoz-Retamozo, Paola V; Smeeth, Liam; Checkley, William; Bernabe-Ortiz, Antonio

2017-11-29

Chronic Kidney Disease (CKD) represents a great burden for the patient and the health system, particularly if diagnosed at late stages. Consequently, tools to identify patients at high risk of having CKD are needed, particularly in limited-resources settings where laboratory facilities are scarce. This study aimed to develop a risk score for prevalent undiagnosed CKD using data from four settings in Peru: a complete risk score including all associated risk factors and another excluding laboratory-based variables. Cross-sectional study. We used two population-based studies: one for developing and internal validation (CRONICAS), and another (PREVENCION) for external validation. Risk factors included clinical- and laboratory-based variables, among others: sex, age, hypertension and obesity; and lipid profile, anemia and glucose metabolism. The outcome was undiagnosed CKD: eGFR < 60 ml/min/1.73m 2 . We tested the performance of the risk scores using the area under the receiver operating characteristic (ROC) curve, sensitivity, specificity, positive/negative predictive values and positive/negative likelihood ratios. Participants in both studies averaged 57.7 years old, and over 50% were females. Age, hypertension and anemia were strongly associated with undiagnosed CKD. In the external validation, at a cut-off point of 2, the complete and laboratory-free risk scores performed similarly well with a ROC area of 76.2% and 76.0%, respectively (P = 0.784). The best assessment parameter of these risk scores was their negative predictive value: 99.1% and 99.0% for the complete and laboratory-free, respectively. The developed risk scores showed a moderate performance as a screening test. People with a score of ≥ 2 points should undergo further testing to rule out CKD. Using the laboratory-free risk score is a practical approach in developing countries where laboratories are not readily available and undiagnosed CKD has significant morbidity and mortality.
Construct Validity of Fresh Frozen Human Cadaver as a Training Model in Minimal Access Surgery

PubMed Central

Macafee, David; Pranesh, Nagarajan; Horgan, Alan F.

2012-01-01

Background: The construct validity of fresh human cadaver as a training tool has not been established previously. The aims of this study were to investigate the construct validity of fresh frozen human cadaver as a method of training in minimal access surgery and determine if novices can be rapidly trained using this model to a safe level of performance. Methods: Junior surgical trainees, novices (<3 laparoscopic procedure performed) in laparoscopic surgery, performed 10 repetitions of a set of structured laparoscopic tasks on fresh frozen cadavers. Expert laparoscopists (>100 laparoscopic procedures) performed 3 repetitions of identical tasks. Performances were scored using a validated, objective Global Operative Assessment of Laparoscopic Skills scale. Scores for 3 consecutive repetitions were compared between experts and novices to determine construct validity. Furthermore, to determine if the novices reached a safe level, a trimmed mean of the experts score was used to define a benchmark. Mann-Whitney U test was used for construct validity analysis and 1-sample t test to compare performances of the novice group with the benchmark safe score. Results: Ten novices and 2 experts were recruited. Four out of 5 tasks (nondominant to dominant hand transfer; simulated appendicectomy; intracorporeal and extracorporeal knot tying) showed construct validity. Novices’ scores became comparable to benchmark scores between the eighth and tenth repetition. Conclusion: Minimal access surgical training using fresh frozen human cadavers appears to have construct validity. The laparoscopic skills of novices can be accelerated through to a safe level within 8 to 10 repetitions. PMID:23318058
The value of the UK Clinical Aptitude Test in predicting pre-clinical performance: a prospective cohort study at Nottingham Medical School.

PubMed

Yates, Janet; James, David

2010-07-28

The UK Clinical Aptitude Test (UKCAT) was introduced in 2006 as an additional tool for the selection of medical students. It tests mental ability in four distinct domains (Quantitative Reasoning, Verbal Reasoning, Abstract Reasoning, and Decision Analysis), and the results are available to students and admissions panels in advance of the selection process. As yet the predictive validity of the test against course performance is largely unknown.The study objective was to determine whether UKCAT scores predict performance during the first two years of the 5-year undergraduate medical course at Nottingham. We studied a single cohort of students, who entered Nottingham Medical School in October 2007 and had taken the UKCAT. We used linear regression analysis to identify independent predictors of marks for different parts of the 2-year preclinical course. Data were available for 204/260 (78%) of the entry cohort. The UKCAT total score had little predictive value. Quantitative Reasoning was a significant independent predictor of course marks in Theme A ('The Cell'), (p = 0.005), and Verbal Reasoning predicted Theme C ('The Community') (p < 0.001), but otherwise the effects were slight or non-existent. This limited study from a single entry cohort at one medical school suggests that the predictive value of the UKCAT, particularly the total score, is low. Section scores may predict success in specific types of course assessment.The ultimate test of validity will not be available for some years, when current cohorts of students graduate. However, if this test of mental ability does not predict preclinical performance, it is arguably less likely to predict the outcome in the clinical years. Further research from medical schools with different types of curriculum and assessment is needed, with longitudinal studies throughout the course.
Predictors of student performance on the Pharmacy Curriculum Outcomes Assessment at a new school of pharmacy using admissions and demographic data.

PubMed

Gillette, Chris; Rudolph, Michael; Rockich-Winston, Nicole; Blough, Eric R; Sizemore, James A; Hao, Jinsong; Booth, Chris; Broedel-Zaugg, Kimberly; Peterson, Megan; Anderson, Stephanie; Riley, Brittany; Train, Brian C; Stanton, Robert B; Anderson, H Glenn

To characterize student performance on the Pharmacy Curriculum Outcomes Assessment (PCOA) and to determine the significance of specific admissions criteria and pharmacy school performance to predict student performance on the PCOA during the first through third professional years. Multivariate linear regression models were developed to study the relationships between various independent variables and students' PCOA total scores during the first through third professional years. To date, four cohorts have successfully taken the PCOA examination. Results indicate that the Pharmacy College Admissions Test (PCAT), the Health Science Reasoning Test (HSRT), and cumulative pharmacy grade point average were the only consistent significant predictors of higher PCOA total scores across all students who have taken the exam at our school of pharmacy. The school should examine and clarify the role of PCOA within its curricular assessment program. Results suggest that certain admissions criteria and performance in pharmacy school are associated with higher PCOA scores. Copyright © 2016 Elsevier Inc. All rights reserved.
The UK Clinical Aptitude Test and clinical course performance at Nottingham: a prospective cohort study.

PubMed

Yates, Janet; James, David

2013-02-26

The UK Clinical Aptitude Test (UKCAT) was introduced in 2006 as an additional tool for the selection of medical students. It tests mental ability in four distinct domains (Verbal Reasoning, Quantitative Reasoning, Abstract Reasoning, and Decision Analysis), and the results are available to students and admission panels in advance of the selection process. Our first study showed little evidence of any predictive validity for performance in the first two years of the Nottingham undergraduate course.The study objective was to determine whether the UKCAT scores had any predictive value for the later parts of the course, largely delivered via clinical placements. Students entering the course in 2007 and who had taken the UKCAT were asked for permission to use their anonymised data in research. The UKCAT scores were incorporated into a database with routine pre-admission socio-demographics and subsequent course performance data. Correlation analysis was followed by hierarchical multivariate linear regression. The original study group comprised 204/254 (80%) of the full entry cohort. With attrition over the five years of the course this fell to 185 (73%) by Year 5. The Verbal Reasoning score and the UKCAT Total score both demonstrated some univariate correlations with clinical knowledge marks, and slightly less with clinical skills. No parts of the UKCAT proved to be an independent predictor of clinical course marks, whereas prior attainment was a highly significant predictor (p <0.001). This study of one cohort of Nottingham medical students showed that UKCAT scores at admission did not independently predict subsequent performance on the course. Whilst the test adds another dimension to the selection process, its fairness and validity in selecting promising students remains unproven, and requires wider investigation and debate by other schools.
Scoring severity in trauma: comparison of prehospital scoring systems in trauma ICU patients.

PubMed

Llompart-Pou, J A; Chico-Fernández, M; Sánchez-Casado, M; Salaberria-Udabe, R; Carbayo-Górriz, C; Guerrero-López, F; González-Robledo, J; Ballesteros-Sanz, M Á; Herrán-Monge, R; Servià-Goixart, L; León-López, R; Val-Jordán, E

2017-06-01

We evaluated the predictive ability of mechanism, Glasgow coma scale, age and arterial pressure (MGAP), Glasgow coma scale, age and systolic blood pressure (GAP), and triage-revised trauma Score (T-RTS) scores in patients from the Spanish trauma ICU registry using the trauma and injury severity score (TRISS) as a reference standard. Patients admitted for traumatic disease in the participating ICU were included. Quantitative data were reported as median [interquartile range (IQR), categorical data as number (percentage)]. Comparisons between groups with quantitative variables and categorical variables were performed using Student's T Test and Chi Square Test, respectively. We performed receiving operating curves (ROC) and evaluated the area under the curve (AUC) with its 95 % confidence interval (CI). Sensitivity, specificity, positive predictive and negative predictive values and accuracy were evaluated in all the scores. A value of p < 0.05 was considered significant. The final sample included 1361 trauma ICU patients. Median age was 45 (30-61) years. 1092 patients (80.3 %) were male. Median ISS was 18 (13-26) and median T-RTS was 11 (10-12). Median GAP was 20 (15-22) and median MGAP 24 (20-27). Observed mortality was 17.7 % whilst predicted mortality using TRISS was 16.9 %. The AUC in the scores evaluated was: TRISS 0.897 (95 % CI 0.876-0.918), MGAP 0.860 (95 % CI 0.835-0.886), GAP 0.849 (95 % CI 0.823-0.876) and T-RTS 0.796 (95 % CI 0.762-0.830). Both MGAP and GAP scores performed better than the T-RTS in the prediction of hospital mortality in Spanish trauma ICU patients. Since these are easy-to-perform scores, they should be incorporated in clinical practice as a triaging tool.
Medial temporal lobe atrophy ratings in a large 75-year-old population-based cohort: gender-corrected and education-corrected normative data.

PubMed

Velickaite, V; Ferreira, D; Cavallin, L; Lind, L; Ahlström, H; Kilander, L; Westman, E; Larsson, E-M

2018-04-01

To find cut-off values for different medial temporal lobe atrophy (MTA) measures (right, left, average, and highest), accounting for gender and education, investigate the association with cognitive performance, and to compare with decline of cognitive function over 5 years in a large population-based cohort. Three hundred and ninety 75-year-old individuals were examined with magnetic resonance imaging of the brain and cognitive testing. The Scheltens's scale was used to assess visually MTA scores (0-4) in all subjects. Cognitive tests were repeated in 278 of them after 5 years. Normal MTA cut-off values were calculated based on the 10th percentile. Most 75-year-old individuals had MTA score ≤2. Men had significantly higher MTA scores than women. Scores for left and average MTA were significantly higher in highly educated individuals. Abnormal MTA was associated with worse results in cognitive test and individuals with abnormal right MTA had faster cognitive decline. At age 75, gender and education are confounders for MTA grading. A score of ≥2 is abnormal for low-educated women and a score of ≥2.5 is abnormal for men and high-educated women. Subjects with abnormal right MTA, but normal MMSE scores had developed worse MMSE scores 5 years later. • Gender and education are confounders for MTA grading. • We suggest cut-off values for 75-year-olds, taking gender and education into account. • Males have higher MTA scores than women. • Higher MTA scores are associated with worse cognitive performance.
Childhood Fitness and Academic Performance: An Investigation into the Effect of Aerobic Capacity on Academic Test Scores

ERIC Educational Resources Information Center

Hobbs, Mark

2014-01-01

The purpose of this quantitate ve study was to determine whether or not students in fifth grade who meet the healthy fitness zone (HFZ) for aerobic capacity on the fall 2013 FITNESSGRAM® Test scored higher on the math portion of the 2013 fall Measures of Academic Progress (MAP) test, than students that failed to reach the HFZ for aerobic capacity…
Stunting, poor iron status and parasite infection are significant risk factors for lower cognitive performance in Cambodian school-aged children.

PubMed

Perignon, Marlene; Fiorentino, Marion; Kuong, Khov; Burja, Kurt; Parker, Megan; Sisokhom, Sek; Chamnan, Chhoun; Berger, Jacques; Wieringa, Frank T

2014-01-01

Nutrition is one of many factors affecting the cognitive development of children. In Cambodia, 55% of children <5 y were anemic and 40% stunted in 2010. Currently, no data exists on the nutritional status of Cambodian school-aged children, or on how malnutrition potentially affects their cognitive development. To assess the anthropometric and micronutrient status (iron, vitamin A, zinc, iodine) of Cambodian schoolchildren and their associations with cognitive performance. School children aged 6-16 y (n = 2443) from 20 primary schools in Cambodia were recruited. Anthropometry, hemoglobin, serum ferritin, transferrin receptors, retinol-binding protein and zinc concentrations, inflammation status, urinary iodine concentration and parasite infection were measured. Socio-economic data were collected in a sub-group of children (n = 616). Cognitive performance was assessed using Raven's Colored Progressive Matrices (RCPM) and block design and picture completion, two standardized tests from the Wechsler Intelligence Scale for Children (WISC-III). The prevalence of anemia, iron, zinc, iodine and vitamin A deficiency were 15.7%; 51.2%, 92.8%, 17.3% and 0.7% respectively. The prevalence of stunting was 40.0%, including 10.9% of severe stunting. Stunted children scored significantly lower than non-stunted children on all tests. In RCPM test, boys with iron-deficiency anemia had lower scores than boys with normal iron status (-1.46, p<0.05). In picture completion test, children with normal iron status tended to score higher than iron-deficient children with anemia (-0.81; p = 0.067) or without anemia (-0.49; p = 0.064). Parasite infection was associated with an increase in risk of scoring below the median value in block design test (OR = 1.62; p<0.05), and with lower scores in other tests, for girls only (both p<0.05). Poor cognitive performance of Cambodian school-children was multifactorial and significantly associated with long-term (stunting) and current nutritional status indicators (iron status), as well as parasite infection. A life-cycle approach with programs to improve nutrition in early life and at school-age could contribute to optimal cognitive performance.
Relationships between the yo-yo intermittent recovery test and anaerobic performance tests in adolescent handball players.

PubMed

Hermassi, Souhail; Aouadi, Ridha; Khalifa, Riadh; van den Tillaar, Roland; Shephard, Roy J; Chelly, Mohamed Souhaiel

2015-03-29

The aim of the present study was to investigate relationships between a performance index derived from the Yo-Yo Intermittent Recovery Test level 1 (Yo-Yo IR1) and other measures of physical performance and skill in handball players. The other measures considered included peak muscular power of the lower limbs (Wpeak), jumping ability (squat and counter-movement jumps (SJ, CMJ), a handball skill test and the average sprinting velocities over the first step (VS) and the first 5 m (V5m). Test scores for 25 male national-level adolescent players (age: 17.2 ± 0.7 years) averaged 4.83 ± 0.34 m·s(-1) (maximal velocity reached at the Yo-Yo IR1); 917 ± 105 Watt, 12.7 ± 3 W·kg(-1) (Wpeak); 3.41 ± 0.5 m·s(-1) and 6.03 ± 0.6 m·s(-1) (sprint velocities for Vs and V5m respectively) and 10.3 ± 1 s (handball skill test). Yo-Yo IR1 test scores showed statistically significant correlations with all of the variables examined: Wpeak (W and W·kg(-1)) r = 0.80 and 0.65, respectively, p≤0.001); sprinting velocities (r = 0.73 and 0.71 for VS and V5m respectively; p≤0.001); jumping performance (SJ: r = 0.60, p≤0.001; CMJ: r= 0.66, p≤0.001) and the handball skill test (r = 0.71; p≤0.001). We concluded that the Yo-Yo test score showed a sufficient correlation with other potential means of assessing handball players, and that intra-individual changes of Yo-Yo IR1 score could provide a useful composite index of the response to training or rehabilitation, although correlations lack sufficient precision to help in players' selection.
Relationships Between the Yo-Yo Intermittent Recovery Test and Anaerobic Performance Tests in Adolescent Handball Players

PubMed Central

Hermassi, Souhail; Aouadi, Ridha; Khalifa, Riadh; van den Tillaar, Roland; Shephard, Roy J.; Chelly, Mohamed Souhaiel

2015-01-01

The aim of the present study was to investigate relationships between a performance index derived from the Yo-Yo Intermittent Recovery Test level 1 (Yo-Yo IR1) and other measures of physical performance and skill in handball players. The other measures considered included peak muscular power of the lower limbs (Wpeak), jumping ability (squat and counter-movement jumps (SJ, CMJ), a handball skill test and the average sprinting velocities over the first step (VS) and the first 5 m (V5m). Test scores for 25 male national-level adolescent players (age: 17.2 ± 0.7 years) averaged 4.83 ± 0.34 m·s−1 (maximal velocity reached at the Yo-Yo IR1); 917 ± 105 Watt, 12.7 ± 3 W·kg−1 (Wpeak); 3.41 ± 0.5 m·s−1 and 6.03 ± 0.6 m·s−1 (sprint velocities for Vs and V5m respectively) and 10.3 ± 1 s (handball skill test). Yo-Yo IR1 test scores showed statistically significant correlations with all of the variables examined: Wpeak (W and W·kg−1) r = 0.80 and 0.65, respectively, p≤0.001); sprinting velocities (r = 0.73 and 0.71 for VS and V5m respectively; p≤0.001); jumping performance (SJ: r = 0.60, p≤0.001; CMJ: r= 0.66, p≤0.001) and the handball skill test (r = 0.71; p≤0.001). We concluded that the Yo-Yo test score showed a sufficient correlation with other potential means of assessing handball players, and that intra-individual changes of Yo-Yo IR1 score could provide a useful composite index of the response to training or rehabilitation, although correlations lack sufficient precision to help in players’ selection. PMID:25964822
New reference values for the Alberta Infant Motor Scale need to be established.

PubMed

Fleuren, K M W; Smit, L S; Stijnen, Th; Hartman, A

2007-03-01

The Alberta Infant Motor Scale (AIMS) is an infant developmental test, which can be used to evaluate motor performance from birth to independent walking. Between 1990 and 1992 Piper and Darrah determined reference values in a cohort in Canada. To our knowledge no study has been carried out to determine whether the Canadian data are representative for other countries. In the present study we aimed to establish whether the AIMS test needs new reference values for Dutch children. Motor performance of 100 Dutch children, aged 0-12 months, was measured using the AIMS test. The mean percentile score of the Dutch children was 28.8 (+/-22.9, range 1-85). The percentile scores of the group were significantly lower than scores of the Canadian norm population (p < 0.001), whereby 75% of the Dutch children scored below the 50th percentile. These lower scores were not be explained by sex, racial differences or congenital disorders and were seen in all age groups. We conclude that new reference values on the AIMS test for the age group of 0-12 months need to be established for Dutch children. It is recommended that the need for new normative data is also determined in all other European countries.
Are WISC IQ scores in children with mathematical learning disabilities underestimated? The influence of a specialized intervention on test performance.

PubMed

Lambert, Katharina; Spinath, Birgit

2018-01-01

Intelligence measures play a pivotal role in the diagnosis of mathematical learning disabilities (MLD). Probably as a result of math-related material in IQ tests, children with MLD often display reduced IQ scores. However, it remains unclear whether the effects of math remediation extend to IQ scores. The present study investigated the impact of a special remediation program compared to a control group receiving private tutoring (PT) on the WISC IQ scores of children with MLD. We included N=45 MLD children (7-12 years) in a study with a pre- and post-test control group design. Children received remediation for two years on average. The analyses revealed significantly greater improvements in the experimental group on the Full-Scale IQ, and the Verbal Comprehension, Perceptual Reasoning, and Working Memory indices, but not Processing Speed, compared to the PT group. Children in the experimental group showed an average WISC IQ gain of more than ten points. Results indicate that the WISC IQ scores of MLD children might be underestimated and that an effective math intervention can improve WISC IQ test performance. Taking limitations into account, we discuss the use of IQ measures more generally for defining MLD in research and practice. Copyright © 2017 Elsevier Ltd. All rights reserved.
Predicting Fatigue and Psychophysiological Test Performance from Speech for Safety-Critical Environments.

PubMed

Baykaner, Khan Richard; Huckvale, Mark; Whiteley, Iya; Andreeva, Svetlana; Ryumin, Oleg

2015-01-01

Automatic systems for estimating operator fatigue have application in safety-critical environments. A system which could estimate level of fatigue from speech would have application in domains where operators engage in regular verbal communication as part of their duties. Previous studies on the prediction of fatigue from speech have been limited because of their reliance on subjective ratings and because they lack comparison to other methods for assessing fatigue. In this paper, we present an analysis of voice recordings and psychophysiological test scores collected from seven aerospace personnel during a training task in which they remained awake for 60 h. We show that voice features and test scores are affected by both the total time spent awake and the time position within each subject's circadian cycle. However, we show that time spent awake and time-of-day information are poor predictors of the test results, while voice features can give good predictions of the psychophysiological test scores and sleep latency. Mean absolute errors of prediction are possible within about 17.5% for sleep latency and 5-12% for test scores. We discuss the implications for the use of voice as a means to monitor the effects of fatigue on cognitive performance in practical applications.
Predicting Fatigue and Psychophysiological Test Performance from Speech for Safety-Critical Environments

PubMed Central

Baykaner, Khan Richard; Huckvale, Mark; Whiteley, Iya; Andreeva, Svetlana; Ryumin, Oleg

2015-01-01

Automatic systems for estimating operator fatigue have application in safety-critical environments. A system which could estimate level of fatigue from speech would have application in domains where operators engage in regular verbal communication as part of their duties. Previous studies on the prediction of fatigue from speech have been limited because of their reliance on subjective ratings and because they lack comparison to other methods for assessing fatigue. In this paper, we present an analysis of voice recordings and psychophysiological test scores collected from seven aerospace personnel during a training task in which they remained awake for 60 h. We show that voice features and test scores are affected by both the total time spent awake and the time position within each subject’s circadian cycle. However, we show that time spent awake and time-of-day information are poor predictors of the test results, while voice features can give good predictions of the psychophysiological test scores and sleep latency. Mean absolute errors of prediction are possible within about 17.5% for sleep latency and 5–12% for test scores. We discuss the implications for the use of voice as a means to monitor the effects of fatigue on cognitive performance in practical applications. PMID:26380259
Balance Performance Is Task Specific in Older Adults.

PubMed

Dunsky, Ayelet; Zeev, Aviva; Netz, Yael

2017-01-01

Balance ability among the elderly is a key component in the activities of daily living and is divided into two types: static and dynamic. For clinicians who wish to assess the risk of falling among their elderly patients, it is unclear if more than one type of balance test can be used to measure their balance impairment. In this study, we examined the association between static balance measures and two dynamic balance field tests. One hundred and twelve community-dwelling older adults (mean age 74.6) participated in the study. They underwent the Tetrax static postural assessment and then performed the Timed Up and Go (TUG) and the Functional Reach (FR) Test as dynamic balance tests. In general, low-moderate correlations were found between the two types of balance tests. For women, age and static balance parameters explained 28.1-40.4% of the variance of TUG scores and 14.6-24% of the variance of FR scores. For men, age and static balance parameters explained 9.5-31.2% of the variance of TUG scores and 23.9-41.7% of the variance of FR scores. Based on our findings, it is suggested that a combination of both static and dynamic tests be used for assessing postural balance ability.
Performance of the PEdiatric Logistic Organ Dysfunction-2 score in critically ill children requiring plasma transfusions.

PubMed

Karam, Oliver; Demaret, Pierre; Duhamel, Alain; Shefler, Alison; Spinella, Philip C; Stanworth, Simon J; Tucci, Marisa; Leteurtre, Stéphane

2016-12-01

Organ dysfunction scores, based on physiological parameters, have been created to describe organ failure. In a general pediatric intensive care unit (PICU) population, the PEdiatric Logistic Organ Dysfunction-2 score (PELOD-2) score had both a good discrimination and calibration, allowing to describe the clinical outcome of critically ill children throughout their stay. This score is increasingly used in clinical trials in specific subpopulation. Our objective was to assess the performance of the PELOD-2 score in a subpopulation of critically ill children requiring plasma transfusions. This was an ancillary study of a prospective observational study on plasma transfusions over a 6-week period, in 101 PICUs in 21 countries. All critically ill children who received at least one plasma transfusion during the observation period were included. PELOD-2 scores were measured on days 1, 2, 5, 8, and 12 after plasma transfusion. Performance of the score was assessed by the determination of the discrimination (area under the ROC curve: AUC) and the calibration (Hosmer-Lemeshow test). Four hundred and forty-three patients were enrolled in the study (median age and weight: 1 year and 9.1 kg, respectively). Observed mortality rate was 26.9 % (119/443). For PELOD-2 on day 1, the AUC was 0.76 (95 % CI 0.71-0.81) and the Hosmer-Lemeshow test was p = 0.76. The serial evaluation of the changes in the daily PELOD-2 scores from day 1 demonstrated a significant association with death, adjusted for the PELOD-2 score on day 1. In a subpopulation of critically ill children requiring plasma transfusion, the PELOD-2 score has a lower but acceptable discrimination than in an entire population. This score should therefore be used cautiously in this specific subpopulation.

Relationship between relaxation by guided imagery and performance of working memory.

PubMed

Hudetz, J A; Hudetz, A G; Klayman, J

2000-02-01

This study tested the hypothesis that relaxation by guided imagery improves working-memory performance of healthy participants. 30 volunteers (both sexes, ages 17-56 years) were randomly assigned to one of three groups and administered the WAIS-III Letter-Number Sequencing Test before and after 10-min. treatment with guided imagery or popular music. The control group received no treatment. Groups' test scores were not different before treatment. The mean increased after relaxation by guided imagery but not after music or no treatment. This result supports the hypothesis that working-memory scores on the test are enhanced by guided imagery and implies that human information processing may be enhanced by prior relaxation.
The feasibility of automated eye tracking with the Early Childhood Vigilance Test of attention in younger HIV-exposed Ugandan children.

PubMed

Boivin, Michael J; Weiss, Jonathan; Chhaya, Ronak; Seffren, Victoria; Awadu, Jorem; Sikorskii, Alla; Giordani, Bruno

2017-07-01

Tobii eye tracking was compared with webcam-based observer scoring on an animation viewing measure of attention (Early Childhood Vigilance Test; ECVT) to evaluate the feasibility of automating measurement and scoring. Outcomes from both scoring approaches were compared with the Mullen Scales of Early Learning (MSEL), Color-Object Association Test (COAT), and Behavior Rating Inventory of Executive Function for preschool children (BRIEF-P). A total of 44 children 44 to 65 months of age were evaluated with the ECVT, COAT, MSEL, and BRIEF-P. Tobii ×2-30 portable infrared cameras were programmed to monitor pupil direction during the ECVT 6-min animation and compared with observer-based PROCODER webcam scoring. Children watched 78% of the cartoon (Tobii) compared with 67% (webcam scoring), although the 2 measures were highly correlated (r = .90, p = .001). It is possible for 2 such measures to be highly correlated even if one is consistently higher than the other (Bergemann et al., 2012). Both ECVT Tobii and webcam ECVT measures significantly correlated with COAT immediate recall (r = .37, p = .02 vs. r = .38, p = .01, respectively) and total recall (r = .33, p = .06 vs. r = .42, p = .005) measures. However, neither the Tobii eye tracking nor PROCODER webcam ECVT measures of attention correlated with MSEL composite cognitive performance or BRIEF-P global executive composite. ECVT scoring using Tobii eye tracking is feasible with at-risk very young African children and consistent with webcam-based scoring approaches in their correspondence to one another and other neurocognitive performance-based measures. By automating measurement and scoring, eye tracking technologies can improve the efficiency and help better standardize ECVT testing of attention in younger children. This holds promise for other neurodevelopmental tests where eye movements, tracking, and gaze length can provide important behavioral markers of neuropsychological and neurodevelopmental processes associated with such tests. (PsycINFO Database Record (c) 2017 APA, all rights reserved).
Migraine and cognitive function: Baseline findings from the Brazilian Longitudinal Study of Adult Health: ELSA-Brasil.

PubMed

Pellegrino Baena, Cristina; Goulart, Alessandra Carvalho; Santos, Itamar de Souza; Suemoto, Claudia Kimie; Lotufo, Paulo Andrade; Bensenor, Isabela Judith

2017-01-01

Background The association between migraine and cognitive performance is unclear. We analyzed whether migraine is associated with cognitive performance among participants of the Brazilian Longitudinal Study of Adult Health, ELSA-Brasil. Methods Cross-sectional analysis, including participants with complete information about migraine and aura at baseline. Headache status (no headaches, non-migraine headaches, migraine without aura and migraine with aura), based on the International Headache Society classification, was used as the dependent variable in the multilinear regression models, using the category "no headache" as reference. Cognitive performance was measured with the Consortium to Establish a Registry for Alzheimer's Disease word list memory test (CERAD-WLMT), the semantic fluency test (SFT), and the Trail Making Test version B (TMTB). Z-scores for each cognitive test and a composite global score were created and analyzed as dependent variables. Multivariate models were adjusted for age, gender, education, race, coronary heart disease, heart failure, hypertension, diabetes, dyslipidemia, body mass index, smoking, alcohol use, physical activity, depression, and anxiety. In women, the models were further adjusted for hormone replacement therapy. Results We analyzed 4208 participants. Of these, 19% presented migraine without aura and 10.3% presented migraine with aura. All migraine headaches were associated with poor cognitive performance (linear coefficient β; 95% CI) at TMTB -0.083 (-0.160; -0.008) and poorer global z-score -0.077 (-0.152; -0.002). Also, migraine without aura was associated with poor cognitive performance at TMTB -0.084 (-0.160, -0.008 and global z-score -0.077 (-0.152; -0.002). Conclusion In participants of the ELSA-study, all migraine headaches and migraine without aura were significantly and independently associated with poorer cognitive performance.
Spatial perception predicts laparoscopic skills on virtual reality laparoscopy simulator.

PubMed

Hassan, I; Gerdes, B; Koller, M; Dick, B; Hellwig, D; Rothmund, M; Zielke, A

2007-06-01

This study evaluates the influence of visual-spatial perception on laparoscopic performance of novices with a virtual reality simulator (LapSim(R)). Twenty-four novices completed standardized tests of visual-spatial perception (Lameris Toegepaste Natuurwetenschappelijk Onderzoek [TNO] Test(R) and Stumpf-Fay Cube Perspectives Test(R)) and laparoscopic skills were assessed objectively, while performing 1-h practice sessions on the LapSim(R), comprising of coordination, cutting, and clip application tasks. Outcome variables included time to complete the tasks, economy of motion as well as total error scores, respectively. The degree of visual-spatial perception correlated significantly with laparoscopic performance on the LapSim(R) scores. Participants with a high degree of spatial perception (Group A) performed the tasks faster than those (Group B) who had a low degree of spatial perception (p = 0.001). Individuals with a high degree of spatial perception also scored better for economy of motion (p = 0.021), tissue damage (p = 0.009), and total error (p = 0.007). Among novices, visual-spatial perception is associated with manual skills performed on a virtual reality simulator. This result may be important for educators to develop adequate training programs that can be individually adapted.
Evaluating the accuracy of the Wechsler Memory Scale-Fourth Edition (WMS-IV) logical memory embedded validity index for detecting invalid test performance.

PubMed

Soble, Jason R; Bain, Kathleen M; Bailey, K Chase; Kirton, Joshua W; Marceaux, Janice C; Critchfield, Edan A; McCoy, Karin J M; O'Rourke, Justin J F

2018-01-08

Embedded performance validity tests (PVTs) allow for continuous assessment of invalid performance throughout neuropsychological test batteries. This study evaluated the utility of the Wechsler Memory Scale-Fourth Edition (WMS-IV) Logical Memory (LM) Recognition score as an embedded PVT using the Advanced Clinical Solutions (ACS) for WAIS-IV/WMS-IV Effort System. This mixed clinical sample was comprised of 97 total participants, 71 of whom were classified as valid and 26 as invalid based on three well-validated, freestanding criterion PVTs. Overall, the LM embedded PVT demonstrated poor concordance with the criterion PVTs and unacceptable psychometric properties using ACS validity base rates (42% sensitivity/79% specificity). Moreover, 15-39% of participants obtained an invalid ACS base rate despite having a normatively-intact age-corrected LM Recognition total score. Receiving operating characteristic curve analysis revealed a Recognition total score cutoff of < 61% correct improved specificity (92%) while sensitivity remained weak (31%). Thus, results indicated the LM Recognition embedded PVT is not appropriate for use from an evidence-based perspective, and that clinicians may be faced with reconciling how a normatively intact cognitive performance on the Recognition subtest could simultaneously reflect invalid performance validity.
Relationships among Testing Medium, Test Performance, and Testing Time of High School Students Who Are Visually Impaired

ERIC Educational Resources Information Center

Erin, Jane N.; Hong, Sunggye; Schoch, Christina; Kuo, YaJu

2006-01-01

This study compared the test scores and time required by high school students who are blind, sighted, or have low vision to complete tests administered in written and oral formats. The quantitative results showed that the blind students performed better on multiple-choice tests in braille and needed more time while taking tests in braille. The…
"Spreading the Wealth": How Principals Use Performance Data to Populate Classrooms

ERIC Educational Resources Information Center

Osborne-Lampkin, La'Tara; Cohen-Vogel, Lora

2014-01-01

There is evidence that school leaders are using test score data for decisions about everything from the curriculum to what is served for lunch. Research suggests that staffing too is data-driven, with principals using test score data to hire, assign, and develop their teachers. Semi-structured interviews with principals and other school actors in…
Comprehensive School Reform and Standardized Test Scores in Illinois Elementary and Middle Schools

ERIC Educational Resources Information Center

McEnroe, James D.

2010-01-01

The study examined the effects of the federally funded Comprehensive School Reform (CSR) program on student performance on mandated standardized tests. The study focused on the mathematics and reading scores of Illinois public elementary and middle and junior high school students. The federal CSR program provided Illinois schools with an annual…
READING PERFORMANCE OF ELEMENTARY STUDENT TEACHERS IN A DEVELOPING INSTITUTION.

ERIC Educational Resources Information Center

ADAMS, EFFIE KAYE

A STUDY WAS CONDUCTED AT BISHOP COLLEGE, DALLAS, TEXAS, TO EXAMINE THE READING NEEDS OF PROSPECTIVE ELEMENTARY TEACHERS. SCORES ON THE NELSON DENNY READING TESTS, ADVANCED FORM A, ON THE OTIS QUICK SCORING TESTS OF MENTAL ABILITY, GAMMA FORM BM, AND GRADE POINT AVERAGES COVERING 4 YEARS OF COLLEGE WORK WERE ANALYZED FOR 29 NEGRO ELEMENTARY STUDENT…
Relationship of Friends, Physical Education, and State Test Scores: Implications for School Counselors

ERIC Educational Resources Information Center

Hollingsworth, Mary Ann

2010-01-01

This study examined the relationship between dimensions of wellness and academic performance for 634 third through fifth grade students in Title One schools in rural Mississippi, using composites of the Five Factor Wellness Inventory for Elementary Children and Reading, Language, and Math Scores of the Mississippi Curriculum Test (a state level…
Falling Behind: New Evidence on the Black-White Achievement Gap

ERIC Educational Resources Information Center

Levitt, Steven D.; Fryer, Roland G.

2004-01-01

On average, black students typically score one standard deviation below white students on standardized tests--roughly the difference in performance between the average 4th grader and the average 8th grader. Historically, what has come to be known as the black-white test-score gap has emerged before children enter kindergarten and has tended to…
Race, Poverty and SAT Scores: Modeling the Influences of Family Income on Black and White High School Students' SAT Performance

ERIC Educational Resources Information Center

Dixon-Roman, Ezekiel J.; Everson, Howard T.; McArdle, John J.

2013-01-01

Background: Educational policy makers and test critics often assert that standardized test scores are strongly influenced by factors beyond individual differences in academic achievement such as family income and wealth. Unfortunately, few empirical studies consider the simultaneous and related influences of family income, parental education, and…
Multiple Imputation of Item Scores in Test and Questionnaire Data, and Influence on Psychometric Results

ERIC Educational Resources Information Center

van Ginkel, Joost R.; van der Ark, L. Andries; Sijtsma, Klaas

2007-01-01

The performance of five simple multiple imputation methods for dealing with missing data were compared. In addition, random imputation and multivariate normal imputation were used as lower and upper benchmark, respectively. Test data were simulated and item scores were deleted such that they were either missing completely at random, missing at…
Pretraining and posttraining assessment of residents' performance in the fourth accreditation council for graduate medical education competency: patient communication skills.

PubMed

Chandawarkar, Rajiv Y; Ruscher, Kimberly A; Krajewski, Aleksandra; Garg, Manish; Pfeiffer, Carol; Singh, Rekha; Longo, Walter E; Kozol, Robert A; Lesnikoski, Beth; Nadkarni, Prakash

2011-08-01

Structured communication curricula will improve surgical residents' ability to communicate effectively with patients. A prospective study approved by the institutional review board involved 44 University of Connecticut general surgery residents. Residents initially completed a written baseline survey to assess general communication skills awareness. In step 1 of the study, residents were randomized to 1 of 2 simulations using standardized patient instructors to mimic patients receiving a diagnosis of either breast or rectal cancer. The standardized patient instructors scored residents' communication skills using a case-specific content checklist and Master Interview Rating Scale. In step 2 of the study, residents attended a 3-part interactive program that comprised (1) principles of patient communication; (2) experiences of a surgeon (role as physician, patient, and patient's spouse); and (3) role-playing (3-resident groups played patient, physician, and observer roles and rated their own performance). In step 3, residents were retested as in step 1, using a crossover case design. Scores were analyzed using Wilcoxon signed rank test with a Bonferroni correction. Case-specific performance improved significantly, from a pretest content checklist median score of 8.5 (65%) to a posttest median of 11.0 (84%) (P = .005 by Wilcoxon signed rank test for paired ordinal data)(n = 44). Median Master Interview Rating Scale scores changed from 58.0 before testing (P = .10) to 61.5 after testing (P = .94). Difference between overall rectal cancer scores and breast cancer scores also were not significant. Patient communication skills need to be taught as part of residency training. With limited training, case-specific skills (herein, involving patients with cancer) are likely to improve more than general communication skills.
Poor performances of EuroSCORE and CARE score for prediction of perioperative mortality in octogenarians undergoing aortic valve replacement for aortic stenosis.

PubMed

Chhor, Vibol; Merceron, Sybille; Ricome, Sylvie; Baron, Gabriel; Daoud, Omar; Dilly, Marie-Pierre; Aubier, Benjamin; Provenchere, Sophie; Philip, Ivan

2010-08-01

Although results of cardiac surgery are improving, octogenarians have a higher procedure-related mortality and more complications with increased length of stay in ICU. Consequently, careful evaluation of perioperative risk seems necessary. The aims of our study were to assess and compare the performances of EuroSCORE and CARE score in the prediction of perioperative mortality among octogenarians undergoing aortic valve replacement for aortic stenosis and to compare these predictive performances with those obtained in younger patients. This retrospective study included all consecutive patients undergoing cardiac surgery in our institution between November 2005 and December 2007. For each patient, risk assessment for mortality was performed using logistic EuroSCORE, additive EuroSCORE and CARE score. The main outcome measure was early postoperative mortality. Predictive performances of these scores were assessed by calibration and discrimination using goodness-of-fit test and area under the receiver operating characteristic curve, respectively. During this 2-year period, we studied 2117 patients, among whom 134/211 octogenarians and 335/1906 nonoctogenarians underwent an aortic valve replacement for aortic stenosis. When considering patients with aortic stenosis, discrimination was poor in octogenarians and the difference from nonoctogenarians was significant for each score (0.58, 0.59 and 0.56 vs. 0.82, 0.81 and 0.77 for additive EuroSCORE, logistic EuroSCORE and CARE score in octogenarians and nonoctogenarians, respectively, P < 0.05). Moreover, in the whole cohort, logistic EuroSCORE significantly overestimated mortality among octogenarians. Predictive performances of these scores are poor in octogenarians undergoing cardiac surgery, especially aortic valve replacement. Risk assessment and therapeutic decisions in octogenarians should not be made with these scoring systems alone.
Education plays a greater role than age in cognitive test performance among participants of the Brazilian Longitudinal Study of Adult Health (ELSA-Brasil).

PubMed

de Azeredo Passos, Valéria Maria; Giatti, Luana; Bensenor, Isabela; Tiemeier, Henning; Ikram, M Arfan; de Figueiredo, Roberta Carvalho; Chor, Dora; Schmidt, Maria Inês; Barreto, Sandhi Maria

2015-10-09

Brazil has gone through fast demographic, epidemiologic and nutritional transitions and, despite recent improvements in wealth distribution, continues to present a high level of social and economic inequality. The ELSA-Brasil, a cohort study, aimed at investigating cardiovascular diseases and diabetes, offers a great opportunity to assess cognitive decline in this aging population through time-sequential analyses drawn from the same battery of tests over time. The purpose of this study is to analyze the influence of sex, age and education on cognitive tests performance of the participants at baseline. Analyses pertain to 14,594 participants with aged 35 to 74 years, who were functionally independent and had no history of stroke or use of neuroleptics, anticonvulsants, cholinesterase inhibitors or antiparkinsonian agents. Mean age was 52.0 ± 9.0 years and 54.2% of participants were women. Cognitive tests included the word memory tests (retention, recall and recognition), verbal fluency tests (VFT, animals and letter F) and Trail Making Test B. Multivariable linear regression analysis was used to determine the influence of sociodemographic characteristics on the distribution of the final score of each test. Women had significant and slightly higher scores than men in all memory tests and VFT, but took more time to perform Trail B. Reduced performance in all tests was seen with an increase age and, more importantly, with decrease level of education. The word list and VFT scores decreased at about one word for every 10 years of age; whereas higher-educated participants scored four words more on the word list test, and six or seven more correct words on VFT, when compared to lower-educated participants. Additionally, the oldest and less educated participants showed significant lower response rates in all tests. The higher influence of education than age in this Brazilian population reinforce the need for caution in analyzing and diagnosing cognitive impairments based on traditional cognitive tests and the importance of searching for education-free cognitive tests, especially in low and middle-income countries.
77 FR 22306 - State Personnel Development Grants; Proposed Priorities and Definitions; CFDA Number 84.323A

Federal Register 2010, 2011, 2012, 2013, 2014

2012-04-13

...: alternative measures of student learning and performance, such as student scores on pre-tests and end-of-course tests; student performance on English language proficiency assessments; and other measures of...
Psychomotor testing predicts rate of skill acquisition for proficiency-based laparoscopic skills training.

PubMed

Stefanidis, Dimitrios; Korndorffer, James R; Black, F William; Dunne, J Bruce; Sierra, Rafael; Touchard, Cheri L; Rice, David A; Markert, Ronald J; Kastl, Peter R; Scott, Daniel J

2006-08-01

Laparoscopic simulator training translates into improved operative performance. Proficiency-based curricula maximize efficiency by tailoring training to meet the needs of each individual; however, because rates of skill acquisition vary widely, such curricula may be difficult to implement. We hypothesized that psychomotor testing would predict baseline performance and training duration in a proficiency-based laparoscopic simulator curriculum. Residents (R1, n = 20) were enrolled in an IRB-approved prospective study at the beginning of the academic year. All completed the following: a background information survey, a battery of 12 innate ability measures (5 motor, and 7 visual-spatial), and baseline testing on 3 validated simulators (5 videotrainer [VT] tasks, 12 virtual reality [minimally invasive surgical trainer-virtual reality, MIST-VR] tasks, and 2 laparoscopic camera navigation [LCN] tasks). Participants trained to proficiency, and training duration and number of repetitions were recorded. Baseline test scores were correlated to skill acquisition rate. Cutoff scores for each predictive test were calculated based on a receiver operator curve, and their sensitivity and specificity were determined in identifying slow learners. Only the Cards Rotation test correlated with baseline simulator ability on VT and LCN. Curriculum implementation required 347 man-hours (6-person team) and 795,000 dollars of capital equipment. With an attendance rate of 75%, 19 of 20 residents (95%) completed the curriculum by the end of the academic year. To complete training, a median of 12 hours (range, 5.5-21), and 325 repetitions (range, 171-782) were required. Simulator score improvement was 50%. Training duration and repetitions correlated with prior video game and billiard exposure, grooved pegboard, finger tap, map planning, Rey Figure Immediate Recall score, and baseline performance on VT and LCN. The map planning cutoff score proved most specific in identifying slow learners. Proficiency-based laparoscopic simulator training provides improvement in performance and can be effectively implemented as a routine part of resident education, but may require significant resources. Although psychomotor testing may be of limited value in the prediction of baseline laparoscopic performance, its importance may lie in the prediction of the rapidity of skill acquisition. These tests may be useful in optimizing curricular design by allowing the tailoring of training to individual needs.
A Randomized Controlled Trial of Team-Based Learning Versus Lectures with Break-Out Groups on Knowledge Retention.

PubMed

Thrall, Grace C; Coverdale, John H; Benjamin, Sophiya; Wiggins, Anna; Lane, Christianne Joy; Pato, Michele T

2016-10-01

This goal of this study was to evaluate the efficacy of team-based learning (TBL) on knowledge retention compared to traditional lectures with small break-out group discussion (teaching as usual (TAU)) using a randomized controlled trial. This randomized controlled trial was conducted during a daylong conference for psychiatric educators on attention-deficit hyperactivity disorder and the research literacy topic of efficacy versus effectiveness trials. Learners (n = 115) were randomized with concealed allocation to either TBL or TAU. Knowledge was measured prior to the intervention, immediately afterward, and 2 months later via multiple-choice tests. Participants were necessarily unblinded. Data enterers, data analysts, and investigators were blinded to group assignment in data analysis. Per-protocol analyses of test scores were performed using change in knowledge from baseline. The primary endpoint was test scores at 2 months. At baseline, there were no statistically significant differences between groups in pre-test knowledge. At immediate post-test, both TBL and TAU groups showed improved knowledge scores compared with their baseline scores. The TBL group performed better statistically on the immediate post-test than the TAU group (Cohen's d = 0.73; p < 0.001), although the differences in knowledge scores were not educationally meaningful, averaging just one additional test question correct (out of 15). On the 2-month remote post-test, there were no group differences in knowledge retention among the 42 % of participants who returned the 2-month test. Both TBL and TAU learners acquired new knowledge at the end of the intervention and retained knowledge over 2 months. At the end of the intervention day and after 2 months, knowledge test scores were not meaningfully different between TBL and TAU completers. In conclusion, this study failed to demonstrate the superiority of TBL over TAU on the primary outcome of knowledge retention at 2 months post-intervention.
Sex is not everything: the role of gender in early performance of a fundamental laparoscopic skill.

PubMed

Kolozsvari, Nicoleta O; Andalib, Amin; Kaneva, Pepa; Cao, Jiguo; Vassiliou, Melina C; Fried, Gerald M; Feldman, Liane S

2011-04-01

Existing literature on the acquisition of surgical skills suggests that women generally perform worse than men. This literature is limited by looking at an arbitrary number of trials and not adjusting for potential confounders. The objective of this study was to evaluate the impact of gender on the learning curve for a fundamental laparoscopic task. Thirty-two medical students performed the FLS peg transfer task and their scores were plotted to generate a learning curve. Nonlinear regression was used to estimate learning plateau and learning rate. Variables that may affect performance were assessed using a questionnaire. Innate visual-spatial abilities were evaluated using tests for spatial orientation, spatial scanning, and perceptual abilities. Score on first peg transfer attempt, learning plateau, and learning rate were compared for men and women using Student's t test. Innate abilities were correlated to simulator performance using Pearson's coefficient. Multivariate linear regression was used to investigate the effect of gender on early laparoscopic performance after adjusting for factors found significant on univariate analysis. Statistical significance was defined as P < 0.05. Nineteen men and 13 women participated in the study; 30 were right-handed, 12 reported high interest in surgery, and 26 had video game experience. There were no differences between men and women in initial peg transfer score, learning plateau, or learning rate. Initial peg transfer score and learning rate were higher in subjects who reported having a high interest in surgery (P = 0.02, P = 0.03). Initial score also correlated with perceptual ability score (P = 0.03). In multivariate analysis, only surgical interest remained a significant predictor of score on first peg transfer (P = 0.03) and learning rate (P = 0.02), while gender had no significant relationship to early performance. Gender did not affect the learning curve for a fundamental laparoscopic task, while interest in surgery and perceptual abilities did influence early performance.

National trends in safety performance of electronic health record systems in children's hospitals.

PubMed

Chaparro, Juan D; Classen, David C; Danforth, Melissa; Stockwell, David C; Longhurst, Christopher A

2017-03-01

To evaluate the safety of computerized physician order entry (CPOE) and associated clinical decision support (CDS) systems in electronic health record (EHR) systems at pediatric inpatient facilities in the US using the Leapfrog Group's pediatric CPOE evaluation tool. The Leapfrog pediatric CPOE evaluation tool, a previously validated tool to assess the ability of a CPOE system to identify orders that could potentially lead to patient harm, was used to evaluate 41 pediatric hospitals over a 2-year period. Evaluation of the last available test for each institution was performed, assessing performance overall as well as by decision support category (eg, drug-drug, dosing limits). Longitudinal analysis of test performance was also carried out to assess the impact of testing and the overall trend of CPOE performance in pediatric hospitals. Pediatric CPOE systems were able to identify 62% of potential medication errors in the test scenarios, but ranged widely from 23-91% in the institutions tested. The highest scoring categories included drug-allergy interactions, dosing limits (both daily and cumulative), and inappropriate routes of administration. We found that hospitals with longer periods since their CPOE implementation did not have better scores upon initial testing, but after initial testing there was a consistent improvement in testing scores of 4 percentage points per year. Pediatric computerized physician order entry (CPOE) systems on average are able to intercept a majority of potential medication errors, but vary widely among implementations. Prospective and repeated testing using the Leapfrog Group's evaluation tool is associated with improved ability to intercept potential medication errors. © The Author 2016. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For Permissions, please email: journals.permissions@oup.com
Performance and blood monitoring in sports: the artificial intelligence evoking target testing in antidoping (AR.I.E.T.T.A.) project.

PubMed

Manfredini, A F; Malagoni, A M; Litmanen, H; Zhukovskaja, L; Jeannier, P; Dal Follo, D; Felisatti, M; Besseberg, A; Geistlinger, M; Bayer, P; Carrabre, J E

2011-03-01

Substances and methods used to increase oxygen blood transport and physical performance can be detected in the blood, but the screening of the athletes to be tested remains a critical issue for the International Federations. This project, AR.I.E.T.T.A., aimed to develop a software capable of analysing athletes' hematological and performance profiles to detect abnormal patterns. One-hundred eighty athletes belonging to the International Biathlon Union gave written informed consent to have their hematological data, previously collected according to anti-doping rules, used to develop the AR.I.E.T.T.A. software. Software was developed with the included sections: 1) log-in; 2) data-entry: where data are loaded, stored and grouped; 3) analysis: where data are analysed, validated scores are calculated, and parameters are simultaneously displayed as statistics, tables and graphs, and individual or subpopulation profiles; 4) screening: where an immediate evaluation of the risk score of the present sample and/or the athlete under study is obtained. The sample risk score or AR.I.E.T.T.A. score is calculated by a simple computational system combining different parameters (absolute values and intra-individual variations) considered concurrently. The AR.I.E.T.T.A. score is obtained by the sum of the deviation units derived from each parameter, considering the shift of the present value from the reference values, based on the number of standard deviations. AR.I.E.T.T.A. enables a quick evaluation of blood results assisting surveillance programs and perform timely target testing controls on athletes by the International Federations. Future studies aiming to validate the AR.I.E.T.T.A. score and improve the diagnostic accuracy will improve the system.
Lifetime Occupation and Late-Life Cognitive Performance Among Women.

PubMed

Ribeiro, Pricila Cristina Correa; Lourenço, Roberto Alves

2015-01-01

We examined whether women who had regular jobs throughout life performed better cognitively than older adult housewives. Linear regression was used to compare global cognitive performance scores of housewives (G1) and women exposed to work of low (G2) and high (G3) complexity. The sample comprised 477 older adult Brazilian women, 430 (90.4%) of whom had performed lifelong jobs. In work with data, the G2 group's cognitive performance scores were 1.73 points higher (p =.03), and the G3 group scored 1.76 points (p =.02) higher, than the G1. In work with things and with people, the G3 scored, respectively, 2.04 (p <.01) and 2.21 (p <.01) cognitive test points higher than the G1. Based on our findings we suggest occupation of greater complexity is associated with better cognitive performance in women later in life.
Objective assessment of operator performance during ultrasound-guided procedures.

PubMed

Tabriz, David M; Street, Mandie; Pilgram, Thomas K; Duncan, James R

2011-09-01

Simulation permits objective assessment of operator performance in a controlled and safe environment. Image-guided procedures often require accurate needle placement, and we designed a system to monitor how ultrasound guidance is used to monitor needle advancement toward a target. The results were correlated with other estimates of operator skill. The simulator consisted of a tissue phantom, ultrasound unit, and electromagnetic tracking system. Operators were asked to guide a needle toward a visible point target. Performance was video-recorded and synchronized with the electromagnetic tracking data. A series of algorithms based on motor control theory and human information processing were used to convert raw tracking data into different performance indices. Scoring algorithms converted the tracking data into efficiency, quality, task difficulty, and targeting scores that were aggregated to create performance indices. After initial feasibility testing, a standardized assessment was developed. Operators (N = 12) with a broad spectrum of skill and experience were enrolled and tested. Overall scores were based on performance during ten simulated procedures. Prior clinical experience was used to independently estimate operator skill. When summed, the performance indices correlated well with estimated skill. Operators with minimal or no prior experience scored markedly lower than experienced operators. The overall score tended to increase according to operator's clinical experience. Operator experience was linked to decreased variation in multiple aspects of performance. The aggregated results of multiple trials provided the best correlation between estimated skill and performance. A metric for the operator's ability to maintain the needle aimed at the target discriminated between operators with different levels of experience. This study used a highly focused task model, standardized assessment, and objective data analysis to assess performance during simulated ultrasound-guided needle placement. The performance indices were closely related to operator experience.
How Do Raters from India Perform in Scoring the TOEFL iBT[TM] Speaking Section and What Kind of Training Helps? TOEFL iBT[TM] Research Report. RR-09-31

ERIC Educational Resources Information Center

Xi, Xiaoming; Mollaun, Pam

2009-01-01

This study investigated the scoring of the Test of English as a Foreign Language[TM] Internet-based Test (TOEFL iBT[TM]) Speaking section by bilingual or multilingual speakers of English and 1 or more Indian languages. We explored the extent to which raters from India, after being trained and certified, were able to score the Speaking section for…
The Harrington-O'Shea Career Decision-Making System (CDM) and the Kaufman Adolescent and Adult Intelligence Test (KAIT): Relationship of Interest Scale Scores to Fluid and Crystallized IQs at Ages 12 to 22 Years.

ERIC Educational Resources Information Center

McLean, James E.; Kaufman, Alan S.

1995-01-01

The six Holland-based Interest Scale scores yielded by the Harrington-O'Shea Career Decision-Making System (CDM) (T. Harrington and A. O'Shea, 1982) were related to sex, race, and performance on the Kaufman Adolescent and Adult Intelligence Test for 254 adolescents and young adults. CDM scores did not relate to most of the variables studied, and…
Planning or something else? Examining neuropsychological predictors of Zoo Map performance.

PubMed

Oosterman, Joukje M; Wijers, Marijn; Kessels, Roy P C

2013-01-01

The Zoo Map Test of the Behavioral Assessment of the Dysexecutive Syndrome battery is often applied to measure planning ability as part of executive function. Successful performance on this test is, however, dependent on various cognitive functions, and deficient Zoo Map performance does therefore not necessarily imply selectively disrupted planning abilities. To address this important issue, we examined whether planning is still the most important predictor of Zoo Map performance in a heterogeneous sample of neurologic and psychiatric outpatients (N = 71). In addition to the Zoo Map Test, the patients completed other neuropsychological tests of planning, inhibition, processing speed, and episodic memory. Planning was the strongest predictor of the total raw score and inappropriate places visited, and no additional contribution of other cognitive scores was found. One exception to this was the total time, which was associated with processing speed. Overall, our findings indicate that the Zoo Map Test is a valid indicator of planning ability in a heterogeneous patient sample.
The Test of Logical Thinking as a predictor of first-year pharmacy students' performance in required first-year courses.

PubMed

Etzler, Frank M; Madden, Michael

2014-08-15

To investigate the correlation of scores on the Test of Logical Thinking (TOLT) with first-year pharmacy students' performance in selected courses. The TOLT was administered to 130 first-year pharmacy students. The examination was administered during the first quarter in a single session. The TOLT scores correlated with grades earned in Pharmaceutical Calculations, Physical Pharmacy, and Basic Pharmacokinetics courses. Performance on the TOLT has been correlated to performance in courses that required the ability to use quantitative reasoning to complete required tasks. In the future, it may be possible to recommend remediation, retention, and/or admission based in part on the results from the TOLT.
Approval Motive and Academic Behaviors: The Self Reinforcement Hypothesis

ERIC Educational Resources Information Center

Matell, Michael S.; Smith, Ronald E.

1970-01-01

Testing of college students in differing conditions as to performance being relevant to academic achievement goals revealed that under hgih relevance conditions scores on the Marlowe Crowne Social Desirability Scale were unrelated to test performance. Under low relevant conditions, the need for approval was highly related to performance in high…
A comparison of hands-on inquiry instruction to lectureinstruction with special needs high school biology students

NASA Astrophysics Data System (ADS)

Jensen-Ruopp, Helga Spitko

A comparison of hands-on inquiry instruction with lecture instruction was presented to 134 Patterns and Process Biology students. Students participated in seven biology lessons that were selected from Biology Survey of Living Things (1992). A pre and post paper and pencil assessment was used as the data collecting instrument. The treatment group was taught using hands-on inquiry strategies while the non-treatment group was taught in the lecture method of instruction. The team teaching model was used as the mode of presentation to the treatment group and the non-treatment group. Achievement levels using specific criterion; novice (0% to 50%), developing proficiency (51% to 69%), accomplished (70% to 84) and exceptional or mastery level (85% to 100%) were used as a guideline to tabulate the results of the pre and post assessment. Rubric tabulation was done to interpret the testing results. The raw data was plotted using percentage change in test score totals versus reading level score by gender as well as percentage change in test score totals versus auditory vocabulary score by gender. Box Whisker plot comparative descriptive of individual pre and post test scores for the treatment and non-treatment group was performed. Analysis of covariance (ANCOVA) using MINITAB Statistical Software version 14.11 was run on data of the seven lessons, as well as on gender (male results individual and combined, and female results individual and combined) results. Normal Probability Plots for total scores as well as individual test scores were performed. The results suggest that hands-on inquiry based instruction when presented to special needs students including; at-risk; English as a second language limited, English proficiency and special education inclusive students' learning may enhance individual student achievement.
A simple bedside blood test (Fibrofast; FIB-5) is superior to FIB-4 index for the differentiation between non-significant and significant fibrosis in patients with chronic hepatitis C.

PubMed

Shiha, G; Seif, S; Eldesoky, A; Elbasiony, M; Soliman, R; Metwally, A; Zalata, K; Mikhail, N

2017-05-01

A simple non-invasive score (Fibrofast, FIB-5) was developed using five routine laboratory tests (ALT, AST, alkaline phosphatase, albumin and platelets count) for the detection of significant hepatic fibrosis in patients with chronic hepatitis C. The FIB-4 index is a non-invasive test for the assessment of liver fibrosis, and a score of ≤1.45 enables the correct identification of patients who have non-significant (F0-1) from significant fibrosis (F2-4), and could avoid liver biopsy. The aim of this study was to compare the performance characteristics of FIB-5 and FIB-4 to differentiate between non-significant and significant fibrosis. A cross-sectional study included 604 chronic HCV patients. All liver biopsies were scored using the METAVIR system. Both FIB-5 and FIB-4 scores were measured and the performance characteristics were calculated using the ROC curve. The performance characteristics of FIB-5 at ≥7.5 and FIB-4 at ≤1.45 for the differentiation between non-significant fibrosis and significant fibrosis were: specificity 94.4%, PPV 85.7%, and specificity 54.9%, PPV 55.7% respectively. FIB-5 score at the new cutoff is superior to FIB-4 index for the differentiation between non-significant and significant fibrosis.
Changes in Study Strategies of Medical Students between Basic Science Courses and Clerkships Are Associated with Performance

ERIC Educational Resources Information Center

Ensminger, David C.; Hoyt, Amy E.; Chandrasekhar, Arcot J.; McNulty, John A.

2013-01-01

We tested the hypothesis that medical students change their study strategies when transitioning from basic science courses to clerkships, and that their study practices are associated with performance scores. Factor scores for three approaches to studying (construction, rote, and review) generated from student (n = 150) responses to a…
Alternative Methods for Estimating Achievement Trends and School Effects: When Is Simple Good Enough?

ERIC Educational Resources Information Center

Warkentien, Siri; Silver, David

2016-01-01

Public schools with impressive records of serving lower-performing students are often overlooked because their average test scores, even when students are growing quickly, are lower than scores in schools that serve higher-performing students. Schools may appear to be doing poorly either because baseline achievement is not easily accounted for or…
The Performance of Latinos in Rural Public Schools: A Comparative Analysis of Test Scores in Grades 3, 6, and 12.

ERIC Educational Resources Information Center

Hampton, Steve; And Others

1995-01-01

Examines effects of socioeconomic status, school funding, English proficiency, and Latino population concentration on achievement scores of students in grades 3, 6, and 12 in 66 rural California school districts. Performance on the California Assessment Program was predicted primarily by parental socioeconomic status, and, unexpectedly, improved…
AN EXAMINATION OF DATA ON IOWA SCHOOL CHILDREN TO DETERMINE PATTERNS OF PERFORMANCE AND "DOWNSTREAM EFFECTS" OF EARLY DEPRESSED SCORES.

ERIC Educational Resources Information Center

FITZSIMMONS, STEPHEN J.

VARIOUS PERFORMANCE PATTERNS WERE STUDIED TO DETERMINE IF EARLY LIMITED FAILURE LEADS TO GENERALIZED FAILURE IN A NUMBER OF AREAS. THE SUBJECTS, 258 DISADVANTAGED URBAN CHILDREN FROM FOUR SCHOOL DISTRICTS IN IOWA, HAD ONE OR MORE SCORES ON THE IOWA TEST OF BASIC SKILLS (ITBS) AT OR BELOW THE 33D PERCENTILE ON NATIONAL NORMS. THEIR PERFORMANCES ON…
The Impact of Linking Distinct Achievement Test Scores on the Interpretation of Student Growth in Achievement

ERIC Educational Resources Information Center

Airola, Denise Tobin

2011-01-01

Changes to state tests impact the ability of State Education Agencies (SEAs) to monitor change in performance over time. The purpose of this study was to evaluate the Standardized Performance Growth Index (PGIz), a proposed statistical model for measuring change in student and school performance, across transitions in tests. The PGIz is a…
Our Students Suffer from Both Lack of Knowledge and Consistency: A PPT (Potential Performance Theory) Analysis of Test-Taking

ERIC Educational Resources Information Center

Rice, Stephen; Geels, Kasha; Trafimow, David; Hackett, Holly

2011-01-01

Test scores are used to assess one's general knowledge of a specific area. Although strategies to improve test performance have been previously identified, the consistency with which one uses these strategies has not been analyzed in such a way that allows assessment of how much consistency affects overall performance. Participants completed one…
Biases and power for groups comparison on subjective health measurements.

PubMed

Hamel, Jean-François; Hardouin, Jean-Benoit; Le Neel, Tanguy; Kubis, Gildas; Roquelaure, Yves; Sébille, Véronique

2012-01-01

Subjective health measurements are increasingly used in clinical research, particularly for patient groups comparisons. Two main types of analytical strategies can be used for such data: so-called classical test theory (CTT), relying on observed scores and models coming from Item Response Theory (IRT) relying on a response model relating the items responses to a latent parameter, often called latent trait. Whether IRT or CTT would be the most appropriate method to compare two independent groups of patients on a patient reported outcomes measurement remains unknown and was investigated using simulations. For CTT-based analyses, groups comparison was performed using t-test on the scores. For IRT-based analyses, several methods were compared, according to whether the Rasch model was considered with random effects or with fixed effects, and the group effect was included as a covariate or not. Individual latent traits values were estimated using either a deterministic method or by stochastic approaches. Latent traits were then compared with a t-test. Finally, a two-steps method was performed to compare the latent trait distributions, and a Wald test was performed to test the group effect in the Rasch model including group covariates. The only unbiased IRT-based method was the group covariate Wald's test, performed on the random effects Rasch model. This model displayed the highest observed power, which was similar to the power using the score t-test. These results need to be extended to the case frequently encountered in practice where data are missing and possibly informative.
Intelligence is in the eye of the beholder: investigating repeated IQ measurements in forensic psychiatry.

PubMed

Habets, Petra; Jeandarme, Inge; Uzieblo, Kasia; Oei, Karel; Bogaerts, Stefan

2015-05-01

A stable assessment of cognition is of paramount importance for forensic psychiatric patients (FPP). The purpose of this study was to compare repeated measures of IQ scores in FPPs with and without intellectual disability. Repeated measurements of IQ scores in FPPs (n = 176) were collected. Differences between tests were computed, and each IQ score was categorized. Additionally, t-tests and regression analyses were performed. Differences of 10 points or more were found in 66% of the cases comparing WAIS-III with RAVEN scores. Fisher's exact test revealed differences between two WAIS-III scores and the WAIS categories. The WAIS-III did not predict other IQs (WAIS or RAVEN) in participants with intellectual disability. This study showed that stability or interchangeability of scores is lacking, especially in individuals with intellectual disability. Caution in interpreting IQ scores is therefore recommended, and the use of the unitary concept of IQ should be discouraged. © 2014 John Wiley & Sons Ltd.
Uncovering curvilinear relationships between conscientiousness and job performance: how theoretically appropriate measurement makes an empirical difference.

PubMed

Carter, Nathan T; Dalal, Dev K; Boyce, Anthony S; O'Connell, Matthew S; Kung, Mei-Chuan; Delgado, Kristin M

2014-07-01

The personality trait of conscientiousness has seen considerable attention from applied psychologists due to its efficacy for predicting job performance across performance dimensions and occupations. However, recent theoretical and empirical developments have questioned the assumption that more conscientiousness always results in better job performance, suggesting a curvilinear link between the 2. Despite these developments, the results of studies directly testing the idea have been mixed. Here, we propose this link has been obscured by another pervasive assumption known as the dominance model of measurement: that higher scores on traditional personality measures always indicate higher levels of conscientiousness. Recent research suggests dominance models show inferior fit to personality test scores as compared to ideal point models that allow for curvilinear relationships between traits and scores. Using data from 2 different samples of job incumbents, we show the rank-order changes that result from using an ideal point model expose a curvilinear link between conscientiousness and job performance 100% of the time, whereas results using dominance models show mixed results, similar to the current state of the literature. Finally, with an independent cross-validation sample, we show that selection based on predicted performance using ideal point scores results in more favorable objective hiring outcomes. Implications for practice and future research are discussed.

Height for age z score and cognitive function are associated with Academic performance among school children aged 8-11 years old.

PubMed

Haile, Demewoz; Nigatu, Dabere; Gashaw, Ketema; Demelash, Habtamu

2016-01-01

Academic achievement of school age children can be affected by several factors such as nutritional status, demographics, and socioeconomic factors. Though evidence about the magnitude of malnutrition is well established in Ethiopia, there is a paucity of evidence about the association of nutritional status with academic performance among the nation's school age children. Hence, this study aimed to determine how nutritional status and cognitive function are associated with academic performance of school children in Goba town, South East Ethiopia. An institution based cross-sectional study was conducted among 131 school age students from primary schools in Goba town enrolled during the 2013/2014 academic year. The nutritional status of students was assessed by anthropometric measurement, while the cognitive assessment was measured by the Kaufman Assessment Battery for Children (KABC-II) and Ravens colored progressive matrices (Raven's CPM) tests. The academic performance of the school children was measured by collecting the preceding semester academic result from the school record. Descriptive statistics, bivariate and multivariable linear regression were used in the statistical analysis. This study found a statistically significant positive association between all cognitive test scores and average academic performance except for number recall (p = 0.12) and hand movements (p = 0.08). The correlation between all cognitive test scores and mathematics score was found positive and statistically significant (p < 0.05). In the multivariable linear regression model, better wealth index was significantly associated with higher mathematics score (ß = 0.63; 95 % CI: 0.12-0.74). Similarly a unit change in height for age z score resulted in 2.11 unit change in mathematics score (ß = 2.11; 95 % CI: 0.002-4.21). A single unit change of wealth index resulted 0.53 unit changes in average score of all academic subjects among school age children (ß = 0.53; 95 % CI: 0.11-0.95). A single unit change of age resulted 3.23 unit change in average score of all academic subjects among school age children (ß = 3.23; 95 % CI: 1.20-5.27). Nutritional status (height for age Z score) and wealth could be modifiable factors to improve academic performance of school age children. Moreover, interventions to improve nutrition for mothers and children may be an important contributor to academic success and national economic growth in Ethiopia. Further study with strong design and large sample size is needed.
An Investigation of the Gender Differential Performance on a High-Stakes Language Proficiency Test in Iran

ERIC Educational Resources Information Center

Karami, Hossein

2013-01-01

There has been a growing consensus among the educational measurement experts and psychometricians that test taker characteristics may unduly affect the performance on tests. This may lead to construct-irrelevant variance in the scores and thus render the test biased. Hence, it is incumbent on test developers and users alike to provide evidence…
The effects of short-term and long-term pulmonary rehabilitation on functional capacity, perceived dyspnea, and quality of life.

PubMed

Verrill, David; Barton, Cole; Beasley, Will; Lippard, W Michael

2005-08-01

The purposes of this study were as follows: (1) to determine whether physical performance, quality of life, and dyspnea with activities of daily living improved following both short-term and long-term pulmonary rehabilitation (PR) across multiple hospital outpatient programs; (2) to examine the differences in these parameters between men and women; and (3) to determine what relationships existed between the psychosocial parameters and the results of the 6-min walk (6MW) test performance across programs. Non-experimental, prospective, and comparative. Seven outpatient hospital PR programs from urban and rural settings across North Carolina. Three hundred nine women and 281 men who were 20 to 93 years of age (mean [+/- SD] age, 66.7 +/- 11.1 years) with chronic lung disease. All 6MW tests and health surveys were administered prior to and immediately following 12 and 24 weeks of supervised PR participation. Scores from the 6MW tests, the Ferrans and Powers quality of life index-pulmonary version III (QLI), the Medical Outcomes Study 36-item short form (SF-36), and the University of California at San Diego shortness of breath questionnaire (SOBQ) were compared at PR entry, at 12 weeks, and at 24 weeks for differences by gender with repeated-measures analysis of variance. The study entry and follow-up SF-36 physical and mental component summary scores, the QLI health/function and overall scores, and the SOBQ scores were also compared to the 6MW test scores with Pearson correlation coefficient analysis. The mean summary scores on the SF-36 and the QLI increased after 12 weeks of PR (p < 0.05), and improvements were maintained by 24 weeks of PR participation (p < 0.05). Scores on the SOBQ improved after 12 weeks (p < 0.001) among the short-term participants, but not until after 24 weeks among the long-term participants (p = 0.009). The 6MW test performance improved after 12 weeks (p < 0.001) and again from 12 to 24 weeks (p = 0.002) in the long-term participants. No relevant correlational relationships were found between 6MW scores and the summary scores of the administered surveys (r = -0.43 to 0.36). Physical performance, as measured by the 6MW test, continued to improve with up to 24 weeks of PR participation. Quality-of-life measures and the perception of dyspnea improved after 12 weeks of PR participation, with improvements maintained by 24 weeks of PR participation. It is recommended that PR patients participate in supervised PR for at least 24 weeks to gain and maintain optimal health benefits.
ADAMTS13 test and/or PLASMIC clinical score in management of acquired thrombotic thrombocytopenic purpura: a cost-effective analysis.

PubMed

Kim, Chong H; Simmons, Sierra C; Williams, Lance A; Staley, Elizabeth M; Zheng, X Long; Pham, Huy P

2017-11-01

The ADAMTS13 test distinguishes thrombotic thrombocytopenic purpura (TTP) from other thrombotic microangiopathies (TMAs). The PLASMIC score helps determine the pretest probability of ADAMTS13 deficiency. Due to inherent limitations of both tests, and potential adverse effects and cost of unnecessary treatments, we performed a cost-effectiveness analysis (CEA) investigating the benefits of incorporating an in-hospital ADAMTS13 test and/or PLASMIC score into our clinical practice. A CEA model was created to compare four scenarios for patients with TMAs, utilizing either an in-house or a send-out ADAMTS13 assay with or without prior risk stratification using PLASMIC scoring. Model variables, including probabilities and costs, were gathered from the medical literature, except for the ADAMTS13 send-out and in-house tests, which were obtained from our institutional data. If only the cost is considered, in-house ADAMTS13 test for patients with intermediate- to high-risk PLASMIC score is the least expensive option ($4,732/patient). If effectiveness is assessed as measured by the number of averted deaths, send-out ADAMTS13 test is the most effective. Considering the cost/effectiveness ratio, the in-house ADAMTS13 test in patients with intermediate- to high-risk PLASMIC score is the best option, followed by the in-house ADAMTS13 test without the PLASMIC score. In patients with clinical presentations of TMAs, having an in-hospital ADAMTS13 test to promptly establish the diagnosis of TTP appears to be cost-effective. Utilizing the PLASMIC score further increases the cost-effectiveness of the in-house ADAMTS13 test. Our findings indicate the benefit of having a rapid and reliable in-house ADAMTS13 test, especially in the tertiary medical center. © 2017 AABB.
Situational judgment test as an additional tool in a medical admission test: an observational investigation.

PubMed

Luschin-Ebengreuth, Marion; Dimai, Hans P; Ithaler, Daniel; Neges, Heide M; Reibnegger, Gilbert

2015-03-14

In the framework of medical university admission procedures the assessment of non-cognitive abilities is increasingly demanded. As tool for assessing personal qualities or the ability to handle theoretical social constructs in complex situations, the Situational Judgment Test (SJT), among other measurement instruments, is discussed in the literature. This study focuses on the development and the results of the SJT as part of the admission test for the study of human medicine and dentistry at one medical university in Austria. Observational investigation focusing on the results of the SJT. 4741 applicants were included in the study. To yield comparable results for the different test parts, "relative scores" for each test part were calculated. Performance differences between women and men in the various test parts are analyzed using effect sizes based on comparison of mean values (Cohen's d). The associations between the relative scores achieved in the various test parts were assessed by computing pairwise linear correlation coefficients between all test parts and visualized by bivariate scatterplots. Among successful candidates, men consistently outperform women. Men perform better in physics and mathematics. Women perform better in the SJT part. The least discriminatory test part was the SJT. A strong correlation between biology and chemistry and moderate correlations between the other test parts except SJT is obvious. The relative scores are not symmetrically distributed. The cognitive loading of the performed SJTs points to the low correlation between the SJTs and cognitive abilities. Adding the SJT part into the admission test, in order to cover more than only knowledge and understanding of natural sciences among the applicants has been quite successful.
The serial use of child neurocognitive tests: development versus practice effects.

PubMed

Slade, Peter D; Townes, Brenda D; Rosenbaum, Gail; Martins, Isabel P; Luis, Henrique; Bernardo, Mario; Martin, Michael D; Derouen, Timothy A

2008-12-01

When serial neurocognitive assessments are performed, 2 main factors are of importance: test-retest reliability and practice effects. With children, however, there is a third, developmental factor, which occurs as a result of maturation. Child tests recognize this factor through the provision of age-corrected scaled scores. Thus, a ready-made method for estimating the relative contribution of developmental versus practice effects is the comparison of raw (developmental and practice) and scaled (practice only) scores. Data from a pool of 507 Portuguese children enrolled in a study of dental amalgams (T. A. DeRouen, B. G. Leroux, et al., 2002; T. A. DeRouen, M. D. Martin, et al., 2006) showed that practice effects over a 5-year period varied on 8 neurocognitive tests. Simple regression equations are provided for calculating individual retest scores from initial test scores. (c) 2008 APA, all rights reserved.
Evaluating Maintenance Performance: The Development and Tryout of Criterion Referenced Job Task Performance Tests for Electronic Maintenance. Final Report for Period January 1969-May 1974.

ERIC Educational Resources Information Center

Shriver, Edgar L.; Foley, John P., Jr.

A battery of criterion referenced job task performance tests (JIPT) for typical electronic maintenance activities were developed. The construction of a battery of such tests together with an appropriate scoring for reporting the results is detailed. The development of a Test Administrators Handbook also is described. This battery is considered to…
CK-MM Polymorphism is Associated With Physical Fitness Test Scores in Military Recruits.

PubMed

Sprouse, Courtney; Tosi, Laura L; Gordish-Dressman, Heather; Abdel-Ghani, Mai S; Panchapakesan, Karuna; Niederberger, Brenda; Devaney, Joseph M; Kelly, Karen R

2015-09-01

Muscle-specific creatine kinase is thought to play an integral role in maintaining energy homeostasis by providing a supply of creatine phosphate. The genetic variant, rs8111989, contributes to individual differences in physical performance, and thus the purpose of this study was to determine if rs8111989 variant is predictive of Physical Fitness Test (PFT) scores in male, military infantry recruits. DNA was extracted from whole blood, and genotyping was performed in 176 Marines. Relationships between PFT measures (run, sit-ups, and pull-ups) and genotype were determined. Participants with 2 copies of the T allele for rs8111989 variant had higher PFT scores for run time, pull-ups, and total PFT score. Specifically, participants with 2 copies of the TT allele (variant) (n = 97) demonstrated an overall higher total PFT score as compared with those with one copy of the C allele (n = 79) (TT: 250 ± 31 vs. 238 ± 31; p = 0.02), run score (TT: 82 ± 10 vs. 78 ± 11; p = 0.04) and pull-up score (TT: 78 ± 11 vs. 65 ± 21; p = 0.04) or those with the CC/CT genotype. These results demonstrate an association between physical performance measures and genetic variation in the muscle-specific creatine kinase gene (rs8111989). Reprint & Copyright © 2015 Association of Military Surgeons of the U.S.
Physical Environment in Relation to Creativity and Intelligence.

ERIC Educational Resources Information Center

Gupta, Ram K.; Mohan, Madan

Research was performed to determine whether: (1) highly creative subjects would obtain higher scores on tests of crativity in an enriched environment, (2) subjects who are poor in creativity will not obtain higher scores because of low perceptual curiosity, and (3) high- and low-intelligence subjects would score equally well on creativity. The…
Validity of the Optometry Admission Test in Predicting Performance in Schools and Colleges of Optometry.

ERIC Educational Resources Information Center

Kramer, Gene A.; Johnston, JoElle

1997-01-01

A study examined the relationship between Optometry Admission Test scores and pre-optometry or undergraduate grade point average (GPA) with first and second year performance in optometry schools. The test's predictive validity was limited but significant, and comparable to those reported for other admission tests. In addition, the scores…
Factors related to student performance in statistics courses in Lebanon

NASA Astrophysics Data System (ADS)

Naccache, Hiba Salim

The purpose of the present study was to identify factors that may contribute to business students in Lebanese universities having difficulty in introductory and advanced statistics courses. Two statistics courses are required for business majors at Lebanese universities. Students are not obliged to be enrolled in any math courses prior to taking statistics courses. Drawing on recent educational research, this dissertation attempted to identify the relationship between (1) students’ scores on Lebanese university math admissions tests; (2) students’ scores on a test of very basic mathematical concepts; (3) students’ scores on the survey of attitude toward statistics (SATS); (4) course performance as measured by students’ final scores in the course; and (5) their scores on the final exam. Data were collected from 561 students enrolled in multiple sections of two courses: 307 students in the introductory statistics course and 260 in the advanced statistics course in seven campuses across Lebanon over one semester. The multiple regressions results revealed four significant relationships at the introductory level: between students’ scores on the math quiz with their (1) final exam scores; (2) their final averages; (3) the Cognitive subscale of the SATS with their final exam scores; and (4) their final averages. These four significant relationships were also found at the advanced level. In addition, two more significant relationships were found between students’ final average and the two subscales of Effort (5) and Affect (6). No relationship was found between students’ scores on the admission math tests and both their final exam scores and their final averages in both the introductory and advanced level courses. On the other hand, there was no relationship between students’ scores on Lebanese admissions tests and their final achievement. Although these results were consistent across course formats and instructors, they may encourage Lebanese universities to assess the effectiveness of prerequisite math courses. Moreover, these findings may lead the Lebanese Ministry of Education to make changes to the admissions exams, course prerequisites, and course content. Finally, to enhance the attitude of students, new learning techniques, such as group work during class meetings can be helpful, and future research should aim to test the effectiveness of these pedagogical techniques on students’ attitudes toward statistics.
Derivation and Cross-Validation of Cutoff Scores for Patients With Schizophrenia Spectrum Disorders on WAIS-IV Digit Span-Based Performance Validity Measures.

PubMed

Glassmire, David M; Toofanian Ross, Parnian; Kinney, Dominique I; Nitch, Stephen R

2016-06-01

Two studies were conducted to identify and cross-validate cutoff scores on the Wechsler Adult Intelligence Scale-Fourth Edition Digit Span-based embedded performance validity (PV) measures for individuals with schizophrenia spectrum disorders. In Study 1, normative scores were identified on Digit Span-embedded PV measures among a sample of patients (n = 84) with schizophrenia spectrum diagnoses who had no known incentive to perform poorly and who put forth valid effort on external PV tests. Previously identified cutoff scores resulted in unacceptable false positive rates and lower cutoff scores were adopted to maintain specificity levels ≥90%. In Study 2, the revised cutoff scores were cross-validated within a sample of schizophrenia spectrum patients (n = 96) committed as incompetent to stand trial. Performance on Digit Span PV measures was significantly related to Full Scale IQ in both studies, indicating the need to consider the intellectual functioning of examinees with psychotic spectrum disorders when interpreting scores on Digit Span PV measures. © The Author(s) 2015.
Factor structure of the functional movement screen in marine officer candidates.

PubMed

Kazman, Josh B; Galecki, Jeffrey M; Lisman, Peter; Deuster, Patricia A; OʼConnor, Francis G

2014-03-01

Functional movement screening (FMS) is a musculoskeletal assessment that is intended to fill a gap between preparticipation examinations and performance tests. Functional movement screening consists of 7 standardized movements involving multiple muscle groups that are rated 0-3 during performance; scores are combined into a final score, which is intended to predict injury risk. This use of a sum-score in this manner assumes that the items are unidimensional and scores are internally consistent, which are measures of internal reliability. Despite research into the FMS' predictive value and interrater reliability, research has not assessed its psychometric properties. The present study is a standard psychometric analysis of the FMS and is the first to assess the internal consistency and factor structure of the FMS, using Cronbach's alpha and exploratory factor analysis (EFA). Using a cohort of 877 male and 57 female Marine officer candidates who performed the FMS, EFA of polychoric correlations with varimax rotation was conducted to explore the structure of the FMS. Tests were repeated on the original scores, which integrated feelings of pain during movement (0-3), and then on scores discounting the pain instruction and based only on the performance (1-3), to determine whether pain ratings affected the factor structure. The average FMS score was 16.7 ± 1.8. Cronbach's alpha was 0.39. Exploratory factor analysis availed 2 components accounting for 21 and 17% and consisting of separate individual movements (shoulder mobility and deep squat, respectively). Analysis on scores discounting pain showed similar results. The factor structures were not interpretable, and the low Cronbach's alpha suggests a lack of internal consistency in FMS sum scores. Results do not offer support for validity of the FMS sum score as a unidimensional construct. In the absence of additional psychometric research, caution is warranted when using the FMS sum score.
Comparison and integration of deleteriousness prediction methods for nonsynonymous SNVs in whole exome sequencing studies

PubMed Central

Dong, Chengliang; Wei, Peng; Jian, Xueqiu; Gibbs, Richard; Boerwinkle, Eric; Wang, Kai; Liu, Xiaoming

2015-01-01

Accurate deleteriousness prediction for nonsynonymous variants is crucial for distinguishing pathogenic mutations from background polymorphisms in whole exome sequencing (WES) studies. Although many deleteriousness prediction methods have been developed, their prediction results are sometimes inconsistent with each other and their relative merits are still unclear in practical applications. To address these issues, we comprehensively evaluated the predictive performance of 18 current deleteriousness-scoring methods, including 11 function prediction scores (PolyPhen-2, SIFT, MutationTaster, Mutation Assessor, FATHMM, LRT, PANTHER, PhD-SNP, SNAP, SNPs&GO and MutPred), 3 conservation scores (GERP++, SiPhy and PhyloP) and 4 ensemble scores (CADD, PON-P, KGGSeq and CONDEL). We found that FATHMM and KGGSeq had the highest discriminative power among independent scores and ensemble scores, respectively. Moreover, to ensure unbiased performance evaluation of these prediction scores, we manually collected three distinct testing datasets, on which no current prediction scores were tuned. In addition, we developed two new ensemble scores that integrate nine independent scores and allele frequency. Our scores achieved the highest discriminative power compared with all the deleteriousness prediction scores tested and showed low false-positive prediction rate for benign yet rare nonsynonymous variants, which demonstrated the value of combining information from multiple orthologous approaches. Finally, to facilitate variant prioritization in WES studies, we have pre-computed our ensemble scores for 87 347 044 possible variants in the whole-exome and made them publicly available through the ANNOVAR software and the dbNSFP database. PMID:25552646
Negatively-marked MCQ assessments that reward partial knowledge do not introduce gender bias yet increase student performance and satisfaction and reduce anxiety.

PubMed

Bond, A Elizabeth; Bodger, Owen; Skibinski, David O F; Jones, D Hugh; Restall, Colin J; Dudley, Edward; van Keulen, Geertje

2013-01-01

Multiple-choice question (MCQ) examinations are increasingly used as the assessment method of theoretical knowledge in large class-size modules in many life science degrees. MCQ-tests can be used to objectively measure factual knowledge, ability and high-level learning outcomes, but may also introduce gender bias in performance dependent on topic, instruction, scoring and difficulty. The 'Single Answer' (SA) test is often used in which students choose one correct answer, in which they are unable to demonstrate partial knowledge. Negatively marking eliminates the chance element of guessing but may be considered unfair. Elimination testing (ET) is an alternative form of MCQ, which discriminates between all levels of knowledge, while rewarding demonstration of partial knowledge. Comparisons of performance and gender bias in negatively marked SA and ET tests have not yet been performed in the life sciences. Our results show that life science students were significantly advantaged by answering the MCQ test in elimination format compared to single answer format under negative marking conditions by rewarding partial knowledge of topics. Importantly, we found no significant difference in performance between genders in either cohort for either MCQ test under negative marking conditions. Surveys showed that students generally preferred ET-style MCQ testing over SA-style testing. Students reported feeling more relaxed taking ET MCQ and more stressed when sitting SA tests, while disagreeing with being distracted by thinking about best tactics for scoring high. Students agreed ET testing improved their critical thinking skills. We conclude that appropriately-designed MCQ tests do not systematically discriminate between genders. We recommend careful consideration in choosing the type of MCQ test, and propose to apply negative scoring conditions to each test type to avoid the introduction of gender bias. The student experience could be improved through the incorporation of the elimination answering methods in MCQ tests via rewarding partial and full knowledge.
Negatively-Marked MCQ Assessments That Reward Partial Knowledge Do Not Introduce Gender Bias Yet Increase Student Performance and Satisfaction and Reduce Anxiety

PubMed Central

Bond, A. Elizabeth; Bodger, Owen; Skibinski, David O. F.; Jones, D. Hugh; Restall, Colin J.; Dudley, Edward; van Keulen, Geertje

2013-01-01

Multiple-choice question (MCQ) examinations are increasingly used as the assessment method of theoretical knowledge in large class-size modules in many life science degrees. MCQ-tests can be used to objectively measure factual knowledge, ability and high-level learning outcomes, but may also introduce gender bias in performance dependent on topic, instruction, scoring and difficulty. The ‘Single Answer’ (SA) test is often used in which students choose one correct answer, in which they are unable to demonstrate partial knowledge. Negatively marking eliminates the chance element of guessing but may be considered unfair. Elimination testing (ET) is an alternative form of MCQ, which discriminates between all levels of knowledge, while rewarding demonstration of partial knowledge. Comparisons of performance and gender bias in negatively marked SA and ET tests have not yet been performed in the life sciences. Our results show that life science students were significantly advantaged by answering the MCQ test in elimination format compared to single answer format under negative marking conditions by rewarding partial knowledge of topics. Importantly, we found no significant difference in performance between genders in either cohort for either MCQ test under negative marking conditions. Surveys showed that students generally preferred ET-style MCQ testing over SA-style testing. Students reported feeling more relaxed taking ET MCQ and more stressed when sitting SA tests, while disagreeing with being distracted by thinking about best tactics for scoring high. Students agreed ET testing improved their critical thinking skills. We conclude that appropriately-designed MCQ tests do not systematically discriminate between genders. We recommend careful consideration in choosing the type of MCQ test, and propose to apply negative scoring conditions to each test type to avoid the introduction of gender bias. The student experience could be improved through the incorporation of the elimination answering methods in MCQ tests via rewarding partial and full knowledge. PMID:23437081
Measurement properties of continuous text reading performance tests.

PubMed

Brussee, Tamara; van Nispen, Ruth M A; van Rens, Ger H M B

2014-11-01

Measurement properties of tests to assess reading acuity or reading performance have not been extensively evaluated. This study aims to provide an overview of the literature on available continuous text reading tests and their measurement properties. A literature search was performed in PubMed, Embase and PsycInfo. Subsequently, information on design and content of reading tests, study design and measurement properties were extracted using consensus-based standards for selection of health measurement instruments. Quality of studies, reading tests and measurement properties were systematically assessed using pre-specified criteria. From 2334 identified articles, 20 relevant articles were found on measurement properties of three reading tests in various languages: IReST, MNread Reading Test and Radner Reading Charts. All three reading tests scored high on content validity. Reproducibility studies (repeated measurements between different testing sessions) of the IReST and MNread of commercially available reading tests in different languages were missing. The IReST scored best on inter-language comparison, the MNread scored well in repeatability studies (repeated measurements under the same conditions) and the Radner showed good reproducibility in studies. Although in daily practice there are other continuous text reading tests available meeting the criteria of this review, measurement properties were described in scientific studies for only three of them. Of the few available studies, the quality and content of study design and methodology used varied. For testing existing reading tests and the development of new ones, for example in other languages, we make several recommendations, including careful description of patient characteristics, use of objective and subjective lighting levels, good control of working distance, documentation of the number of raters and their training, careful documentation of scoring rules and the use of Bland-Altman analyses or similar for reproducibility and repeatability studies. © 2014 The Authors Ophthalmic & Physiological Optics © 2014 The College of Optometrists.
Validation of "laboratory-supported" criteria for functional (psychogenic) tremor.

PubMed

Schwingenschuh, Petra; Saifee, Tabish A; Katschnig-Winter, Petra; Macerollo, Antonella; Koegl-Wallner, Mariella; Culea, Valeriu; Ghadery, Christine; Hofer, Edith; Pendl, Tamara; Seiler, Stephan; Werner, Ulrike; Franthal, Sebastian; Maurits, Natasha M; Tijssen, Marina A; Schmidt, Reinhold; Rothwell, John C; Bhatia, Kailash P; Edwards, Mark J

2016-04-01

In a small group of patients, we have previously shown that a combination of electrophysiological tests was able to distinguish functional (psychogenic) tremor and organic tremor with excellent sensitivity and specificity. This study aims to validate an electrophysiological test battery as a tool to diagnose patients with functional tremor with a "laboratory-supported" level of certainty. For this prospective data collection study, we recruited 38 new patients with functional tremor (mean age 37.9 ± 24.5 years; mean disease duration 5.9 ± 9.0 years) and 73 new patients with organic tremor (mean age 55.4 ± 25.4 years; mean disease duration 15.8 ± 17.7 years). Tremor was recorded at rest, posture (with and without loading), action, while performing tapping tasks (1, 3, and 5 Hz), and while performing ballistic movements with the less-affected hand. Electrophysiological tests were performed by raters blinded to the clinical diagnosis. We calculated a sum score for all performed tests (maximum of 10 points) and used a previously suggested cut-off score of 3 points for a diagnosis of laboratory-supported functional tremor. We demonstrated good interrater reliability and test-retest reliability. Patients with functional tremor had a higher average score on the test battery when compared with patients with organic tremor (3.6 ± 1.4 points vs 1.0 ± 0.8 points; P < .001), and the predefined cut-off score for laboratory-supported functional tremor yielded a test sensitivity of 89.5% and a specificity of 95.9%. We now propose this test battery as the basis of laboratory-supported criteria for the diagnosis of functional tremor, and we encourage its use in clinical and research practice. © 2016 International Parkinson and Movement Disorder Society.
Performance of Simplified Acute Physiology Score 3 In Predicting Hospital Mortality In Emergency Intensive Care Unit.

PubMed

Ma, Qing-Bian; Fu, Yuan-Wei; Feng, Lu; Zhai, Qiang-Rong; Liang, Yang; Wu, Meng; Zheng, Ya-An

2017-07-05

Since the 1980s, severity of illness scoring systems has gained increasing popularity in Intensive Care Units (ICUs). Physicians used them for predicting mortality and assessing illness severity in clinical trials. The objective of this study was to assess the performance of Simplified Acute Physiology Score 3 (SAPS 3) and its customized equation for Australasia (Australasia SAPS 3, SAPS 3 [AUS]) in predicting clinical prognosis and hospital mortality in emergency ICU (EICU). A retrospective analysis of the EICU including 463 patients was conducted between January 2013 and December 2015 in the EICU of Peking University Third Hospital. The worst physiological data of enrolled patients were collected within 24 h after admission to calculate SAPS 3 score and predicted mortality by regression equation. Discrimination between survivals and deaths was assessed by the area under the receiver operator characteristic curve (AUC). Calibration was evaluated by Hosmer-Lemeshow goodness-of-fit test through calculating the ratio of observed-to-expected numbers of deaths which is known as the standardized mortality ratio (SMR). A total of 463 patients were enrolled in the study, and the observed hospital mortality was 26.1% (121/463). The patients enrolled were divided into survivors and nonsurvivors. Age, SAPS 3 score, Acute Physiology and Chronic Health Evaluation Score II (APACHE II), and predicted mortality were significantly higher in nonsurvivors than survivors (P < 0.05 or P < 0.01). The AUC (95% confidence intervals [CI s]) for SAPS 3 score was 0.836 (0.796-0.876). The maximum of Youden's index, cutoff, sensitivity, and specificity of SAPS 3 score were 0.526%, 70.5 points, 66.9%, and 85.7%, respectively. The Hosmer-Lemeshow goodness-of-fit test for SAPS 3 demonstrated a Chi-square test score of 10.25, P = 0.33, SMR (95% CI) = 0.63 (0.52-0.76). The Hosmer-Lemeshow goodness-of-fit test for SAPS 3 (AUS) demonstrated a Chi-square test score of 9.55, P = 0.38, SMR (95% CI) = 0.68 (0.57-0.81). Univariate and multivariate analyses were conducted for biochemical variables that were probably correlated to prognosis. Eventually, blood urea nitrogen (BUN), albumin,lactate and free triiodothyronine (FT3) were selected as independent risk factors for predicting prognosis. The SAPS 3 score system exhibited satisfactory performance even superior to APACHE II in discrimination. In predicting hospital mortality, SAPS 3 did not exhibit good calibration and overestimated hospital mortality, which demonstrated that SAPS 3 needs improvement in the future.
Evaluation of an external quality assessment program for HIV testing in Haiti, 2006-2011.

PubMed

Louis, Frantz Jean; Anselme, Renette; Ndongmo, Clement; Buteau, Josiane; Boncy, Jacques; Dahourou, Georges; Vertefeuille, John; Marston, Barbara; Balajee, S Arunmozhi

2013-12-01

To evaluate an external quality assessment (EQA) program for human immunodeficiency virus (HIV) rapid diagnostics testing by the Haitian National Public Health Laboratory (French acronym: LNSP). Acceptable performance was defined as any proficiency testing (PT) score more than 80%. The PT database was reviewed and analyzed to assess the testing performance of the participating laboratories and the impact of the program over time. A total of 242 laboratories participated in the EQA program from 2006 through 2011; participation increased from 70 laboratories in 2006 to 159 in 2011. In 2006, 49 (70%) laboratories had a PT score of 80% or above; by 2011, 145 (97.5%) laboratories were proficient (P < .05). The EQA program for HIV testing ensures quality of testing and allowed the LNSP to document improvements in the quality of HIV rapid testing over time.

People with Parkinson Disease and Normal MMSE Score Have a Broad Range of Cognitive Performance

PubMed Central

Burdick, DJ; Cholerton, B; Watson, GS; Siderowf, A; Trojanowski, JQ; Weintraub, D; Ritz, B; Rhodes, SL; Rausch, R; Factor, SA; Wood-Siverio, C; Quinn, JF; Chung, KA; Srivatsal, S; Edwards, KL; Montine, TJ; Zabetian, CP; Leverenz, JB

2014-01-01

Background Cognitive impairment, including dementia, is common in Parkinson disease (PD). The Mini-Mental State Examination (MMSE) has been recommended as a screening tool for PDD, with values below 26 indicative of possible dementia. Using a detailed neuropsychological battery, we examined the range of cognitive impairment in PD patients with a MMSE score ≥ 26. Methods In this multi-center, cross-sectional, observational study, we performed neuropsychological testing in a sample of 788 PD patients with MMSE ≥ 26. Evaluation included tests of global cognition, executive function, language, memory, and visuospatial skills. A consensus panel reviewed results for 342 subjects and assigned a diagnosis of no cognitive impairment, mild cognitive impairment, or dementia. Results 67% of the 788 subjects performed 1.5 standard deviations below the normative mean on at least one test. On eight of the 15 tests, more than 20% of subjects scored 1.5 standard deviations or more below the normative mean. Greatest impairments were found on Hopkins Verbal Learning and Digit Symbol Coding tests. The sensitivity of the MMSE to detect dementia was 45% in a subset of participants who underwent clinical diagnostic procedures. Conclusions A remarkably wide range of cognitive impairment can be found in PD patients with a relatively high score on the MMSE, including a level of cognitive impairment consistent with dementia. Given these findings, clinicians must be aware of the limitations of the MMSE in detecting cognitive impairment, including dementia, in PD. PMID:25073717
A weighted generalized score statistic for comparison of predictive values of diagnostic tests

PubMed Central

Kosinski, Andrzej S.

2013-01-01

Positive and negative predictive values are important measures of a medical diagnostic test performance. We consider testing equality of two positive or two negative predictive values within a paired design in which all patients receive two diagnostic tests. The existing statistical tests for testing equality of predictive values are either Wald tests based on the multinomial distribution or the empirical Wald and generalized score tests within the generalized estimating equations (GEE) framework. As presented in the literature, these test statistics have considerably complex formulas without clear intuitive insight. We propose their re-formulations which are mathematically equivalent but algebraically simple and intuitive. As is clearly seen with a new re-formulation we present, the generalized score statistic does not always reduce to the commonly used score statistic in the independent samples case. To alleviate this, we introduce a weighted generalized score (WGS) test statistic which incorporates empirical covariance matrix with newly proposed weights. This statistic is simple to compute, it always reduces to the score statistic in the independent samples situation, and it preserves type I error better than the other statistics as demonstrated by simulations. Thus, we believe the proposed WGS statistic is the preferred statistic for testing equality of two predictive values and for corresponding sample size computations. The new formulas of the Wald statistics may be useful for easy computation of confidence intervals for difference of predictive values. The introduced concepts have potential to lead to development of the weighted generalized score test statistic in a general GEE setting. PMID:22912343
The Relationship of Laboratory Performance Ratings, Information Achievement and Pencil-Paper Performance Test Scores in College-Level Electricity.

ERIC Educational Resources Information Center

Francis, Charles E.

In this study, a pencil paper performance test (PPPT) was developed and administered to an experimental group of 46 students and a control group of 48 students to determine: (1) the difference between laboratory performance and the successful completion of a laboratory course in electricity, (2) the relationship between laboratory performance as…
Comparing Student Performance on the Old vs New Versions of the NAPLEX.

PubMed

Welch, Adam C; Karpen, Samuel C

2018-04-01

Objective. To determine if the new 2016 version of the North American Pharmacy Licensure Examination (NAPLEX) affected scores when controlling for student performance on other measures using data from one institution. Methods. There were 201 records from the classes of 2014-2016. Doubly robust estimation using weighted propensity scores was used to compare NAPLEX scaled scores and pass rates while considering student performance on other measures. Of the potential controllers of student performance: Pharmacy Curricular Outcomes Assessment (PCOA), scaled composite scores from the Pharmacy College Admission Test (PCAT), and P3 Grade Point Average (GPA). Only PCOA and P3 GPA were found to be appropriate for propensity scoring. Results. The weighted NAPLEX scaled scores did not significantly drop from the old (2014-2015) to the new (2016) version of NAPLEX. The change in pass rates between the new and old versions of NAPLEX were also non-significant. Conclusion. Using data from one institution, the new version itself of the NAPLEX did not have a significant effect on NAPLEX scores or first-time pass rates when controlling for student performance on other measures. Colleges are encouraged to repeat this analysis with pooled data and larger sample sizes.
Does household access to improved water and sanitation in infancy and childhood predict better vocabulary test performance in Ethiopian, Indian, Peruvian and Vietnamese cohort studies?

PubMed

Dearden, Kirk A; Brennan, Alana T; Behrman, Jere R; Schott, Whitney; Crookston, Benjamin T; Humphries, Debbie L; Penny, Mary E; Fernald, Lia C H

2017-03-07

Test associations between household water and sanitation (W&S) and children's concurrent and subsequent Peabody Picture Vocabulary Test (PPVT) scores. Prospective cohort study. Ethiopia, India, Peru, Vietnam. 7269 children. PPVT scores at 5 and 8 years. Key exposure variables were related to W&S, and collected at 1, 5 and 8 years, including 'improved' water (eg, piped, public tap or standpipe) and 'improved' toilets (eg, collection, storage, treatment and recycling of human excreta). Access to improved water at 1 year was associated with higher language scores at 5 years (3/4 unadjusted associations) and 8 years (4/4 unadjusted associations). Ethiopian children with access to improved water at 1 year had test scores that were 0.26 SD (95% CI 0.17 to 0.36) higher at 5 years than children without access. Access to improved water at 5 years was associated with higher concurrent PPVT scores (in 3/4 unadjusted associations), but not later scores (in 1/4 unadjusted associations). 5-year-old Peruvian children with access to improved water had better concurrent performance on the PPVT (0.44 SD, 95% CI 0.30 to 0.59) than children without access to improved water. Toilet access at 1 year was also associated with better PPVT scores at 5 years (3/4 unadjusted associations) and sometimes associated with test results at 8 years (2/4 unadjusted associations). Toilet access at 5 years was associated with concurrent PPVT scores (3/4 unadjusted associations). More than half of all associations in unadjusted models (water and toilets) persisted in adjusted models, particularly for toilets in India, Peru and Vietnam. Access to 'improved' water and toilets had independent associations with children's PPVT scores that often persisted with adjustment for covariates. Our findings suggest that effects of W&S may go beyond subacute and acute infections and physical growth to include children's language performance, a critical component of cognitive development. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://www.bmj.com/company/products-services/rights-and-licensing/.
Cheating in OSCEs: The Impact of Simulated Security Breaches on OSCE Performance.

PubMed

Gotzmann, Andrea; De Champlain, André; Homayra, Fahmida; Fotheringham, Alexa; de Vries, Ingrid; Forgie, Melissa; Pugh, Debra

2017-01-01

Construct: Valid score interpretation is important for constructs in performance assessments such as objective structured clinical examinations (OSCEs). An OSCE is a type of performance assessment in which a series of standardized patients interact with the student or candidate who is scored by either the standardized patient or a physician examiner. In high-stakes examinations, test security is an important issue. Students accessing unauthorized test materials can create an unfair advantage and lead to examination scores that do not reflect students' true ability level. The purpose of this study was to assess the impact of various simulated security breaches on OSCE scores. Seventy-six 3rd-year medical students participated in an 8-station OSCE and were randomized to either a control group or to 1 of 2 experimental conditions simulating test security breaches: station topic (i.e., providing a list of station topics prior to the examination) or egregious security breach (i.e., providing detailed content information prior to the examination). Overall total scores were compared for the 3 groups using both a one-way between-subjects analysis of variance and a repeated measure analysis of variance to compare the checklist, rating scales, and oral question subscores across the three conditions. Overall total scores were highest for the egregious security breach condition (81.8%), followed by the station topic condition (73.6%), and they were lowest for the control group (67.4%). This trend was also found with checklist subscores only (79.1%, 64.9%, and 60.3%, respectively for the security breach, station topic, and control conditions). Rating scale subscores were higher for both the station topic and egregious security breach conditions compared to the control group (82.6%, 83.1%, and 77.6%, respectively). Oral question subscores were significantly higher for the egregious security breach condition (88.8%) followed by the station topic condition (64.3%), and they were the lowest for the control group (48.6%). This simulation of different OSCE security breaches demonstrated that student performance is greatly advantaged by having prior access to test materials. This has important implications for medical educators as they develop policies and procedures regarding the safeguarding and reuse of test content.
The comparison of performances of preschool children on two motor assessments.

PubMed

Logan, S Wood; Robinson, Leah E; Getchell, Nancy

2011-12-01

Understanding children's motor performance on different assessments is important for researchers. The Test of Gross Motor Development-2 (TGMD-2) and the Movement Assessment Battery for Children-2 (MABC-2) are motor assessments that use either a process- or product-oriented scoring approach. However, no studies have examined how performances are related to these two types of assessment. This study compared the performance of preschool children on the TGMD-2 and the MABC-2. 32 children (M age = 4.2 yr., SD = 9) completed each test to assess whether each described motor performance similarly. Significant low to moderate Spearman's rank correlations (r2 range = .13-.40) were found between the subscales of the assessments. A related-samples Wilcoxon signed rank test was not significant between total performances on the TGMD-2 and MABC-2. From a practical standpoint, each assessment provides a similar overall description of motor competence in preschool children. However, each assessment results in scores that present different information about motor performance.
Understanding pretest and posttest reactions to cognitive ability and personality tests.

PubMed

Chan, D; Schmitt, N; Sacco, J M; DeShon, R P

1998-06-01

To understand the nature of test reactions and their relationship to test performance, the relationships among belief in tests, pretest reactions, test performance, and posttest reactions were modeled for cognitive ability and personality tests. Results from structural equation models that were fitted to responses from 197 undergraduate examinees supported the hypothesized relationships. On the cognitive ability test, pretest reactions affected test performance and mediated the relationship between belief in tests and test performance. Test performance affected posttest reactions even after taking into account the effect of pretest reactions. On the personality test, belief in tests affected pretest and posttest reactions, but the three variables were unrelated to test performance (Conscientiousness scores). Conceptual, methodological, and practical implications of the findings are discussed in the context of research on test reactions and test performance.
Do personality traits assessed on medical school admission predict exit performance? A UK-wide longitudinal cohort study.

PubMed

MacKenzie, R K; Dowell, J; Ayansina, D; Cleland, J A

2017-05-01

Traditional methods of assessing personality traits in medical school selection have been heavily criticised. To address this at the point of selection, "non-cognitive" tests were included in the UK Clinical Aptitude Test, the most widely-used aptitude test in UK medical education (UKCAT: http://www.ukcat.ac.uk/ ). We examined the predictive validity of these non-cognitive traits with performance during and on exit from medical school. We sampled all students graduating in 2013 from the 30 UKCAT consortium medical schools. Analysis included: candidate demographics, UKCAT non-cognitive scores, medical school performance data-the Educational Performance Measure (EPM) and national exit situational judgement test (SJT) outcomes. We examined the relationships between these variables and SJT and EPM scores. Multilevel modelling was used to assess the relationships adjusting for confounders. The 3343 students who had taken the UKCAT non-cognitive tests and had both EPM and SJT data were entered into the analysis. There were four types of non-cognitive test: (1) libertariancommunitarian, (2) NACE-narcissism, aloofness, confidence and empathy, (3) MEARS-self-esteem, optimism, control, self-discipline, emotional-nondefensiveness (END) and faking, (4) an abridged version of 1 and 2 combined. Multilevel regression showed that, after correcting for demographic factors, END predicted SJT and EPM decile. Aloofness and empathy in NACE were predictive of SJT score. This is the first large-scale study examining the relationship between performance on non-cognitive selection tests and medical school exit assessments. The predictive validity of these tests was limited, and the relationships revealed do not fit neatly with theoretical expectations. This study does not support their use in selection.
A score card for upper GI endoscopy: Evaluation of interobserver variability in examiners with various levels of experience.

PubMed

Neumann, M; Friedl, S; Meining, A; Egger, K; Heldwein, W; Rey, J F; Hochberger, J; Classen, M; Hohenberger, W; Rösch, T

2002-10-01

In most European countries, training in GI endoscopy has largely been based on hands-on acquisition of experience in patients rather than on a structured training programme. With the development of training models systematic hands-on training in a variety of diagnostic and therapeutic endoscopy techniques was achieved. Little, however, is known about methods of objectively assessing trainees' performance. We therefore developed an assessment 'score card' for upper GI endoscopy and tested it in endoscopists with various levels of experience. The aim of the study was therefore to assess interobserver variations in the evaluation of trainees. On the basis of textbook and expert opinions a consensus group of eight experienced endoscopists developed a score card for diagnostic upper GI endoscopy with biopsy. The score card includes an assessment of the single steps of the procedure as well as of the times needed to complete each step. This score card was then evaluated in a further conference including ten experts who blindly assessed videotapes of 15 endoscopists performing upper GI endoscopy in a training bio-simulation model (the 'Erlangen Endo-Trainer'). On the basis of their previous experience (i. e. the number of endoscopies performed) these 15 endoscopists were classified into four groups: very experienced, experienced, having some experience and inexperienced. Interobserver variability (IOV) was tested for the various score card parameters (Kendall's rank-correlation coefficient 0.0-0.5 poor, 0.5-1.0 good agreement). In addition, the correlation between the score card assessment and the examiners' experience levels was analysed. Despite poor IOV results for all the parameters tested (Kendall coefficient < 0.3), the assessment parameters correlated well when the examiners' different experience levels were taken into account (correlation coefficient 0.59-0.89, p < 0.05). The score card parameters were suitable for differentiating between the four groups of examiners with different levels of endoscopic experience. As expected with scores involving subjective assessment of performance, the variability between reviewers was substantial. Nevertheless, the assessment score was capable of distinguishing reliably between different experience levels in terms of a good individual observer consistency. The score card can therefore be used to document both training status and progress during endoscopy training courses using bio-simulation models, and this might be able to provide improved quality assurance in GI endoscopy training.
MODIFIED FUNCTIONAL MOVEMENT SCREENING AS A PREDICTOR OF TACTICAL PERFORMANCE POTENTIAL IN RECREATIONALLY ACTIVE ADULTS.

PubMed

Glass, Stephen M; Ross, Scott E

2015-10-01

Failure to meet minimum performance standards is a leading cause of attrition from basic combat training. A standardized assessment such as the Functional Movement Screen™ (FMS™) could help identify movement behaviors relevant to physical performance in tactical occupations. Previous work has demonstrated only marginal association between FMS™ tests and performance outcomes, but adding a load challenge to this movement assessment may help highlight performance-limiting behaviors. The purposes of this investigation were to quantify the effect of load on FMS™ tests and determine the extent to which performance outcomes could be predicted using scores from both loaded and unloaded FMS™ conditions. Crossover Trial. Thirteen female and six male recreationally active college students (21 ± 1.37 years, 168 ± 9.8 cm, 66 ± 12.25 kg) completed the FMS™ under (1) a control condition (FMS™C), and (2) an 18.10kg weight vest condition (FMS™W). Balance was assessed using a force plate in double-legged stance and tactical physical performance was evaluated via completion times in a battery of field tests. For each condition, penalized regression was used to select models from the seven FMS™ component tests to predict balance and performance outcomes. Data were collected during a single session lasting approximately three hours per participant. For balance, significant predictors were identified from both conditions but primarily predicted poorer balance with increasing FMS™ scores. For tactical performance, models were retained almost exclusively from FMS™W and generally predicted better performance with higher item scores. The current results suggest that FMS™ screening with an external load could help predict performance relevant to tactical occupations. Sports medicine and fitness professionals interested in performance outcomes may consider assessing movement behaviors under a load. 3.
Comparison of credible patients of very low intelligence and non-credible patients on neurocognitive performance validity indicators.

PubMed

Smith, Klayton; Boone, Kyle; Victor, Tara; Miora, Deborah; Cottingham, Maria; Ziegler, Elizabeth; Zeller, Michelle; Wright, Matthew

2014-01-01

The purpose of this archival study was to identify performance validity tests (PVTs) and standard IQ and neurocognitive test scores, which singly or in combination, differentiate credible patients of low IQ (FSIQ ≤ 75; n = 55) from non-credible patients. We compared the credible participants against a sample of 74 non-credible patients who appeared to have been attempting to feign low intelligence specifically (FSIQ ≤ 75), as well as a larger non-credible sample (n = 383) unselected for IQ. The entire non-credible group scored significantly higher than the credible participants on measures of verbal crystallized intelligence/semantic memory and manipulation of overlearned information, while the credible group performed significantly better on many processing speed and memory tests. Additionally, credible women showed faster finger-tapping speeds than non-credible women. The credible group also scored significantly higher than the non-credible subgroup with low IQ scores on measures of attention, visual perceptual/spatial tasks, processing speed, verbal learning/list learning, and visual memory, and credible women continued to outperform non-credible women on finger tapping. When cut-offs were selected to maintain approximately 90% specificity in the credible group, sensitivity rates were highest for verbal and visual memory measures (i.e., TOMM trials 1 and 2; Warrington Words correct and time; Rey Word Recognition Test total; RAVLT Effort Equation, Trial 5, total across learning trials, short delay, recognition, and RAVLT/RO discriminant function; and Digit Symbol recognition), followed by select attentional PVT scores (i.e., b Test omissions and time to recite four digits forward). When failure rates were tabulated across seven most sensitive scores, a cut-off of ≥ 2 failures was associated with 85.4% specificity and 85.7% sensitivity, while a cut-off of ≥ 3 failures resulted in 95.1% specificity and 66.0% sensitivity. Results are discussed in light of extant literature and directions for future research.
Deficits in Physical Function Among Young Childhood Cancer Survivors

PubMed Central

Hoffman, Megan C.; Mulrooney, Daniel A.; Steinberger, Julia; Lee, Jill; Baker, K. Scott; Ness, Kirsten K.

2013-01-01

Purpose Childhood cancer survivors (CCSs) are at risk for physical disability. The aim of this investigation was to characterize and compare physical performance among CCSs and a group of siblings age < 18 years and determine if diagnosis, treatment, and physical activity levels were associated with lower performance scores. Methods CCSs ≥ 5 years from diagnosis and a sibling comparison group were recruited and evaluated for strength, mobility, and fitness. Physical performance measures were compared in regression models between survivors and siblings by diagnosis and among survivors by treatment exposures and physical activity levels. Results CCSs (n = 183; mean age ± standard deviation [SD], 13.5 ± 2.5 years; 53% male) scored lower than siblings (n = 147; mean age ± SD, 13.4 ± 2.4 years; 50% male) on lower-extremity strength testing, the timed up-and-go (TUG) test, and the 6-minute walk (6MW) test, despite reporting similar levels and types of habitual physical activity. The lowest scores were prevalent among survivors of CNS tumors and bone and soft tissue sarcomas on strength testing (score ± SD: CNS tumors, 76.5 ± 4.7; sarcoma 67.1 ± 7.2 v siblings, 87.3 ± 2.4 Newton-meters quadricep strength at 90° per second; P = .04 and .01, respectively) and among CNS tumor survivors on the TUG (score ± SD: 5.1 ± 0.1 v siblings, 4.4 ± 0.1 seconds; P < .001) and 6MW tests (score ± SD: 533.3 ± 15.6 v siblings, 594.1 ± 8.3 m; P < .001). Conclusion CCSs may have underlying physiologic deficits that interfere with function that cannot be completely overcome by participation in regular physical activity. These survivors may need referral for specialized exercise interventions in addition to usual counseling to remain physically active. PMID:23796992
Detectable changes in physical performance measures in elderly African Americans.

PubMed

Mangione, Kathleen Kline; Craik, Rebecca L; McCormick, Alyson A; Blevins, Heather L; White, Meaghan B; Sullivan-Marx, Eileen M; Tomlinson, James D

2010-06-01

African American older adults have higher rates of self-reported disability and lower physical performance scores compared with white older adults. Measures of physical performance are used to predict future morbidity and to determine the effect of exercise. Characteristics of performance measures are not known for African American older adults. The purpose of this study was to estimate the standard error of measurement (SEM) and minimal detectable change (MDC) for the Short Physical Performance Battery (SPPB), Timed "Up & Go" Test (TUG) time, free gait speed, fast gait speed, and Six-Minute Walk Test (6MWT) distance in frail African American adults. This observational measurement study used a test-retest design. Individuals were tested 2 times over a 1-week period. Demographic data collected included height, weight, number of medications, assistive device use, and Mini-Mental Status Examination (MMSE) scores. Participants then completed the 5 physical performance tests. Fifty-two participants (mean age=78 years) completed the study. The average MMSE score was 25 points, and the average body mass index was 29.4 kg/m(2). On average, participants took 7 medications, and the majority used assistive devices. Intraclass correlation coefficients (ICC [2,1]) were greater than .90, except for the SPPB score (ICC=.81). The SEMs were 1.2 points for the SPPB, 1.7 seconds for the TUG, 0.08 m/s for free gait speed, 0.09 m/s for fast gait speed, and 28 m for 6MWT distance. The MDC values were 2.9 points for the SPPB, 4 seconds for the TUG, 0.19 m/s for free gait speed, 0.21 m/s for fast gait speed, and 65 m for 6MWT distance. The entire sample was from an urban area. The SEMs were similar to previously reported values and can be used when working with African American and white older adults. Estimates of MDC were calculated to assist in clinical interpretation.
Impairment of perception and recognition of faces, mimic expression and gestures in schizophrenic patients.

PubMed

Berndl, K; von Cranach, M; Grüsser, O J

1986-01-01

The perception and recognition of faces, mimic expression and gestures were investigated in normal subjects and schizophrenic patients by means of a movie test described in a previous report (Berndl et al. 1986). The error scores were compared with results from a semi-quantitative evaluation of psychopathological symptoms and with some data from the case histories. The overall error scores found in the three groups of schizophrenic patients (paranoic, hebephrenic, schizo-affective) were significantly increased (7-fold) over those of normals. No significant difference in the distribution of the error scores in the three different patient groups was found. In 10 different sub-tests following the movie the deficiencies found in the schizophrenic patients were analysed in detail. The error score for the averbal test was on average higher in paranoic patients than in the two other groups of patients, while the opposite was true for the error scores found in the verbal tests. Age and sex had some impact on the test results. In normals, female subjects were somewhat better than male. In schizophrenic patients the reverse was true. Thus female patients were more affected by the disease than male patients with respect to the task performance. The correlation between duration of the disease and error score was small; less than 10% of the error scores could be attributed to factors related to the duration of illness. Evaluation of psychopathological symptoms indicated that the stronger the schizophrenic defect, the higher the error score, but again this relationship was responsible for not more than 10% of the errors. The estimated degree of acute psychosis and overall sum of psychopathological abnormalities as scored in a semi-quantitative exploration did not correlate with the error score, but with each other. Similarly, treatment with psychopharmaceuticals, previous misuse of drugs or of alcohol had practically no effect on the outcome of the test data. The analysis of performance and test data of schizophrenic patients indicated that our findings are most likely not due to a "non-specific" impairment of cognitive function in schizophrenia, but point to a fairly selective defect in elementary cognitive visual functions necessary for averbal social communication. Some possible explanations of the data are discussed in relation to neuropsychological and neurophysiological findings on "face-specific" cortical areas located in the primate temporal lobe.
Impaired consciousness in partial seizures is bimodally distributed

PubMed Central

Cunningham, Courtney; Chen, William C.; Shorten, Andrew; McClurkin, Michael; Choezom, Tenzin; Schmidt, Christian P.; Chu, Victoria; Bozik, Anne; Best, Cameron; Chapman, Melissa; Furman, Moran; Detyniecki, Kamil; Giacino, Joseph T.

2014-01-01

Objective: To investigate whether impaired consciousness in partial seizures can usually be attributed to specific deficits in the content of consciousness or to a more general decrease in the overall level of consciousness. Methods: Prospective testing during partial seizures was performed in patients with epilepsy using the Responsiveness in Epilepsy Scale (n = 83 partial seizures, 30 patients). Results were compared with responsiveness scores in a cohort of patients with severe traumatic brain injury evaluated with the JFK Coma Recovery Scale–Revised (n = 552 test administrations, 184 patients). Results: Standardized testing during partial seizures reveals a bimodal scoring distribution, such that most patients were either fully impaired or relatively spared in their ability to respond on multiple cognitive tests. Seizures with impaired performance on initial test items remained consistently impaired on subsequent items, while other seizures showed spared performance throughout. In the comparison group, we found that scores of patients with brain injury were more evenly distributed across the full range in severity of impairment. Conclusions: Partial seizures can often be cleanly separated into those with vs without overall impaired responsiveness. Results from similar testing in a comparison group of patients with brain injury suggest that the bimodal nature of Responsiveness in Epilepsy Scale scores is not a result of scale bias but may be a finding unique to partial seizures. These findings support a model in which seizures either propagate or do not propagate to key structures that regulate overall arousal and thalamocortical function. Future investigations are needed to relate these behavioral findings to the physiology underlying impaired consciousness in partial seizures. PMID:24727311
Impaired consciousness in partial seizures is bimodally distributed.

PubMed

Cunningham, Courtney; Chen, William C; Shorten, Andrew; McClurkin, Michael; Choezom, Tenzin; Schmidt, Christian P; Chu, Victoria; Bozik, Anne; Best, Cameron; Chapman, Melissa; Furman, Moran; Detyniecki, Kamil; Giacino, Joseph T; Blumenfeld, Hal

2014-05-13

To investigate whether impaired consciousness in partial seizures can usually be attributed to specific deficits in the content of consciousness or to a more general decrease in the overall level of consciousness. Prospective testing during partial seizures was performed in patients with epilepsy using the Responsiveness in Epilepsy Scale (n = 83 partial seizures, 30 patients). Results were compared with responsiveness scores in a cohort of patients with severe traumatic brain injury evaluated with the JFK Coma Recovery Scale-Revised (n = 552 test administrations, 184 patients). Standardized testing during partial seizures reveals a bimodal scoring distribution, such that most patients were either fully impaired or relatively spared in their ability to respond on multiple cognitive tests. Seizures with impaired performance on initial test items remained consistently impaired on subsequent items, while other seizures showed spared performance throughout. In the comparison group, we found that scores of patients with brain injury were more evenly distributed across the full range in severity of impairment. Partial seizures can often be cleanly separated into those with vs without overall impaired responsiveness. Results from similar testing in a comparison group of patients with brain injury suggest that the bimodal nature of Responsiveness in Epilepsy Scale scores is not a result of scale bias but may be a finding unique to partial seizures. These findings support a model in which seizures either propagate or do not propagate to key structures that regulate overall arousal and thalamocortical function. Future investigations are needed to relate these behavioral findings to the physiology underlying impaired consciousness in partial seizures.
A new instrument to assess physician skill at thoracic ultrasound, including pleural effusion markup.

PubMed

Salamonsen, Matthew; McGrath, David; Steiler, Geoff; Ware, Robert; Colt, Henri; Fielding, David

2013-09-01

To reduce complications and increase success, thoracic ultrasound is recommended to guide all chest drainage procedures. Despite this, no tools currently exist to assess proceduralist training or competence. This study aims to validate an instrument to assess physician skill at performing thoracic ultrasound, including effusion markup, and examine its validity. We developed an 11-domain, 100-point assessment sheet in line with British Thoracic Society guidelines: the Ultrasound-Guided Thoracentesis Skills and Tasks Assessment Test (UGSTAT). The test was used to assess 22 participants (eight novices, seven intermediates, seven advanced) on two occasions while performing thoracic ultrasound on a pleural effusion phantom. Each test was scored by two blinded expert examiners. Validity was examined by assessing the ability of the test to stratify participants according to expected skill level (analysis of variance) and demonstrating test-retest and intertester reproducibility by comparison of repeated scores (mean difference [95% CI] and paired t test) and the intraclass correlation coefficient. Mean scores for the novice, intermediate, and advanced groups were 49.3, 73.0, and 91.5 respectively, which were all significantly different (P < .0001). There were no significant differences between repeated scores. Procedural training on mannequins prior to unsupervised performance on patients is rapidly becoming the standard in medical education. This study has validated the UGSTAT, which can now be used to determine the adequacy of thoracic ultrasound training prior to clinical practice. It is likely that its role could be extended to live patients, providing a way to document ongoing procedural competence.
Examining Critical Thinking Skills in Family Medicine Residents.

PubMed

Ross, David; Schipper, Shirley; Westbury, Chris; Linh Banh, Hoan; Loeffler, Kim; Allan, G Michael; Ross, Shelley

2016-02-01

Our objective was to determine the relationship between critical thinking skills and objective measures of academic success in a family medicine residency program. This prospective observational cohort study was set in a large Canadian family medicine residency program. Intervention was the California Critical Thinking Skills Test (CCTST), administered at three points in residency: upon entry, at mid-point, and at graduation. Results from the CCTST, Canadian Residency Matching Service file, and interview scores were compared to other measures of academic performance (Medical Colleges Admission Test [MCAT] and College of Family Physicians of Canada [CCFP] certification examination results). For participants (n=60), significant positive correlations were found between critical thinking skills and performance on tests of knowledge. For the MCAT, CCTST scores correlated positively with full scores (n=24, r=0.57) as well as with each section score (verbal reasoning: r=0.59; physical sciences: r=0.64; biological sciences: r=0.54). For CCFP examination, CCTST correlated reliably with both sections (n=49, orals: r=0.34; short answer: r=0.47). Additionally, CCTST was a better predictor of performance on the CCFP exam than was the interview score at selection into the residency program (Fisher's r-to-z test, z=2.25). Success on a critical thinking skills exam was found to predict success on family medicine certification examinations. Given that critical thinking skills appear to be stable throughout residency training, including an assessment of critical thinking in the selection process may help identify applicants more likely to be successful on final certification exam.
Older Children Have a Greater Chance to Be Accepted to Gifted Student Programmes

ERIC Educational Resources Information Center

Segev, Elad; Cahan, Sorel

2014-01-01

Selection to programmes for gifted students in Israel, performed in the second grade, relies on raw ability and achievement test scores, irrespective of age, thereby ignoring the well-known effect of within-grade age differences on test scores. Employing the entire cohort of third graders of legal age (67,366 students, 1.4% of whom were enrolled…

Some links on this page may take you to non-federal websites. Their policies may differ from this site.