ability test scores: Topics by Science.gov

Sample records for ability test scores

A Review of Scoring Algorithms for Ability and Aptitude Tests.

ERIC Educational Resources Information Center

Chevalier, Shirley A.

In conventional practice, most educators and educational researchers score cognitive tests using a dichotomous right-wrong scoring system. Although simple and straightforward, this method does not take into consideration other factors, such as partial knowledge or guessing tendencies and abilities. This paper discusses alternative scoring models:…
Sex Differences in Cognitive Abilities Test Scores: A UK National Picture

ERIC Educational Resources Information Center

Strand, Steve; Deary, Ian J.; Smith, Pauline

2006-01-01

Background and aims: There is uncertainty about the extent or even existence of sex differences in the mean and variability of reasoning test scores ( Jensen, 1998; Lynn, 1994, ; Mackintosh, 1996). This paper analyses the Cognitive Abilities Test (CAT) scores of a large and representative sample of UK pupils to determine the extent of any sex…
Situational Effects May Account for Gain Scores in Cognitive Ability Testing: A Longitudinal SEM Approach

ERIC Educational Resources Information Center

Matton, Nadine; Vautier, Stephane; Raufaste, Eric

2009-01-01

Mean gain scores for cognitive ability tests between two sessions in a selection setting are now a robust finding, yet not fully understood. Many authors do not attribute such gain scores to an increase in the target abilities. Our approach consists of testing a longitudinal SEM model suitable to this view. We propose to model the scores' changes…
The Effect of Schooling and Ability on Achievement Test Scores. NBER Working Paper Series.

ERIC Educational Resources Information Center

Hansen, Karsten; Heckman, James J.; Mullen, Kathleen J.

This study developed two methods for estimating the effect of schooling on achievement test scores that control for the endogeneity of schooling by postulating that both schooling and test scores are generated by a common unobserved latent ability. The methods were applied to data on schooling and test scores. Estimates from the two methods are in…
Multidimensional Scoring of Abilities: The Ordered Polytomous Response Case

ERIC Educational Resources Information Center

de la Torre, Jimmy

2008-01-01

Recent work has shown that multidimensionally scoring responses from different tests can provide better ability estimates. For educational assessment data, applications of this approach have been limited to binary scores. Of the different variants, the de la Torre and Patz model is considered more general because implementing the scoring procedure…
An Investigation of Calculator Use on Employment Tests of Mathematical Ability: Effects on Reliability, Validity, Test Scores, and Speed of Completion

ERIC Educational Resources Information Center

Bing, Mark N.; Stewart, Susan M.; Davison, H. Kristl

2009-01-01

Handheld calculators have been used on the job for more than 30 years, yet the degree to which these devices can affect performance on employment tests of mathematical ability has not been thoroughly examined. This study used a within-subjects research design (N = 167) to investigate the effects of calculator use on test score reliability, test…
The Score Reliability of Draw-a-Person Intellectual Ability Test (DAP: IQ) for Rural Malawi Students

ERIC Educational Resources Information Center

Khasu, Denis S.; Williams, Thomas O., Jr.

2016-01-01

In this brief article, the reliability of scores for the Draw-A-Person Intellectual Ability Test for Children, Adolescents, and Adults (DAP: IQ; Reynolds & Hickman, 2004) was examined through several analyses with a sample of 147 children from rural Malawi, Africa using a Chichewa translation of instructions. Cronbach alpha coefficients for…
What Do Test Score Really Mean? A Latent Class Analysis of Danish Test Score Performance

ERIC Educational Resources Information Center

McIntosh, James; Munk, Martin D.

2014-01-01

Latent class Poisson count models are used to analyse a sample of Danish test score results from a cohort of individuals born in 1954-1955, tested in 1968, and followed until 2011. The procedure takes account of unobservable effects as well as excessive zeros in the data. We show that the test scores measure manifest or measured ability as it has…
Interpreting Vocabulary Test Scores: What Do Various Item Formats Tell Us about Learners' Ability to Employ Words?

ERIC Educational Resources Information Center

Kremmel, Benjamin; Schmitt, Norbert

2016-01-01

The scores from vocabulary size tests have typically been interpreted as demonstrating that the target words are "known" or "learned." But "knowing" a word should entail the ability to use it in real language communication in one or more of the four skills. It should also entail deeper knowledge, such as knowing the…
Fluctuation in Spatial Ability Scores during the Menstrual Cycle.

ERIC Educational Resources Information Center

Moody, M. Suzanne

Whether or not fluctuations in spatial ability as measured by S. G. Vandenberg's Mental Rotations Test occur during the menstrual cycle was studied with 133 female students from 9 undergraduate educational psychology and nursing classes. For comparison, 28 male students also took the test. Scores from 55 females fell into the relevant menstrual…
Changing abilities vs. changing tasks: Examining validity degradation with test scores and college performance criteria both assessed longitudinally.

PubMed

Dahlke, Jeffrey A; Kostal, Jack W; Sackett, Paul R; Kuncel, Nathan R

2018-05-03

We explore potential explanations for validity degradation using a unique predictive validation data set containing up to four consecutive years of high school students' cognitive test scores and four complete years of those students' college grades. This data set permits analyses that disentangle the effects of predictor-score age and timing of criterion measurements on validity degradation. We investigate the extent to which validity degradation is explained by criterion dynamism versus the limited shelf-life of ability scores. We also explore whether validity degradation is attributable to fluctuations in criterion variability over time and/or GPA contamination from individual differences in course-taking patterns. Analyses of multiyear predictor data suggest that changes to the determinants of performance over time have much stronger effects on validity degradation than does the shelf-life of cognitive test scores. The age of predictor scores had only a modest relationship with criterion-related validity when the criterion measurement occasion was held constant. Practical implications and recommendations for future research are discussed. (PsycINFO Database Record (c) 2018 APA, all rights reserved).
Test Scores and Stereotypes.

ERIC Educational Resources Information Center

Gose, Ben

1995-01-01

A psychologist's research suggests that black and female students may have lower standardized test scores and academic achievement because they have accepted stereotypes concerning their ability. Critics feel the researcher, Claude M. Steele, may be overlooking other factors. Steele has developed a program a Stanford University (California) to…
Dynamic testing and test anxiety amongst gifted and average-ability children.

PubMed

Vogelaar, Bart; Bakker, Merel; Elliott, Julian G; Resing, Wilma C M

2017-03-01

Dynamic testing has been proposed as a testing approach that is less disadvantageous for children who may be potentially subject to bias when undertaking conventional assessments. For example, those who encounter high levels of test anxiety, or who are unfamiliar with standardized test procedures, may fail to demonstrate their true potential or capabilities. While dynamic testing has proven particularly useful for special groups of children, it has rarely been used with gifted children. We investigated whether it would be useful to conduct a dynamic test to measure the cognitive abilities of intellectually gifted children. We also investigated whether test anxiety scores would be related to a progression in the children's test scores after dynamic training. Participants were 113 children aged between 7 and 8 years from several schools in the western part of the Netherlands. The children were categorized as either gifted or average-ability and split into an unguided practice or a dynamic testing condition. The study employed a pre-test-training-post-test design. Using linear mixed modelling analysis with a multilevel approach, we inspected the growth trajectories of children in the various conditions and examined the impact of ability and test anxiety on progression and training benefits. Dynamic testing proved to be successful in improving the scores of the children, although no differences in training benefits were found between gifted and average-ability children. Test anxiety was shown to influence the children's rate of change across all test sessions and their improvement in performance accuracy after dynamic training. © 2016 The British Psychological Society.
Addressing criticisms of existing predictive bias research: cognitive ability test scores still overpredict African Americans' job performance.

PubMed

Berry, Christopher M; Zhao, Peng

2015-01-01

Predictive bias studies have generally suggested that cognitive ability test scores overpredict job performance of African Americans, meaning these tests are not predictively biased against African Americans. However, at least 2 issues call into question existing over-/underprediction evidence: (a) a bias identified by Aguinis, Culpepper, and Pierce (2010) in the intercept test typically used to assess over-/underprediction and (b) a focus on the level of observed validity instead of operational validity. The present study developed and utilized a method of assessing over-/underprediction that draws on the math of subgroup regression intercept differences, does not rely on the biased intercept test, allows for analysis at the level of operational validity, and can use meta-analytic estimates as input values. Therefore, existing meta-analytic estimates of key parameters, corrected for relevant statistical artifacts, were used to determine whether African American job performance remains overpredicted at the level of operational validity. African American job performance was typically overpredicted by cognitive ability tests across levels of job complexity and across conditions wherein African American and White regression slopes did and did not differ. Because the present study does not rely on the biased intercept test and because appropriate statistical artifact corrections were carried out, the present study's results are not affected by the 2 issues mentioned above. The present study represents strong evidence that cognitive ability tests generally overpredict job performance of African Americans. (c) 2015 APA, all rights reserved.
Measurement of ability emotional intelligence: results for two new tests.

PubMed

Austin, Elizabeth J

2010-08-01

Emotional intelligence (EI) has attracted considerable interest amongst both individual differences researchers and those in other areas of psychology who are interested in how EI relates to criteria such as well-being and career success. Both trait (self-report) and ability EI measures have been developed; the focus of this paper is on ability EI. The associations of two new ability EI tests with psychometric intelligence, emotion perception, and the Mayer-Salovey-Caruso EI test (MSCEIT) were examined. The new EI tests were the Situational Test of Emotion Management (STEM) and the Situational Test of Emotional Understanding (STEU). Only the STEU and the MSCEIT Understanding Emotions branch were significantly correlated with psychometric intelligence, suggesting that only understanding emotions can be regarded as a candidate new intelligence component. These understanding emotions tests were also positively correlated with emotion perception tests, and STEM and STEU scores were positively correlated with MSCEIT total score and most branch scores. Neither the STEM nor the STEU were significantly correlated with trait EI tests, confirming the distinctness of trait and ability EI. Taking the present results as a starting-point, approaches to the development of new ability EI tests and models of EI are suggested.
Visual-Constructional Ability in Individuals with Severe Obesity: Rey Complex Figure Test Accuracy and the Q-Score.

PubMed

Sargénius, Hanna L; Bylsma, Frederick W; Lydersen, Stian; Hestad, Knut

2017-01-01

The aims of this study were to investigate visual-construction and organizational strategy among individuals with severe obesity, as measured by the Rey Complex Figure Test (RCFT), and to examine the validity of the Q-score as a measure for the quality of performance on the RCFT. Ninety-six non-demented morbidly obese (MO) patients and 100 healthy controls (HC) completed the RCFT. Their performance was calculated by applying the standard scoring criteria. The quality of the copying process was evaluated per the directions of the Q-score scoring system. Results revealed that the MO did not perform significantly lower than the HC on Copy accuracy (mean difference -0.302, CI -1.374 to 0.769, p = 0.579). In contrast, the groups did statistically differ from each other, with MO performing poorer than the HC on the Q-score (mean -1.784, CI -3.237 to -0.331, p = 0.016) and the Unit points (mean -1.409, CI -2.291 to -0.528, p = 0.002), but not on the Order points score (mean -0.351, CI -0.994 to 0.293, p = 0.284). Differences on the Unit score and the Q-score were slightly reduced when adjusting for gender, age, and education. This study presents evidence supporting the presence of inefficiency in visuospatial constructional ability among MO patients. We believe we have found an indication that the Q-score captures a wider range of cognitive processes that are not described by traditional scoring methods. Rather than considering accuracy and placement of the different elements only, the Q-score focuses more on how the subject has approached the task.
Equating Scores from Adaptive to Linear Tests

ERIC Educational Resources Information Center

van der Linden, Wim J.

2006-01-01

Two local methods for observed-score equating are applied to the problem of equating an adaptive test to a linear test. In an empirical study, the methods were evaluated against a method based on the test characteristic function (TCF) of the linear test and traditional equipercentile equating applied to the ability estimates on the adaptive test…
Work ability as prognostic risk marker of disability pension: single-item work ability score versus multi-item work ability index.

PubMed

Roelen, Corné A M; van Rhenen, Willem; Groothoff, Johan W; van der Klink, Jac J L; Twisk, Jos W R; Heymans, Martijn W

2014-07-01

Work ability predicts future disability pension (DP). A single-item work ability score (WAS) is emerging as a measure for work ability. This study compared single-item WAS with the multi-item work ability index (WAI) in its ability to identify workers at risk of DP. This prospective cohort study comprised 11 537 male construction workers, who completed the WAI at baseline and reported DP after a mean 2.3 years of follow-up. WAS and WAI were calibrated for DP risk predictions with the Hosmer-Lemeshow (H-L) test and their ability to discriminate between high- and low-risk construction workers was investigated with the area under the receiver operating characteristic curve (AUC). At follow-up, 336 (3%) construction workers reported DP. Both WAS [odds ratio (OR) 0.72, 95% confidence interval (95% CI) 0.66-0.78] and WAI (OR 0.57, 95% CI 0.52-0.63) scores were associated with DP at follow-up. The WAS showed miscalibration (H-L model χ (�)=10.60; df=3; P=0.01) and poorly discriminated between high- and low-risk construction workers (AUC 0.67, 95% CI 0.64-0.70). In contrast, calibration (H-L model χ �=8.20; df=8; P=0.41) and discrimination (AUC 0.78, 95% CI 0.75-0.80) were both adequate for the WAI. Although associated with the risk of future DP, the single-item WAS poorly identified male construction workers at risk of DP. We recommend using the multi-item WAI to screen for risk of DP in occupational health practice.
Latent ability: grades and test scores systematically underestimate the intellectual ability of negatively stereotyped students.

PubMed

Walton, Gregory M; Spencer, Steven J

2009-09-01

Past research has assumed that group differences in academic performance entirely reflect genuine differences in ability. In contrast, extending research on stereotype threat, we suggest that standard measures of academic performance are biased against non-Asian ethnic minorities and against women in quantitative fields. This bias results not from the content of performance measures, but from the context in which they are assessed-from psychological threats in common academic environments, which depress the performances of people targeted by negative intellectual stereotypes. Like the time of a track star running into a stiff headwind, such performances underestimate the true ability of stereotyped students. Two meta-analyses, combining data from 18,976 students in five countries, tested this latent-ability hypothesis. Both meta-analyses found that, under conditions that reduce psychological threat, stereotyped students performed better than nonstereotyped students at the same level of past performance. We discuss implications for the interpretation of and remedies for achievement gaps.
Can Tracking Raise the Test Scores of High-Ability Minority Students?

ERIC Educational Resources Information Center

Card, David; Giuliano, Laura

2016-01-01

We evaluate a tracking program in a large urban district where schools with at least one gifted fourth grader create a separate "gifted/high achiever" classroom. Most seats are filled by non-gifted high achievers, ranked by previous-year test scores. We study the program's effects on the high achievers using (1) a rank-based regression…

The Relationship between Deductive Reasoning Ability, Test Anxiety, and Standardized Test Scores in a Latino Sample

ERIC Educational Resources Information Center

Rich, John D., Jr.; Fullard, William; Overton, Willis

2011-01-01

One Hundred and Twelve Latino students from Philadelphia participated in this study, which examined the development of deductive reasoning across adolescence, and the relation of reasoning to test anxiety and standardized test scores. As predicted, 11th and ninth graders demonstrated significantly more advanced reasoning than seventh graders.…
Graduate Students' Administration and Scoring Errors on the Woodcock-Johnson III Tests of Cognitive Abilities

ERIC Educational Resources Information Center

Ramos, Erica; Alfonso, Vincent C.; Schermerhorn, Susan M.

2009-01-01

The interpretation of cognitive test scores often leads to decisions concerning the diagnosis, educational placement, and types of interventions used for children. Therefore, it is important that practitioners administer and score cognitive tests without error. This study assesses the frequency and types of examiner errors that occur during the…
A Factor Analysis of Learning Data and Selected Ability Test Scores

ERIC Educational Resources Information Center

Jones, Dorothy L.

1976-01-01

A verbal concept-learning task permitting the externalizing and quantifying of learning behavior and 16 ability tests were administered to female graduate students. Data were analyzed by alpha factor analysis and incomplete image analysis. Six alpha factors and 12 image factors were extracted and orthogonally rotated. Four areas of cognitive…
Relationship between candidate communication ability and oral certification examination scores.

PubMed

Lunz, Mary E; Bashook, Philip G

2008-12-01

Structured case-based oral examinations are widely used in medical certifying examinations in the USA. These orals assess the candidate's decision-making skills using real or realistic patient cases. Frequently mentioned but not empirically evaluated is the potential bias introduced by the candidate's communication ability. This study aimed to assess the relationship between candidate communication ability and medical certification oral examination scores. Non-doctor communication observers rated a random sample of 90 candidates on communication ability during a medical oral certification examination. The multi-facet Rasch model was used to analyse the communication survey and the oral examination data. The multi-facet model accounts for observer and examiner severity bias. anova was used to measure differences in communication ability between passing and failing candidates and candidates grouped by level of communication ability. Pearson's correlations were used to compare candidate communication ability and oral certification examination performance. Candidate separation reliability values for the communication survey and the oral examination were 0.85 and 0.97, respectively, suggesting accurate candidate measurement. The correlation between communication scores and oral examination scores was 0.10. No significant difference was found between passing and failing candidates for measured communication ability. When candidates were grouped by high, moderate and low communication ability, there was no significant difference in their oral certification examination performance. Candidates' communication ability has little relationship to candidate performance on high-stakes, case-based oral examinations. Examiners for this certifying examination focused on assessing candidate decision-making ability and were not influenced by candidate communication ability.
Evaluating Pekin duck walking ability using a treadmill performance test.

PubMed

Byrd, C J; Main, R P; Makagon, M M

2016-10-01

Gait scoring is the most popular method for assessing the walking ability of poultry species. Although inexpensive and easy to implement, gait scoring systems are often criticized for being subjective. Using a treadmill performance test we assessed whether observable differences in Pekin duck walking ability identified using a gait scoring system translated to differences in walking performance. One hundred and eighty ducks were selected using a three-category gait scoring system (GS0 = smooth gait, n = 55; GS0.5 = labored walk without easily identifiable impediment, n = 56; GS1 = obvious impediment, n = 59) and the amount of time each duck was able to sustain walking on a treadmill at a speed of 0.31 m/s was evaluated. The walking test ended when each duck met one of three elimination criteria: (1) The duck walked for a maximum time of ten minutes, (2) the duck required support from the observer's hand for more than three seconds in order to continue walking on the treadmill, or (3) the duck sat down on the treadmill and made no attempt to stand despite receiving assistance from the observer. Data were analyzed in SAS 9.4 using PROC GLM. Tukey's multiple comparison test was used to compare differences in time spent walking between gait scores. Significant differences were found between all gait scores (P < 0.05). Behavioral correlates of walking performance were investigated. Video recorded during the treadmill test was analyzed for counts of sitting, standing, and leaning behaviors. Data were analyzed in SAS 9.4 using a negative binomial model for count data. No differences were found between gait scores for counts of sitting, standing, and leaning behaviors (P > 0.05). In conclusion, the amount of time spent walking on the treadmill corresponded to gait score and was an effective measurement for quantifying Pekin duck walking ability. The test could be a valuable tool for assessing the development of walking issues or the effectiveness of
The Effects of Item by Item Feedback Given during an Ability Test.

ERIC Educational Resources Information Center

Whetton, C.; Childs, R.

1981-01-01

Answer-until-correct (AUC) is a procedure for providing feedback during a multiple-choice test, giving an increased range of scores. The performance of secondary students on a verbal ability test using AUC procedures was compared with a group using conventional instructions. AUC scores considerably enhanced reliability but not validity.…
Accountancy, teaching methods, sex, and American College Test scores.

PubMed

Heritage, J; Harper, B S; Harper, J P

1990-10-01

This study examines the significance of sex, methodology, academic preparation, and age as related to development of judgmental and problem-solving skills. Sex, American College Test (ACT) Mathematics scores, Composite ACT scores, grades in course work, grade point average (GPA), and age were used in studying the effects of teaching method on 96 students' ability to analyze data in financial statements. Results reflect positively on accounting students compared to the general college population and the women students in particular.
A Quick Assessment of Visuospatial Abilities in Adolescents Using the Design Organization Test (DOT).

PubMed

Burggraaf, Rudolf; Frens, Maarten A; Hooge, Ignace T C; van der Geest, Jos N

2016-01-01

Tests measuring visuospatial abilities have shown that these abilities increase during adolescence. Unfortunately, the Block Design test and other such tests are complicated and time-consuming to administer, making them unsuitable for use with large groups of restless adolescents. The results of the Design Organization Test (DOT), a quick pen-and-paper test, have been shown to correlate with those of the Block Design test. A group of 198 healthy adolescents (110 male and 88 female) aged 12 to 19 years old participated in this study. A slightly modified version of the DOT has been used in which we shortened the administration time to avoid a ceiling effect in the score. Scores show a linear increase with age (on average 2.0 points per year, r = .61) independent of sex. Scores did not differ between individual setting and group setting. Thus, the DOT is a simple and effective way to assess visuospatial ability in large groups, such as in schools, and it can be easily administered year after year to follow the development of students.
Qualitative Dimensions in Scoring the Rey Visual Memory Test of Malingering.

ERIC Educational Resources Information Center

Griffin, G. A. Elmer; And Others

1996-01-01

A new qualitative scoring system for the Rey Visual Memory Test was tested for its ability to distinguish between malingerers and nonmalingerers. The new system, based on the types of errors made, was able to distinguish between 53 psychiatrically disabled and 64 normal nonmalingerers, and between nonmalingerers and 91 possible malingerers. (SLD)
Explaining the black-white gap in cognitive test scores: Toward a theory of adverse impact.

PubMed

Cottrell, Jonathan M; Newman, Daniel A; Roisman, Glenn I

2015-11-01

In understanding the causes of adverse impact, a key parameter is the Black-White difference in cognitive test scores. To advance theory on why Black-White cognitive ability/knowledge test score gaps exist, and on how these gaps develop over time, the current article proposes an inductive explanatory model derived from past empirical findings. According to this theoretical model, Black-White group mean differences in cognitive test scores arise from the following racially disparate conditions: family income, maternal education, maternal verbal ability/knowledge, learning materials in the home, parenting factors (maternal sensitivity, maternal warmth and acceptance, and safe physical environment), child birth order, and child birth weight. Results from a 5-wave longitudinal growth model estimated on children in the NICHD Study of Early Child Care and Youth Development from ages 4 through 15 years show significant Black-White cognitive test score gaps throughout early development that did not grow significantly over time (i.e., significant intercept differences, but not slope differences). Importantly, the racially disparate conditions listed above can account for the relation between race and cognitive test scores. We propose a parsimonious 3-Step Model that explains how cognitive test score gaps arise, in which race relates to maternal disadvantage, which in turn relates to parenting factors, which in turn relate to cognitive test scores. This model and results offer to fill a need for theory on the etiology of the Black-White ethnic group gap in cognitive test scores, and attempt to address a missing link in the theory of adverse impact. (c) 2015 APA, all rights reserved).
Understanding pretest and posttest reactions to cognitive ability and personality tests.

PubMed

Chan, D; Schmitt, N; Sacco, J M; DeShon, R P

1998-06-01

To understand the nature of test reactions and their relationship to test performance, the relationships among belief in tests, pretest reactions, test performance, and posttest reactions were modeled for cognitive ability and personality tests. Results from structural equation models that were fitted to responses from 197 undergraduate examinees supported the hypothesized relationships. On the cognitive ability test, pretest reactions affected test performance and mediated the relationship between belief in tests and test performance. Test performance affected posttest reactions even after taking into account the effect of pretest reactions. On the personality test, belief in tests affected pretest and posttest reactions, but the three variables were unrelated to test performance (Conscientiousness scores). Conceptual, methodological, and practical implications of the findings are discussed in the context of research on test reactions and test performance.
Work ability score and future work ability as predictors of register-based disability pension and long-term sickness absence: A three-year follow-up study.

PubMed

Kinnunen, Ulla; Nätti, Jouko

2018-05-01

We investigated two single items of the Work Ability Index - work ability score, and future work ability - as predictors of register-based disability pension and long-term sickness absence over a three-year follow-up. Survey responses of 11,131 Finnish employees were linked to pension and long-term (more than 10 days) sickness absence register data by Statistics Finland. Work ability score was divided into poor (0-5), moderate (6-7) and good/excellent (8-10) and future work ability into poor (1-2) and good (3) work ability at baseline. Cox proportional hazard regressions were used in the analysis of disability pension, and a negative binomial model in the analysis of long-term sickness absence. The results were adjusted for several background, work- and health-related covariates. Compared with those with good/excellent work ability scores, the hazard ratios of disability pension after adjusting for all covariates were 9.84 (95% CI 6.68-14.49) for poor and 2.25 (CI 95% 1.51-3.35) for moderate work ability score. For future work ability, the hazard ratio was 8.19 (95% CI 4.71-14.23) among those with poor future work ability. The incidence rate ratios of accumulated long-term sickness absence days were 3.08 (95% CI 2.19-4.32) and 1.59 (95% CI 1.32-1.92) for poor and moderate work ability scores, and 1.51 (95% CI 0.97-2.36) for poor future work ability. The single items of work ability score and future work ability predicted register-based disability pension equally well, but work ability score was a better predictor of register-based long-term sickness absence days than future work ability in a three-year follow-up. Both items seem to be of use especially when examining the risk of poor work ability for disability but also for long sick leave.
Evaluating the Stability of Test Score Means for the "TOEIC"® Speaking and Writing Tests. Research Report. ETS RR-17-50

ERIC Educational Resources Information Center

Qu, Yanxuan; Huo, Yan; Chan, Eric; Shotts, Matthew

2017-01-01

For educational tests, it is critical to maintain consistency of score scales and to understand the sources of variation in score means over time. This practice helps to ensure that interpretations about test takers' abilities are comparable from one administration (or one form) to another. This study examines the consistency of reported scores…
Estimating verbal fluency and naming ability from the test of premorbid functioning and demographic variables: Regression equations derived from a regional UK sample.

PubMed

Jenkinson, Toni-Marie; Muncer, Steven; Wheeler, Miranda; Brechin, Don; Evans, Stephen

2018-06-01

Neuropsychological assessment requires accurate estimation of an individual's premorbid cognitive abilities. Oral word reading tests, such as the test of premorbid functioning (TOPF), and demographic variables, such as age, sex, and level of education, provide a reasonable indication of premorbid intelligence, but their ability to predict other related cognitive abilities is less well understood. This study aimed to develop regression equations, based on the TOPF and demographic variables, to predict scores on tests of verbal fluency and naming ability. A sample of 119 healthy adults provided demographic information and were tested using the TOPF, FAS, animal naming test (ANT), and graded naming test (GNT). Multiple regression analyses, using the TOPF and demographics as predictor variables, were used to estimate verbal fluency and naming ability test scores. Change scores and cases of significant impairment were calculated for two clinical samples with diagnosed neurological conditions (TBI and meningioma) using the method in Knight, McMahon, Green, and Skeaff (). Demographic variables provided a significant contribution to the prediction of all verbal fluency and naming ability test scores; however, adding TOPF score to the equation considerably improved prediction beyond that afforded by demographic variables alone. The percentage of variance accounted for by demographic variables and/or TOPF score varied from 19 per cent (FAS), 28 per cent (ANT), and 41 per cent (GNT). Change scores revealed significant differences in performance in the clinical groups, particularity the TBI group. Demographic variables, particularly education level, and scores on the TOPF should be taken into consideration when interpreting performance on tests of verbal fluency and naming ability. © 2017 The British Psychological Society.
Do Test Scores Buy Happiness?

ERIC Educational Resources Information Center

McCluskey, Neal

2017-01-01

Since at least the enactment of No Child Left Behind in 2002, standardized test scores have served as the primary measures of public school effectiveness. Yet, such scores fail to measure the ultimate goal of education: maximizing happiness. This exploratory analysis assesses nation level associations between test scores and happiness, controlling…
Predicting occupational personality test scores.

PubMed

Furnham, A; Drakeley, R

2000-01-01

The relationship between students' actual test scores and their self-estimated scores on the Hogan Personality Inventory (HPI; R. Hogan & J. Hogan, 1992), an omnibus personality questionnaire, was examined. Despite being given descriptive statistics and explanations of each of the dimensions measured, the students tended to overestimate their scores; yet all correlations between actual and estimated scores were positive and significant. Correlations between self-estimates and actual test scores were highest for sociability, ambition, and adjustment (r = .62 to r = .67). The results are discussed in terms of employers' use and abuse of personality assessment for job recruitment.
An Analysis of Time-Related Score Increments and/or Decrements for GRE Repeaters across Ability and Sex Groups.

ERIC Educational Resources Information Center

Rock, Donald; Werts, Charles

The purpose of this study was to obtain information on both the number of individuals who retest and their patterns of score gain (or decrement) by sex and ability. Individuals who retested only once were found to gain about 26-27 points on the Graduate Record Examination (GRE) verbal test and about 23 points on the GRE quantitative test. This…
Interpreting the g loadings of intelligence test composite scores in light of Spearman's law of diminishing returns.

PubMed

Reynolds, Matthew R

2013-03-01

The linear loadings of intelligence test composite scores on a general factor (g) have been investigated recently in factor analytic studies. Spearman's law of diminishing returns (SLODR), however, implies that the g loadings of test scores likely decrease in magnitude as g increases, or they are nonlinear. The purpose of this study was to (a) investigate whether the g loadings of composite scores from the Differential Ability Scales (2nd ed.) (DAS-II, C. D. Elliott, 2007a, Differential Ability Scales (2nd ed.). San Antonio, TX: Pearson) were nonlinear and (b) if they were nonlinear, to compare them with linear g loadings to demonstrate how SLODR alters the interpretation of these loadings. Linear and nonlinear confirmatory factor analysis (CFA) models were used to model Nonverbal Reasoning, Verbal Ability, Visual Spatial Ability, Working Memory, and Processing Speed composite scores in four age groups (5-6, 7-8, 9-13, and 14-17) from the DAS-II norming sample. The nonlinear CFA models provided better fit to the data than did the linear models. In support of SLODR, estimates obtained from the nonlinear CFAs indicated that g loadings decreased as g level increased. The nonlinear portion for the nonverbal reasoning loading, however, was not statistically significant across the age groups. Knowledge of general ability level informs composite score interpretation because g is less likely to produce differences, or is measured less, in those scores at higher g levels. One implication is that it may be more important to examine the pattern of specific abilities at higher general ability levels. PsycINFO Database Record (c) 2013 APA, all rights reserved.
Exploring a Source of Uneven Score Equity across the Test Score Range

ERIC Educational Resources Information Center

Huggins-Manley, Anne Corinne; Qiu, Yuxi; Penfield, Randall D.

2018-01-01

Score equity assessment (SEA) refers to an examination of population invariance of equating across two or more subpopulations of test examinees. Previous SEA studies have shown that score equity may be present for examinees scoring at particular test score ranges but absent for examinees scoring at other score ranges. No studies to date have…
School accountability and the black-white test score gap.

PubMed

Gaddis, S Michael; Lauen, Douglas Lee

2014-03-01

Since at least the 1960s, researchers have closely examined the respective roles of families, neighborhoods, and schools in producing the black-white achievement gap. Although many researchers minimize the ability of schools to eliminate achievement gaps, the No Child Left Behind Act (NCLB) increased pressure on schools to do so by 2014. In this study, we examine the effects of NCLB's subgroup-specific accountability pressure on changes in black-white math and reading test score gaps using a school-level panel dataset on all North Carolina public elementary and middle schools between 2001 and 2009. Using difference-in-difference models with school fixed effects, we find that accountability pressure reduces black-white achievement gaps by raising mean black achievement without harming mean white achievement. We find no differential effects of accountability pressure based on the racial composition of schools, but schools with more affluent populations are the most successful at reducing the black-white math achievement gap. Thus, our findings suggest that school-based interventions have the potential to close test score gaps, but differences in school composition and resources play a significant role in the ability of schools to reduce racial inequality. Copyright © 2013 Elsevier Inc. All rights reserved.

Demographically Adjusted Groups for Equating Test Scores. Research Report. ETS RR-14-30

ERIC Educational Resources Information Center

Livingston, Samuel A.

2014-01-01

In this study, I investigated 2 procedures intended to create test-taker groups of equal ability by poststratifying on a composite variable created from demographic information. In one procedure, the stratifying variable was the composite variable that best predicted the test score. In the other procedure, the stratifying variable was the…
Do Examinees Understand Score Reports for Alternate Methods of Scoring Computer Based Tests?

ERIC Educational Resources Information Center

Whittaker, Tiffany A.; Williams, Natasha J.; Dodd, Barbara G.

2011-01-01

This study assessed the interpretability of scaled scores based on either number correct (NC) scoring for a paper-and-pencil test or one of two methods of scoring computer-based tests: an item pattern (IP) scoring method and a method based on equated NC scoring. The equated NC scoring method for computer-based tests was proposed as an alternative…
Work ability score of solvent-exposed workers.

PubMed

Furu, Heidi; Sainio, Markku; Hyvärinen, Hanna-Kaisa; Kaukiainen, Ari

2018-03-28

Occupational chronic solvent encephalopathy (CSE), characterized by neurocognitive dysfunction, often leads to early retirement. However, only the more severe cases are diagnosed with CSE, and little is known about the work ability of solvent-exposed workers in general. The aim was to study memory and concentration symptoms, work ability and the effect of both solvent-related and non-occupational factors on work ability, in an actively working solvent-exposed population. A questionnaire on exposure and health was sent to 3640 workers in four solvent-exposed fields, i.e. painters and floor-layers, boat builders, printers, and metal workers. The total number of responses was 1730. We determined the work ability score (WAS), a single question item of the Work Ability Index, and studied solvent exposure, demographic factors, Euroquest memory and concentration symptoms, chronic diseases, and employment status using univariate and multivariate analyses. The findings were compared to those of a corresponding national blue-collar reference population (n = 221), and a small cohort of workers with CSE (n = 18). The proportion of workers with memory and concentration symptoms was significantly associated with solvent exposure. The WAS of solvent-exposed workers was lower than that of the national blue-collar reference group, and the difference was significant in the oldest age group (those aged over 60). Solvent-exposed worker's WAS were higher than those of workers diagnosed with CSE. The WAS were lowest among painters and floor-layers, followed by metal workers and printers, and highest among boat builders. The strongest explanatory factors for poor work ability were the number of chronic diseases, age and employment status. Solvent exposure was a weak independent risk factor for reduced WAS, comparable to a level of high alcohol consumption. Even if memory and concentration symptoms were associated with higher solvent exposure, the effect of solvents on self
TIE: an ability test of emotional intelligence.

PubMed

Śmieja, Magdalena; Orzechowski, Jarosław; Stolarski, Maciej S

2014-01-01

The Test of Emotional Intelligence (TIE) is a new ability scale based on a theoretical model that defines emotional intelligence as a set of skills responsible for the processing of emotion-relevant information. Participants are provided with descriptions of emotional problems, and asked to indicate which emotion is most probable in a given situation, or to suggest the most appropriate action. Scoring is based on the judgments of experts: professional psychotherapists, trainers, and HR specialists. The validation study showed that the TIE is a reliable and valid test, suitable for both scientific research and individual assessment. Its internal consistency measures were as high as .88. In line with theoretical model of emotional intelligence, the results of the TIE shared about 10% of common variance with a general intelligence test, and were independent of major personality dimensions.
TIE: An Ability Test of Emotional Intelligence

PubMed Central

Śmieja, Magdalena; Orzechowski, Jarosław; Stolarski, Maciej S.

2014-01-01

The Test of Emotional Intelligence (TIE) is a new ability scale based on a theoretical model that defines emotional intelligence as a set of skills responsible for the processing of emotion-relevant information. Participants are provided with descriptions of emotional problems, and asked to indicate which emotion is most probable in a given situation, or to suggest the most appropriate action. Scoring is based on the judgments of experts: professional psychotherapists, trainers, and HR specialists. The validation study showed that the TIE is a reliable and valid test, suitable for both scientific research and individual assessment. Its internal consistency measures were as high as .88. In line with theoretical model of emotional intelligence, the results of the TIE shared about 10% of common variance with a general intelligence test, and were independent of major personality dimensions. PMID:25072656
Relationship between Age and the Ability to Break Scored Tablets

PubMed Central

Notenboom, Kim; Vromans, Herman; Schipper, Maarten; Leufkens, Hubert G. M.; Bouvy, Marcel L.

2016-01-01

Background: Practical problems with the use of medicines, such as difficulties with breaking tablets, are an often overlooked cause for non-adherence. Tablets frequently break in uneven parts and loss of product can occur due to crumbling and powdering. Health characteristics, such as the presence of peripheral neuropathy, decreased grip strength and manual dexterity, can affect a patient's ability to break tablets. As these impairments are associated with aging and age-related diseases, such as Parkinson's disease and arthritis, difficulties with breaking tablets could be more prevalent among older adults. The objective of this study was to investigate the relationship between age and the ability to break scored tablets. Methods: A comparative study design was chosen. Thirty-six older adults and 36 young adults were systematically observed with breaking scored tablets. Twelve different tablets were included. All participants were asked to break each tablet by three techniques: in between the fingers with the use of nails, in between the fingers without the use of nails and pushing the tablet downward with one finger on a solid surface. It was established whether a tablet was broken or not, and if broken, whether the tablet was broken accurately or not. Results: The older adults experienced more difficulties to break tablets compared to the young adults. On average, the older persons broke 38.1% of the tablets, of which 71.0% was broken accurately. The young adults broke 78.2% of the tablets, of which 77.4% was broken accurately. Further analysis by mixed effects logistic regression revealed that age was associated with the ability to break tablets, but not with the accuracy of breaking. Conclusions: Breaking scored tablets by hand is less successful in an elderly population compared to a group of young adults. Health care providers should be aware that tablet breaking is not appropriate for all patients and for all drugs. In case tablet breaking is unavoidable, a
The effects of calculator-based laboratories on standardized test scores

NASA Astrophysics Data System (ADS)

Stevens, Charlotte Bethany Rains

Nationwide, the goal of providing a productive science and math education to our youth in today's educational institutions is centering itself around the technology being utilized in these classrooms. In this age of digital technology, educational software and calculator-based laboratories (CBL) have become significant devices in the teaching of science and math for many states across the United States. Among the technology, the Texas Instruments graphing calculator and Vernier Labpro interface, are among some of the calculator-based laboratories becoming increasingly popular among middle and high school science and math teachers in many school districts across this country. In Tennessee, however, it is reported that this type of technology is not regularly utilized at the student level in most high school science classrooms, especially in the area of Physical Science (Vernier, 2006). This research explored the effect of calculator based laboratory instruction on standardized test scores. The purpose of this study was to determine the effect of traditional teaching methods versus graphing calculator teaching methods on the state mandated End-of-Course (EOC) Physical Science exam based on ability, gender, and ethnicity. The sample included 187 total tenth and eleventh grade physical science students, 101 of which belonged to a control group and 87 of which belonged to the experimental group. Physical Science End-of-Course scores obtained from the Tennessee Department of Education during the spring of 2005 and the spring of 2006 were used to examine the hypotheses. The findings of this research study suggested the type of teaching method, traditional or calculator based, did not have an effect on standardized test scores. However, the students' ability level, as demonstrated on the End-of-Course test, had a significant effect on End-of-Course test scores. This study focused on a limited population of high school physical science students in the middle Tennessee
How Accurate Is a Test Score?

ERIC Educational Resources Information Center

Doppelt, Jerome E.

1956-01-01

The standard error of measurement as a means for estimating the margin of error that should be allowed for in test scores is discussed. The true score measures the performance that is characteristic of the person tested; the variations, plus and minus, around the true score describe a characteristic of the test. When the standard deviation is used…
Cognitive Ability and Personality Variables as Predictors of School Grades and Test Scores in Adolescents

ERIC Educational Resources Information Center

Hofer, Manfred; Kuhnle, Claudia; Kilian, Britta; Fries, Stefan

2012-01-01

The predictive power of cognitive ability and self-control strength for self-reported grades and an achievement test were studied. It was expected that the variables use of time structure, academic procrastination, and motivational interference during learning further aid in predicting students' achievement because they are operative in situations…
Predicting student performance in sonographic scanning using spatial ability as an ability determinent of skill acquisition

NASA Astrophysics Data System (ADS)

Clem, Douglas Wayne

Spatial ability refers to an individual's capacity to visualize and mentally manipulate three dimensional objects. Since sonographers manually manipulate 2D and 3D sonographic images to generate multi-viewed, logical, sequential renderings of an anatomical structure, it can be assumed that spatial ability is central to the perception and interpretation of these medical images. Using Ackerman's theory of ability determinants of skilled performance as a conceptual framework, this study explored the relationship of spatial ability and learning sonographic scanning. Beginning first year sonography students from four different educational institutions were administered a spatial abilities test prior to their initial scanning lab coursework. The students' spatial test scores were compared with their scanning competency performance scores. A significant relationship between the students' spatial ability scores and their scanning performance scores was found. This result suggests that the use of spatial ability tests for admission to sonography programs may improve candidate selection, as well as assist programs in adjusting instruction and curriculum for students who demonstrate low spatial ability.
The Anatomy Competence Score--A New Marker for Anatomical Ability

ERIC Educational Resources Information Center

Schoeman, Scarpa; Chandratilake, Madawa

2012-01-01

The assessment of students' ability in gross anatomy is a complex process as it involves the measurement of multiple facets. In this work, the authors developed and introduced the Anatomy Competence Score (ACS), which incorporates the three domains of anatomy teaching and assessment namely: theoretical knowledge, practical 3D application of the…
Innovative testing of spatial ability: interactive responding and the use of complex stimuli material.

PubMed

Jelínek, Martin; Květon, Petr; Vobořil, Dalibor

2015-02-01

Despite initial expectations, which have emerged with the advancement of computer technology over the last decade of the twentieth century, scientific literature does not contain many relevant references regarding the development and use of innovative items in psychological testing. Our study presents and evaluates two novel item types. One item type is derived from a standard schematic test item used for the assessment of the spatial perception aspect of spatial ability, enhanced by an interactive response module. The performance on this item type is correlated with the performance on its paper and pencil counterpart. The other innovative item type used complex stimuli in the form of a short video of a ride through a city presented in an on-route perspective, which is intended to measure navigation skills and the ability to keep oneself oriented in space. In this case, the scores were related to the capacity of visuo-spatial working memory and also to the overall score in the paper/pencil test of spatial ability. The second relationship was moderated by gender.
Evaluation of 2 cognitive abilities tests in a dual-task environment

NASA Technical Reports Server (NTRS)

Vidulich, M. A.; Tsang, P. S.

1986-01-01

Most real world operators are required to perform multiple tasks simultaneously. In some cases, such as flying a high performance aircraft or trouble shooting a failing nuclear power plant, the operator's ability to time share or process in parallel" can be driven to extremes. This has created interest in selection tests of cognitive abilities. Two tests that have been suggested are the Dichotic Listening Task and the Cognitive Failures Questionnaire. Correlations between these test results and time sharing performance were obtained and the validity of these tests were examined. The primary task was a tracking task with dynamically varying bandwidth. This was performed either alone or concurrently with either another tracking task or a spatial transformation task. The results were: (1) An unexpected negative correlation was detected between the two tests; (2) The lack of correlation between either test and task performance made the predictive utility of the tests scores appear questionable; (3) Pilots made more errors on the Dichotic Listening Task than college students.
Work ability assessment in a worker population: comparison and determinants of Work Ability Index and Work Ability score.

PubMed

El Fassi, Mehdi; Bocquet, Valery; Majery, Nicole; Lair, Marie Lise; Couffignal, Sophie; Mairiaux, Philippe

2013-04-08

Public authorities in European countries are paying increasing attention to the promotion of work ability throughout working life and the best method to monitor work ability in populations of workers is becoming a significant question. The present study aims to compare the assessment of work ability based on the use of the Work Ability Index (WAI), a 7-item questionnaire, with another one based on the use of WAI's first item, which consists in the worker's self-assessment of his/her current work ability level as opposed to his/her lifetime best, this single question being termed "Work Ability score" (WAS). Using a database created by an occupational health service, the study intends to answer the following questions: could the assessment of work ability be based on a single-item measure and which are the variables significantly associated with self-reported work ability among those systematically recorded by the occupational physician during health examinations? A logistic regression model was used in order to estimate the probability of observing "poor" or "moderate" WAI levels depending on age, gender, body mass index, smoking status, position held, firm size and diseases reported by the worker in a population of workers aged 40 to 65 and examined between January 2006 and June 2010 (n=12389). The convergent validity between WAS and WAI was statistically significant (rs=0.63). In the multivariable model, age (p<0.001), reported diseases (OR=1.13, 95%CI [1.11-1.15]) and holding a position mostly characterized by physical activity (OR=1.67, 95%CI [1.49-1.87]) increased the probability of reporting moderate or poor work ability. A work position characterized by the predominance of mental activity (OR=0.71, 95%CI [0.61-0.84]) had a favourable impact on work ability. These relations were observed regardless of the work ability measurement tool used. The convergent validity and the similarity in results between WAI and WAS observed in a large population of employed
From Test Scores to Language Use: Emergent Bilinguals Using English to Accomplish Academic Tasks

ERIC Educational Resources Information Center

Rodriguez-Mojica, Claudia

2018-01-01

Prominent discourses about emergent bilinguals' academic abilities tend to focus on performance as measured by test scores and perpetuate the message that emergent bilinguals trail far behind their peers. When we remove the constraints of formal testing situations, what can emergent bilinguals do in English as they engage in naturally occurring…
Validating Test Score Meaning and Defending Test Score Use: Different Aims, Different Methods

ERIC Educational Resources Information Center

Cizek, Gregory J.

2016-01-01

Advances in validity theory and alacrity in validation practice have suffered because the term "validity" has been used to refer to two incompatible concerns: (1) the degree of support for specified interpretations of test scores (i.e. intended score meaning) and (2) the degree of support for specified applications (i.e. intended test…
Using implicit association tests in age-heterogeneous samples: The importance of cognitive abilities and quad model processes.

PubMed

Wrzus, Cornelia; Egloff, Boris; Riediger, Michaela

2017-08-01

Implicit association tests (IATs) are increasingly used to indirectly assess people's traits, attitudes, or other characteristics. In addition to measuring traits or attitudes, IAT scores also reflect differences in cognitive abilities because scores are based on reaction times (RTs) and errors. As cognitive abilities change with age, questions arise concerning the usage and interpretation of IATs for people of different age. To address these questions, the current study examined how cognitive abilities and cognitive processes (i.e., quad model parameters) contribute to IAT results in a large age-heterogeneous sample. Participants (N = 549; 51% female) in an age-stratified sample (range = 12-88 years) completed different IATs and 2 tasks to assess cognitive processing speed and verbal ability. From the IAT data, D2-scores were computed based on RTs, and quad process parameters (activation of associations, overcoming bias, detection, guessing) were estimated from individual error rates. Substantial IAT scores and quad processes except guessing varied with age. Quad processes AC and D predicted D2-scores of the content-specific IAT. Importantly, the effects of cognitive abilities and quad processes on IAT scores were not significantly moderated by participants' age. These findings suggest that IATs seem suitable for age-heterogeneous studies from adolescence to old age when IATs are constructed and analyzed appropriately, for example with D-scores and process parameters. We offer further insight into how D-scoring controls for method effects in IATs and what IAT scores capture in addition to implicit representations of characteristics. (PsycINFO Database Record (c) 2017 APA, all rights reserved).
State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP

ERIC Educational Resources Information Center

Chudowsky, Naomi; Chudowsky, Victor

2010-01-01

In recent years, scores on the annual state reading and mathematics tests used for accountability have gone up in most states. These trends in state test scores do not always coincide, however, with trends on the National Assessment of Educational Progress (NAEP), the federally sponsored assessment that is administered periodically to…
Estimating Total-Test Scores from Partial Scores in a Matrix Sampling Design.

ERIC Educational Resources Information Center

Sachar, Jane; Suppes, Patrick

1980-01-01

The present study compared six methods, two of which utilize the content structure of items, to estimate total-test scores using 450 students and 60 items of the 110-item Stanford Mental Arithmetic Test. Three methods yielded fairly good estimates of the total-test score. (Author/RL)
State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. Washington

ERIC Educational Resources Information Center

Center on Education Policy, 2010

2010-01-01

This paper profiles Washington's test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the basic level on NAEP (National Assessment of Educational Progress) decreased in grade 4 reading. In grade 4 math, the percentage scoring proficient on the state test decreased…

State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. Utah

ERIC Educational Resources Information Center

Center on Education Policy, 2010

2010-01-01

This paper profiles Utah's test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the basic level on NAEP (National Assessment of Educational Progress) increased in grade 8 reading. In grade 4 reading, the percentage scoring proficient on the state test showed a…
State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. Arkansas

ERIC Educational Resources Information Center

Center on Education Policy, 2010

2010-01-01

This paper profiles Arkansas's test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the basic level on NAEP (National Assessment of Educational Progress) went up in math at grades 4 and 8. In reading, the percentages scoring proficient on the state test went up at…
Do scores on a tachistoscope test correlate with baseball batting averages?

PubMed

Reichow, Alan W; Garchow, Kenneth E; Baird, Richard Y

2011-05-01

Millions of dollars are spent each year by individuals seeking to improve their athletic performance. One area of visual training is the use of the tachistoscope, which measures inspection time or visual recognition time. Although the potential of the tachistoscope as a training tool has received some research attention, its use as a means of measurement or predictor of athletic ability in sports has not been explored. The purpose of this pilot study is to assess the potential of the tachistoscope as a measurement instrument by determining if a baseball player's ability to identify a tachistoscopically presented picture of a pitch is correlated with hitting performance as measured by batting average. Using sport-specific slides, 20 subjects-all non-pitching members of the Pacific University Baseball Team-were administered a tachistoscopic test. The test consisted of identifying the type of pitch illustrated in 30 randomly ordered slides depicting a pitcher throwing four different baseball pitches. Each slide was presented for 0.2 sec. The results of the test were compared with the athlete's previous season's batting average. A positive correlation was found between an athlete's ability to correctly identify a picture of a pitch presented tachistoscopically and batting average (r=0.648; P<0.01). These results suggest that a superior ability to recognize pitches presented via tachistoscope may correlate with a higher skill level in batting. Tachistoscopic test scores correlated positively with batting averages. The tachistoscope may be an acceptable tool to help in assessing batting performance. Additional testing with players from different sports, different levels of ability, and different tachistoscopic times should be performed to determine if the tachistoscope is a valid measure of athletic ability. Implications may also be drawn in other areas such as military and police work.
EDUCATION AND PSYCHOLOGICAL TEST SCORES

PubMed Central

Pershad, Dwarka; Verma, S. K.

1980-01-01

Education, a long neglected variable affecting psychological test score, is in search of reemphasis. Some evidence for this has accumulated on the psychological tests constructed and standardized here at the department of Psychiatry, P.G.I., Chandigarh. Tentative norms prepared education wise on WAIS-Verbal section, PGI-Memory Scale, Proverb and Similarity Tests, Psychoticism Questionnaire, and PGI MQN 2, for adults, in the age range of 16-50, are reported. The results showed marked difference in the mean scores of different educational categories and thus stressed the need for reporting norms separately for different educational levels. PMID:22064617
Tests, Abilities, Race, and Conflict.

ERIC Educational Resources Information Center

Elliott, Rogers

1988-01-01

Relationship between ability tests and race and issues of famous lawsuits concerning possible bias in intelligence tests are summarized. Reasons for the origins of ethnic and racial differences in general intellectual ability are considered. Prospects for the reduction of group differences and conflicts are discussed. (SLD)
[The Impact of Visual Perceptual Abilities on the Performance on the Wechsler Nonverbal Scale of Ability (WNV)].

PubMed

Werpup-Stüwe, L; Petermann, F; Daseking, M

2015-10-01

The use of psychometric tests in with children and adolescents is especially important in psychological diagnostics. Nonverbal intelligence tests are very often used to diagnose psychological abnormalities and generate developmental prognosis independent of the child´s verbal abilities. The correlation of the German version of the Developmental Test of Visual Perception - Adolescents and Adults (DTVP-A) with the Wechsler Nonverbal Scala of Abilities (WNV) was calculated based on the results of 172 children, adolescents and young adults aged 9-21 years. Furthermore, it was examined if individuals with poor visual perceptual abilities scored lower on the WNV than healthy subjects. The correlations of the results scored on DTVP-A and WNV ranged from moderate to strong. The group with poor visual perceptual abilities scored significantly lower on the WNV than the control group. Nonverbal intelligence tests like the WNV are not reliable for estimating the intelligence of individuals with low visual perceptual abilities. Therefore, the intelligence of these subjects should be tested with a test that also contains verbal subtests. If poor visual perceptual abilities are suspected, then they should be tested. The DTVP-A seems to be the right instrument for achieving this goal. © Georg Thieme Verlag KG Stuttgart · New York.
Use of Multi-Response Format Test in the Assessment of Medical Students' Critical Thinking Ability.

PubMed

Mafinejad, Mahboobeh Khabaz; Arabshahi, Seyyed Kamran Soltani; Monajemi, Alireza; Jalili, Mohammad; Soltani, Akbar; Rasouli, Javad

2017-09-01

To evaluate students critical thinking skills effectively, change in assessment practices is must. The assessment of a student's ability to think critically is a constant challenge, and yet there is considerable debate on the best assessment method. There is evidence that the intrinsic nature of open and closed-ended response questions is to measure separate cognitive abilities. To assess critical thinking ability of medical students by using multi-response format of assessment. A cross-sectional study was conducted on a group of 159 undergraduate third-year medical students. All the participants completed the California Critical Thinking Skills Test (CCTST) consisting of 34 multiple-choice questions to measure general critical thinking skills and a researcher-developed test that combines open and closed-ended questions. A researcher-developed 48-question exam, consisting of 8 short-answers and 5 essay questions, 19 Multiple-Choice Questions (MCQ), and 16 True-False (TF) questions, was used to measure critical thinking skills. Correlation analyses were performed using Pearson's coefficient to explore the association between the total scores of tests and subtests. One hundred and fifty-nine students participated in this study. The sample comprised 81 females (51%) and 78 males (49%) with an age range of 20±2.8 years (mean 21.2 years). The response rate was 64.1%. A significant positive correlation was found between types of questions and critical thinking scores, of which the correlations of MCQ (r=0.82) and essay questions (r=0.77) were strongest. The significant positive correlations between multi-response format test and CCTST's subscales were seen in analysis, evaluation, inference and inductive reasoning. Unlike CCTST subscales, multi-response format test have weak correlation with CCTST total score (r=0.45, p=0.06). This study highlights the importance of considering multi-response format test in the assessment of critical thinking abilities of medical
Use of Multi-Response Format Test in the Assessment of Medical Students’ Critical Thinking Ability

PubMed Central

Mafinejad, Mahboobeh Khabaz; Monajemi, Alireza; Jalili, Mohammad; Soltani, Akbar; Rasouli, Javad

2017-01-01

Introduction To evaluate students critical thinking skills effectively, change in assessment practices is must. The assessment of a student’s ability to think critically is a constant challenge, and yet there is considerable debate on the best assessment method. There is evidence that the intrinsic nature of open and closed-ended response questions is to measure separate cognitive abilities. Aim To assess critical thinking ability of medical students by using multi-response format of assessment. Materials and Methods A cross-sectional study was conducted on a group of 159 undergraduate third-year medical students. All the participants completed the California Critical Thinking Skills Test (CCTST) consisting of 34 multiple-choice questions to measure general critical thinking skills and a researcher-developed test that combines open and closed-ended questions. A researcher-developed 48-question exam, consisting of 8 short-answers and 5 essay questions, 19 Multiple-Choice Questions (MCQ), and 16 True-False (TF) questions, was used to measure critical thinking skills. Correlation analyses were performed using Pearson’s coefficient to explore the association between the total scores of tests and subtests. Results One hundred and fifty-nine students participated in this study. The sample comprised 81 females (51%) and 78 males (49%) with an age range of 20±2.8 years (mean 21.2 years). The response rate was 64.1%. A significant positive correlation was found between types of questions and critical thinking scores, of which the correlations of MCQ (r=0.82) and essay questions (r=0.77) were strongest. The significant positive correlations between multi-response format test and CCTST’s subscales were seen in analysis, evaluation, inference and inductive reasoning. Unlike CCTST subscales, multi-response format test have weak correlation with CCTST total score (r=0.45, p=0.06). Conclusion This study highlights the importance of considering multi-response format test in
Prediction of true test scores from observed item scores and ancillary data.

PubMed

Haberman, Shelby J; Yao, Lili; Sinharay, Sandip

2015-05-01

In many educational tests which involve constructed responses, a traditional test score is obtained by adding together item scores obtained through holistic scoring by trained human raters. For example, this practice was used until 2008 in the case of GRE(®) General Analytical Writing and until 2009 in the case of TOEFL(®) iBT Writing. With use of natural language processing, it is possible to obtain additional information concerning item responses from computer programs such as e-rater(®). In addition, available information relevant to examinee performance may include scores on related tests. We suggest application of standard results from classical test theory to the available data to obtain best linear predictors of true traditional test scores. In performing such analysis, we require estimation of variances and covariances of measurement errors, a task which can be quite difficult in the case of tests with limited numbers of items and with multiple measurements per item. As a consequence, a new estimation method is suggested based on samples of examinees who have taken an assessment more than once. Such samples are typically not random samples of the general population of examinees, so that we apply statistical adjustment methods to obtain the needed estimated variances and covariances of measurement errors. To examine practical implications of the suggested methods of analysis, applications are made to GRE General Analytical Writing and TOEFL iBT Writing. Results obtained indicate that substantial improvements are possible both in terms of reliability of scoring and in terms of assessment reliability. © 2015 The British Psychological Society.
Development of WAIS-III General Ability Index Minus WMS-III memory discrepancy scores.

PubMed

Lange, Rael T; Chelune, Gordon J; Tulsky, David S

2006-09-01

Analysis of the discrepancy between intellectual functioning and memory ability has received some support as a useful means for evaluating memory impairment. In recent additions to Wechlser scale interpretation, the WAIS-III General Ability Index (GAI) and the WMS-III Delayed Memory Index (DMI) were developed. The purpose of this investigation is to develop base rate data for GAI-IMI, GAI-GMI, and GAI-DMI discrepancy scores using data from the WAIS-III/WMS-III standardization sample (weighted N = 1250). Base rate tables were developed using the predicted-difference method and two simple-difference methods (i.e., stratified and non-stratified). These tables provide valuable data for clinical reference purposes to determine the frequency of GAI-IMI, GAI-GMI, and GAI-DMI discrepancy scores in the WAIS-III/WMS-III standardization sample.
Gender differences in variance and means on the Naglieri Non-verbal Ability Test: data from the Philippines.

PubMed

Vista, Alvin; Care, Esther

2011-06-01

Research on gender differences in intelligence has focused mostly on samples from Western countries and empirical evidence on gender differences from Southeast Asia is relatively sparse. This article presents results on gender differences in variance and means on a non-verbal intelligence test using a national sample of public school students from the Philippines. More than 2,700 sixth graders from public schools across the country were tested with the Naglieri Non-verbal Ability Test (NNAT). Variance ratios (VRs) and log-transformed VRs were computed. Proportion ratios for each of the ability levels were also calculated and a chi-square goodness-of-fit test was performed. An analysis of variance was performed to determine the overall gender difference in mean scores as well as within each of three age subgroups. Our data show non-existent or trivial gender difference in mean scores. However, the tails of the distributions show differences between the males and females, with greater variability among males in the upper half of the distribution and greater variability among females in the lower half of the distribution. Descriptions of the results and their implications are discussed. Results on mean score differences support the hypothesis that there are no significant gender differences in cognitive ability. The unusual results regarding differences in variance and the male-female proportion in the tails require more complex investigations. ©2010 The British Psychological Society.
Test/score/report: Simulation techniques for automating the test process

NASA Technical Reports Server (NTRS)

Hageman, Barbara H.; Sigman, Clayton B.; Koslosky, John T.

1994-01-01

A Test/Score/Report capability is currently being developed for the Transportable Payload Operations Control Center (TPOCC) Advanced Spacecraft Simulator (TASS) system which will automate testing of the Goddard Space Flight Center (GSFC) Payload Operations Control Center (POCC) and Mission Operations Center (MOC) software in three areas: telemetry decommutation, spacecraft command processing, and spacecraft memory load and dump processing. Automated computer control of the acceptance test process is one of the primary goals of a test team. With the proper simulation tools and user interface, the task of acceptance testing, regression testing, and repeatability of specific test procedures of a ground data system can be a simpler task. Ideally, the goal for complete automation would be to plug the operational deliverable into the simulator, press the start button, execute the test procedure, accumulate and analyze the data, score the results, and report the results to the test team along with a go/no recommendation to the test team. In practice, this may not be possible because of inadequate test tools, pressures of schedules, limited resources, etc. Most tests are accomplished using a certain degree of automation and test procedures that are labor intensive. This paper discusses some simulation techniques that can improve the automation of the test process. The TASS system tests the POCC/MOC software and provides a score based on the test results. The TASS system displays statistics on the success of the POCC/MOC system processing in each of the three areas as well as event messages pertaining to the Test/Score/Report processing. The TASS system also provides formatted reports documenting each step performed during the tests and the results of each step. A prototype of the Test/Score/Report capability is available and currently being used to test some POCC/MOC software deliveries. When this capability is fully operational it should greatly reduce the time necessary
State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP

ERIC Educational Resources Information Center

Chudowsky, Naomi; Chudowsky, Victor

2010-01-01

This report compares state math and reading proficiency scores in grades 4 and 8 to National Assessment of Educational Progress (NAEP) basic scores for the period of 2005 to 2009. The study found that scores on state tests and NAEP have increased in most states with sufficient data. Also included with the report are profiles for the 23 states that…
Estimating Total-test Scores from Partial Scores in a Matrix Sampling Design.

ERIC Educational Resources Information Center

Sachar, Jane; Suppes, Patrick

It is sometimes desirable to obtain an estimated total-test score for an individual who was administered only a subset of the items in a total test. The present study compared six methods, two of which utilize the content structure of items, to estimate total-test scores using 450 students in grades 3-5 and 60 items of the ll0-item Stanford Mental…
State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. Ohio

ERIC Educational Resources Information Center

Center on Education Policy, 2010

2010-01-01

This paper profiles Ohio's test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the basic level on NAEP (National Assessment of Educational Progress) increased in grade 4 reading and grade 8 math. In grade 8 reading, the percentage of students scoring proficient…
Estimating the Reliability of a Test Battery Composite or a Test Score Based on Weighted Item Scoring

ERIC Educational Resources Information Center

Feldt, Leonard S.

2004-01-01

In some settings, the validity of a battery composite or a test score is enhanced by weighting some parts or items more heavily than others in the total score. This article describes methods of estimating the total score reliability coefficient when differential weights are used with items or parts.
Improvement in intelligence test scores from 6 to 10 years in children of teenage mothers.

PubMed

Cornelius, Marie D; Goldschmidt, Lidush; De Genna, Natacha M; Richardson, Gale A; Leech, Sharon L; Day, Richard

2010-06-01

This study investigates change in IQ scores among 290 children born to teenage mothers and identifies social, economic, and environmental variables that may be associated with change in intelligence test performance. The children of 290 teenage mothers (72% African-American and 28% European American) were assessed with the Stanford-Binet Intelligence Scale-4th Edition at ages 6 and 10. The mean composite score at age 6 was 84.8 and 91.2 at age 10, an improvement of 6.4 points. Significant cross-sectional predictors at both ages 6 and 10 of higher Stanford-Binet Intelligence Scale scores were maternal cognitive ability, school grade, white ethnicity, and caregiver education. Having more children in the household significantly predicted lower Stanford-Binet Intelligence Scale scores at age 6. Higher satisfaction with maternal social support predicted higher Stanford-Binet Intelligence Scale scores at age 10. Change in IQ scores was not related to maternal socioeconomic status, social support, home environment, ethnicity, or family interactions. Custodial stability was associated with an improvement in IQ scores, whereas increase in caregiver depression was related to decline in IQ scores. Our findings suggest that improvement in IQ scores of offspring of teenage mothers may be related to stability of maternal custody. More research is needed to determine the impact of the maturation of adolescent mothers' parenting and the role of early education on improvement in cognitive abilities.
ITC Guidelines on Quality Control in Scoring, Test Analysis, and Reporting of Test Scores

ERIC Educational Resources Information Center

Allalouf, Avi

2014-01-01

The Quality Control (QC) Guidelines are intended to increase the efficiency, precision, and accuracy of the scoring, analysis, and reporting process of testing. The QC Guidelines focus on large-scale testing operations where multiple forms of tests are created for use on set dates. However, they may also be used for a wide variety of other testing…
A psychometric evaluation of the Arm Motor Ability Test.

PubMed

O'Dell, Michael W; Kim, Grace; Rivera, Lisa; Fieo, Robert; Christos, Paul; Polistena, Caitlin; Fitzgerald, Kerri; Gorga, Delia

2013-06-01

To further examine the psychometric properties of a 9-item version of the Arm Motor Ability Test (AMAT-9) in persons with stroke. Thirty-two community-dwelling persons > 6 months post-stroke undergoing robotics treatment (mean age = 56.0 years, time post-stroke = 4.1 years, National Institutes of Health Stroke Scale score = 4.1, and AMAT-9 score = 1.22). Construct validity (including Rasch analyses) used baseline data prior to treatment (n = 32). Standardized response mean was calculated for subjects completing the protocol (n = 29). The Wolf Motor Function Test (WMFT), Fugl-Meyer Assessment (FMA), Action Research Arm Test (ARAT), and Stroke Impact Scale (SIS) were also administered. Spearman-rank correlation coefficients between AMAT-9 and the WMFT, FMA, and ARAT were strong (0.78-0.79, all p < 0.001). The correlation between the AMAT-9 and SIS Hand Function sub-score was stronger than that between the AMAT-9 and the Communication sub-score (0.40, p = 0.025 and -0.16, p = 0.39, respectively). Rasch analyses provided evidence for an appropriate hierarchical structure of item difficulties, unidimensionality, and good reliability. The AMAT demonstrated a comparable standardized response mean of 0.98. The AMAT-9 is valid and responsive among subjects scoring in the lower range of the scale. It has the advantage of assessing function and by eliminating the standing item from the previous iteration, it may be more easily used with severely impaired patients.
Does Test Anxiety Induce Measurement Bias in Cognitive Ability Tests?

ERIC Educational Resources Information Center

Reeve, Charlie L.; Bonaccio, Silvia

2008-01-01

Although test anxiety is typically negatively related to performance on cognitive ability tests, little research has systematically investigated whether differences in test anxiety result in measurement bias on cognitive ability tests. The current paper uses a structural equation modeling technique to explicitly test for measurement bias due to…

Summary of Score Changes (in other Tests).

ERIC Educational Resources Information Center

Cleary, T. Anne; McCandless, Sam A.

Scholastic Aptitude Test (SAT) scores have declined during the last 14 years. Similar score declines have been observed in many different testing programs, many groups, and tested areas. The declines, while not large in any given year, have been consistent over time, area, and group. The period around 1965 is critical for the interpretation of…
On the Myth and the Reality of the Temporal Validity Degradation of General Mental Ability Test Scores

ERIC Educational Resources Information Center

Reeve, Charlie L.; Bonaccio, Silvia

2011-01-01

Claims of changes in the validity coefficients associated with general mental ability (GMA) tests due to the passage of time (i.e., temporal validity degradation) have been the focus of an on-going debate in applied psychology. To evaluate whether and, if so, under what conditions this degradation may occur, we integrate evidence from multiple…
Testing Intelligently Includes Double-Checking Wechsler IQ Scores

ERIC Educational Resources Information Center

Kuentzel, Jeffrey G.; Hetterscheidt, Lesley A.; Barnett, Douglas

2011-01-01

The rigors of standardized testing make for numerous opportunities for examiner error, including simple computational mistakes in scoring. Although experts recommend that test scoring be double-checked, the extent to which independent double-checking would reduce scoring errors is not known. A double-checking procedure was established at a…
Physiologic Dysfunction Scores and Cognitive Function Test Performance in United States Adults

PubMed Central

Kobrosly, Roni W; Seplaki, Christopher L; Jones, Courtney M; van Wijngaarden, Edwin

2013-01-01

Objective To investigate the relationship between a measure of cumulative physiologic dysfunction and specific domains of cognitive function. Methods We examined a summary score measuring physiological dysfunction, a multisystem measure of the body’s ability to effectively adapt to physical and psychological demands, in relation to cognitive function deficits in a population of 4511 adults aged 20 to 59 who participated in the third National Health and Nutrition Examination Survey (1988–1994). Measures of cognitive function comprised three domains: working memory, visuomotor speed, and perceptual-motor speed. ‘Physiologic dysfunction’ scores summarizing measures of cardiovascular, immunologic, kidney, and liver function were explored. We used multiple linear regression models to estimate associations between cognitive function measures and physiological dysfunction scores, adjusting for socioeconomic factors, test conditions, and self-reported health factors. Results We noted a dose-response relationship between physiologic dysfunction and working memory (coefficient = 0.207, 95% CI = (0.066, 0.348), p < 0.0001) that persisted after adjustment for all covariates (p = 0.03). We did not observe any significant relationships between dysfunction scores and visuomotor (p = 0.37) or perceptual-motor ability (p = 0.33). Conclusions Our findings suggest that multisystem physiologic dysfunction is associated with working memory. Future longitudinal studies are needed to clarify the underlying mechanisms and explore the persistency of this association into later life. We suggest that such studies should incorporate physiologic data, neuroendocrine parameters, and a wide range of specific cognitive domains. PMID:22155941
Structural brain MRI trait polygenic score prediction of cognitive abilities

PubMed Central

Luciano, Michelle; Marioni, Riccardo E; Hernández, Maria Valdés; Maniega, Susana Munoz; Hamilton, Iona F; Royle, Natalie A.; Scotland, Generation; Chauhan, Ganesh; Bis, Joshua C.; Debette, Stephanie; DeCarli, Charles; Fornage, Myriam; Schmidt, Reinhold; Ikram, M. Arfan; Launer, Lenore J.; Seshadri, Sudha; Bastin, Mark E.; Porteous, David J.; Wardlaw, Joanna; Deary, Ian J

2016-01-01

Structural brain magnetic resonance imaging (MRI) traits share part of their genetic variance with cognitive traits. Here, we use genetic association results from large meta-analytic studies of genome-wide association for brain infarcts, white matter hyperintensities, intracranial, hippocampal and total brain volumes to estimate polygenic scores for these traits in three Scottish samples: Generation Scotland: Scottish Family Health Study (GS:SFHS), and the Lothian Birth Cohorts of 1936 (LBC1936) and 1921 (LBC1921). These five brain MRI trait polygenic scores were then used to 1) predict corresponding MRI traits in the LBC1936 (numbers ranged 573 to 630 across traits) and 2) predict cognitive traits in all three cohorts (in 8,115 to 8,250 persons). In the LBC1936, all MRI phenotypic traits were correlated with at least one cognitive measure; and polygenic prediction of MRI traits was observed for intracranial volume. Meta-analysis of the correlations between MRI polygenic scores and cognitive traits revealed a significant negative correlation (maximal r=0.08) between the hippocampal volume polygenic score and measures of global cognitive ability collected in childhood and in old age in the Lothian Birth Cohorts. The lack of association to a related general cognitive measure when including the GS:SFHS points to either type 1 error or the importance of using prediction samples that closely match the demographics of the genome-wide association samples from which prediction is based. Ideally, these analyses should be repeated in larger samples with data on both MRI and cognition, and using MRI GWA results from even larger meta-analysis studies. PMID:26427786
Using Patterns of Summed Scores in Paper-and-Pencil Tests and Computer-Adaptive Tests to Detect Misfitting Item Score Patterns

ERIC Educational Resources Information Center

Meijer, Rob R.

2004-01-01

Two new methods have been proposed to determine unexpected sum scores on sub-tests (testlets) both for paper-and-pencil tests and computer adaptive tests. A method based on a conservative bound using the hypergeometric distribution, denoted p, was compared with a method where the probability for each score combination was calculated using a…
A Dynamic Speech Comprehension Test for Assessing Real-World Listening Ability.

PubMed

Best, Virginia; Keidser, Gitte; Freeston, Katrina; Buchholz, Jörg M

2016-07-01

Many listeners with hearing loss report particular difficulties with multitalker communication situations, but these difficulties are not well predicted using current clinical and laboratory assessment tools. The overall aim of this work is to create new speech tests that capture key aspects of multitalker communication situations and ultimately provide better predictions of real-world communication abilities and the effect of hearing aids. A test of ongoing speech comprehension introduced previously was extended to include naturalistic conversations between multiple talkers as targets, and a reverberant background environment containing competing conversations. In this article, we describe the development of this test and present a validation study. Thirty listeners with normal hearing participated in this study. Speech comprehension was measured for one-, two-, and three-talker passages at three different signal-to-noise ratios (SNRs), and working memory ability was measured using the reading span test. Analyses were conducted to examine passage equivalence, learning effects, and test-retest reliability, and to characterize the effects of number of talkers and SNR. Although we observed differences in difficulty across passages, it was possible to group the passages into four equivalent sets. Using this grouping, we achieved good test-retest reliability and observed no significant learning effects. Comprehension performance was sensitive to the SNR but did not decrease as the number of talkers increased. Individual performance showed associations with age and reading span score. This new dynamic speech comprehension test appears to be valid and suitable for experimental purposes. Further work will explore its utility as a tool for predicting real-world communication ability and hearing aid benefit. American Academy of Audiology.
Does Test Preparation Work? Implications for Score Validity

ERIC Educational Resources Information Center

Xie, Qin

2013-01-01

This article reports an empirical study that examined the pattern of test preparation for College English Test Band 4 (CET4) and the differential effects of test preparation practices on its scores, thereby drawing implications for CET4 score validity. Data collection involved 1,003 test takers of CET4. A pretest was administered at the beginning…
Woodcock-Johnson-III, Kaufman Adolescent and Adult Intelligence Test (KAIT), Kaufman Assessment Battery for Children (KABC), and Differential Ability Scales (DAS) support Carroll but not Cattell-Horn.

PubMed

Cucina, Jeffrey M; Howardson, Garett N

2017-08-01

Recently emerging evidence suggests that the dominant structural model of mental abilities-the Cattell-Horn-Carroll (CHC) model-may not adequately account for observed scores for mental abilities batteries, leading scholars to call into question the model's validity. Establishing the robustness of these findings is important since CHC is the foundation for several contemporary mental abilities test batteries, such as the Woodcock-Johnson III (WJ-III). Using confirmatory factor analysis, we investigated CHC's robustness across 4 archival samples of mental abilities test battery data, including the WJ-III, the Kaufman Adolescent & Adult Intelligence Test (KAIT), the Kaufman Assessment Battery for Children (KABC), and the Differential Ability Scales (DAS). We computed omega hierarchical (ωH) and omega subscale (ωS) coefficients for g and the broad factors, which estimated the relationship of composite scores to g and the broad factors, respectively. Across all 4 samples, we found strong evidence for a general ability, g. We additionally found evidence for 3 to 9 residualized, orthogonal broad abilities existing independently of g, many of which also explained reliable variance in test battery scores that cannot be accounted for by g alone. The reliabilities of these broad factors, however, were less than desirable (i.e., <.80) and achieving desirable reliabilities would be practically infeasible (e.g., requiring excessively large numbers of subtests). Our results, and those of CHC critics, are wholly consistent with Carroll's model. Essentially, both g and orthogonal broad abilities are required to explain variance in mental abilities test battery scores, which is consistent with Carroll but not Cattell-Horn. (PsycINFO Database Record (c) 2017 APA, all rights reserved).
Self-assessment of social cognitive ability in schizophrenia: Association with social cognitive test performance, informant assessments of social cognitive ability, and everyday outcomes.

PubMed

Silberstein, Juliet M; Pinkham, Amy E; Penn, David L; Harvey, Philip D

2018-04-17

Impairments in self-assessment are common in people with schizophrenia and impairments in self-assessment of cognitive ability have been found to predict impaired functional outcome. In this study, we examined self-assessment of social cognitive ability and related them to assessments of social cognition provided by informants, to performance on tests of social cognition, and to everyday outcomes. The difference between self-reported social cognition and informant ratings was used to predict everyday functioning. People with schizophrenia (n=135) performed 8 different tests of social cognition. They were asked to rate their social cognitive abilities on the Observable Social Cognition Rating Scale (OSCARs). High contact informants also rated social cognitive ability and everyday outcomes, while unaware of the patients' social cognitive performance and self-assessments. Social competence was measured with a performance-based assessment and clinical ratings of negative symptoms were also performed. Patient reports of their social cognitive abilities were uncorrelated with performance on social cognitive tests and with three of the four domains of functional outcomes. Differences between self-reported and informant rated social cognitive ability predicted impaired everyday functioning across all four functional domains. This difference score predicted disability even when the influences of social cognitive performance, social competence, and negative symptoms were considered. Mis-estimation of social cognitive ability was an important predictor of social and nonsocial outcomes in schizophrenia compared to performance on social cognitive tests. These results suggest that consideration of self-assessment is critical when attempting to evaluate the causes of disability and when trying to implement interventions targeting disability reduction. Copyright © 2018 Elsevier B.V. All rights reserved.
Learning Anatomy Enhances Spatial Ability

ERIC Educational Resources Information Center

Vorstenbosch, Marc A. T. M.; Klaassen, Tim P. F. M.; Donders, A. R. T.; Kooloos, Jan G. M.; Bolhuis, Sanneke M.; Laan, Roland F. J. M.

2013-01-01

Spatial ability is an important factor in learning anatomy. Students with high scores on a mental rotation test (MRT) systematically score higher on anatomy examinations. This study aims to investigate if learning anatomy also oppositely improves the MRT-score. Five hundred first year students of medicine ("n" = 242, intervention) and…
Facilitating the Interpretation of English Language Proficiency Scores: Combining Scale Anchoring and Test Score Mapping Methodologies

ERIC Educational Resources Information Center

Powers, Donald; Schedl, Mary; Papageorgiou, Spiros

2017-01-01

The aim of this study was to develop, for the benefit of both test takers and test score users, enhanced "TOEFL ITP"® test score reports that go beyond the simple numerical scores that are currently reported. To do so, we applied traditional scale anchoring (proficiency scaling) to item difficulty data in order to develop performance…
Predictive ability of the ISS, NISS, and APACHE II score for SIRS and sepsis in polytrauma patients.

PubMed

Mica, L; Furrer, E; Keel, M; Trentz, O

2012-12-01

Systemic inflammatory response syndrome (SIRS) and sepsis as causes of multiple organ dysfunction syndrome (MODS) remain challenging to treat in polytrauma patients. In this study, the focus was set on widely used scoring systems to assess their diagnostic quality. A total of 512 patients (mean age: 39.2 ± 16.2, range: 16-88 years) who had an Injury Severity Score (ISS) ≥17 were included in this retrospective study. The patients were subdivided into four groups: no SIRS, slight SIRS, severe SIRS, and sepsis. The ISS, New Injury Severity Score (NISS), Acute Physiology and Chronic Health Evaluation II (APACHE II) scores, and prothrombin time were collected at admission. The Kruskal-Wallis test and χ(2)-test, multinomial regression analysis, and kernel density estimates were performed. Receiver operating characteristic (ROC) analysis is reported as the area under the curve (AUC). Data were considered as significant if p < 0.05. All variables were significantly different in all groups (p < 0.001). The odds ratio increased with increasing SIRS severity for NISS (slight vs. no SIRS, 1.06, p = 0.07; severe vs. no SIRS, 1.07, p = 0.04; and sepsis vs. no SIRS, 1.11, p = 0.0028) and APACHE II score (slight vs. no SIRS, 0.97, p = 0.44; severe vs. no SIRS, 1.08, p = 0.02; and sepsis vs. no SIRS, 1.12, p = 0.0028). ROC analysis revealed that the NISS (slight vs. no SIRS, AUC 0.61; severe vs. no SIRS, AUC 0.67; and sepsis vs. no SIRS, AUC 0.77) and APACHE II score (slight vs. no SIRS, AUC 0.60; severe vs. no SIRS, AUC 0.74; and sepsis vs. no SIRS, AUC 0.82) had the best predictive ability for SIRS and sepsis. Quick assessment with the NISS or APACHE II score could preselect possible candidates for sepsis following polytrauma and provide guidance in trauma surgeons' decision-making.
Simulating the Effects of Common and Specific Abilities on Test Performance: An Evaluation of Factor Analysis

ERIC Educational Resources Information Center

McFarland, Dennis J.

2014-01-01

Purpose: Factor analysis is a useful technique to aid in organizing multivariate data characterizing speech, language, and auditory abilities. However, knowledge of the limitations of factor analysis is essential for proper interpretation of results. The present study used simulated test scores to illustrate some characteristics of factor…
A physical function test for use in the intensive care unit: validity, responsiveness, and predictive utility of the physical function ICU test (scored).

PubMed

Denehy, Linda; de Morton, Natalie A; Skinner, Elizabeth H; Edbrooke, Lara; Haines, Kimberley; Warrillow, Stephen; Berney, Sue

2013-12-01

Several tests have recently been developed to measure changes in patient strength and functional outcomes in the intensive care unit (ICU). The original Physical Function ICU Test (PFIT) demonstrates reliability and sensitivity. The aims of this study were to further develop the original PFIT, to derive an interval score (the PFIT-s), and to test the clinimetric properties of the PFIT-s. A nested cohort study was conducted. One hundred forty-four and 116 participants performed the PFIT at ICU admission and discharge, respectively. Original test components were modified using principal component analysis. Rasch analysis examined the unidimensionality of the PFIT, and an interval score was derived. Correlations tested validity, and multiple regression analyses investigated predictive ability. Responsiveness was assessed using the effect size index (ESI), and the minimal clinically important difference (MCID) was calculated. The shoulder lift component was removed. Unidimensionality of combined admission and discharge PFIT-s scores was confirmed. The PFIT-s displayed moderate convergent validity with the Timed "Up & Go" Test (r=-.60), the Six-Minute Walk Test (r=.41), and the Medical Research Council (MRC) sum score (rho=.49). The ESI of the PFIT-s was 0.82, and the MCID was 1.5 points (interval scale range=0-10). A higher admission PFIT-s score was predictive of: an MRC score of ≥48, increased likelihood of discharge home, reduced likelihood of discharge to inpatient rehabilitation, and reduced acute care hospital length of stay. Scoring of sit-to-stand assistance required is subjective, and cadence cutpoints used may not be generalizable. The PFIT-s is a safe and inexpensive test of physical function with high clinical utility. It is valid, responsive to change, and predictive of key outcomes. It is recommended that the PFIT-s be adopted to test physical function in the ICU.
Conservatism and Cognitive Ability

ERIC Educational Resources Information Center

Stankov, Lazar

2009-01-01

Conservatism and cognitive ability are negatively correlated. The evidence is based on 1254 community college students and 1600 foreign students seeking entry to United States' universities. At the individual level of analysis, conservatism scores correlate negatively with SAT, Vocabulary, and Analogy test scores. At the national level of…
The Truth about Scores Children Achieve on Tests.

ERIC Educational Resources Information Center

Brown, Jonathan R.

1989-01-01

The importance of using the standard error of measurement (SEm) in determining reliability in test scores is emphasized. The SEm is compared to the hypothetical true score for standardized tests, and procedures for calculation of the SEm are explained. (JDD)
Concurrent and Predictive Validity of the Raven Progressive Matrices and the Naglieri Nonverbal Ability Test

ERIC Educational Resources Information Center

Balboni, Giulia; Naglieri, Jack A.; Cubelli, Roberto

2010-01-01

The concurrent and predictive validities of the Naglieri Nonverbal Ability Test (NNAT) and Raven's Colored Progressive Matrices (CPM) were investigated in a large group of Italian third-and fifth-grade students with different sociocultural levels evaluated at the beginning and end of the school year. CPM and NNAT scores were related to math and…
A Test of the Relationship between Reading Ability & Standardized Biology Assessment Scores

ERIC Educational Resources Information Center

Allen, Denise A.

2014-01-01

Little empirical evidence suggested that independent reading abilities of students enrolled in biology predicted their performance on the Biology I Graduation End-of-Course Assessment (ECA). An archival study was conducted at one Indiana urban public high school in Indianapolis, Indiana, by examining existing educational assessment data to test…
Association between the gait pattern characteristics of older people and their two-step test scores.

PubMed

Kobayashi, Yoshiyuki; Ogata, Toru

2018-04-27

The Two-Step test is one of three official tests authorized by the Japanese Orthopedic Association to evaluate the risk of locomotive syndrome (a condition of reduced mobility caused by an impairment of the locomotive organs). It has been reported that the Two-Step test score has a good correlation with one's walking ability; however, its association with the gait pattern of older people during normal walking is still unknown. Therefore, this study aims to clarify the associations between the gait patterns of older people observed during normal walking and their Two-Step test scores. We analyzed the whole waveforms obtained from the lower-extremity joint angles and joint moments of 26 older people in various stages of locomotive syndrome using principal component analysis (PCA). The PCA was conducted using a 260 × 2424 input matrix constructed from the participants' time-normalized pelvic and right-lower-limb-joint angles along three axes (ten trials of 26 participants, 101 time points, 4 angles, 3 axes, and 2 variable types per trial). The Pearson product-moment correlation coefficient between the scores of the principal component vectors (PCVs) and the scores of the Two-Step test revealed that only one PCV (PCV 2) among the 61 obtained relevant PCVs is significantly related to the score of the Two-Step test. We therefore concluded that the joint angles and joint moments related to PCV 2-ankle plantar-flexion, ankle plantar-flexor moments during the late stance phase, ranges of motion and moments on the hip, knee, and ankle joints in the sagittal plane during the entire stance phase-are the motions associated with the Two-Step test.

The ability of the 2013 ACC/AHA cardiovascular risk score to identify rheumatoid arthritis patients with high coronary artery calcification scores

PubMed Central

Kawai, Vivian K.; Chung, Cecilia P.; Solus, Joseph F.; Oeser, Annette; Raggi, Paolo; Stein, C. Michael

2014-01-01

Objective Patients with rheumatoid arthritis (RA) have increased risk of atherosclerotic cardiovascular disease (ASCVD) that is underestimated by the Framingham risk score (FRS). We hypothesized that the 2013 ACC/AHA 10-year risk score would perform better than the FRS and the Reynolds risk score (RRS) in identifying RA patients known to have elevated cardiovascular risk based on high coronary artery calcification (CAC) scores. Methods Among 98 RA patients eligible for risk stratification using the ACC/AHA score we identified 34 patients with high CAC (≥ 300 Agatston units or ≥75th percentile) and compared the ability of the 10-year FRS, RRS and the ACC/AHA risk scores to correctly assign these patients to an elevated risk category. Results All three risk scores were higher in patients with high CAC (P values <0.05). The percentage of patients with high CAC correctly assigned to the elevated risk category was similar among the three scores (FRS 32%, RRS 32%, ACC/AHA 41%) (P=0.233). The c-statistics for the FRS, RRS and ACC/AHA risk scores predicting the presence of high CAC were 0.65, 0.66, and 0.65, respectively. Conclusions The ACC/AHA 10-year risk score does not offer any advantage compared to the traditional FRS and RRS in the identification of RA patients with elevated risk as determined by high CAC. The ACC/AHA risk score assigned almost 60% of patients with high CAC into a low risk category. Risk scores and standard risk prediction models used in the general population do not adequately identify many RA patients with elevated cardiovascular risk. PMID:25371313
The Probability of Obtaining Two Statistically Different Test Scores as a Test Index

ERIC Educational Resources Information Center

Muller, Jorg M.

2006-01-01

A new test index is defined as the probability of obtaining two randomly selected test scores (PDTS) as statistically different. After giving a concept definition of the test index, two simulation studies are presented. The first analyzes the influence of the distribution of test scores, test reliability, and sample size on PDTS within classical…
Concurrent Validity of the Woodcock-Johnson Tests of Cognitive Ability with the WISC-R: EMR Children.

ERIC Educational Resources Information Center

Cummings, Jack A.; Sanville, David

1983-01-01

Administered the Wechsler Intelligence Scale for Children-Revised (WISC-R) and the Woodcock-Johnson Tests of Cognitive Ability (WJTCA) to educable mentally retarded children (N=30). Results showed significant mean differences between WISC-R and WJTCA full-scale standard scores, providing implications for placement of children in classes for the…
Predicting Job Performance by Use of Ability Tests and Studying Job Satisfaction as a Moderating Variable

ERIC Educational Resources Information Center

Ivancevich, John M.

1976-01-01

This empirically based study of 324 technicians investigated the moderating impact of job satisfaction in the prediction of job performance criteria from ability test scores. The findings suggest that the type of job satisfaction facet and the performance criterion used are important considerations when examining satisfaction as a moderator.…
The Relationship between Students' Performance on the Cognitive Abilities Test (CogAT) and the Fourth and Fifth Grade Reading and Math Achievement Tests in Ohio

ERIC Educational Resources Information Center

Warnimont, Chad S.

2010-01-01

The purpose of this quantitative study was to examine the relationship between students' performance on the Cognitive Abilities Test (CogAT) and the fourth and fifth grade Reading and Math Achievement Tests in Ohio. The sample utilized students from a suburban school district in Northwest Ohio. Third grade CogAT scores (2006-2007 school year), 4th…
Minority Performance on the Naglieri Nonverbal Ability Test, Second Edition, versus the Cognitive Abilities Test, Form 6: One Gifted Program's Experience

ERIC Educational Resources Information Center

Giessman, Jacob A.; Gambrell, James L.; Stebbins, Molly S.

2013-01-01

The Naglieri Nonverbal Ability Test, Second Edition (NNAT2), is used widely to screen students for possible inclusion in talent development programs. The NNAT2 claims to provide a more culturally neutral evaluation of general ability than tests such as Form 6 of the Cognitive Abilities Test (CogAT6), which has Verbal and Quantitative batteries in…
State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. Nevada

ERIC Educational Resources Information Center

Center on Education Policy, 2010

2010-01-01

This paper profiles Nevada's test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the basic level on NAEP increased in grade 8 reading and math. Average annual gains were larger on the state test than on NAEP in both subjects. Trends in average (mean) test scores…
Application of new WAIS-III/WMS-III discrepancy scores for evaluating memory functioning: relationship between intellectual and memory ability.

PubMed

Lange, Rael T; Chelune, Gordon J

2006-05-01

Analysis of the discrepancy between memory and intellectual ability has received some support as a means for evaluating memory impairment. Recently, comprehensive base rate tables for General Ability Index (GAI) minus memory discrepancy scores (i.e., GAI-memory) were developed using the WAIS-III/WMS-III standardization sample (Lange, Chelune, & Tulsky, in press). The purpose of this study was to evaluate the clinical utility of GAI-memory discrepancy scores to identify memory impairment in 34 patients with Alzheimer's type dementia (DAT) versus a sample of 34 demographically matched healthy participants. On average, patients with DAT obtained significantly lower scores on all WAIS-III and WMS-III indexes and had larger GAI-memory discrepancy scores. Clinical outcome analyses revealed that GAI-memory scores were useful at identifying memory impairment in patients with DAT versus matched healthy participants. However, GAI-memory discrepancy scores failed to provide unique interpretive information beyond that which is gained from the memory indexes alone. Implications and future research directions are discussed.
Predictive value of background experiences and visual spatial ability testing on laparoscopic baseline performance among residents entering postgraduate surgical training.

PubMed

Louridas, Marisa; Quinn, Lauren E; Grantcharov, Teodor P

2016-03-01

Emerging evidence suggests that despite dedicated practice, not all surgical trainees have the ability to reach technical competency in minimally invasive techniques. While selecting residents that have the ability to reach technical competence is important, evidence to guide the incorporation of technical ability into selection processes is limited. Therefore, the purpose of the present study was to evaluate whether background experiences and 2D-3D visual spatial test results are predictive of baseline laparoscopic skill for the novice surgical trainee. First-year residents were studied. Demographic data and background surgical and non-surgical experiences were obtained using a questionnaire. Visual spatial ability was evaluated using the PicSOr, cube comparison (CC) and card rotation (CR) tests. Technical skill was assessed using the camera navigation (LCN) task and laparoscopic circle cut (LCC) task. Resident performance on these technical tasks was compared and correlated with the questionnaire and visual spatial findings. Previous experience in observing laparoscopic procedures was associated with significantly better LCN performance, and experience in navigating the laparoscopic camera was associated with significantly better LCC task results. Residents who scored higher on the CC test demonstrated a more accurate LCN path length score (r s(PL) = -0.36, p = 0.03) and angle path (r s(AP) = -0.426, p = 0.01) score when completing the LCN task. No other significant correlations were found between the visual spatial tests (PicSOr, CC or CR) and LCC performance. While identifying selection tests for incoming surgical trainees that predict technical skill performance is appealing, the surrogate markers evaluated correlate with specific metrics of surgical performance related to a single task but do not appear to reliably predict technical performance of different laparoscopic tasks. Predicting the acquisition of technical skills will require the development
Do Gains in Test Scores Explain Labor Market Outcomes?

ERIC Educational Resources Information Center

Rose, Heather

2006-01-01

Using data from the National Education Longitudinal Study of 1988, this article investigates whether students who made relatively large test score gains during high school had larger earnings 7 years after high school compared to students whose scores improved little. In models that control for pre-high school test scores, family background, and…
State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. Louisiana

ERIC Educational Resources Information Center

Center on Education Policy, 2010

2010-01-01

This paper profiles Louisiana's test score trends through 2008-09. Between 2005 and 2009, trends on state tests and NAEP (National Assessment of Educational Progress) sometimes differed. On the state test, the percentages of students reaching the proficient level increased at grades 4 and 8 in both reading and math. On NAEP, the percentage of…
The Effects of Video Game Experience on Computer-Based Air Traffic Controller Specialist, Air Traffic Scenario Test Scores.

DTIC Science & Technology

1997-02-01

application with a strong resemblance to a video game , concern has been raised that prior video game experience might have a moderating effect on scores. Much...such as spatial ability. The effects of computer or video game experience on work sample scores have not been systematically investigated. The purpose...of this study was to evaluate the incremental validity of prior video game experience over that of general aptitude as a predictor of work sample test
An approach to analyzing a single subject's scores obtained in a standardized test with application to the Aachen Aphasia Test (AAT).

PubMed

Willmes, K

1985-08-01

Methods for the analysis of a single subject's test profile(s) proposed by Huber (1973) are applied to the Aachen Aphasia Test (AAT). The procedures are based on the classical test theory model (Lord & Novick, 1968) and are suited for any (achievement) test with standard norms from a large standardization sample and satisfactory reliability estimates. Two test profiles of a Wernicke's aphasic, obtained before and after a 3-month period of speech therapy, are analyzed using inferential comparisons between (groups of) subtest scores on one test application and between two test administrations for single (groups of) subtests. For each of these comparisons, the two aspects of (i) significant (reliable) differences in performance beyond measurement error and (ii) the diagnostic validity of that difference in the reference population of aphasic patients are assessed. Significant differences between standardized subtest scores and a remarkably better preserved reading and writing ability could be found for both test administrations using the multiple test procedure of Holm (1979). Comparison of both profiles revealed an overall increase in performance for each subtest as well as changes in level of performance relations between pairs of subtests.
State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. Tennessee

ERIC Educational Resources Information Center

Center on Education Policy, 2010

2010-01-01

This paper profiles Tennessee's test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the basic level on NAEP (National Assessment of Educational Progress) increased in grade 8 reading and math. At grade 4, trends on the state test and NAEP differed somewhat. In…
State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. Maryland

ERIC Educational Resources Information Center

Center on Education Policy, 2010

2010-01-01

This paper profiles Maryland's test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the basic level on NAEP (National Assessment of Educational Progress) increased at grades 4 and 8 in both reading and math. Average annual gains were larger on the state test than…
State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. Pennsylvania

ERIC Educational Resources Information Center

Center on Education Policy, 2010

2010-01-01

This paper profiles Pennsylvania's test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the basic level on NAEP (National Assessment of Educational Progress) increased in grade 8 reading and math. Average annual gains were larger on the state test than on NAEP in…
Spatial abilities, Earth science conceptual understanding, and psychological gender of university non-science majors

NASA Astrophysics Data System (ADS)

Black, Alice A. (Jill)

Research has shown the presence of many Earth science misconceptions and conceptual difficulties that may impede concept understanding, and has also identified a number of categories of spatial ability. Although spatial ability has been linked to high performance in science, some researchers believe it has been overlooked in traditional education. Evidence exists that spatial ability can be improved. This correlational study investigated the relationship among Earth science conceptual understanding, three types of spatial ability, and psychological gender, a self-classification that reflects socially-accepted personality and gender traits. A test of Earth science concept understanding, the Earth Science Concepts (ESC) test, was developed and field tested from 2001 to 2003 in 15 sections of university classes. Criterion validity was .60, significant at the .01 level. Spearman/Brown reliability was .74 and Kuder/Richardson reliability was .63. The Purdue Visualization of Rotations (PVOR) (mental rotation), the Group Embedded Figures Test (GEFT) (spatial perception), the Differential Aptitude Test: Space Relations (DAT) (spatial visualization), and the Bem Inventory (BI) (psychological gender) were administered to 97 non-major university students enrolled in undergraduate science classes. Spearman correlations revealed moderately significant correlations at the .01 level between ESC scores and each of the three spatial ability test scores. Stepwise regression analysis indicated that PVOR scores were the best predictor of ESC scores, and showed that spatial ability scores accounted for 27% of the total variation in ESC scores. Spatial test scores were moderately or weakly correlated with each other. No significant correlations were found among BI scores and other test scores. Scantron difficulty analysis of ESC items produced difficulty ratings ranging from 33.04 to 96.43, indicating the percentage of students who answered incorrectly. Mean score on the ESC was 34
The Woodcock-Johnson Tests of Cognitive Abilities III's Cognitive Performance Model: Empirical Support for Intermediate Factors within CHC Theory

ERIC Educational Resources Information Center

Taub, Gordon E.; McGrew, Kevin S.

2014-01-01

The Woodcock-Johnson Tests of Cognitive Ability Third Edition is developed using the Cattell-Horn-Carroll (CHC) measurement-theory test design as the instrument's theoretical blueprint. The instrument provides users with cognitive scores based on the Cognitive Performance Model (CPM); however, the CPM is not a part of CHC theory. Within the…
Assessing manual dexterity: Comparing the WorkAbility Rate of Manipulation Test with the Minnesota Manual Dexterity Test.

PubMed

Wang, Ying-Chih; Wickstrom, Rick; Yen, Sheng-Che; Kapellusch, Jay; Grogan, Kimberly A

2017-05-10

Cross-sectional study. The WorkAbility Rate of Manipulation Test (WRMT), an adaptation of the Minnesota Manual Dexterity Test (MMDT), contains a revised board and protocols to improve its utility for therapy or fitness assessment. To describe the development and preliminary psychometric properties of WRMT. Sixty-six healthy participants completed MMDT and WRMT in a random order followed by a user experience survey. We compared tests using repeated-measures analysis of variance, test-retest reliability, and examined agreement between tests. Despite the similarities of these 2 instruments, the different administration protocols resulted in statistically different score distributions (P < .001). Results supported good test-retest reliability of WRMT (placing test ICC = 0.88-0.90 and turning test ICC = 0.68-0.82). The WRMT correlated moderately with MMDT (r = 0.81 in placing test and r = 0.44-0.57 in turning test). Bland-Altman plot showed that the differences in completion time were 3.8 seconds between placing tests and 19.6 (both hands), 0.3 (right hand), and 3.9 (left hand) seconds between turning tests. Overall, participants felt that the instruction of WRMT was easier to follow (44%) and preferred its setup, color, and depth of the test board (49%). Time required to complete 1 panel of 20 disks correlated highly with the time needed to finish a complete trial of 60 disks in both MMDT (r = 0.91-0.97) and WRMT (r = 0.88-0.95). Caution is warranted in comparing scores from these 2 test variants. 3b. Copyright © 2017 Hanley & Belfus. Published by Elsevier Inc. All rights reserved.
Racial/Ethnic Differences in the Criterion-Related Validity of Cognitive Ability Tests: A Qualitative and Quantitative Review

ERIC Educational Resources Information Center

Berry, Christopher M.; Clark, Malissa A.; McClure, Tara K.

2011-01-01

The correlation between cognitive ability test scores and performance was separately meta-analyzed for Asian, Black, Hispanic, and White racial/ethnic subgroups. Compared to the average White observed correlation ([image omitted] = 0.33, N = 903,779), average correlations were lower for Black samples ([image omitted] = 0.24, N = 112,194) and…

The Ability of Standardized Test Instruments to Differentiate Membership in Different Vocational-Technical Curricula. Project MINI-SCORE, Final Technical Report.

ERIC Educational Resources Information Center

Pucel, David J.; And Others

Using post-secondary vocational education students as the populations, these two sub-studies of the Project MINI-SCORE sought to determine the extent to which pre-enrollment standardized test data can be used to predict vocational success. For the purpose of the study, vocational success was defined either as successful graduation or successful…
Prepharmacy predictors of success in pharmacy school: grade point averages, pharmacy college admissions test, communication abilities, and critical thinking skills.

PubMed

Allen, D D; Bond, C A

2001-07-01

Good admissions decisions are essential for identifying successful students and good practitioners. Various parameters have been shown to have predictive power for academic success. Previous academic performance, the Pharmacy College Admissions Test (PCAT), and specific prepharmacy courses have been suggested as academic performance indicators. However, critical thinking abilities have not been evaluated. We evaluated the connection between academic success and each of the following predictive parameters: the California Critical Thinking Skills Test (CCTST) score, PCAT score, interview score, overall academic performance prior to admission at a pharmacy school, and performance in specific prepharmacy courses. We confirmed previous reports but demonstrated intriguing results in predicting practice-based skills. Critical thinking skills predict practice-based course success. Also, the CCTST and PCAT scores (Pearson correlation [pc] = 0.448, p < 0.001) were closely related in our students. The strongest predictors of practice-related courses and clerkship success were PCAT (pc=0.237, p<0.001) and CCTST (pc = 0.201, p < 0.001). These findings and other analyses suggest that PCAT may predict critical thinking skills in pharmacy practice courses and clerkships. Further study is needed to confirm this finding and determine which PCAT components predict critical thinking abilities.
New and updated tests of print exposure and reading abilities in college students

PubMed Central

Acheson, Daniel J.; Wells, Justine B.; MacDonald, Maryellen C.

2010-01-01

The relationship between print exposure and measures of reading skill was examined in college students (N = 99, 58 female; mean age = 20.3 years). Print exposure was measured with several new self-reports of reading and writing habits, as well as updated versions of the Author Recognition Test and the Magazine Recognition Test (Stanovich & West, 1989). Participants completed a sentence comprehension task with syntactically complex sentences, and reading times and comprehension accuracy were measured. An additional measure of reading skill was provided by participants’ scores on the verbal portions of the ACT, a standardized achievement test. Higher levels of print exposure were associated with higher sentence processing abilities and superior verbal ACT performance. The relative merits of different print exposure assessments are discussed. PMID:18411551
State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. Nebraska

ERIC Educational Resources Information Center

Center on Education Policy, 2010

2010-01-01

This paper profiles Nebraska's test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the percentages reaching the basic level on NAEP (National Assessment of Educational Progress) increased at grade 4 in both reading and math. At grade 8, however, the percentages…
Stability of scores for the Slosson Full-Range Intelligence Test.

PubMed

Williams, Thomas O; Eaves, Ronald C; Woods-Groves, Suzanne; Mariano, Gina

2007-08-01

The test-retest stability of the Slosson Full-Range Intelligence Test by Algozzine, Eaves, Mann, and Vance was investigated with test scores from a sample of 103 students. With a mean interval of 13.7 mo. and different examiners for each of the two test administrations, the test-retest reliability coefficients for the Full-Range IQ, Verbal Reasoning, Abstract Reasoning, Quantitative Reasoning, and Memory were .93, .85, .80, .80, and .83, respectively. Mean differences from the test-retest scores were not statistically significantly different for any of the scales. Results suggest that Slosson scores are stable over time even when different examiners administer the test.
State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. Alaska

ERIC Educational Resources Information Center

Center on Education Policy, 2010

2010-01-01

This paper profiles Alaska's test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the basic level on NAEP (National Assessment of Educational Progress) increased in grades 4 and 8 in math and grade 8 in reading. In grade 4 reading, the percentage reaching the…
State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. Massachusetts

ERIC Educational Resources Information Center

Center on Education Policy, 2010

2010-01-01

This paper profiles Massachusetts' test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the basic level on NAEP (National Assessment of Educational Progress) increased in grade 4 reading and math and grade 8 math. Average annual gains were larger on the state test…
State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. California

ERIC Educational Resources Information Center

Center on Education Policy, 2010

2010-01-01

This paper profiles California's test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the basic level on NAEP (National Assessment of Educational Progress) increased in grades 4 and 8 in both reading and math. Average annual gains were larger on the state test…
State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. Montana

ERIC Educational Resources Information Center

Center on Education Policy, 2010

2010-01-01

This paper profiles Montana's test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the basic level on NAEP (National Assessment of Educational Progress) increased in grade 4 reading and math and grade 8 reading. In grade 8 math, however, the percentage proficient…
State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. Colorado

ERIC Educational Resources Information Center

Center on Education Policy, 2010

2010-01-01

This paper profiles Colorado's test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the basic level on NAEP (National Assessment of Educational Progress) increased in grades 4 and 8 in both reading and math. Average annual gains were generally larger on NAEP than…
State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. Wisconsin

ERIC Educational Resources Information Center

Center on Education Policy, 2010

2010-01-01

This paper profiles Wisconsin's test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the basic level on NAEP (National Assessment of Educational Progress) increased in math at grades 4 and 8 and in reading at grade 8. In grade 4 reading, the percentage scoring…
State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. Alabama

ERIC Educational Resources Information Center

Center on Education Policy, 2010

2010-01-01

This paper profiles Alabama's test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the basic level on NAEP (National Assessment of Educational Progress) increased in grades 4 and 8 in both reading and math. Average annual gains were generally larger on the state…
State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. Texas

ERIC Educational Resources Information Center

Center on Education Policy, 2010

2010-01-01

This paper profiles Texas' test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the basic level on NAEP (National Assessment of Educational Progress) increased in reading at grades 4 and 8 and in math at grade 8. In grade 4 math, however, the percentage scoring…
State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. Florida

ERIC Educational Resources Information Center

Center on Education Policy, 2010

2010-01-01

This paper profiles Florida's test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the basic level on NAEP (National Assessment of Educational Progress) increased in grades 4 and 8 in both reading and math. Average annual gains were generally larger on the state…
State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. Arizona

ERIC Educational Resources Information Center

Center on Education Policy, 2010

2010-01-01

This paper profiles Arizona's test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the basic level on NAEP (National Assessment of Educational Progress) increased in grades 4 and 8 in both reading and math. Average annual gains were generally larger on the state…
State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. Iowa

ERIC Educational Resources Information Center

Center on Education Policy, 2010

2010-01-01

This paper profiles Iowa's test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the basic level on NAEP (National Assessment of Educational Progress) increased in grade 4 reading and math and in grade 8 math. In grade 8 reading, the percentage of students reaching…
Modified Balance Error Scoring System (M-BESS) test scores in athletes wearing protective equipment and cleats.

PubMed

Azad, Aftab Mohammad; Al Juma, Saad; Bhatti, Junaid Ahmad; Delaney, J Scott

2016-01-01

Balance testing is an important part of the initial concussion assessment. There is no research on the differences in Modified Balance Error Scoring System (M-BESS) scores when tested in real world as compared to control conditions. To assess the difference in M-BESS scores in athletes wearing their protective equipment and cleats on different surfaces as compared to control conditions. This cross-sectional study examined university North American football and soccer athletes. Three observers independently rated athletes performing the M-BESS test in three different conditions: (1) wearing shorts and T-shirt in bare feet on firm surface (control); (2) wearing athletic equipment with cleats on FieldTurf; and (3) wearing athletic equipment with cleats on firm surface. Mean M-BESS scores were compared between conditions. 60 participants were recruited: 39 from football (all males) and 21 from soccer (11 males and 10 females). Average age was 21.1 years (SD=1.8). Mean M-BESS scores were significantly lower (p<0.001) for cleats on FieldTurf (mean=26.3; SD=2.0) and for cleats on firm surface (mean=26.6; SD=2.1) as compared to the control condition (mean=28.4; SD=1.5). Females had lower scores than males for cleats on FieldTurf condition (24.9 (SD=1.9) vs 27.3 (SD=1.6), p=0.005). Players who had taping or bracing on their ankles/feet had lower scores when tested with cleats on firm surface condition (24.6 (SD=1.7) vs 26.9 (SD=2.0), p=0.002). Total M-BESS scores for athletes wearing protective equipment and cleats standing on FieldTurf or a firm surface are around two points lower than M-BESS scores performed on the same athletes under control conditions.
Modified Balance Error Scoring System (M-BESS) test scores in athletes wearing protective equipment and cleats

PubMed Central

Azad, Aftab Mohammad; Al Juma, Saad; Bhatti, Junaid Ahmad; Delaney, J Scott

2016-01-01

Background Balance testing is an important part of the initial concussion assessment. There is no research on the differences in Modified Balance Error Scoring System (M-BESS) scores when tested in real world as compared to control conditions. Objective To assess the difference in M-BESS scores in athletes wearing their protective equipment and cleats on different surfaces as compared to control conditions. Methods This cross-sectional study examined university North American football and soccer athletes. Three observers independently rated athletes performing the M-BESS test in three different conditions: (1) wearing shorts and T-shirt in bare feet on firm surface (control); (2) wearing athletic equipment with cleats on FieldTurf; and (3) wearing athletic equipment with cleats on firm surface. Mean M-BESS scores were compared between conditions. Results 60 participants were recruited: 39 from football (all males) and 21 from soccer (11 males and 10 females). Average age was 21.1 years (SD=1.8). Mean M-BESS scores were significantly lower (p<0.001) for cleats on FieldTurf (mean=26.3; SD=2.0) and for cleats on firm surface (mean=26.6; SD=2.1) as compared to the control condition (mean=28.4; SD=1.5). Females had lower scores than males for cleats on FieldTurf condition (24.9 (SD=1.9) vs 27.3 (SD=1.6), p=0.005). Players who had taping or bracing on their ankles/feet had lower scores when tested with cleats on firm surface condition (24.6 (SD=1.7) vs 26.9 (SD=2.0), p=0.002). Conclusions Total M-BESS scores for athletes wearing protective equipment and cleats standing on FieldTurf or a firm surface are around two points lower than M-BESS scores performed on the same athletes under control conditions. PMID:27900181
A Test for the Assessment of Pragmatic Abilities and Cognitive Substrates (APACS): Normative Data and Psychometric Properties

PubMed Central

Arcara, Giorgio; Bambini, Valentina

2016-01-01

The Assessment of Pragmatic Abilities and Cognitive Substrates (APACS) test is a new tool to evaluate pragmatic abilities in clinical populations with acquired communicative deficits, ranging from schizophrenia to neurodegenerative diseases. APACS focuses on two main domains, namely discourse and non-literal language, combining traditional tasks with refined linguistic materials in Italian, in a unified framework inspired by language pragmatics. The test includes six tasks (Interview, Description, Narratives, Figurative Language 1, Humor, Figurative Language 2) and three composite scores (Pragmatic Productions, Pragmatic Comprehension, APACS Total). Psychometric properties and normative data were computed on a sample of 119 healthy participants representative of the general population. The analysis revealed acceptable internal consistency and good test-retest reliability for almost every APACS task, suggesting that items are coherent and performance is consistent over time. Factor analysis supports the validity of the test, revealing two factors possibly related to different facets and substrates of the pragmatic competence. Finally, excellent match between APACS items and scores and the pragmatic constructs measured in the test was evidenced by experts' evaluation of content validity. The performance on APACS showed a general effect of demographic variables, with a negative effect of age and a positive effect of education. The norms were calculated by means of state-of-the-art regression methods. Overall, APACS is a valuable tool for the assessment of pragmatic deficits in verbal communication. The short duration and easiness of administration make the test especially suitable to use in clinical settings. In presenting APACS, we also aim at promoting the inclusion of pragmatics in the assessment practice, as a relevant dimension in defining the patient's cognitive profile, given its vital role for communication and social interaction in daily life. The combined
Exploring the general motor ability construct.

PubMed

Ibrahim, Halijah; Heard, N Paul; Blanksby, Brian

2011-10-01

Malaysian students ages 12 to 15 years (N = 330; 165 girls, 165 boys) took the Australian Institute of Sport Talent Identification Test (AIST) and the Balance and Movement Coordination Test (BMC), developed specifically to identify sport talent in Malaysian adolescents. To investigate evidence for general aptitude ("g") in motor ability, a higher-order factor analysis was applied to the motor skills subtests from the AIST and BMC. First-order principal components analysis indicated that scores for the adolescent boys and girls could be described by similar sets of specific motor abilities. In particular, sets of skills identified as Movement Coordination and Postural Control were found, with Balancing Ability also emerging. For the girls, a factor labeled Static Balance was indicated. However, for the boys a more general balance ability labeled Kinesthetic Integration was found, along with an ability labeled Explosive Power. These first-order analyses accounted for 45% to 60% of the variance in the scores on the motor skills tests for the boys and girls, respectively. Separate second-order factor analyses for the boys and girls extracted a single higher-order factor, which was consistent with the existence of a motoric "g".

The effect of perceptual reasoning abilities on confrontation naming performance: An examination of three naming tests.

PubMed

Soble, Jason R; Marceaux, Janice C; Galindo, Juliette; Sordahl, Jeffrey A; Highsmith, Jonathan M; O'Rourke, Justin J F; González, David Andrés; Critchfield, Edan A; McCoy, Karin J M

2016-01-01

Confrontation naming tests are a common neuropsychological method of assessing language and a critical diagnostic tool in identifying certain neurodegenerative diseases; however, there is limited literature examining the visual-perceptual demands of these tasks. This study investigated the effect of perceptual reasoning abilities on three confrontation naming tests, the Boston Naming Test (BNT), Neuropsychological Assessment Battery (NAB) Naming Test, and Visual Naming Test (VNT) to elucidate the diverse cognitive functions underlying these tasks to assist with test selection procedures and increase diagnostic accuracy. A mixed clinical sample of 121 veterans were administered the BNT, NAB, VNT, and Wechsler Adult Intelligence Scale-4th Edition (WAIS-IV) Verbal Comprehension Index (VCI) and Perceptual Reasoning Index (PRI) as part of a comprehensive neuropsychological evaluation. Multiple regression indicated that PRI accounted for 23%, 13%, and 15% of the variance in BNT, VNT, and NAB scores, respectively, but dropped out as a significant predictor once VCI was added. Follow-up bootstrap mediation analyses revealed that PRI had a significant indirect effect on naming performance after controlling education, primary language, and severity of cognitive impairment, as well as the mediating effect of general verbal abilities for the BNT (B = 0.13; 95% confidence interval, CI [.07, .20]), VNT (B = 0.01; 95% CI [.002, .03]), and NAB (B = 0.03; 95% CI [.01, .06]). Findings revealed a complex relationship between perceptual reasoning abilities and confrontation naming that is mediated by general verbal abilities. However, when verbal abilities were statistically controlled, perceptual reasoning abilities were found to have a significant indirect effect on performance across all three confrontation naming measures with the largest effect noted with the BNT relative to the VNT and NAB Naming Test.
Surgical simulation tasks challenge visual working memory and visual-spatial ability differently.

PubMed

Schlickum, Marcus; Hedman, Leif; Enochsson, Lars; Henningsohn, Lars; Kjellin, Ann; Felländer-Tsai, Li

2011-04-01

New strategies for selection and training of physicians are emerging. Previous studies have demonstrated a correlation between visual-spatial ability and visual working memory with surgical simulator performance. The aim of this study was to perform a detailed analysis on how these abilities are associated with metrics in simulator performance with different task content. The hypothesis is that the importance of visual-spatial ability and visual working memory varies with different task contents. Twenty-five medical students participated in the study that involved testing visual-spatial ability using the MRT-A test and visual working memory using the RoboMemo computer program. Subjects were also trained and tested for performance in three different surgical simulators. The scores from the psychometric tests and the performance metrics were then correlated using multivariate analysis. MRT-A score correlated significantly with the performance metrics Efficiency of screening (p = 0.006) and Total time (p = 0.01) in the GI Mentor II task and Total score (p = 0.02) in the MIST-VR simulator task. In the Uro Mentor task, both the MRT-A score and the visual working memory 3-D cube test score as presented in the RoboMemo program (p = 0.02) correlated with Total score (p = 0.004). In this study we have shown that some differences exist regarding the impact of visual abilities and task content on simulator performance. When designing future cognitive training programs and testing regimes, one might have to consider that the design must be adjusted in accordance with the specific surgical task to be trained in mind.
THE EFFECTS ON ACHIEVEMENT TEST RESULTS OF VARYING CONDITIONS OF EXPERIMENTAL ATMOSPHERE, NOTICE OF TEST, TEST ADMINISTRATION, AND TEST SCORING.

ERIC Educational Resources Information Center

GOODWIN, WILLIAM L.; AND OTHERS

NULL HYPOTHESES WERE TESTED TO DETERMINE THE DIFFERENTIAL EFFECTS OF (1) EXPERIMENTAL ATMOSPHERE AND ABSENCE OF SAME, (2) NOTICE OF TEST (10 SCHOOL DAYS) AND NO NOTICE (1 SCHOOL DAY), (3) TEACHER ADMINISTRATION AND OUTSIDE ADMINISTRATION OF TESTS, AND (4) TEACHER SCORING AND OUTSIDE SCORING OF TESTS. SIXTH-GRADE CLASSES (N=64), EACH FROM A…
Score Equating and Nominally Parallel Language Tests.

ERIC Educational Resources Information Center

Moy, Raymond

Score equating requires that the forms to be equated are functionally parallel. That is, the two test forms should rank order examinees in a similar fashion. In language proficiency testing situations, this assumption is often put into doubt because of the numerous tests that have been proposed as measures of language proficiency and the…
Relationships between the handball-specific complex test, non-specific field tests and the match performance score in elite professional handball players.

PubMed

Hermassi, Souhail; Chelly, Mohamed-Souhaiel; Wollny, Rainer; Hoffmeyer, Birgit; Fieseler, Georg; Schulze, Stephan; Irlenbusch, Lars; Delank, Karl-Stefan; Shephard, Roy J; Bartels, Thomas; Schwesig, René

2018-06-01

This study assessed the validity of the handball-specific complex test (HBCT) and two non-specific field tests in professional elite handball athletes, using the match performance score (MPS) as the gold standard of performance. Thirteen elite male handball players (age: 27.4±4.8 years; premier German league) performed the HBCT, the Yo-Yo Intermittent Recovery (YYIR) test and a repeated shuttle sprint ability (RSA) test at the beginning of pre-season training. The RSA results were evaluated in terms of best time, total time, and fatigue decrement. Heart rates (HR) were assessed at selected times throughout all tests; the recovery HR was measured immediately post-test and 10 minutes later. The match performance score was based on various handball specific parameters (e.g., field goals, assists, steals, blocks, and technical mistakes) as seen during all matches of the immediately subsequent season (2015/2016). The parameters of run 1, run 2, and HR recovery at minutes 6 and 10 of the RSA test all showed a variance of more than 10% (range: 11-15%). However, the variance of scores for the YYIR test was much smaller (range: 1-7%). The resting HR (r2=0.18), HR recovery at minute 10 (r2=0.10), lactate concentration at rest (r2=0.17), recovery of heart rate from 0 to 10 minutes (r2=0.15), and velocity of second throw at first trial (r2=0.37) were the most valid HBCT parameters. Much effort is necessary to assess MPS and to develop valid tests. Speed and the rate of functional recovery seem the best predictors of competitive performance for elite handball players.
Reporting Diagnostic Scores in Educational Testing: Temptations, Pitfalls, and Some Solutions

ERIC Educational Resources Information Center

Sinharay, Sandip; Puhan, Gautam; Haberman, Shelby J.

2010-01-01

Diagnostic scores are of increasing interest in educational testing due to their potential remedial and instructional benefit. Naturally, the number of educational tests that report diagnostic scores is on the rise, as are the number of research publications on such scores. This article provides a critical evaluation of diagnostic score reporting…
The reliability and validity of qualitative scores for the Controlled Oral Word Association Test.

PubMed

Ross, Thomas P; Calhoun, Emily; Cox, Tara; Wenner, Carolyn; Kono, Whitney; Pleasant, Morgan

2007-05-01

The reliability and validity of two qualitative scoring systems for the Controlled Oral Word Association Test [Benton, A. L., Hamsher, de S. K., & Sivan, A. B. (1983). Multilingual aplasia examination (2nd ed.). Iowa City, IA: AJA Associates] were examined in 108 healthy young adults. The scoring systems developed by Troyer et al. [Troyer, A. K., Moscovich, M., & Winocur, G. (1997). Clustering and switching as two components of verbal fluency: Evidence from younger and older healthy adults. Neuropsychology, 11, 138-146] and by Abwender et al. [Abwender, D. A., Swan, J. G., Bowerman, J. T., & Connolly, S. W. (2001a). Qualitative analysis of verbal fluency output: Review and comparison of several scoring methods. Assessment, 8, 323-336] each demonstrated excellent interrater reliability (all indices at or above r(icc)=.9). Consistent with previous research [e.g., Ross, T. P. (2003). The reliability of cluster and switch scores for the COWAT. Archives of Clinical Psychology, 18, 153-164), test-retest reliability coefficients (N=53; M interval 44.6 days) for the qualitative scores were modest to poor (r(icc)=.6 to .4 range). Correlations among COWAT scores, measures of executive functioning, verbal learning, working memory, and vocabulary were examined. The idea that qualitative scores represent distinct executive functions such as cognitive flexibility or strategy utilization was not supported. We offer the interpretation that COWAT performance may require the ability to retrieve words in a non-routine manner while suppressing habitual responses and associated processing interference, presumably due to a spread of activation across semantic or lexical networks. This interpretation, though speculative at present, implies that clustering and switching on the COWAT may not be entirely deliberate, but rather an artifact of a passive (i.e., state-dependent) process. Ideas for future research, most noticeably experimental studies using cognitive methods (e.g., priming), are
Neuropsychological test scores, academic performance, and developmental disorders in Spanish-speaking children.

PubMed

Rosselli, M; Ardila, A; Bateman, J R; Guzmán, M

2001-01-01

Limited information is currently available about performance of Spanish-speaking children on different neuropsychological tests. This study was designed to (a) analyze the effects of age and sex on different neuropsychological test scores of a randomly selected sample of Spanish-speaking children, (b) analyze the value of neuropsychological test scores for predicting school performance, and (c) describe the neuropsychological profile of Spanish-speaking children with learning disabilities (LD). Two hundred ninety (141 boys, 149 girls) 6- to 11-year-old children were selected from a school in Bogotá, Colombia. Three age groups were distinguished: 6- to 7-, 8- to 9-, and 10- to 11-year-olds. Performance was measured utilizing the following neuropsychological tests: Seashore Rhythm Test, Finger Tapping Test (FTT), Grooved Pegboard Test, Children's Category Test (CCT), California Verbal Learning Test-Children's Version (CVLT-C), Benton Visual Retention Test (BVRT), and Bateria Woodcock Psicoeducativa en Español (Woodcock, 1982). Normative scores were calculated. Age effect was significant for most of the test scores. A significant sex effect was observed for 3 test scores. Intercorrelations were performed between neuropsychological test scores and academic areas (science, mathematics, Spanish, social studies, and music). In a post hoc analysis, children presenting very low scores on the reading, writing, and arithmetic achievement scales of the Woodcock battery were identified in the sample, and their neuropsychological test scores were compared with a matched normal group. Finally, a comparison was made between Colombian and American norms.
Score Reporting for the 1991 Medical College Admission Test.

ERIC Educational Resources Information Center

Mitchell, Karen J.; Haynes, Robert

1990-01-01

Data used in a major review of the system for reporting scores on the Medical College Admission Test (MCAT) are presented and discussed. The data demonstrated the value of the current score-reporting system and led to retention of the 15-point MCAT score scale in 1991. (Author/MSE)
Teacher Greetings Increase College Students' Test Scores

ERIC Educational Resources Information Center

Weinstein, Lawrence; Laverghetta, Antonio; Alexander, Ralph; Stewart, Megan

2009-01-01

The current study is an extension of a previous investigation dealing with teacher greetings to students. The present investigation used teacher greetings with college students and academic performance (test scores). We report data using university students and in-class test performance. Students in introductory psychology who received teachers'…
State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. New Mexico

ERIC Educational Resources Information Center

Center on Education Policy, 2010

2010-01-01

This paper profiles New Mexico's test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the basic level on NAEP (National Assessment of Educational Progress) increased in grade 4 math and grade 8 reading and math. In grade 4 reading, the percentage basic on NAEP …
State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. North Dakota

ERIC Educational Resources Information Center

Center on Education Policy, 2010

2010-01-01

This paper profiles North Dakota's test score trends through 2008-09. Between 2005 and 2009, the percentage of students reaching the proficient level on the state test and the basic level on NAEP (National Assessment of Educational Progress) increased in grades 4 and 8 in both reading and math. Average annual gains were larger on the state test…
Specific algorithm method of scoring the Clock Drawing Test applied in cognitively normal elderly

PubMed Central

Mendes-Santos, Liana Chaves; Mograbi, Daniel; Spenciere, Bárbara; Charchat-Fichman, Helenice

2015-01-01

The Clock Drawing Test (CDT) is an inexpensive, fast and easily administered measure of cognitive function, especially in the elderly. This instrument is a popular clinical tool widely used in screening for cognitive disorders and dementia. The CDT can be applied in different ways and scoring procedures also vary. Objective The aims of this study were to analyze the performance of elderly on the CDT and evaluate inter-rater reliability of the CDT scored by using a specific algorithm method adapted from Sunderland et al. (1989). Methods We analyzed the CDT of 100 cognitively normal elderly aged 60 years or older. The CDT ("free-drawn") and Mini-Mental State Examination (MMSE) were administered to all participants. Six independent examiners scored the CDT of 30 participants to evaluate inter-rater reliability. Results and Conclusion A score of 5 on the proposed algorithm ("Numbers in reverse order or concentrated"), equivalent to 5 points on the original Sunderland scale, was the most frequent (53.5%). The CDT specific algorithm method used had high inter-rater reliability (p<0.01), and mean score ranged from 5.06 to 5.96. The high frequency of an overall score of 5 points may suggest the need to create more nuanced evaluation criteria, which are sensitive to differences in levels of impairment in visuoconstructive and executive abilities during aging. PMID:29213954
A process dissociation approach to objective-projective test score interrelationships.

PubMed

Bornstein, Robert F

2002-02-01

Even when self-report and projective measures of a given trait or motive both predict theoretically related features of behavior, scores on the 2 tests correlate modestly with each other. This article describes a process dissociation framework for personality assessment, derived from research on implicit memory and learning, which can resolve these ostensibly conflicting results. Research on interpersonal dependency is used to illustrate 3 key steps in the process dissociation approach: (a) converging behavioral predictions, (b) modest test score intercorrelations, and (c) delineation of variables that differentially affect self-report and projective test scores. Implications of the process dissociation framework for personality assessment and test development are discussed.
A Strategy for Replacing Sum Scoring

ERIC Educational Resources Information Center

Ramsay, James O.; Wiberg, Marie

2017-01-01

This article promotes the use of modern test theory in testing situations where sum scores for binary responses are now used. It directly compares the efficiencies and biases of classical and modern test analyses and finds an improvement in the root mean squared error of ability estimates of about 5% for two designed multiple-choice tests and…
Visuospatial ability correlates with performance in simulated gynecological laparoscopy.

PubMed

Ahlborg, Liv; Hedman, Leif; Murkes, Daniel; Westman, Bo; Kjellin, Ann; Felländer-Tsai, Li; Enochsson, Lars

2011-07-01

To analyze the relationship between visuospatial ability and simulated laparoscopy performed by consultants in obstetrics and gynecology (OBGYN). This was a prospective cohort study carried out at two community hospitals in Sweden. Thirteen consultants in obstetrics and gynecology were included. They had previously independently performed 10-100 advanced laparoscopies. Participants were tested for visuospatial ability by the Mental Rotations Test version A (MRT-A). After a familiarization session and standardized instruction, all participants subsequently conducted three consecutive virtual tubal occlusions followed by three virtual salpingectomies. Performance in the simulator was measured by Total Time, Score and Ovarian Diathermy Damage. Linear regression was used to analyze the relationship between visuospatial ability and simulated laparoscopic performance. The learning curves in the simulator were assessed in order to interpret the relationship with the visuospatial ability. Visuospatial ability correlated with Total Time (r=-0.62; p=0.03) and Score (r=0.57; p=0.05) in the medium level of the virtual tubal occlusion. In the technically more advanced virtual salpingectomy the visuospatial ability correlated with Total Time (r=-0.64; p=0.02), Ovarian Diathermy Damage (r=-0.65; p=0.02) and with overall Score (r=0.64; p=0.02). Visuospatial ability appears to be related to the performance of gynecological laparoscopic procedures in a simulator. Testing visuospatial ability might be helpful when designing individual training programs. Copyright © 2011 Elsevier Ireland Ltd. All rights reserved.
A prognostic scoring system for arm exercise stress testing.

PubMed

Xie, Yan; Xian, Hong; Chandiramani, Pooja; Bainter, Emily; Wan, Leping; Martin, Wade H

2016-01-01

Arm exercise stress testing may be an equivalent or better predictor of mortality outcome than pharmacological stress imaging for the ≥50% for patients unable to perform leg exercise. Thus, our objective was to develop an arm exercise ECG stress test scoring system, analogous to the Duke Treadmill Score, for predicting outcome in these individuals. In this retrospective observational cohort study, arm exercise ECG stress tests were performed in 443 consecutive veterans aged 64.1 (11.1) years. (mean (SD)) between 1997 and 2002. From multivariate Cox models, arm exercise scores were developed for prediction of 5-year and 12-year all-cause and cardiovascular mortality and 5-year cardiovascular mortality or myocardial infarction (MI). Arm exercise capacity in resting metabolic equivalents (METs), 1 min heart rate recovery (HRR) and ST segment depression ≥1 mm were the stress test variables independently associated with all-cause and cardiovascular mortality by step-wise Cox analysis (all p<0.01). A score based on the relation HRR (bpm)+7.3×METs-10.5×ST depression (0=no; 1=yes) prognosticated 5-year cardiovascular mortality with a C-statistic of 0.81 before and 0.88 after adjustment for significant demographic and clinical covariates. Arm exercise scores for the other outcome end points yielded C-statistic values of 0.77-0.79 before and 0.82-0.86 after adjustment for significant covariates versus 0.64-0.72 for best fit pharmacological myocardial perfusion imaging models in a cohort of 1730 veterans who were evaluated over the same time period. Arm exercise scores, analogous to the Duke Treadmill Score, have good power for prediction of mortality or MI in patients who cannot perform leg exercise.
Test Operations Procedure (TOP) 03-2-827 Test Procedures for Video Target Scoring Using Calibration Lights

DTIC Science & Technology

2016-04-04

Final 3. DATES COVERED (From - To) 4. TITLE AND SUBTITLE Test Operations Procedure (TOP) 03-2-827 Test Procedures for Video Target Scoring Using...ABSTRACT This Test Operations Procedure (TOP) describes typical equipment and procedures to setup and operate a Video Target Scoring System (VTSS) to...lights. 15. SUBJECT TERMS Video Target Scoring System, VTSS, witness screens, camera, target screen, light pole 16. SECURITY
Semi-Quantitative Scoring of an Immunochromatographic Test for Circulating Filarial Antigen

PubMed Central

Chesnais, Cédric B.; Missamou, François; Pion, Sébastien D. S.; Bopda, Jean; Louya, Frédéric; Majewski, Andrew C.; Weil, Gary J.; Boussinesq, Michel

2013-01-01

The value of a semi-quantitative scoring of the filarial antigen test (Binax Now Filariasis card test, ICT) results was evaluated during a field survey in the Republic of Congo. One hundred and thirty-four (134) of 774 tests (17.3%) were clearly positive and were scored 1, 2, or 3; and 11 (1.4%) had questionable results. Wuchereria bancrofti microfilariae (mf) were detected in 41 of those 133 individuals with an ICT test score ≥ 1 who also had a night blood smear; none of the 11 individuals with questionable ICT results harbored night mf. Cuzick's test showed a significant trend for higher microfilarial densities in groups with higher ICT scores (P < 0.001). The ICT scores were also significantly correlated with blood mf counts. Because filarial antigen levels provide an indication of adult worm infection intensity, our results suggest that semi-quantitative reading of the ICT may be useful for grading the intensity of filarial infections in individuals and populations. PMID:24019435
The academic penalty for gaining weight: a longitudinal, change-in-change analysis of BMI and perceived academic ability in middle school students.

PubMed

Kenney, E L; Gortmaker, S L; Davison, K K; Bryn Austin, S

2015-09-01

Worse educational outcomes for obese children regardless of academic ability may begin early in the life course. This study tested whether an increase in children's relative weight predicted lower teacher- and child-perceived academic ability even after adjusting for standardized test scores. Three thousand three hundred and sixty-two children participating in the Early Childhood Longitudinal Study-Kindergarten Cohort were studied longitudinally from fifth to eighth grade. Heights, weights, standardized test scores in maths and reading, and teacher and self-ratings of ability in maths and reading were measured at each wave. Longitudinal, within-child linear regression models estimated the impact of a change in body mass index (BMI) z-score on change in normalized teacher and student ratings of ability in reading and maths, adjusting for test score. A change in BMI z-score from fifth to eighth grade was not independently associated with a change in standardized test scores. However, adjusting for standardized test scores, an increasing BMI z-score was associated with significant reductions in teacher's perceptions of girls' ability in reading (-0.12, 95% confidence interval (CI): -0.23, -0.03, P=0.03) and boys' ability in math (-0.30, 95% CI: -0.43, -0.17, P<0.001). Among children who were overweight at fifth grade and increased in BMI z-score, there were even larger reductions in teacher ratings for boys' reading ability (-0.37, 95% CI: -0.71, -0.03, P=0.03) and in girls' self-ratings of maths ability (-0.47, 95% CI: -0.83, -0.11, P=0.01). From fifth to eighth grade, increase in BMI z-score was significantly associated with worsening teacher perceptions of academic ability for both boys and girls, regardless of objectively measured ability (standardized test scores). Future research should examine potential interventions to reduce bias and promote positive school climate.

Critical Thinking: More than Test Scores

ERIC Educational Resources Information Center

Smith, Vernon G.; Szymanski, Antonia

2013-01-01

This article is for practicing or aspiring school administrators. The demand for excellence in public education has lead to an emphasis on standardized test scores. This article explores the development of a professional enhancement program designed to prepare teachers to teach higher order thinking skills. Higher order thinking is the primary…
Transferability of Norms and Its Implication in Cross-Cultural Gifted Education: Norming Naglieri Nonverbal Ability Test (NNAT) in the Philippine Public Schools

ERIC Educational Resources Information Center

Vista, Alvin; Grantham, Tarek

2009-01-01

This is a normative study to investigate the transferability of norms from western-based intelligence tests to Filipino students. More than 2,700 Filipino sixth graders were sampled across the country and administered the Naglieri Nonverbal Ability Test (NNAT). Scores were then compared to the US normative sample. The results showed no significant…
Scores Obtained from a Simple Cognitive Test of Visuospatial Episodic Memory Performed Decades before Death Are Associated with the Ultimate Presence of Alzheimer Disease Pathology.

PubMed

Robinson, Andrew C; McNamee, Roseanne; Davidson, Yvonne S; Horan, Michael A; Snowden, Julie S; McInnes, Lynn; Pendleton, Neil; Mann, David M A

2018-04-25

Community- or population-based longitudinal studies of cognitive ability with a brain donation end point offer an opportunity to examine relationships between pathology and cognitive state prior to death. Discriminating the earliest signs of dementing disorders, such as Alzheimer disease (AD), is necessary to undertake early interventions and treatments. The neuropathological profile of brains donated from The University of Manchester Longitudinal Study of Cognition in Normal Healthy Old Age, including CERAD (Consortium to Establish a Registry for Alzheimer's Disease) and Braak stage, was assessed by immunohistochemistry. Cognitive test scores collected 20 years prior to death were correlated with the extent of AD pathology present at death. Baseline scores from the Memory Circle test had the ability to distinguish between individuals who developed substantial AD pathology and those with no, or low, AD pathology. Predicted test scores at the age of 65 years also discriminated between these pathology groups. The addition of APOE genotype further improved the discriminatory ability of the model. The results raise the possibility of identifying individuals at future risk of the neuropathological changes associated with AD over 20 years before death using a simple cognitive test. This work may facilitate early interventions, therapeutics and treatments for AD by identifying at-risk and minimally affected (in pathological terms) individuals. © 2018 S. Karger AG, Basel.
Developing an Academic Ability Scale for the Kuder Occupational Interest Survey.

ERIC Educational Resources Information Center

Figel, William J.

Earlier studies had shown that differences in measured interests are related to differences in scores on tests of academic ability. Specifically, scores on the college major interest scales of the Kuder Occupational Interest Survey (KOIS) were found to be related to scores on the National Merit Scholarship Qualifying Test (NMSQT). This suggested…
GalaxyDock BP2 score: a hybrid scoring function for accurate protein-ligand docking

NASA Astrophysics Data System (ADS)

Baek, Minkyung; Shin, Woong-Hee; Chung, Hwan Won; Seok, Chaok

2017-07-01

Protein-ligand docking is a useful tool for providing atomic-level understanding of protein functions in nature and design principles for artificial ligands or proteins with desired properties. The ability to identify the true binding pose of a ligand to a target protein among numerous possible candidate poses is an essential requirement for successful protein-ligand docking. Many previously developed docking scoring functions were trained to reproduce experimental binding affinities and were also used for scoring binding poses. However, in this study, we developed a new docking scoring function, called GalaxyDock BP2 Score, by directly training the scoring power of binding poses. This function is a hybrid of physics-based, empirical, and knowledge-based score terms that are balanced to strengthen the advantages of each component. The performance of the new scoring function exhibits significant improvement over existing scoring functions in decoy pose discrimination tests. In addition, when the score is used with the GalaxyDock2 protein-ligand docking program, it outperformed other state-of-the-art docking programs in docking tests on the Astex diverse set, the Cross2009 benchmark set, and the Astex non-native set. GalaxyDock BP2 Score and GalaxyDock2 with this score are freely available at http://galaxy.seoklab.org/softwares/galaxydock.html.
The Black-White Test Score Gap.

ERIC Educational Resources Information Center

Jencks, Christopher, Ed.; Phillips, Meredith, Ed.

The 15 chapters of this book address issues related to the continuing test score gap between black and white students. The editors argue against traditional explanations which emphasize differences in economic resources and demographic factors, and they urge that more emphasis be put on psychological and cultural factors. The book suggests studies…
Ability Tests? A Shot in the Dark

ERIC Educational Resources Information Center

Petrovsky, Arthur V.

1973-01-01

Several areas of controversy between Soviet pyschologists and their Western colleagues concerning the usefulness of tests for measurement of mental ability are noted in this article. The author outlines test procedures suggested by Russian psychologist, Lev Vygotsky. (SM)
Test Takers and the Validity of Score Interpretations

ERIC Educational Resources Information Center

Kopriva, Rebecca J.; Thurlow, Martha L.; Perie, Marianne; Lazarus, Sheryl S.; Clark, Amy

2016-01-01

This article argues that test takers are as integral to determining validity of test scores as defining target content and conditioning inferences on test use. A principled sustained attention to how students interact with assessment opportunities is essential, as is a principled sustained evaluation of evidence confirming the validity or calling…
Intuitive Sense of Number Correlates With Math Scores on College-Entrance Examination

PubMed Central

Libertus, Melissa E.; Odic, Darko; Halberda, Justin

2012-01-01

Many educated adults possess exact mathematical abilities in addition to an approximate, intuitive sense of number, often referred to as the Approximate Number System (ANS). Here we investigate the link between ANS precision and mathematics performance in adults by testing participants on an ANS-precision test and collecting their scores on the Scholastic Aptitude Test (SAT), a standardized college-entrance exam in the USA. In two correlational studies, we found that ANS precision correlated with SAT-Quantitative (i.e., mathematics) scores. This relationship remained robust even when controlling for SAT-Verbal scores, suggesting a small but specific relationship between our primitive sense for number and formal mathematical abilities. PMID:23098904
21 CFR 866.6050 - Ovarian adnexal mass assessment score test system.

Code of Federal Regulations, 2011 CFR

2011-04-01

... 21 Food and Drugs 8 2011-04-01 2011-04-01 false Ovarian adnexal mass assessment score test system... immunological Test Systems § 866.6050 Ovarian adnexal mass assessment score test system. (a) Identification. An ovarian/adnexal mass assessment test system is a device that measures one or more proteins in serum or...
ANOVA Analysis of Student Daily Test Scores in Multi-Day Test Periods

ERIC Educational Resources Information Center

Mouritsen, Matthew L.; Davis, Jefferson T.; Jones, Steven C.

2016-01-01

Instructors are often concerned when giving multiple-day tests because students taking the test later in the exam period may have an advantage over students taking the test early in the exam period due to information leakage. However, exam scores seemed to decline as students took the same test later in a multi-day exam period (Mouritsen and…
Scoring Yes-No Vocabulary Tests: Reaction Time vs. Nonword Approaches

ERIC Educational Resources Information Center

Pellicer-Sanchez, Ana; Schmitt, Norbert

2012-01-01

Despite a number of research studies investigating the Yes-No vocabulary test format, one main question remains unanswered: What is the best scoring procedure to adjust for testee overestimation of vocabulary knowledge? Different scoring methodologies have been proposed based on the inclusion and selection of nonwords in the test. However, there…
Increased correlation coefficient between the written test score and tutors' performance test scores after training of tutors for assessment of medical students during problem-based learning course in Malaysia.

PubMed

Jaiprakash, Heethal; Min, Aung Ko Ko; Ghosh, Sarmishtha

2016-03-01

This paper is aimed at finding if there was a change of correlation between the written test score and tutors' performance test scores in the assessment of medical students during a problem-based learning (PBL) course in Malaysia. This is a cross-sectional observational study, conducted among 264 medical students in two groups from November 2010 to November 2012. The first group's tutors did not receive tutor training; while the second group's tutors were trained in the PBL process. Each group was divided into high, middle and low achievers based on their end-of-semester exam scores. PBL scores were taken which included written test scores and tutors' performance test scores. Pearson correlation coefficient was calculated between the two kinds of scores in each group. The correlation coefficient between the written scores and tutors' scores in group 1 was 0.099 (p<0.001) and for group 2 was 0.305 (p<0.001). The higher correlation coefficient in the group where tutors received the PBL training reinforces the importance of tutor training before their participation in the PBL course.
The Effect of Pretest Exercise on Baseline Computerized Neurocognitive Test Scores.

PubMed

Pawlukiewicz, Alec; Yengo-Kahn, Aaron M; Solomon, Gary

2017-10-01

Baseline neurocognitive assessment plays a critical role in return-to-play decision making following sport-related concussions. Prior studies have assessed the effect of a variety of modifying factors on neurocognitive baseline test scores. However, relatively little investigation has been conducted regarding the effect of pretest exercise on baseline testing. The aim of our investigation was to determine the effect of pretest exercise on baseline Immediate Post-Concussion Assessment and Cognitive Testing (ImPACT) scores in adolescent and young adult athletes. We hypothesized that athletes undergoing self-reported strenuous exercise within 3 hours of baseline testing would perform more poorly on neurocognitive metrics and would report a greater number of symptoms than those who had not completed such exercise. Cross-sectional study; Level of evidence, 3. The ImPACT records of 18,245 adolescent and young adult athletes were retrospectively analyzed. After application of inclusion and exclusion criteria, participants were dichotomized into groups based on a positive (n = 664) or negative (n = 6609) self-reported history of strenuous exercise within 3 hours of the baseline test. Participants with a positive history of exercise were then randomly matched, based on age, sex, education level, concussion history, and hours of sleep prior to testing, on a 1:2 basis with individuals who had reported no pretest exercise. The baseline ImPACT composite scores of the 2 groups were then compared. Significant differences were observed for the ImPACT composite scores of verbal memory, visual memory, reaction time, and impulse control as well as for the total symptom score. No significant between-group difference was detected for the visual motor composite score. Furthermore, pretest exercise was associated with a significant increase in the overall frequency of invalid test results. Our results suggest a statistically significant difference in ImPACT composite scores between
The Contributions of Memory and Vocabulary to Non-Verbal Ability Scores in Adolescents with Intellectual Disability

PubMed Central

Mungkhetklang, Chantanee; Bavin, Edith L.; Crewther, Sheila G.; Goharpey, Nahal; Parsons, Carl

2016-01-01

It is usually assumed that performance on non-verbal intelligence tests reflects visual cognitive processing and that aspects of working memory (WM) will be involved. However, the unique contribution of memory to non-verbal scores is not clear, nor is the unique contribution of vocabulary. Thus, we aimed to investigate these contributions. Non-verbal test scores for 17 individuals with intellectual disability (ID) and 39 children with typical development (TD) of similar mental age were compared to determine the unique contribution of visual and verbal short-term memory (STM) and WM and the additional variance contributed by vocabulary scores. No significant group differences were found in the non-verbal test scores or receptive vocabulary scores, but there was a significant difference in expressive vocabulary. Regression analyses indicate that for the TD group STM and WM (both visual and verbal) contributed similar variance to the non-verbal scores. For the ID group, visual STM and verbal WM contributed most of the variance to the non-verbal test scores. The addition of vocabulary scores to the model contributed greater variance for both groups. More unique variance was contributed by vocabulary than memory for the TD group, whereas for the ID group memory contributed more than vocabulary. Visual and auditory memory and vocabulary contributed significantly to solving visual non-verbal problems for both the TD group and the ID group. However, for each group, there were different weightings of these variables. Our findings indicate that for individuals with TD, vocabulary is the major factor in solving non-verbal problems, not memory, whereas for adolescents with ID, visual STM, and verbal WM are more influential than vocabulary, suggesting different pathways to achieve solutions to non-verbal problems. PMID:28082922
An Empirical Comparison of Two-Stage and Pyramidal Adaptive Ability Testing.

ERIC Educational Resources Information Center

Larkin, Kevin C.; Weiss, David J.

A 15-stage pyramidal test and a 40-item two-stage test were constructed and administered by computer to 111 college undergraduates. The two-stage test was found to utilize a smaller proportion of its potential score range than the pyramidal test. Score distributions for both tests were positively skewed but not significantly different from the…
Observed-Score Equating as a Test Assembly Problem.

ERIC Educational Resources Information Center

van der Linden, Wim J.; Luecht, Richard M.

1998-01-01

Derives a set of linear conditions of item-response functions that guarantees identical observed-score distributions on two test forms. The conditions can be added as constraints to a linear programming model for test assembly. An example illustrates the use of the model for an item pool from the Law School Admissions Test (LSAT). (SLD)
Parkinson's disease and driving ability

PubMed Central

Singh, Rajiv; Pentland, Brian; Hunter, John; Provan, Frances

2007-01-01

Objectives To explore the driving problems associated with Parkinson's disease (PD) and to ascertain whether any clinical features or tests predict driver safety. Methods The driving ability of 154 individuals with PD referred to a driving assessment centre was determined by a combination of clinical tests, reaction times on a test rig and an in‐car driving test. Results The majority of cases (104, 66%) were able to continue driving although 46 individuals required an automatic transmission and 10 others needed car modifications. Ability to drive was predicted by the severity of physical disease, age, presence of other associated medical conditions, particularly dementia, duration of disease, brake reaction, time on a test rig and score on a driving test (all p<0.001). The level of drug treatment and the length of driving history were not correlated. Discriminant analysis revealed that the most important features in distinguishing safety to drive were severe physical disease (Hoehn and Yahr stage 3), reaction time, moderate disease associated with another medical condition and high score on car testing. Conclusions Most individuals with PD are safe to drive, although many benefit from car modifications or from using an automatic transmission. A combination of clinical tests and in‐car driving assessment will establish safety to drive, and a number of clinical correlates can be shown to predict the likely outcome and may assist in the decision process. This is the largest series of consecutive patients seen at a driving assessment centre reported to date, and the first to devise a scoring system for on‐road driving assessment. PMID:17178820
The Impact of Conditional Scores on the Performance of DETECT.

ERIC Educational Resources Information Center

Zhang, Yanwei Oliver; Yu, Feng; Nandakumar, Ratna

DETECT is a nonparametric, conditional covariance-based procedure to identify dimensional structure and the degree of multidimensionality of test data. The ability composite or conditional score used to estimate conditional covariance plays a significant role in the performance of DETECT. The number correct score of all items in the test (T) and…
Comparison of trait and ability measures of emotional intelligence in medical students.

PubMed

Brannick, Michael T; Wahi, Monika M; Arce, Melissa; Johnson, Hazel-Anne; Nazian, Stanley; Goldin, Steven B

2009-11-01

Emotional intelligence (EI), the ability to perceive emotions in the self and others, and to understand, regulate and use such information in productive ways, is believed to be important in health care delivery for both recipients and providers of health care. There are two types of EI measure: ability and trait. Ability and trait measures differ in terms of both the definition of constructs and the methods of assessment. Ability measures conceive of EI as a capacity that spans the border between reason and feeling. Items on such a measure include showing a person a picture of a face and asking what emotion the pictured person is feeling; such items are scored by comparing the test-taker's response to a keyed emotion. Trait measures include a very large array of non-cognitive abilities related to success, such as self-control. Items on such measures ask individuals to rate themselves on such statements as: 'I generally know what other people are feeling.' Items are scored by giving higher scores to greater self-assessments. We compared one of each type of test with the other for evidence of reliability, convergence and overlap with personality. Year 1 and 2 medical students completed the Meyer-Salovey-Caruso Emotional Intelligence Test (MSCEIT, an ability measure), the Wong and Law Emotional Intelligence Scale (WLEIS, a trait measure) and an industry standard personality test (the Neuroticism-Extroversion-Openness [NEO] test). The MSCEIT showed problems with reliability. The MSCEIT and the WLEIS did not correlate highly with one another (overall scores correlated at 0.18). The WLEIS was more highly correlated with personality scales than the MSCEIT. Different tests that are supposed to measure EI do not measure the same thing. The ability measure was not correlated with personality, but the trait measure was correlated with personality.

Clinical implications of using the arm motor ability test in stroke rehabilitation.

PubMed

O'Dell, Michael W; Kim, Grace; Finnen, Lisa Rivera; Polistena, Caitlin

2011-05-01

To identify all published studies using the Arm Motor Ability Test (AMAT), a standardized, laboratory-based measure for selected upper extremity activities of daily living (ADLs); and to summarize its current uses and provide recommendations for its future use. An Ovid online search was performed using the terms "Arm Motor Ability Test" and "AMAT." The reference lists of all articles obtained were reviewed for additional studies not appearing in the literature search. In addition, the original manual for the use and administration of the AMAT was reviewed. All studies examining the psychometric properties of the AMAT or using the AMAT as an outcome measure were identified. Articles simply mentioning the AMAT without providing data and case reports or abstracts (other than those addressing a specific aspect of the scale of interest) were excluded. Studies were reviewed by the primary author. No formal system of quality review was used. The AMAT has been used as an outcome measure in stroke rehabilitation research examining upper extremity robotics, functional electrical stimulation, and cortical stimulation. The most recent version contains 10 ADL tasks, each of which is composed of 1 to 3 subtasks. Of the 3 domains originally proposed, only the "functional ability" domain is routinely assessed. Psychometric studies have demonstrated good reliability and at least reasonable construct validity. The instrument's sensitivity to change over time is less well established, and no recommendation can be made regarding a minimal clinically important difference. We recommend that the 10-item version of the AMAT and assessment of only the functional ability domain be adopted as standard going forward. Further research should include examination of sensitivity over time, minimal clinically important change, reliability and validity in the mid and lower range of scores, and in neurologic diagnoses other than stroke. Copyright © 2011 American Congress of Rehabilitation Medicine
Score tests for independence in semiparametric competing risks models.

PubMed

Saïd, Mériem; Ghazzali, Nadia; Rivest, Louis-Paul

2009-12-01

A popular model for competing risks postulates the existence of a latent unobserved failure time for each risk. Assuming that these underlying failure times are independent is attractive since it allows standard statistical tools for right-censored lifetime data to be used in the analysis. This paper proposes simple independence score tests for the validity of this assumption when the individual risks are modeled using semiparametric proportional hazards regressions. It assumes that covariates are available, making the model identifiable. The score tests are derived for alternatives that specify that copulas are responsible for a possible dependency between the competing risks. The test statistics are constructed by adding to the partial likelihoods for the individual risks an explanatory variable for the dependency between the risks. A variance estimator is derived by writing the score function and the Fisher information matrix for the marginal models as stochastic integrals. Pitman efficiencies are used to compare test statistics. A simulation study and a numerical example illustrate the methodology proposed in this paper.
Cognitive ability in young adulthood predicts risk of early-onset dementia in Finnish men.

PubMed

Rantalainen, Ville; Lahti, Jari; Henriksson, Markus; Kajantie, Eero; Eriksson, Johan G; Räikkönen, Katri

2018-06-06

To test if the Finnish Defence Forces Basic Intellectual Ability Test scores at 20.1 years predicted risk of organic dementia or Alzheimer disease (AD). Dementia was defined as inpatient or outpatient diagnosis of organic dementia or AD risk derived from Hospital Discharge or Causes of Death Registers in 2,785 men from the Helsinki Birth Cohort Study, divided based on age at first diagnosis into early onset (<65 years) or late onset (≥65 years). The Finnish Defence Forces Basic Intellectual Ability Test comprises verbal, arithmetic, and visuospatial subtests and a total score (scores transformed into a mean of 100 and SD of 15). We used Cox proportional hazard models and adjusted for age at testing, childhood socioeconomic status, mother's age at delivery, parity, participant's birthweight, education, and stroke or coronary heart disease diagnosis. Lower cognitive ability total and verbal ability (hazard ratio [HR] per 1 SD disadvantage >1.69, 95% confidence interval [CI] 1.01-2.63) scores predicted higher early-onset any dementia risk across the statistical models; arithmetic and visuospatial ability scores were similarly associated with early-onset any dementia risk, but these associations weakened after covariate adjustments (HR per 1 SD disadvantage >1.57, 95% CI 0.96-2.57). All associations were rendered nonsignificant when we adjusted for participant's education. Cognitive ability did not predict late-onset dementia risk. These findings reinforce previous suggestions that lower cognitive ability in early life is a risk factor for early-onset dementia. © 2018 American Academy of Neurology.
Relationships of Declining Test Scores and Grade Inflation.

ERIC Educational Resources Information Center

Bellott, Fred K.

The relationship between declining scores on national standardized tests and grade inflation is explored. Grade inflation refers to the indicated measure of evaluation of student performance having higher placement than is usual based on the performances. Data for this study were taken from the American College Testing (ACT) Program Class Profile…
D.C. Student Test Scores Show Uneven Progress. Data Snapshot

ERIC Educational Resources Information Center

DuPre, Mary

2011-01-01

Over the past five years, both DC Public Schools (DCPS) and public charter schools (PCS) have seen significant growth in secondary reading and math scores on the state test known as the District of Columbia Comprehensive Assessment System (DC CAS). However, scores have not improved as much at the elementary level. Reading and math scores for DCPS…
Reliability of Total Test Scores When Considered as Ordinal Measurements

ERIC Educational Resources Information Center

Biswas, Ajoy Kumar

2006-01-01

This article studies the ordinal reliability of (total) test scores. This study is based on a classical-type linear model of observed score (X), true score (T), and random error (E). Based on the idea of Kendall's tau-a coefficient, a measure of ordinal reliability for small-examinee populations is developed. This measure is extended to large…
Correlation of Simulation Examination to Written Test Scores for Advanced Cardiac Life Support Testing: Prospective Cohort Study.

PubMed

Strom, Suzanne L; Anderson, Craig L; Yang, Luanna; Canales, Cecilia; Amin, Alpesh; Lotfipour, Shahram; McCoy, C Eric; Osborn, Megan Boysen; Langdorf, Mark I

2015-11-01

Traditional Advanced Cardiac Life Support (ACLS) courses are evaluated using written multiple-choice tests. High-fidelity simulation is a widely used adjunct to didactic content, and has been used in many specialties as a training resource as well as an evaluative tool. There are no data to our knowledge that compare simulation examination scores with written test scores for ACLS courses. To compare and correlate a novel high-fidelity simulation-based evaluation with traditional written testing for senior medical students in an ACLS course. We performed a prospective cohort study to determine the correlation between simulation-based evaluation and traditional written testing in a medical school simulation center. Students were tested on a standard acute coronary syndrome/ventricular fibrillation cardiac arrest scenario. Our primary outcome measure was correlation of exam results for 19 volunteer fourth-year medical students after a 32-hour ACLS-based Resuscitation Boot Camp course. Our secondary outcome was comparison of simulation-based vs. written outcome scores. The composite average score on the written evaluation was substantially higher (93.6%) than the simulation performance score (81.3%, absolute difference 12.3%, 95% CI [10.6-14.0%], p<0.00005). We found a statistically significant moderate correlation between simulation scenario test performance and traditional written testing (Pearson r=0.48, p=0.04), validating the new evaluation method. Simulation-based ACLS evaluation methods correlate with traditional written testing and demonstrate resuscitation knowledge and skills. Simulation may be a more discriminating and challenging testing method, as students scored higher on written evaluation methods compared to simulation.
Ability evaluation by binary tests: Problems, challenges & recent advances

NASA Astrophysics Data System (ADS)

Bashkansky, E.; Turetsky, V.

2016-11-01

Binary tests designed to measure abilities of objects under test (OUTs) are widely used in different fields of measurement theory and practice. The number of test items in such tests is usually very limited. The response to each test item provides only one bit of information per OUT. The problem of correct ability assessment is even more complicated, when the levels of difficulty of the test items are unknown beforehand. This fact makes the search for effective ways of planning and processing the results of such tests highly relevant. In recent years, there has been some progress in this direction, generated by both the development of computational tools and the emergence of new ideas. The latter are associated with the use of so-called “scale invariant item response models”. Together with maximum likelihood estimation (MLE) approach, they helped to solve some problems of engineering and proficiency testing. However, several issues related to the assessment of uncertainties, replications scheduling, the use of placebo, as well as evaluation of multidimensional abilities still present a challenge for researchers. The authors attempt to outline the ways to solve the above problems.
The Experimental Design Ability Test (EDAT)

ERIC Educational Resources Information Center

Sirum, Karen; Humburg, Jennifer

2011-01-01

Higher education goals include helping students develop evidence based reasoning skills; therefore, scientific thinking skills such as those required to understand the design of a basic experiment are important. The Experimental Design Ability Test (EDAT) measures students' understanding of the criteria for good experimental design through their…
Improving the Quality of Ability Estimates through Multidimensional Scoring and Incorporation of Ancillary Variables

ERIC Educational Resources Information Center

de la Torre, Jimmy

2009-01-01

For one reason or another, various sources of information, namely, ancillary variables and correlational structure of the latent abilities, which are usually available in most testing situations, are ignored in ability estimation. A general model that incorporates these sources of information is proposed in this article. The model has a general…
Estimating Premorbid Cognitive Abilities in Low-Educated Populations

PubMed Central

Apolinario, Daniel; Brucki, Sonia Maria Dozzi; Ferretti, Renata Eloah de Lucena; Farfel, José Marcelo; Magaldi, Regina Miksian; Busse, Alexandre Leopold; Jacob-Filho, Wilson

2013-01-01

Objective To develop an informant-based instrument that would provide a valid estimate of premorbid cognitive abilities in low-educated populations. Methods A questionnaire was drafted by focusing on the premorbid period with a 10-year time frame. The initial pool of items was submitted to classical test theory and a factorial analysis. The resulting instrument, named the Premorbid Cognitive Abilities Scale (PCAS), is composed of questions addressing educational attainment, major lifetime occupation, reading abilities, reading habits, writing abilities, calculation abilities, use of widely available technology, and the ability to search for specific information. The validation sample was composed of 132 older Brazilian adults from the following three demographically matched groups: normal cognitive aging (n = 72), mild cognitive impairment (n = 33), and mild dementia (n = 27). The scores of a reading test and a neuropsychological battery were adopted as construct criteria. Post-mortem inter-informant reliability was tested in a sub-study with two relatives from each deceased individual. Results All items presented good discriminative power, with corrected item-total correlation varying from 0.35 to 0.74. The summed score of the instrument presented high correlation coefficients with global cognitive function (r = 0.73) and reading skills (r = 0.82). Cronbach's alpha was 0.90, showing optimal internal consistency without redundancy. The scores did not decrease across the progressive levels of cognitive impairment, suggesting that the goal of evaluating the premorbid state was achieved. The intraclass correlation coefficient was 0.96, indicating excellent inter-informant reliability. Conclusion The instrument developed in this study has shown good properties and can be used as a valid estimate of premorbid cognitive abilities in low-educated populations. The applicability of the PCAS, both as an estimate of premorbid intelligence and cognitive
Spatial and Visual Reasoning: Do These Abilities Improve in First-Year Veterinary Medical Students Exposed to an Integrated Curriculum?

PubMed

Gutierrez, J Claudio; Chigerwe, Munashe; Ilkiw, Jan E; Youngblood, Patricia; Holladay, Steven D; Srivastava, Sakti

Spatial visualization ability refers to the human cognitive ability to form, retrieve, and manipulate mental models of spatial nature. Visual reasoning ability has been linked to spatial ability. There is currently limited information about how entry-level spatial and visual reasoning abilities may predict veterinary anatomy performance or may be enhanced with progression through the veterinary anatomy content in an integrated curriculum. The present study made use of two tests that measure spatial ability and one test that measures visual reasoning ability in veterinary students: Guay's Visualization of Views Test, adapted version (GVVT), the Mental Rotations Test (MRT), and Raven's Advanced Progressive Matrices Test, short form (RavenT). The tests were given to the entering class of veterinary students during their orientation week and at week 32 in the veterinary medical curriculum. Mean score on the MRT significantly increased from 15.2 to 20.1, and on the RavenT significantly increased from 7.5 to 8.8. When females only were evaluated, results were similar to the total class outcome; however, all three tests showed significant increases in mean scores. A positive correlation between the pre- and post-test scores was found for all three tests. The present results should be considered preliminary at best for associating anatomic learning in an integrated curriculum with spatial and visual reasoning abilities. Other components of the curriculum, for instance histology or physiology, could also influence the improved spatial visualization and visual reasoning test scores at week 32.
Between-District Test Score Variation, 2009-2012

ERIC Educational Resources Information Center

Fahle, Erin; Reardon, Sean

2016-01-01

Describing the variation in test scores between and within school districts is critical for: (1) for policy-related and descriptive work that investigates the sorting of students among districts and the differential effectiveness of those districts; and (2) for methodological work planning future experiments or interventions. Intraclass…
Impact of the Ability to Divide Attention on Reading Performance in Glaucoma.

PubMed

Swenor, Bonnielin K; Varadaraj, Varshini; Dave, Paulomi; West, Sheila K; Rubin, Gary S; Ramulu, Pradeep Y

2017-05-01

To determine if the ability to divide attention affects the relationship between glaucoma-related vision loss and reading speed. Better eye mean deviation (MD), contrast sensitivity (CS), and better-eye distance visual acuity (VA) were measured in 28 participants with glaucoma and 21 controls. Reading speeds were assessed using MNRead, IRest, and sustained silent reading tests (words per minute, wpm). The ability to divide attention was measured using the Brief Test of Attention (BTA; scored 0-10). Multivariable linear regression models were used to determine the relationship between visual factors and reading speeds. Effect modification by BTA score (low BTA: <7; high BTA: ≥7) was examined. Worse CS (per 0.1 log unit) was associated with slower maximum reading speed on MNRead test for participants with low BTA scores (β = -9 wpm; 95% confidence interval [CI]: -16, -2), but not for those with high BTA scores (β = -2 wpm; 95% CI: -6, +2). Similarly, for the IRest test, worse CS was associated with slower reading speeds (β = -12 wpm; 95% CI: -20, -4) among those with low, but not high BTA scores (β = -4 wpm; 95% CI: -10, +2). For the sustained silent reading test, glaucoma status (versus controls), worse visual field (VF) MD (per 5 dB), and worse CS were associated with 39%, 21%, and 19% slower reading speeds, respectively, for those with low BTA scores (P < 0.05), but these associations were not significant among those with high BTA scores (P > 0.1 for all). Decreased ability to divide attention, indicated by lower BTA scores, is associated with slower reading speeds in glaucoma with reduced CS and VF defects.
Performance on a virtual reality angled laparoscope task correlates with spatial ability of trainees.

PubMed

Rosenthal, Rachel; Hamel, Christian; Oertli, Daniel; Demartines, Nicolas; Gantert, Walter A

2010-08-01

The aim of the present study was to investigate whether trainees' performance on a virtual reality angled laparoscope navigation task correlates with scores obtained on a validated conventional test of spatial ability. 56 participants of a surgery workshop performed an angled laparoscope navigation task on the Xitact LS 500 virtual reality Simulator. Performance parameters were correlated with the score of a validated paper-and-pencil test of spatial ability. Performance at the conventional spatial ability test significantly correlated with performance at the virtual reality task for overall task score (p < 0.001), task completion time (p < 0.001) and economy of movement (p = 0.035), not for endoscope travel speed (p = 0.947). In conclusion, trainees' performance in a standardized virtual reality camera navigation task correlates with their innate spatial ability. This VR session holds potential to serve as an assessment tool for trainees.
[Appraisal of occupational stress and work ability].

PubMed

Yang, Xinwei; Wang, Zhiming; Lan, Yajia; Wang, Mianzhen

2004-01-01

This study was conducted to assess occupational stress and work ability. A test of occupational stress and work ability was carry out with revised occupational stress inventory (OSI-R) and work ability index(WAI) for 2270 workers. (1) The occupational stress and strain in male was significantly higher than those in female, but self-care and social support in female werehigher than in male(P < 0.01). The level of occupational stress, strain except interpersonal strain increased with age, while work ability decreased(P < 0.05). (2) Among 6 items of occupational role questionnaire, the score of role boundary and responsibility were obviously higher in college education (P < 0.05). The score of occupational role, psychological strain, physical strain was higher in maried, divorce than unmarried(P < 0.05). (3) The score of occupational role, strain in good work ability category was significantly lower than others, but personal resources were higher(P < 0.05). (4) The correlation of work ability and occupational stress, strain, personal resources were significant(P < 0.01), occupational role and personal strain were positively correlated, both of which correlated negatively to the personal resources(P < 0.01). (5) The major influential factors of personal strain were age, recreation, self-care, social support, rational/cognitive, role insufficiency, role ambiguity and role boundary.
The Persisting Racial Scoring Gap on Graduate and Professional School Admission Tests.

ERIC Educational Resources Information Center

Journal of Blacks in Higher Education, 2003

2003-01-01

Discusses the racial scoring gap on tests for admission to medical, business, law, and other graduate programs, noting that in the highest-scoring brackets on the Medical College Admission Test (MCAT), the racial gap is even larger. Whites are five times, twelve times, and seven times more likely, respectively, to score higher on the MCAT, Law…
Academic self-concept of ability and cortisol reactivity.

PubMed

Minkley, N; Westerholt, D M; Kirchner, W H

2014-05-01

The present study aimed to clarify the relationship between a school-specific trait (academic self-concept of ability [ASCA]) and hormonal stress response by using a trait-compatible stressor (test). First, we determined 52 students' ASCA scores for biology and measured their salivary cortisol concentration before and after a biology test (experimental group, n=28) or a free writing task (control group, n=24). For participants who took the test, statistical analysis indicated a significant negative correlation between ASCA score and cortisol response. In contrast, the control group showed a decrease in cortisol concentrations between test times and no correlation between cortisol concentration and ASCA scores were found. These findings indicated an interaction between ASCA scores and hormonal stress response when an academic-related stressor is present. Furthermore, these variables might influence each other adversely: high cortisol concentrations during a test situation may lead to greater feelings of insecurity, resulting in low ASCA scores and awareness of these low scores may lead to a further increase in cortisol, creating a vicious cycle.
Number sense in infancy predicts mathematical abilities in childhood.

PubMed

Starr, Ariel; Libertus, Melissa E; Brannon, Elizabeth M

2013-11-05

Human infants in the first year of life possess an intuitive sense of number. This preverbal number sense may serve as a developmental building block for the uniquely human capacity for mathematics. In support of this idea, several studies have demonstrated that nonverbal number sense is correlated with mathematical abilities in children and adults. However, there has been no direct evidence that infant numerical abilities are related to mathematical abilities later in childhood. Here, we provide evidence that preverbal number sense in infancy predicts mathematical abilities in preschool-aged children. Numerical preference scores at 6 months of age correlated with both standardized math test scores and nonsymbolic number comparison scores at 3.5 years of age, suggesting that preverbal number sense facilitates the acquisition of numerical symbols and mathematical abilities. This relationship held even after controlling for general intelligence, indicating that preverbal number sense imparts a unique contribution to mathematical ability. These results validate the many prior studies purporting to show number sense in infancy and support the hypothesis that mathematics is built upon an intuitive sense of number that predates language.
A Confirmatory Factor Analysis of Cattell-Horn-Carroll Theory and Cross-Age Invariance of the Woodcock-Johnson Tests of Cognitive Abilities III

ERIC Educational Resources Information Center

Taub, Gordon E.; McGrew, Kevin S.

2004-01-01

Establishing an instrument's factorial invariance provides the empirical foundation to compare an individual's score across time or to examine the pattern of correlations between variables in differentiated age groups. In the recently published Woodcock-Johnson Tests of Cognitive Ability (WJ COG) and Achievement (WJ ACH) Third Edition (III) the…

Comparability of IQ Scores on Five Widely Used Intelligence Tests

ERIC Educational Resources Information Center

Hieronymus, A. N.; Stroud, James B.

1969-01-01

Attempts to fill research gap on testing by obtaining comparisons of deviation scores, at grade levels four, seven, and ten, from the California Test of Mental Maturity, Henmon-Nelson Tests, and Lorge-Thorndike Intelligence tests. Results tabulated. (CJ)
Reading Ability and Print Exposure: Item Response Theory Analysis of the Author Recognition Test

PubMed Central

Moore, Mariah; Gordon, Peter C.

2015-01-01

In the Author Recognition Test (ART) participants are presented with a series of names and foils and are asked to indicate which ones they recognize as authors. The test is a strong predictor of reading skill, with this predictive ability generally explained as occurring because author knowledge is likely acquired through reading or other forms of print exposure. This large-scale study (1012 college student participants) used Item Response Theory (IRT) to analyze item (author) characteristics to facilitate identification of the determinants of item difficulty, provide a basis for further test development, and to optimize scoring of the ART. Factor analysis suggests a potential two factor structure of the ART differentiating between literary vs. popular authors. Effective and ineffective author names were identified so as to facilitate future revisions of the ART. Analyses showed that the ART is a highly significant predictor of time spent encoding words as measured using eye-tracking during reading. The relationship between the ART and time spent reading provided a basis for implementing a higher penalty for selecting foils, rather than the standard method of ART scoring (names selected minus foils selected). The findings provide novel support for the view that the ART is a valid indicator of reading volume. Further, they show that frequency data can be used to select items of appropriate difficulty and that frequency data from corpora based on particular time periods and types of text may allow test adaptation for different populations. PMID:25410405
Do Self-Efficacy and Ability Self-Estimate Scores Reflect Distinct Facets of Ability Judgments?

ERIC Educational Resources Information Center

Hansen, Jo-Ida C.; Bubany, Shawn T.

2008-01-01

Vocational psychology has generated a number of concepts and assessment instruments considered to reflect ability self-concept (i.e., one's view of one's own abilities) relevant to career development. These concepts and measures often are categorized as either self efficacy beliefs or self-estimated (i.e., self-rated, self-evaluated) abilities.…
Emotional Intelligence Abilities and Traits in Different Career Paths

ERIC Educational Resources Information Center

Kafetsios, Konstantinos; Maridaki-Kassotaki, Aikaterini; Zammuner, Vanda L.; Zampetakis, Leonidas A.; Vouzas, Fotios

2009-01-01

Two studies tested hypotheses about differences in emotional intelligence (EI) abilities and traits between followers of different career paths. Compared to their social science peers, science students had higher scores in adaptability and general mood traits measured with the Emotion Quotient Inventory, but lower scores in strategic EI abilities…
Socioeconomic Position Across the Life Course and Cognitive Ability Later in Life: The Importance of Considering Early Cognitive Ability.

PubMed

Foverskov, Else; Mortensen, Erik Lykke; Holm, Anders; Pedersen, Jolene Lee Masters; Osler, Merete; Lund, Rikke

2017-11-01

Investigate direct and indirect associations between markers of socioeconomic position (SEP) across the life course and midlife cognitive ability while addressing methodological limitations in prior work. Longitudinal data from the Danish Metropolit cohort of men born in 1953 ( N = 2,479) who completed ability tests at age 12, 18, and 56-58 linked to register-based information on paternal occupational class, educational attainment, and occupational level. Associations were assessed using structural equation models, and different models were estimated to examine the importance of accounting for childhood ability and measurement error. Associations between adult SEP measures and midlife ability decreased significantly when adjusting for childhood ability and measurement error. The association between childhood and midlife ability was by far the strongest. The impact of adult SEP on later life ability may be exaggerated when not accounting for the stability of individual differences in cognitive ability and measurement error in test scores.
Exploring visuospatial abilities and their contribution to constructional abilities and nonverbal intelligence.

PubMed

Trojano, Luigi; Siciliano, Mattia; Cristinzio, Chiara; Grossi, Dario

2018-01-01

The present study aimed at exploring relationships among the visuospatial tasks included in the Battery for Visuospatial Abilities (BVA), and at assessing the relative contribution of different facets of visuospatial processing on tests tapping constructional abilities and nonverbal abstract reasoning. One hundred forty-four healthy subjects with a normal score on Mini Mental State Examination completed the BVA plus Raven's Coloured Progressive Matrices and Constructional Apraxia test. We used Principal Axis Factoring and Parallel Analysis to investigate relationships among the BVA visuospatial tasks, and performed regression analyses to assess the visuospatial contribution to constructional abilities and nonverbal abstract reasoning. Principal Axis Factoring and Parallel Analysis revealed two eigenvalues exceeding 1, accounting for about 60% of the variance. A 2-factor model provided the best fit. Factor 1 included sub-tests exploring "complex" visuospatial skills, whereas Factor 2 included two subtests tapping "simple" visuospatial skills. Regression analyses revealed that both Factor 1 and Factor 2 significantly affected performance on Raven's Coloured Progressive Matrices, whereas only the Factor 1 affected performance on Constructional Apraxia test. Our results supported functional segregation proposed by De Renzi, suggesting clinical caution to utilize a single test to assess visuospatial domain, and qualified the visuospatial contribution in drawing and non-verbal intelligence test.
Laterality, spatial abilities, and accident proneness.

PubMed

Voyer, Susan D; Voyer, Daniel

2015-01-01

Although handedness as a measure of cerebral specialization has been linked to accident proneness, more direct measures of laterality are rarely considered. The present study aimed to fill that gap in the existing research. In addition, individual difference factors in accident proneness were further examined with the inclusion of mental rotation and navigation abilities measures. One hundred and forty participants were asked to complete the Mental Rotations Test, the Santa Barbara Sense of Direction scale, the Greyscales task, the Fused Dichotic Word Test, the Waterloo Handedness Questionnaire, and a grip strength task before answering questions related to number of accidents in five areas. Results indicated that handedness scores, absolute visual laterality score, absolute response time on the auditory laterality index, and navigation ability were significant predictors of the total number of accidents. Results are discussed with respect to cerebral hemispheric specialization and risk-taking attitudes and behavior.
Teacher Use of Achievement Test Score Data

ERIC Educational Resources Information Center

Miller, Steven C.

2012-01-01

The Wyoming Department of Education (WDE) has invested time and money developing standardized achievement test score reports designed to give teachers data about each of their students' levels of mastery of particular concepts in order to differentiate their instruction. The purpose of this study was to determine the extent to which eighth-grade…
Generalization of the Lord-Wingersky Algorithm to Computing the Distribution of Summed Test Scores Based on Real-Number Item Scores

ERIC Educational Resources Information Center

Kim, Seonghoon

2013-01-01

With known item response theory (IRT) item parameters, Lord and Wingersky provided a recursive algorithm for computing the conditional frequency distribution of number-correct test scores, given proficiency. This article presents a generalized algorithm for computing the conditional distribution of summed test scores involving real-number item…
Longitudinal Assessment of Intellectual Abilities of Children with Williams Syndrome: Multilevel Modeling of Performance on the Kaufman Brief Intelligence Test--Second Edition

ERIC Educational Resources Information Center

Mervis, Carolyn B.; Kistler, Doris J.; John, Angela E.; Morris, Colleen A.

2012-01-01

Multilevel modeling was used to address the longitudinal stability of standard scores (SSs) measuring intellectual ability for children with Williams syndrome (WS). Participants were 40 children with genetically confirmed WS who completed the Kaufman Brief Intelligence Test--Second Edition (KBIT-2; A. S. Kaufman & N. L. Kaufman, 2004) 4-7…
Impact of the Ability to Divide Attention on Reading Performance in Glaucoma

PubMed Central

Swenor, Bonnielin K.; Varadaraj, Varshini; Dave, Paulomi; West, Sheila K.; Rubin, Gary S.; Ramulu, Pradeep Y.

2017-01-01

Purpose To determine if the ability to divide attention affects the relationship between glaucoma-related vision loss and reading speed. Methods Better eye mean deviation (MD), contrast sensitivity (CS), and better-eye distance visual acuity (VA) were measured in 28 participants with glaucoma and 21 controls. Reading speeds were assessed using MNRead, IRest, and sustained silent reading tests (words per minute, wpm). The ability to divide attention was measured using the Brief Test of Attention (BTA; scored 0–10). Multivariable linear regression models were used to determine the relationship between visual factors and reading speeds. Effect modification by BTA score (low BTA: <7; high BTA: ≥7) was examined. Results Worse CS (per 0.1 log unit) was associated with slower maximum reading speed on MNRead test for participants with low BTA scores (β = −9 wpm; 95% confidence interval [CI]: −16, −2), but not for those with high BTA scores (β = −2 wpm; 95% CI: −6, +2). Similarly, for the IRest test, worse CS was associated with slower reading speeds (β = −12 wpm; 95% CI: −20, −4) among those with low, but not high BTA scores (β = −4 wpm; 95% CI: −10, +2). For the sustained silent reading test, glaucoma status (versus controls), worse visual field (VF) MD (per 5 dB), and worse CS were associated with 39%, 21%, and 19% slower reading speeds, respectively, for those with low BTA scores (P < 0.05), but these associations were not significant among those with high BTA scores (P > 0.1 for all). Conclusions Decreased ability to divide attention, indicated by lower BTA scores, is associated with slower reading speeds in glaucoma with reduced CS and VF defects. PMID:28460047
Measuring intellectual ability in children with cerebral palsy: can we do better?

PubMed

Sherwell, Sarah; Reid, Susan M; Reddihough, Dinah S; Wrennall, Jacquie; Ong, Ben; Stargatt, Robyn

2014-10-01

Standard intelligence tests such as the WPPSI-III have limitations when testing children with motor impairment. This study aimed to determine the proportion of children with cerebral palsy with sufficient verbal and motor skills to complete the WPPSI-III, to determine their comparative ability to complete tasks with and without a significant motor component, and to investigate short forms of the WPPSI-III as alternatives. Participants were 78 of 235 eligible 4-5 year old children with cerebral palsy resident in the Australian state of Victoria. Verbal IQ (VIQ), Performance IQ (PIQ), and Full-scale IQ (FSIQ) were determined using the WPPSI-III. Initial screening for pointing and verbal abilities determined which tests were attempted. The impact of speed was investigated by comparing scores on the Block Design subtest with and without an imposed time limit. FSIQ scores were calculated from two short forms of the WPPSI-III and compared to the full form. On screening, 16 children had inadequate pointing (14) and verbal abilities (2). FSIQ was obtained in 62 (82%) children. Strong associations were seen between completion of the entire test battery and topographical pattern, level of manual ability and level of gross motor function. Scores on subtests requiring manual ability were depressed relative to other scores. Children performed better using short forms of the WPPSI-III and, for a minority, when time limits were disregarded. In summary, children with cerebral palsy often lack the fine and gross motor skills necessary to complete the WPPSI-III, scoring relatively poorly on tasks requiring a fine motor response. Using short-form estimations of FSIQ comprised of subtests without a significant fine motor component has the potential to increase a child's FSIQ by approximately 5 points. These findings have important clinical implications when assessing a child with both motor and cognitive limitations. Copyright © 2014 Elsevier Ltd. All rights reserved.
Misidentifying Factors Underlying Singapore's High Test Scores

ERIC Educational Resources Information Center

Usiskin, Zalman

2012-01-01

Singapore students have scored exceedingly well on international tests in mathematics. In response, there has been a desire in the United States--both at the policy level and at the school level--to emulate Singapore. Because what can be identified most easily about Singapore's school mathematics can be gleaned from curriculum documents from the…
Comparison of Measures of Ability in Adolescents with Intellectual Disability

PubMed Central

Mungkhetklang, Chantanee; Crewther, Sheila G.; Bavin, Edith L.; Goharpey, Nahal; Parsons, Carl

2016-01-01

Finding the most appropriate intelligence test for adolescents with Intellectual Disability (ID) is challenging given their limited language, attention, perceptual, and motor skills and ability to stay on task. The study compared performance of 23 adolescents with ID on the Wechsler Intelligence Scale for Children-Fourth Edition (WISC-IV), one of the most widely used intelligence tests, and three non-verbal IQ tests, the Raven's Colored Progressive Matrices (RCPM), the Test of Non-verbal Intelligence-Fourth Edition and the Wechsler Non-verbal test of Ability. Results showed that the WISC-IV Full Scale IQ raw and scaled scores were highly correlated with total scores from the three non-verbal tests, although the correlations were higher for raw scores, suggesting they may lead to better understanding of within group differences and what individuals with ID can do at the time of assessment. All participants attempted more questions on the non-verbal tests than the verbal. A preliminary analysis showed that adolescents with ID without ASD (n = 15) achieved higher scores overall than those presenting with ID+ASD (n = 8). Our findings support the view that short non-verbal tests are more likely to give a similar IQ result as obtained from the WISC-IV. In terms of the time to administer and the stress for participants, they are more appropriate for assessing adolescents with ID. PMID:27242597
Comparison of Measures of Ability in Adolescents with Intellectual Disability.

PubMed

Mungkhetklang, Chantanee; Crewther, Sheila G; Bavin, Edith L; Goharpey, Nahal; Parsons, Carl

2016-01-01

Finding the most appropriate intelligence test for adolescents with Intellectual Disability (ID) is challenging given their limited language, attention, perceptual, and motor skills and ability to stay on task. The study compared performance of 23 adolescents with ID on the Wechsler Intelligence Scale for Children-Fourth Edition (WISC-IV), one of the most widely used intelligence tests, and three non-verbal IQ tests, the Raven's Colored Progressive Matrices (RCPM), the Test of Non-verbal Intelligence-Fourth Edition and the Wechsler Non-verbal test of Ability. Results showed that the WISC-IV Full Scale IQ raw and scaled scores were highly correlated with total scores from the three non-verbal tests, although the correlations were higher for raw scores, suggesting they may lead to better understanding of within group differences and what individuals with ID can do at the time of assessment. All participants attempted more questions on the non-verbal tests than the verbal. A preliminary analysis showed that adolescents with ID without ASD (n = 15) achieved higher scores overall than those presenting with ID+ASD (n = 8). Our findings support the view that short non-verbal tests are more likely to give a similar IQ result as obtained from the WISC-IV. In terms of the time to administer and the stress for participants, they are more appropriate for assessing adolescents with ID.
Development of an Itemwise Efficiency Scoring Method: Concurrent, Convergent, Discriminant, and Neuroimaging-Based Predictive Validity Assessed in a Large Community Sample

PubMed Central

Moore, Tyler M.; Reise, Steven P.; Roalf, David R.; Satterthwaite, Theodore D.; Davatzikos, Christos; Bilker, Warren B.; Port, Allison M.; Jackson, Chad T.; Ruparel, Kosha; Savitt, Adam P.; Baron, Robert B.; Gur, Raquel E.; Gur, Ruben C.

2016-01-01

Traditional “paper-and-pencil” testing is imprecise in measuring speed and hence limited in assessing performance efficiency, but computerized testing permits precision in measuring itemwise response time. We present a method of scoring performance efficiency (combining information from accuracy and speed) at the item level. Using a community sample of 9,498 youths age 8-21, we calculated item-level efficiency scores on four neurocognitive tests, and compared the concurrent, convergent, discriminant, and predictive validity of these scores to simple averaging of standardized speed and accuracy-summed scores. Concurrent validity was measured by the scores' abilities to distinguish men from women and their correlations with age; convergent and discriminant validity were measured by correlations with other scores inside and outside of their neurocognitive domains; predictive validity was measured by correlations with brain volume in regions associated with the specific neurocognitive abilities. Results provide support for the ability of itemwise efficiency scoring to detect signals as strong as those detected by standard efficiency scoring methods. We find no evidence of superior validity of the itemwise scores over traditional scores, but point out several advantages of the former. The itemwise efficiency scoring method shows promise as an alternative to standard efficiency scoring methods, with overall moderate support from tests of four different types of validity. This method allows the use of existing item analysis methods and provides the convenient ability to adjust the overall emphasis of accuracy versus speed in the efficiency score, thus adjusting the scoring to the real-world demands the test is aiming to fulfill. PMID:26866796
A weighted generalized score statistic for comparison of predictive values of diagnostic tests

PubMed Central

Kosinski, Andrzej S.

2013-01-01

Positive and negative predictive values are important measures of a medical diagnostic test performance. We consider testing equality of two positive or two negative predictive values within a paired design in which all patients receive two diagnostic tests. The existing statistical tests for testing equality of predictive values are either Wald tests based on the multinomial distribution or the empirical Wald and generalized score tests within the generalized estimating equations (GEE) framework. As presented in the literature, these test statistics have considerably complex formulas without clear intuitive insight. We propose their re-formulations which are mathematically equivalent but algebraically simple and intuitive. As is clearly seen with a new re-formulation we present, the generalized score statistic does not always reduce to the commonly used score statistic in the independent samples case. To alleviate this, we introduce a weighted generalized score (WGS) test statistic which incorporates empirical covariance matrix with newly proposed weights. This statistic is simple to compute, it always reduces to the score statistic in the independent samples situation, and it preserves type I error better than the other statistics as demonstrated by simulations. Thus, we believe the proposed WGS statistic is the preferred statistic for testing equality of two predictive values and for corresponding sample size computations. The new formulas of the Wald statistics may be useful for easy computation of confidence intervals for difference of predictive values. The introduced concepts have potential to lead to development of the weighted generalized score test statistic in a general GEE setting. PMID:22912343
Using Raters from India to Score a Large-Scale Speaking Test

ERIC Educational Resources Information Center

Xi, Xiaoming; Mollaun, Pam

2011-01-01

We investigated the scoring of the Speaking section of the Test of English as a Foreign Language[TM] Internet-based (TOEFL iBT[R]) test by speakers of English and one or more Indian languages. We explored the extent to which raters from India, after being trained and certified, were able to score the TOEFL examinees with mixed first languages…
The impact of testing accommodations on MCAT scores: descriptive results.

PubMed

Julian, Ellen R; Ingersoll, Deborah J; Etienne, Patricia M; Hilger, Anthony E

2004-04-01

Medical College Admission Test (MCAT) examinees with disabilities who receive accommodations receive flagged scores indicating nonstandard administration. This report compares MCAT examinees who received accommodations and their performances with standard examinees. Aggregate history records of all 1994-2000 MCAT examinees were identified as flagged (2,401) or standard (297,880), then further sorted by race/ethnicity (broadly identified as underrepresented minority and non-URM, at the time of testing) and gender. Those with flagged scores were also classified by disability (LD = learning disability, ADHD = attention deficit hyperactivity disorder, LD/ADHD = learning disability and attention deficit hyperactivity disorder, and Other = other disability) and type of accommodation. Mean MCAT scores were calculated for all groups. A group of 866 examinees took the MCAT first as a standard administration and subsequently with accommodations. In a separate analysis, their two sets of scores were compared. Less than 1% of examinees (2,401) had accommodations; of these, 55% were LD, 17% ADHD, 5% LD/ADHD, and 23% Other. Extended time was the most frequently provided accommodation. Mean flagged scores slightly exceeded mean standard scores on all MCAT sections. Examinees who retook the MCAT with accommodations after a standard administration increased their scores by six points, quadrupling the average gain Standard-Standard retest cohort from another study. The small but statistically significant different higher flagged scores may reflect either appropriate compensation or overly generous accommodations. Extended time had a positive impact on the scores of those who retested with this accommodation. The validity the flagged MCAT in predicting success in medical school is not known, and further investigation is underway.
[Study on relationship between fatigue and work ability in chemistry workers].

PubMed

Wu, Si-Ying; Wang, Mian-Zhen; Wang, Zhi-Ming; Lan, Ya-Jia

2005-01-01

Explore the relationship between fatigue and work ability in 976 chemistry workers. A test of fatigue and work ability was carried out with fatigue scale and work ability index (WAI) for 976 workers, other influence factors of the work ability (such as work environment, labor load, job factors) were investigated with questionnaire. (1) The frequency of fatigue of the unmarried workers was significantly lower than that of the married workers and other marital status workers, while the score of WAI of the unmarried was significantly higher than that of those( P < 0.05); (2) the frequency of fatigue of the mental workers was significantly lower than that of the mixed physical and mental workers, while the score of WAI of the mental workers was significantly higher than that of physical workers and mixed physical and mental workers ( P < 0.05); (3) compared with the workers free of fatigue, the other workers had lower WAI scores; (4) the fatigue score correlated negatively to the WAI score (r = -0.499, P < 0.01); (5) Cumulative odds model analysis showed that after controlling the other risk factors, fatigue was an important risk factor of work ability (OR = 4.005). Fatigue has affected work ability in chemistry workers, the frequency of fatigue is higher, the score of WAI is lower.

Leveraging Gender Differences to Boost Test Scores

ERIC Educational Resources Information Center

Costello, Bill

2008-01-01

According to the 2004 National Assessment of Educational Progress, males who have made it through 12 years of school have significantly poorer reading skills than their female peers. In every age group, boys have been scoring lower than girls annually for more than three decades on U.S. Department of Education reading tests. The longer boys are in…
Toward a Nonspeech Test of Auditory Cognition: Semantic Context Effects in Environmental Sound Identification in Adults of Varying Age and Hearing Abilities

PubMed Central

Sheft, Stanley; Norris, Molly; Spanos, George; Radasevich, Katherine; Formsma, Paige; Gygi, Brian

2016-01-01

Objective Sounds in everyday environments tend to follow one another as events unfold over time. The tacit knowledge of contextual relationships among environmental sounds can influence their perception. We examined the effect of semantic context on the identification of sequences of environmental sounds by adults of varying age and hearing abilities, with an aim to develop a nonspeech test of auditory cognition. Method The familiar environmental sound test (FEST) consisted of 25 individual sounds arranged into ten five-sound sequences: five contextually coherent and five incoherent. After hearing each sequence, listeners identified each sound and arranged them in the presentation order. FEST was administered to young normal-hearing, middle-to-older normal-hearing, and middle-to-older hearing-impaired adults (Experiment 1), and to postlingual cochlear-implant users and young normal-hearing adults tested through vocoder-simulated implants (Experiment 2). Results FEST scores revealed a strong positive effect of semantic context in all listener groups, with young normal-hearing listeners outperforming other groups. FEST scores also correlated with other measures of cognitive ability, and for CI users, with the intelligibility of speech-in-noise. Conclusions Being sensitive to semantic context effects, FEST can serve as a nonspeech test of auditory cognition for diverse listener populations to assess and potentially improve everyday listening skills. PMID:27893791
Verbal ability and delinquency: testing the moderating role of psychopathic traits.

PubMed

Muñoz, Luna C; Frick, Paul J; Kimonis, Eva R; Aucoin, Katherine J

2008-04-01

Impaired verbal abilities are one of the most consistent risk factors for serious antisocial and delinquent behavior. However, individuals with psychopathic traits often show serious antisocial behavior, despite showing no impairment in their verbal abilities. Thus, the aim of the current study was to examine whether psychopathy moderates the relationship between verbal abilities and delinquent behavior in a sample of detained youth. The sample included 100 detained adolescent boys who were assessed on self-reported delinquent acts and psychopathic traits, as well as their age at first offense based on official records. Participants also completed a competitive computer task involving two levels of provocation, during which skin conductance was measured. A standard measure of receptive vocabulary was individually administered. As predicted, there was a significant interaction between callous-unemotional (CU) traits (a critical dimension of psychopathy) and verbal ability when predicting violent delinquency. Individuals who were high on CU traits with higher scores on the measure of verbal abilities reported the greatest violent delinquency. These individuals also showed the lowest level of skin conductance reactivity during the provocation task. The results suggest CU traits are an important moderator of the relation between verbal abilities and violent delinquency.
Effects of Differential Item Functioning on Examinees' Test Performance and Reliability of Test

ERIC Educational Resources Information Center

Lee, Yi-Hsuan; Zhang, Jinming

2017-01-01

Simulations were conducted to examine the effect of differential item functioning (DIF) on measurement consequences such as total scores, item response theory (IRT) ability estimates, and test reliability in terms of the ratio of true-score variance to observed-score variance and the standard error of estimation for the IRT ability parameter. The…
How Do Executive Functions Fit with the Cattell-Horn-Carroll Model? Some Evidence from a Joint Factor Analysis of the Delis-Kaplan Executive Function System and the Woodcock-Johnson III Tests of Cognitive Abilities

ERIC Educational Resources Information Center

Floyd, Randy G.; Bergeron, Renee; Hamilton, Gloria; Parra, Gilbert R.

2010-01-01

This study investigated the relations among executive functions and cognitive abilities through a joint exploratory factor analysis and joint confirmatory factor analysis of 25 test scores from the Delis-Kaplan Executive Function System and the Woodcock-Johnson III Tests of Cognitive Abilities. Participants were 100 children and adolescents…
Use of the Short Physical Performance Battery Score to predict loss of ability to walk 400 meters: analysis from the InCHIANTI study.

PubMed

Vasunilashorn, Sarinnapha; Coppin, Antonia K; Patel, Kushang V; Lauretani, Fulvio; Ferrucci, Luigi; Bandinelli, Stefania; Guralnik, Jack M

2009-02-01

Early detection of mobility limitations remains an important goal for preventing mobility disability. The purpose of this study was to examine the association between the Short Physical Performance Battery (SPPB) and the loss of ability to walk 400 m, an objectively assessed mobility outcome increasingly used in clinical trials. The study sample consisted of 542 adults from the InCHIANTI study aged 65 and older, who completed the 400 m walk at baseline and had evaluations on the SPPB and 400 m walk at baseline and 3-year follow-up. Multiple logistic regression models were used to determine whether SPPB scores predict the loss of ability to walk 400 m at follow-up among persons able to walk 400 m at baseline. The 3-year incidence of failing the 400 m walk was 15.5%. After adjusting for age, sex, education, body mass index, Mini-Mental State Examination, number of medical conditions, and 400 m walk gait speed at baseline, SPPB score was significantly associated with loss of ability to walk 400 m after 3 years. Participants with SPPB scores of 10 or lower at baseline had significantly higher odds of mobility disability at follow-up (odds ratio [OR] = 3.38, 95% confidence interval [CI]: 1.32-8.65) compared with those who scored 12, with a graded response across the range of SPPB scores (OR = 26.93, 95% CI: 7.51-96.50; OR = 7.67, 95% CI: 2.26-26.04; OR = 8.28, 95% CI: 3.32-20.67 for SPPB < or = 7, SPPB 8, and SPPB 9, respectively). The SPPB strongly predicts loss of ability to walk 400 m. Thus, using the SPPB to identify older persons at high risk of lower body functional limitations seems a valid means of recognizing individuals who would benefit most from preventive interventions.
Test Score Stability and the Relationship of Adult Manifest Anxiety Scale-College Version Scores to External Variables among Graduate Students

ERIC Educational Resources Information Center

Lowe, Patricia A.; Peyton, Vicki; Reynolds, Cecil R.

2007-01-01

A sample of 79 individuals participated in the present study to evaluate the test score stability (8-week test-retest interval) and construct validity of the scores of the Adult Manifest Anxiety Scale-College Version, a new measure used to assess anxiety in college students, for application to graduate-level students. Results of the study…
Number sense in infancy predicts mathematical abilities in childhood

PubMed Central

Starr, Ariel; Libertus, Melissa E.; Brannon, Elizabeth M.

2013-01-01

Human infants in the first year of life possess an intuitive sense of number. This preverbal number sense may serve as a developmental building block for the uniquely human capacity for mathematics. In support of this idea, several studies have demonstrated that nonverbal number sense is correlated with mathematical abilities in children and adults. However, there has been no direct evidence that infant numerical abilities are related to mathematical abilities later in childhood. Here, we provide evidence that preverbal number sense in infancy predicts mathematical abilities in preschool-aged children. Numerical preference scores at 6 months of age correlated with both standardized math test scores and nonsymbolic number comparison scores at 3.5 years of age, suggesting that preverbal number sense facilitates the acquisition of numerical symbols and mathematical abilities. This relationship held even after controlling for general intelligence, indicating that preverbal number sense imparts a unique contribution to mathematical ability. These results validate the many prior studies purporting to show number sense in infancy and support the hypothesis that mathematics is built upon an intuitive sense of number that predates language. PMID:24145427
Are teachers' judgements of pupils' ability influenced by body shape?

PubMed

Shackleton, N L; Campbell, T

2014-04-01

Evidence indicates that teachers can judge pupils on the basis of their physical appearance, including their body shape. Teacher bias towards obese pupils has been suggested as a potential pathway through which obese children attain relatively lower academic levels. The aim of this study was to investigate whether teachers' judgements of pupils' ability are influenced by the body shape of the child. The sample includes English, singleton children in state schools from the Millennium Cohort Study. The data were taken from the fourth wave of data collection, when the children were approximately 7 years old. In all, 5086/5072 children had teacher ability ratings of reading and maths. Logistic regression analyses were used to test whether teachers' perceptions of the child's reading and mathematics ability were influenced by the pupil's waist circumference, conditional upon cognitive test scores of reading and maths ability. After adjustment for cognitive test scores, no significant overall relationship was found between the pupil's waist circumference and the teacher's judgements of ability. No statistically significant differences were observed in the probability of being judged as above average after further adjustments were made for potential confounders. There is little evidence that teachers' judgements of pupils' ability are influenced by obesity.
An Approach to Scoring and Equating Tests with Binary Items: Piloting With Large-Scale Assessments

ERIC Educational Resources Information Center

Dimitrov, Dimiter M.

2016-01-01

This article describes an approach to test scoring, referred to as "delta scoring" (D-scoring), for tests with dichotomously scored items. The D-scoring uses information from item response theory (IRT) calibration to facilitate computations and interpretations in the context of large-scale assessments. The D-score is computed from the…
Reading ability and print exposure: item response theory analysis of the author recognition test.

PubMed

Moore, Mariah; Gordon, Peter C

2015-12-01

In the author recognition test (ART), participants are presented with a series of names and foils and are asked to indicate which ones they recognize as authors. The test is a strong predictor of reading skill, and this predictive ability is generally explained as occurring because author knowledge is likely acquired through reading or other forms of print exposure. In this large-scale study (1,012 college student participants), we used item response theory (IRT) to analyze item (author) characteristics in order to facilitate identification of the determinants of item difficulty, provide a basis for further test development, and optimize scoring of the ART. Factor analysis suggested a potential two-factor structure of the ART, differentiating between literary and popular authors. Effective and ineffective author names were identified so as to facilitate future revisions of the ART. Analyses showed that the ART is a highly significant predictor of the time spent encoding words, as measured using eyetracking during reading. The relationship between the ART and time spent reading provided a basis for implementing a higher penalty for selecting foils, rather than the standard method of ART scoring (names selected minus foils selected). The findings provide novel support for the view that the ART is a valid indicator of reading volume. Furthermore, they show that frequency data can be used to select items of appropriate difficulty, and that frequency data from corpora based on particular time periods and types of texts may allow adaptations of the test for different populations.
Exploratory study of the relations between spatial ability and drawing from memory.

PubMed

Czarnolewski, Mark Y; Eliot, John

2012-04-01

Test scores of 119 students, attending either a public four-year college or a technical school, were related to their proportionality and detail drawing scores on the Memory for Designs Test. In regression models, the ETS Maze Tracing, Eliot-Price Mental Rotations, and Bender-Gestalt tests were consistent predictors of proportionality scores, with the latter two tests uniquely related to these. The ETS Shapes Memory Test and the Form Board Test were the strongest predictors for detail accuracy scores. The Shapes test predicted proportionality when the CTY Visual Memory Test BB was excluded. The models then provided support for the hypothesis that drawing designs from memory, a critical skill in drawing, regardless of whether one focuses on accuracy for proportionality scores or for detail scores, is jointly related to the measures of recognition, production, and traditional spatial ability measures. This study identified multifaceted skills in drawing from memory.
Developing Test Score Reports that Work: The Process and Best Practices for Effective Communication

ERIC Educational Resources Information Center

Zenisky, April L.; Hambleton, Ronald K.

2012-01-01

Test scores matter these days. Test-takers want to understand how they performed, and test score reports, particularly those for individual examinees, are the vehicles by which most people get the bulk of this information. Historically, score reports have not always met the examinees' information or usability needs, but this is clearly changing…
Testing Students with Special Educational Needs in Large-Scale Assessments – Psychometric Properties of Test Scores and Associations with Test Taking Behavior

PubMed Central

Pohl, Steffi; Südkamp, Anna; Hardt, Katinka; Carstensen, Claus H.; Weinert, Sabine

2016-01-01

Assessing competencies of students with special educational needs in learning (SEN-L) poses a challenge for large-scale assessments (LSAs). For students with SEN-L, the available competence tests may fail to yield test scores of high psychometric quality, which are—at the same time—measurement invariant to test scores of general education students. We investigated whether we can identify a subgroup of students with SEN-L, for which measurement invariant competence measures of adequate psychometric quality may be obtained with tests available in LSAs. We furthermore investigated whether differences in test-taking behavior may explain dissatisfying psychometric properties and measurement non-invariance of test scores within LSAs. We relied on person fit indices and mixture distribution models to identify students with SEN-L for whom test scores with satisfactory psychometric properties and measurement invariance may be obtained. We also captured differences in test-taking behavior related to guessing and missing responses. As a result we identified a subgroup of students with SEN-L for whom competence scores of adequate psychometric quality that are measurement invariant to those of general education students were obtained. Concerning test taking behavior, there was a small number of students who unsystematically picked response options. Removing these students from the sample slightly improved item fit. Furthermore, two different patterns of missing responses were identified that explain to some extent problems in the assessments of students with SEN-L. PMID:26941665
Children's human figure drawings do not measure intellectual ability.

PubMed

Willcock, Emma; Imuta, Kana; Hayne, Harlene

2011-11-01

Children typically follow a well-defined series of stages as they learn to draw, but the rate at which they progress through these stages varies from child to child. Some experts have argued that these individual differences in drawing development reflect individual differences in intelligence. Here we assessed the validity of a drawing test that is commonly used to assess children's intellectual abilities. In a single study, 125 5- and 6-year-olds completed the Draw-A-Person: A Quantitative Scoring System (DAP:QSS) and the Wechsler Preschool and Primary Scale of Intelligence-Revised (WPPSI-R) or the Wechsler Abbreviated Scale of Intelligence (WASI). Although there was a statistically significant correlation between scores on the DAP:QSS and scores on the Wechsler tests, when the scores of individual children were examined, the DAP:QSS yielded a high number of false positives and false negatives for low intellectual functioning. We conclude that the DAP:QSS is not a valid measure of intellectual ability and should not be used as a screening tool. Copyright © 2011 Elsevier Inc. All rights reserved.
Construction and Evaluation of Reliability and Validity of Reasoning Ability Test

ERIC Educational Resources Information Center

Bhat, Mehraj A.

2014-01-01

This paper is based on the construction and evaluation of reliability and validity of reasoning ability test at secondary school students. In this paper an attempt was made to evaluate validity, reliability and to determine the appropriate standards to interpret the results of reasoning ability test. The test includes 45 items to measure six types…
Flow and diffusion of high-stakes test scores.

PubMed

Marder, M; Bansal, D

2009-10-13

We apply visualization and modeling methods for convective and diffusive flows to public school mathematics test scores from Texas. We obtain plots that show the most likely future and past scores of students, the effects of random processes such as guessing, and the rate at which students appear in and disappear from schools. We show that student outcomes depend strongly upon economic class, and identify the grade levels where flows of different groups diverge most strongly. Changing the effectiveness of instruction in one grade naturally leads to strongly nonlinear effects on student outcomes in subsequent grades.
Scoring systems for the Clock Drawing Test: A historical review

PubMed Central

Spenciere, Bárbara; Alves, Heloisa; Charchat-Fichman, Helenice

2017-01-01

The Clock Drawing Test (CDT) is a simple neuropsychological screening instrument that is well accepted by patients and has solid psychometric properties. Several different CDT scoring methods have been developed, but no consensus has been reached regarding which scoring method is the most accurate. This article reviews the literature on these scoring systems and the changes they have undergone over the years. Historically, different types of scoring systems emerged. Initially, the focus was on screening for dementia, and the methods were both quantitative and semi-quantitative. Later, the need for an early diagnosis called for a scoring system that can detect subtle errors, especially those related to executive function. Therefore, qualitative analyses began to be used for both differential and early diagnoses of dementia. A widely used qualitative method was proposed by Rouleau et al. (1992). Tracing the historical path of these scoring methods is important for developing additional scoring systems and furthering dementia prevention research. PMID:29213488
The Validity of Scores from the "GRE"® revised General Test for Forecasting Performance in Business Schools: Phase One. ETS GRE® Board Research Report. ETS GRE®-14-01. ETS Research Report. RR-14-17

ERIC Educational Resources Information Center

Young, John W.; Klieger, David; Bochenek, Jennifer; Li, Chen; Cline, Fred

2014-01-01

Scores from the "GRE"® revised General Test provide important information regarding the verbal and quantitative reasoning abilities and analytical writing skills of applicants to graduate programs. The validity and utility of these scores depend upon the degree to which the scores predict success in graduate and business school in…
Effects of Test Media on Different EFL Test-Takers in Writing Scores and in the Cognitive Writing Process

ERIC Educational Resources Information Center

Zou, Xiao-Ling; Chen, Yan-Min

2016-01-01

The effects of computer and paper test media on EFL test-takers with different computer familiarity in writing scores and in the cognitive writing process have been comprehensively explored from the learners' aspect as well as on the basis of related theories and practice. The results indicate significant differences in test scores among the…

Descriptive Statistics for Modern Test Score Distributions: Skewness, Kurtosis, Discreteness, and Ceiling Effects.

PubMed

Ho, Andrew D; Yu, Carol C

2015-06-01

Many statistical analyses benefit from the assumption that unconditional or conditional distributions are continuous and normal. More than 50 years ago in this journal, Lord and Cook chronicled departures from normality in educational tests, and Micerri similarly showed that the normality assumption is met rarely in educational and psychological practice. In this article, the authors extend these previous analyses to state-level educational test score distributions that are an increasingly common target of high-stakes analysis and interpretation. Among 504 scale-score and raw-score distributions from state testing programs from recent years, nonnormal distributions are common and are often associated with particular state programs. The authors explain how scaling procedures from item response theory lead to nonnormal distributions as well as unusual patterns of discreteness. The authors recommend that distributional descriptive statistics be calculated routinely to inform model selection for large-scale test score data, and they illustrate consequences of nonnormality using sensitivity studies that compare baseline results to those from normalized score scales.
The Bangor Voice Matching Test: A standardized test for the assessment of voice perception ability.

PubMed

Mühl, Constanze; Sheil, Orla; Jarutytė, Lina; Bestelmeyer, Patricia E G

2017-11-09

Recognising the identity of conspecifics is an important yet highly variable skill. Approximately 2 % of the population suffers from a socially debilitating deficit in face recognition. More recently the existence of a similar deficit in voice perception has emerged (phonagnosia). Face perception tests have been readily available for years, advancing our understanding of underlying mechanisms in face perception. In contrast, voice perception has received less attention, and the construction of standardized voice perception tests has been neglected. Here we report the construction of the first standardized test for voice perception ability. Participants make a same/different identity decision after hearing two voice samples. Item Response Theory guided item selection to ensure the test discriminates between a range of abilities. The test provides a starting point for the systematic exploration of the cognitive and neural mechanisms underlying voice perception. With a high test-retest reliability (r=.86) and short assessment duration (~10 min) this test examines individual abilities reliably and quickly and therefore also has potential for use in developmental and neuropsychological populations.
Do in-training evaluation reports deserve their bad reputations? A study of the reliability and predictive ability of ITER scores and narrative comments.

PubMed

Ginsburg, Shiphra; Eva, Kevin; Regehr, Glenn

2013-10-01

Although scores on in-training evaluation reports (ITERs) are often criticized for poor reliability and validity, ITER comments may yield valuable information. The authors assessed across-rotation reliability of ITER scores in one internal medicine program, ability of ITER scores and comments to predict postgraduate year three (PGY3) performance, and reliability and incremental predictive validity of attendings' analysis of written comments. Numeric and narrative data from the first two years of ITERs for one cohort of residents at the University of Toronto Faculty of Medicine (2009-2011) were assessed for reliability and predictive validity of third-year performance. Twenty-four faculty attendings rank-ordered comments (without scores) such that each resident was ranked by three faculty. Mean ITER scores and comment rankings were submitted to regression analyses; dependent variables were PGY3 ITER scores and program directors' rankings. Reliabilities of ITER scores across nine rotations for 63 residents were 0.53 for both postgraduate year one (PGY1) and postgraduate year two (PGY2). Interrater reliabilities across three attendings' rankings were 0.83 for PGY1 and 0.79 for PGY2. There were strong correlations between ITER scores and comments within each year (0.72 and 0.70). Regressions revealed that PGY1 and PGY2 ITER scores collectively explained 25% of variance in PGY3 scores and 46% of variance in PGY3 rankings. Comment rankings did not improve predictions. ITER scores across multiple rotations showed decent reliability and predictive validity. Comment ranks did not add to the predictive ability, but correlation analyses suggest that trainee performance can be measured through these comments.
Principles and Practices of Test Score Equating. Research Report. ETS RR-10-29

ERIC Educational Resources Information Center

Dorans, Neil J.; Moses, Tim P.; Eignor, Daniel R.

2010-01-01

Score equating is essential for any testing program that continually produces new editions of a test and for which the expectation is that scores from these editions have the same meaning over time. Particularly in testing programs that help make high-stakes decisions, it is extremely important that test equating be done carefully and accurately.…
The Role of Test Scores in Explaining Race and Gender Differences in Wages

ERIC Educational Resources Information Center

Blackburn, McKinley L.

2004-01-01

Previous research has suggested that skills reflected in test-score performance on tests such as the Armed Forces Qualification Test (AFQT) can account for some of the racial differences in average wages. I use a more complete set of test scores available with the National Longitudinal Survey of Youth 1979 Cohort to reconsider this evidence, and…
Do candidate reactions relate to job performance or affect criterion-related validity? A multistudy investigation of relations among reactions, selection test scores, and job performance.

PubMed

McCarthy, Julie M; Van Iddekinge, Chad H; Lievens, Filip; Kung, Mei-Chuan; Sinar, Evan F; Campion, Michael A

2013-09-01

Considerable evidence suggests that how candidates react to selection procedures can affect their test performance and their attitudes toward the hiring organization (e.g., recommending the firm to others). However, very few studies of candidate reactions have examined one of the outcomes organizations care most about: job performance. We attempt to address this gap by developing and testing a conceptual framework that delineates whether and how candidate reactions might influence job performance. We accomplish this objective using data from 4 studies (total N = 6,480), 6 selection procedures (personality tests, job knowledge tests, cognitive ability tests, work samples, situational judgment tests, and a selection inventory), 5 key candidate reactions (anxiety, motivation, belief in tests, self-efficacy, and procedural justice), 2 contexts (industry and education), 3 continents (North America, South America, and Europe), 2 study designs (predictive and concurrent), and 4 occupational areas (medical, sales, customer service, and technological). Consistent with previous research, candidate reactions were related to test scores, and test scores were related to job performance. Further, there was some evidence that reactions affected performance indirectly through their influence on test scores. Finally, in no cases did candidate reactions affect the prediction of job performance by increasing or decreasing the criterion-related validity of test scores. Implications of these findings and avenues for future research are discussed. PsycINFO Database Record (c) 2013 APA, all rights reserved
Does breastfeeding contribute to the racial gap in reading and math test scores?

PubMed

Peters, Kristen E; Huang, Jin; Vaughn, Michael G; Witko, Christopher

2013-10-01

The aim of this study was to examine the impact of divergent breastfeeding practices between Caucasian and African American mothers on the lingering achievement test gap between Caucasian and African American children. The Child Development Supplement of the Panel Study of Income Dynamics, beginning in 1997, followed a cohort of 3563 children aged 0-12 years. Reading and math test scores from 2002 for 1928 children were linked with breastfeeding history. Regression analysis was used to examine associations between ever having been breastfed and duration of breastfeeding and test scores, controlling for characteristics of child, mother, and household. African American students scored significantly lower than Caucasian children by 10.6 and 10.9 points on reading and math tests, respectively. After accounting for the impact of having been breastfed during infancy, the racial test gap decreased by 17% for reading scores and 9% for math scores. Study findings indicate that breastfeeding explains 17% and 9% of the observed gaps in reading and math scores, respectively, between African Americans and Caucasians, an effect larger than most recent educational policy interventions. Renewed efforts around policies and clinical practices that promote and remove barriers for African American mothers to breastfeed should be implemented. Copyright © 2013 Elsevier Inc. All rights reserved.
Ability-performance relationships in education and employment settings: critical tests of the more-is-better and the good-enough hypotheses.

PubMed

Arneson, Justin J; Sackett, Paul R; Beatty, Adam S

2011-10-01

The nature of the relationship between ability and performance is of critical importance for admission decisions in the context of higher education and for personnel selection. Although previous research has supported the more-is-better hypothesis by documenting linearity of ability-performance relationships, such research has not been sensitive enough to detect deviations at the top ends of the score distributions. An alternative position receiving considerable attention is the good-enough hypothesis, which suggests that although higher levels of ability may result in better performance up to a threshold, above this threshold greater ability does not translate to better performance. In this study, the nature of the relationship between cognitive ability and performance was examined throughout the score range in four large-scale data sets. Monotonicity was maintained in all instances. Contrary to the good-enough hypothesis, the ability-performance relationship was commonly stronger at the top end of the score distribution than at the bottom end.
Discrepancies between modified Medical Research Council dyspnea score and COPD assessment test score in patients with COPD

PubMed Central

Rhee, Chin Kook; Kim, Jin Woo; Hwang, Yong Il; Lee, Jin Hwa; Jung, Ki-Suck; Lee, Myung Goo; Yoo, Kwang Ha; Lee, Sang Haak; Shin, Kyeong-Cheol; Yoon, Hyoung Kyu

2015-01-01

Background and objective According to the Global Initiative for Chronic Obstructive Lung Disease (GOLD) guidelines, either a modified Medical Research Council (mMRC) dyspnea score of ≥2 or a chronic obstructive pulmonary disease (COPD) assessment test (CAT) score of ≥10 is considered to represent COPD patients who are more symptomatic. We aimed to identify the ideal CAT score that exhibits minimal discrepancy with the mMRC score. Methods A receiver operating characteristic curve of the CAT score was generated for an mMRC scores of 1 and 2. A concordance analysis was applied to quantify the association between the frequencies of patients categorized into GOLD groups A–D using symptom cutoff points. A κ-coefficient was calculated. Results For an mMRC score of 2, a CAT score of 15 showed the maximum value of Youden’s index with a sensitivity and specificity of 0.70 and 0.66, respectively (area under the receiver operating characteristic curve [AUC] 0.74; 95% confidence interval [CI], 0.70–0.77). For an mMRC score of 1, a CAT score of 10 showed the maximum value of Youden’s index with a sensitivity and specificity of 0.77 and 0.65, respectively (AUC 0.77; 95% CI, 0.72–0.83). The κ value for concordance was highest between an mMRC score of 1 and a CAT score of 10 (0.66), followed by an mMRC score of 2 and a CAT score of 15 (0.56), an mMRC score of 2 and a CAT score of 10 (0.47), and an mMRC score of 1 and a CAT score of 15 (0.43). Conclusion A CAT score of 10 was most concordant with an mMRC score of 1 when classifying patients with COPD into GOLD groups A–D. However, a discrepancy remains between the CAT and mMRC scoring systems. PMID:26316736
An Investigation into the Relationships Between Cloze Test Scores and Informal Reading Inventory Scores of Fifth Grade Pupils.

ERIC Educational Resources Information Center

Walter, Richard Barry

This study investigated the relationship between instructional level scores as determined by a cloze test and instructional level scores as determined by an informal reading inventory (IRI). Fifty male and 50 female subjects were randomly selected from the total fifth grade population of five schools chosen from a total of 22 midwestern elementary…
Reduce, Reuse, Recycle: The Longitudinal Value of Local Cut Scores Using State Test Data

ERIC Educational Resources Information Center

Nelson, Peter M.; Van Norman, Ethan R.; VanDerHeyden, Amanda

2017-01-01

We used existing reading (n = 1,498) and math (n = 2,260) data to evaluate state test scores for screening middle school students. In Phase 1, state test data were used to create a research-derived cut score that was optimal for predicting state test performance the following year. In Phase 2, those cut scores were applied with future cohorts.…
Students' Ability in Science: Results from a Test Development Study

ERIC Educational Resources Information Center

Akkanat, Cigdem; Gokdere, Murat

2017-01-01

Student's ability to use and manipulate scientific concepts has been widely explored; however there is still a need to define the characteristics and nature of science ability. Also, the tests and performance scales that require minimal conceptual knowledge to measure this ability are relatively less common. The aim of this study was to develop an…
Online pre-race education improves test scores for volunteers at a marathon.

PubMed

Maxwell, Shane; Renier, Colleen; Sikka, Robby; Widstrom, Luke; Paulson, William; Christensen, Trent; Olson, David; Nelson, Benjamin

2017-09-01

This study examined whether an online course would lead to increased knowledge about the medical issues volunteers encounter during a marathon. Health care professionals who volunteered to provide medical coverage for an annual marathon were eligible for the study. Demographic information about medical volunteers including profession, specialty, education level and number of marathons they had volunteered for was collected. A 15-question test about the most commonly encountered medical issues was created by the authors and administered before and after the volunteers took the online educational course and compared to a pilot study the previous year. Seventy-four subjects completed the pre-test. Those who participated in the pilot study last year (N = 15) had pre-test scores that were an average of 2.4 points higher than those who did not (mean ranks: pilot study = 51.6 vs. non-pilot = 33.9, p = 0.004). Of the 74 subjects who completed the pre-test, 54 also completed the post-test. The overall post-pre mean score difference was 3.8 ± 2.7 (t = 10.5 df = 53 p < 0.001). While subjects with all levels of volunteer experience demonstrated improvement, only change among first time marathon volunteers was significantly different from the others. Subjects reporting all degree/certification levels demonstrated improvement, but no difference in improvement was found between degree/certification levels. In this follow-up to the previous year's pilot study, online education demonstrated a long-term (one-year) increase in test scores. Testing also continued to show short-term improvement in post-course test scores, compared to pre-course test scores. In general, marathon medical volunteers who had no volunteer experience demonstrated greater improvement than those who had prior volunteer experience.
Examining the Validity of GED[R] Tests Scores with Scheduling and Setting Accommodations. GED Testing Service Research Studies, 2004-1

ERIC Educational Resources Information Center

George-Ezzelle, Carol E.; Skaggs, Gary

2004-01-01

Current testing standards call for test developers to provide evidence that testing procedures and test scores, and the inferences made based on the test scores, show evidence of validity and are comparable across subpopulations (American Educational Research Association [AERA], American Psychological Association [APA], & National Council on…
Do We Really Become Smarter When Our Fluid-Intelligence Test Scores Improve?

PubMed Central

Hayes, Taylor R.; Petrov, Alexander A.; Sederberg, Per B.

2014-01-01

Recent reports of training-induced gains on fluid intelligence tests have fueled an explosion of interest in cognitive training—now a billion-dollar industry. The interpretation of these results is questionable because score gains can be dominated by factors that play marginal roles in the scores themselves, and because intelligence gain is not the only possible explanation for the observed control-adjusted far transfer across tasks. Here we present novel evidence that the test score gains used to measure the efficacy of cognitive training may reflect strategy refinement instead of intelligence gains. A novel scanpath analysis of eye movement data from 35 participants solving Raven’s Advanced Progressive Matrices on two separate sessions indicated that one-third of the variance of score gains could be attributed to test-taking strategy alone, as revealed by characteristic changes in eye-fixation patterns. When the strategic contaminant was partialled out, the residual score gains were no longer significant. These results are compatible with established theories of skill acquisition suggesting that procedural knowledge tacitly acquired during training can later be utilized at posttest. Our novel method and result both underline a reason to be wary of purported intelligence gains, but also provide a way forward for testing for them in the future. PMID:25395695
Do We Really Become Smarter When Our Fluid-Intelligence Test Scores Improve?

PubMed

Hayes, Taylor R; Petrov, Alexander A; Sederberg, Per B

2015-01-01

Recent reports of training-induced gains on fluid intelligence tests have fueled an explosion of interest in cognitive training-now a billion-dollar industry. The interpretation of these results is questionable because score gains can be dominated by factors that play marginal roles in the scores themselves, and because intelligence gain is not the only possible explanation for the observed control-adjusted far transfer across tasks. Here we present novel evidence that the test score gains used to measure the efficacy of cognitive training may reflect strategy refinement instead of intelligence gains. A novel scanpath analysis of eye movement data from 35 participants solving Raven's Advanced Progressive Matrices on two separate sessions indicated that one-third of the variance of score gains could be attributed to test-taking strategy alone, as revealed by characteristic changes in eye-fixation patterns. When the strategic contaminant was partialled out, the residual score gains were no longer significant. These results are compatible with established theories of skill acquisition suggesting that procedural knowledge tacitly acquired during training can later be utilized at posttest. Our novel method and result both underline a reason to be wary of purported intelligence gains, but also provide a way forward for testing for them in the future.
A weighted generalized score statistic for comparison of predictive values of diagnostic tests.

PubMed

Kosinski, Andrzej S

2013-03-15

Positive and negative predictive values are important measures of a medical diagnostic test performance. We consider testing equality of two positive or two negative predictive values within a paired design in which all patients receive two diagnostic tests. The existing statistical tests for testing equality of predictive values are either Wald tests based on the multinomial distribution or the empirical Wald and generalized score tests within the generalized estimating equations (GEE) framework. As presented in the literature, these test statistics have considerably complex formulas without clear intuitive insight. We propose their re-formulations that are mathematically equivalent but algebraically simple and intuitive. As is clearly seen with a new re-formulation we presented, the generalized score statistic does not always reduce to the commonly used score statistic in the independent samples case. To alleviate this, we introduce a weighted generalized score (WGS) test statistic that incorporates empirical covariance matrix with newly proposed weights. This statistic is simple to compute, always reduces to the score statistic in the independent samples situation, and preserves type I error better than the other statistics as demonstrated by simulations. Thus, we believe that the proposed WGS statistic is the preferred statistic for testing equality of two predictive values and for corresponding sample size computations. The new formulas of the Wald statistics may be useful for easy computation of confidence intervals for difference of predictive values. The introduced concepts have potential to lead to development of the WGS test statistic in a general GEE setting. Copyright © 2012 John Wiley & Sons, Ltd.
Motor ability and inhibitory processes in children with ADHD: a neuroelectric study.

PubMed

Hung, Chiao-Ling; Chang, Yu-Kai; Chan, Yuan-Shuo; Shih, Chia-Hao; Huang, Chung-Ju; Hung, Tsung-Min

2013-06-01

The purpose of the current study was to examine the relationship between motor ability and response inhibition using behavioral and electrophysiological indices in children with ADHD. A total of 32 participants were recruited and underwent a motor ability assessment by administering the Basic Motor Ability Test-Revised (BMAT) as well as the Go/No-Go task and event-related potential (ERP) measurements at the same time. The results indicated that the BMAT scores were positively associated with the behavioral and ERP measures. Specifically, the BMAT average score was associated with a faster reaction time and higher accuracy, whereas higher BMAT subset scores predicted a shorter P3 latency in the Go condition. Although the association between the BMAT average score and the No-Go accuracy was limited, higher BMAT average and subset scores predicted a shorter N2 and P3 latency and a larger P3 amplitude in the No-Go condition. These findings suggest that motor abilities may play roles that benefit the cognitive performance of ADHD children.
Validity of GRE General Test scores and TOEFL scores for graduate admission to a technical university in Western Europe

NASA Astrophysics Data System (ADS)

Zimmermann, Judith; von Davier, Alina A.; Buhmann, Joachim M.; Heinimann, Hans R.

2018-01-01

Graduate admission has become a critical process in tertiary education, whereby selecting valid admissions instruments is key. This study assessed the validity of Graduate Record Examination (GRE) General Test scores for admission to Master's programmes at a technical university in Europe. We investigated the indicative value of GRE scores for the Master's programme grade point average (GGPA) with and without the addition of the undergraduate GPA (UGPA) and the TOEFL score, and of GRE scores for study completion and Master's thesis performance. GRE scores explained 20% of the variation in the GGPA, while additional 7% were explained by the TOEFL score and 3% by the UGPA. Contrary to common belief, the GRE quantitative reasoning score showed only little explanatory power. GRE scores were also weakly related to study progress but not to thesis performance. Nevertheless, GRE and TOEFL scores were found to be sensible admissions instruments. Rigorous methodology was used to obtain highly reliable results.
Proficiency Standards and Cut-Scores for Language Proficiency Tests.

ERIC Educational Resources Information Center

Moy, Raymond H.

1984-01-01

Discusses the problems associated with "grading on a curve," the approach often used for standard setting on language proficiency tests. Proposes four main steps presented in the setting of a non-arbitrary cut-score. These steps not only establish a proficiency standard checked by external criteria, but also check to see that the test covers the…

Effort Analysis: Individual Score Validation of Achievement Test Data

ERIC Educational Resources Information Center

Wise, Steven L.

2015-01-01

Whenever the purpose of measurement is to inform an inference about a student's achievement level, it is important that we be able to trust that the student's test score accurately reflects what that student knows and can do. Such trust requires the assumption that a student's test event is not unduly influenced by construct-irrelevant factors…
Student Laptop Use and Scores on Standardized Tests

ERIC Educational Resources Information Center

Kposowa, Augustine J.; Valdez, Amanda D.

2013-01-01

Objectives: The primary objective of the study was to investigate the relationship between ubiquitous laptop use and academic achievement. It was hypothesized that students with ubiquitous laptops would score on average higher on standardized tests than those without such computers. Methods: Data were obtained from two sources. First, demographic…
The Mediating Effect of Listening Metacognitive Awareness between Test-Taking Motivation and Listening Test Score: An Expectancy-Value Theory Approach

PubMed Central

Xu, Jian

2017-01-01

The present study investigated test-taking motivation in L2 listening testing context by applying Expectancy-Value Theory as the framework. Specifically, this study was intended to examine the complex relationships among expectancy, importance, interest, listening anxiety, listening metacognitive awareness, and listening test score using data from a large-scale and high-stakes language test among Chinese first-year undergraduates. Structural equation modeling was used to examine the mediating effect of listening metacognitive awareness on the relationship between expectancy, importance, interest, listening anxiety, and listening test score. According to the results, test takers’ listening scores can be predicted by expectancy, interest, and listening anxiety significantly. The relationship between expectancy, interest, listening anxiety, and listening test score was mediated by listening metacognitive awareness. The findings have implications for test takers to improve their test taking motivation and listening metacognitive awareness, as well as for L2 teachers to intervene in L2 listening classrooms. PMID:29312063
The Mediating Effect of Listening Metacognitive Awareness between Test-Taking Motivation and Listening Test Score: An Expectancy-Value Theory Approach.

PubMed

Xu, Jian

2017-01-01

The present study investigated test-taking motivation in L2 listening testing context by applying Expectancy-Value Theory as the framework. Specifically, this study was intended to examine the complex relationships among expectancy, importance, interest, listening anxiety, listening metacognitive awareness, and listening test score using data from a large-scale and high-stakes language test among Chinese first-year undergraduates. Structural equation modeling was used to examine the mediating effect of listening metacognitive awareness on the relationship between expectancy, importance, interest, listening anxiety, and listening test score. According to the results, test takers' listening scores can be predicted by expectancy, interest, and listening anxiety significantly. The relationship between expectancy, interest, listening anxiety, and listening test score was mediated by listening metacognitive awareness. The findings have implications for test takers to improve their test taking motivation and listening metacognitive awareness, as well as for L2 teachers to intervene in L2 listening classrooms.
The Dynamics of the Evolution of the Black-White Test Score Gap

ERIC Educational Resources Information Center

Sohn, Kitae

2012-01-01

We apply a quantile version of the Oaxaca-Blinder decomposition to estimate the counterfactual distribution of the test scores of Black students. In the Early Childhood Longitudinal Study, Kindergarten Class of 1998-1999 (ECLS-K), we find that the gap initially appears only at the top of the distribution of test scores. As children age, however,…
The Dental Hygiene Aptitude Tests and the American College Testing Program Tests as Predictors of Scores on the National Board Dental Hygiene Examination.

ERIC Educational Resources Information Center

Longenbecker, Sueann; Wood, Peter H.

1984-01-01

Scores from the National Board Dental Hygiene Examination (NBDHE) served as the criterion variable in a comparison of the predictive validity of the Dental Hygiene Aptitude Tests (DHAT) and the ACT Assessment tests. The DHAT-Science and Verbal tests combined to produce the highest multiple correlation with NBDHE scores. (Author/DWH)
Comparing Graphical and Verbal Representations of Measurement Error in Test Score Reports

ERIC Educational Resources Information Center

Zwick, Rebecca; Zapata-Rivera, Diego; Hegarty, Mary

2014-01-01

Research has shown that many educators do not understand the terminology or displays used in test score reports and that measurement error is a particularly challenging concept. We investigated graphical and verbal methods of representing measurement error associated with individual student scores. We created four alternative score reports, each…
The Glasgow Voice Memory Test: Assessing the ability to memorize and recognize unfamiliar voices.

PubMed

Aglieri, Virginia; Watson, Rebecca; Pernet, Cyril; Latinus, Marianne; Garrido, Lúcia; Belin, Pascal

2017-02-01

One thousand one hundred and twenty subjects as well as a developmental phonagnosic subject (KH) along with age-matched controls performed the Glasgow Voice Memory Test, which assesses the ability to encode and immediately recognize, through an old/new judgment, both unfamiliar voices (delivered as vowels, making language requirements minimal) and bell sounds. The inclusion of non-vocal stimuli allows the detection of significant dissociations between the two categories (vocal vs. non-vocal stimuli). The distributions of accuracy and sensitivity scores (d') reflected a wide range of individual differences in voice recognition performance in the population. As expected, KH showed a dissociation between the recognition of voices and bell sounds, her performance being significantly poorer than matched controls for voices but not for bells. By providing normative data of a large sample and by testing a developmental phonagnosic subject, we demonstrated that the Glasgow Voice Memory Test, available online and accessible from all over the world, can be a valid screening tool (~5 min) for a preliminary detection of potential cases of phonagnosia and of "super recognizers" for voices.
Oral Cancer Knowledge and Diagnostic Ability Among Dental Students.

PubMed

Hassona, Y; Scully, C; Abu Tarboush, N; Baqain, Z; Ismail, F; Hawamdeh, S; Sawair, F

2017-09-01

The purpose of this study is to examine factors that influence the diagnostic ability of dental students with regards to oral cancer and oral potentially malignant disorders. Dental students at different levels of study were directly interviewed to examine their oral cancer knowledge and diagnostic ability using a validated and pre-tested survey instrument containing validated clinical images of oral cancer and oral potentially malignant disorders. An oral cancer knowledge scale (0 to 31) was generated from correct responses on oral cancer general knowledge, and a diagnostic ability scale (0 to 100) was generated from correct selections of suspicious oral lesions. Knowledge scores ranged from 0 to 27 (mean 10.1 ± 6.0); mean knowledge scores increased with year of study; 5th year students had the highest mean knowledge score (19.1 ± 4.0), while 1st year students had the lowest (5.6 ± 3.5). Diagnostic ability scores increased with year of study and ranged from 0 to 88.5 % (mean 41.8 % ± 15.6). The ability to recognize suspicious oral lesions was significantly correlated with knowledge about oral cancer and oral potentially malignant disorders (r = 0.28; P < 0.001). There is a need to improve oral cancer education curricula; increasing students' contact with patients who have oral lesions including oral cancer will help to improve their future diagnostic ability and early detection practices.
Rank score and permutation testing alternatives for regression quantile estimates

USGS Publications Warehouse

Cade, B.S.; Richards, J.D.; Mielke, P.W.

2006-01-01

Performance of quantile rank score tests used for hypothesis testing and constructing confidence intervals for linear quantile regression estimates (0 ≤ τ ≤ 1) were evaluated by simulation for models with p = 2 and 6 predictors, moderate collinearity among predictors, homogeneous and hetero-geneous errors, small to moderate samples (n = 20–300), and central to upper quantiles (0.50–0.99). Test statistics evaluated were the conventional quantile rank score T statistic distributed as χ2 random variable with q degrees of freedom (where q parameters are constrained by H 0:) and an F statistic with its sampling distribution approximated by permutation. The permutation F-test maintained better Type I errors than the T-test for homogeneous error models with smaller n and more extreme quantiles τ. An F distributional approximation of the F statistic provided some improvements in Type I errors over the T-test for models with > 2 parameters, smaller n, and more extreme quantiles but not as much improvement as the permutation approximation. Both rank score tests required weighting to maintain correct Type I errors when heterogeneity under the alternative model increased to 5 standard deviations across the domain of X. A double permutation procedure was developed to provide valid Type I errors for the permutation F-test when null models were forced through the origin. Power was similar for conditions where both T- and F-tests maintained correct Type I errors but the F-test provided some power at smaller n and extreme quantiles when the T-test had no power because of excessively conservative Type I errors. When the double permutation scheme was required for the permutation F-test to maintain valid Type I errors, power was less than for the T-test with decreasing sample size and increasing quantiles. Confidence intervals on parameters and tolerance intervals for future predictions were constructed based on test inversion for an example application
Super-recognizers: people with extraordinary face recognition ability.

PubMed

Russell, Richard; Duchaine, Brad; Nakayama, Ken

2009-04-01

We tested 4 people who claimed to have significantly better than ordinary face recognition ability. Exceptional ability was confirmed in each case. On two very different tests of face recognition, all 4 experimental subjects performed beyond the range of control subject performance. They also scored significantly better than average on a perceptual discrimination test with faces. This effect was larger with upright than with inverted faces, and the 4 subjects showed a larger "inversion effect" than did control subjects, who in turn showed a larger inversion effect than did developmental prosopagnosics. This result indicates an association between face recognition ability and the magnitude of the inversion effect. Overall, these "super-recognizers" are about as good at face recognition and perception as developmental prosopagnosics are bad. Our findings demonstrate the existence of people with exceptionally good face recognition ability and show that the range of face recognition and face perception ability is wider than has been previously acknowledged.
Super-recognizers: People with extraordinary face recognition ability

PubMed Central

Russell, Richard; Duchaine, Brad; Nakayama, Ken

2014-01-01

We tested four people who claimed to have significantly better than ordinary face recognition ability. Exceptional ability was confirmed in each case. On two very different tests of face recognition, all four experimental subjects performed beyond the range of control subject performance. They also scored significantly better than average on a perceptual discrimination test with faces. This effect was larger with upright than inverted faces, and the four subjects showed a larger ‘inversion effect’ than control subjects, who in turn showed a larger inversion effect than developmental prosopagnosics. This indicates an association between face recognition ability and the magnitude of the inversion effect. Overall, these ‘super-recognizers’ are about as good at face recognition and perception as developmental prosopagnosics are bad. Our findings demonstrate the existence of people with exceptionally good face recognition ability, and show that the range of face recognition and face perception ability is wider than previously acknowledged. PMID:19293090
Impaired reading comprehension and mathematical abilities in male adolescents with average or above general intellectual abilities are associated with comorbid and future psychopathology.

PubMed

Weiser, Mark; Reichenberg, Abraham; Rabinowitz, Jonathan; Nahon, Daniella; Kravitz, Efrat; Lubin, Gad; Knobler, Haim Y; Davidson, Michael; Noy, Shlomo

2007-11-01

Research indicates that persons with learning disorders often suffer from psychopathology. We assessed current and future psychopathology in male adolescents with discrete impairments in reading comprehension (IRC) or arithmetic abilities (IAA) but with average or above-average general intellectual abilities. Subjects were a population-based cohort of 174,994 male adolescents screened by the Israeli Draft Board with average or above-average intellectual abilities but with low scores (8.6th and 10th lowest percentile respectively) on reading or arithmetic tests. They were compared with adolescents who scored in the 10th percentile and above on these tests (comparison group). Relative to the comparison group, male adolescents with IRC, IAA, or IRC and IAA (0.69%), had poorer scores on most behavioral assessments and higher prevalence of current psychopathology: 4.2% (comparison group), 8.0% (IRC), 7.0% (IAA), and 9.8% (IRC and IAA). Adolescents with IRC were also at increased risk for later hospitalization for schizophrenia (hazard ratios = 1.8, 95% confidence interval: 1.3-2.6). Male adolescents with average and above-average general intellectual abilities but with IRC or IAA are more likely to have current and future psychopathology. Impairments in intellectual functioning and abnormal behaviors leading to mental illnesses may share common neurobiological substrates. The results support screening male adolescents with learning disorders for psychopathology.
Effects of work-related stress on work ability index among refinery workers

PubMed Central

Habibi, Ehsanollah; Dehghan, Habibollah; Safari, Shahram; Mahaki, Behzad; Hassanzadeh, Akbar

2014-01-01

Introduction: Work-related stress is one of the basic problems in industrial also top 10 work-related health problems and it is increasingly implicated in the development a number of problems such as cardiovascular disease, musculoskeletal diseases, early retirement to employees. On the other hand, early retirement to employees from the workplace has increased on the problems of today's industries. Hereof, improving work ability is one of the most effective ways to enhance the ability and preventing disability and early retirement. The aim of This study is determine the relationship between job stress score and work ability index (WAI) at the refinery workers. Materials and Methods: This is a cross-sectional study in which 171 workers from a refinery in isfahan in 2012 who were working in different occupational groups participated. Based on appropriate assignment sampling, 33 office workers, 69 operational workers, and 69 maintenance workers, respectively, were invited to participate in this study. Two questionnaires including work related-stress and WAI were filled in. Finally, the information was analyzed using the SPSS-20 and statistic tests namely, analysis of covariance Kruskal-Wallis test. Pearson correlation coefficient, ANOVA and t-test. Results: Data analysis revealed that 86% and 14% of participants had moderate and severe stress respectively. Average score of stress and standard deviation was 158.7 ± 17.3 that was in extreme stress range. Average score and standard deviation of WAI questionnaire were 37.18 and 3.86 respectively. That placed in a good range. Pearson correlation coefficient showed that WAI score had significant reversed relationship with a score of stress. Conclusion: According to the results, mean stress score among refinery worker was high and one fator that affect work abiity was high stress, hence training on communication skills and safe working environment in order to decreses stress, enhance the work ability of workers. PMID
Effects of work-related stress on work ability index among refinery workers.

PubMed

Habibi, Ehsanollah; Dehghan, Habibollah; Safari, Shahram; Mahaki, Behzad; Hassanzadeh, Akbar

2014-01-01

Work-related stress is one of the basic problems in industrial also top 10 work-related health problems and it is increasingly implicated in the development a number of problems such as cardiovascular disease, musculoskeletal diseases, early retirement to employees. On the other hand, early retirement to employees from the workplace has increased on the problems of today's industries. Hereof, improving work ability is one of the most effective ways to enhance the ability and preventing disability and early retirement. The aim of This study is determine the relationship between job stress score and work ability index (WAI) at the refinery workers. This is a cross-sectional study in which 171 workers from a refinery in isfahan in 2012 who were working in different occupational groups participated. Based on appropriate assignment sampling, 33 office workers, 69 operational workers, and 69 maintenance workers, respectively, were invited to participate in this study. Two questionnaires including work related-stress and WAI were filled in. Finally, the information was analyzed using the SPSS-20 and statistic tests namely, analysis of covariance Kruskal-Wallis test. Pearson correlation coefficient, ANOVA and t-test. Data analysis revealed that 86% and 14% of participants had moderate and severe stress respectively. Average score of stress and standard deviation was 158.7 ± 17.3 that was in extreme stress range. Average score and standard deviation of WAI questionnaire were 37.18 and 3.86 respectively. That placed in a good range. Pearson correlation coefficient showed that WAI score had significant reversed relationship with a score of stress. According to the results, mean stress score among refinery worker was high and one fator that affect work abiity was high stress, hence training on communication skills and safe working environment in order to decreses stress, enhance the work ability of workers.
Differential Ability Scales-II Prediction of Reading Performance: Global Scores Are Not Enough

ERIC Educational Resources Information Center

Elliott, Colin D.; Hale, James B.; Fiorello, Catherine A.; Dorvil, Cledicianne; Moldovan, Jaime

2010-01-01

This study investigated the effects of broad cognitive abilities derived from the Cattell-Horn-Carroll (CHC) taxonomy, together with the effect of the general factor ("g"), on Wechsler Individual Achievement Test, Second Edition (WIAT-II) reading achievement. Structural equation modeling (SEM) and commonality analyses were applied to the…
Contributions of Hamstring Stiffness to Straight-Leg-Raise and Sit-and-Reach Test Scores.

PubMed

Miyamoto, Naokazu; Hirata, Kosuke; Kimura, Noriko; Miyamoto-Mikami, Eri

2018-02-01

The passive straight-leg-raise (PSLR) and the sit-and-reach (SR) tests have been widely used to assess hamstring extensibility. However, it remains unclear to what extent hamstring stiffness (a measure of material properties) contributes to PSLR and SR test scores. Therefore, we aimed to clarify the relationship between hamstring stiffness and PSLR and SR scores using ultrasound shear wave elastography. Ninety-eight healthy subjects completed the study. Each subject completed PSLR testing, and classic and modified SR testing of the right leg. Muscle shear modulus of the biceps femoris, semitendinosus, and semimembranosus was quantified as an index of muscle stiffness. The relationships between shear modulus of each muscle and PSLR or SR scores were calculated using Pearson's product-moment correlation coefficients. Shear modulus of the semitendinosus and semimembranosus showed negative correlations with the two PSLR and two SR scores (absolute r value≤0.484). Shear modulus of the biceps femoris was significantly correlated with the PSLR score determined by the examiner and the modified SR score (absolute r value≤0.308). The present findings suggest that PSLR and SR test scores are strongly influenced by factors other than hamstring stiffness and therefore might not accurately evaluate hamstring stiffness. © Georg Thieme Verlag KG Stuttgart · New York.
Manual for Scoring the Test of Directed Imagination.

ERIC Educational Resources Information Center

Veldman, Donald J.; And Others

A scoring manual for the Directed Imagination Test, a projective technique wherein the subject is instructed to write four fictional stories (four minutes are allowed for each) about teachers and their experiences, is presented. The manual provides detailed instructions for rating each story by fifteen dimensions relevant to teacher education…
The Relationship between Spatial Visualization Ability and Students' Ability to Model 3D Objects from Engineering Assembly Drawings

ERIC Educational Resources Information Center

Branoff, T. J.; Dobelis, M.

2012-01-01

Spatial abilities have been used as a predictor of success in several engineering and technology disciplines (Strong & Smith, 2001). In engineering graphics courses, scores on spatial tests have also been used to predict success (Adanez & Velasco, 2002; Leopold, Gorska, & Sorby, 2001). Other studies have shown that some type of…
Use of the Short Physical Performance Battery Score to Predict Loss of Ability to Walk 400 Meters: Analysis From the InCHIANTI Study

PubMed Central

Coppin, Antonia K.; Patel, Kushang V.; Lauretani, Fulvio; Ferrucci, Luigi; Bandinelli, Stefania; Guralnik, Jack M.

2009-01-01

Background Early detection of mobility limitations remains an important goal for preventing mobility disability. The purpose of this study was to examine the association between the Short Physical Performance Battery (SPPB) and the loss of ability to walk 400 m, an objectively assessed mobility outcome increasingly used in clinical trials. Methods The study sample consisted of 542 adults from the InCHIANTI study aged 65 and older, who completed the 400 m walk at baseline and had evaluations on the SPPB and 400 m walk at baseline and 3-year follow-up. Multiple logistic regression models were used to determine whether SPPB scores predict the loss of ability to walk 400 m at follow-up among persons able to walk 400 m at baseline. Results The 3-year incidence of failing the 400 m walk was 15.5%. After adjusting for age, sex, education, body mass index, Mini-Mental State Examination, number of medical conditions, and 400 m walk gait speed at baseline, SPPB score was significantly associated with loss of ability to walk 400 m after 3 years. Participants with SPPB scores of 10 or lower at baseline had significantly higher odds of mobility disability at follow-up (odds ratio [OR] = 3.38, 95% confidence interval [CI]: 1.32–8.65) compared with those who scored 12, with a graded response across the range of SPPB scores (OR = 26.93, 95% CI: 7.51–96.50; OR = 7.67, 95% CI: 2.26–26.04; OR = 8.28, 95% CI: 3.32–20.67 for SPPB ≤ 7, SPPB 8, and SPPB 9, respectively). Conclusions The SPPB strongly predicts loss of ability to walk 400 m. Thus, using the SPPB to identify older persons at high risk of lower body functional limitations seems a valid means of recognizing individuals who would benefit most from preventive interventions. PMID:19182232

AP Trends: Tests Soar, Scores Slip--Gaps between Groups Spur Equity Concerns

ERIC Educational Resources Information Center

Cech, Scott J.

2008-01-01

More students are taking Advanced Placement tests, but the proportion of tests receiving what is deemed a passing score has dipped, and the mean score is down for the fourth year in a row. Data released here this week by the New York City-based nonprofit organization that owns the AP brand shows that a greater-than-ever proportion of students…
Generalized likelihood ratios for quantitative diagnostic test scores.

PubMed

Tandberg, D; Deely, J J; O'Malley, A J

1997-11-01

The reduction of quantitative diagnostic test scores to the dichotomous case is a wasteful and unnecessary simplification in the era of high-speed computing. Physicians could make better use of the information embedded in quantitative test results if modern generalized curve estimation techniques were applied to the likelihood functions of Bayes' theorem. Hand calculations could be completely avoided and computed graphical summaries provided instead. Graphs showing posttest probability of disease as a function of pretest probability with confidence intervals (POD plots) would enhance acceptance of these techniques if they were immediately available at the computer terminal when test results were retrieved. Such constructs would also provide immediate feedback to physicians when a valueless test had been ordered.
Outcome of older persons admitted to intensive care unit, mortality, prognosis factors, dependency scores and ability trajectory within 1 year: a prospective cohort study.

PubMed

Level, Claude; Tellier, Eric; Dezou, Patrick; Chaoui, Karim; Kherchache, Aissa; Sejourné, Philippe; Rullion-Pac Soo, Anne Marie

2017-12-06

The outcome and functional trajectory of older persons admitted to intensive care (ICU) unit remain a true question for critical care physicians and geriatricians, due to the heterogeneity of geriatric population, heterogeneity of practices and absence of guidelines. To describe the 1-year outcome, prognosis factors and functional trajectory for older people admitted to ICU. In a prospective 1-year cohort study, all patients aged 75 years and over admitted to our ICU were included according to a global comprehensive geriatric assessment. Follow-up was conducted for 1 year survivors, in particular, ability scores and living conditions. Of 188 patients included [aged 82.3 ± 4.7 years, 46% of admissions, median SAPS II 53.5 (43-74), ADL of Katz's score 4.2 ± 1.6, median Barthel's index 71 (55-90), AGGIR scale 4.5 ± 1.5], the ICU, hospital and 1-year mortality were, respectively, 34, 42.5 and 65.5%. Prognosis factors were: SAPS 2, mechanical ventilation, comorbidity (Lee's and Mc Cabe's scores), disability scores (ADL of Katz's score, Barthel's index and AGGIR scale), admission creatinin, hypoalbuminemia, malignant haemopathy, cognitive impairment. One-year survivors lived in their own home for 83%, with a preserved physical ability, without significant variation of the three ability assessed scores compared to prior ICU admission. The mortality of older people admitted to ICU is high, with a significant impact of disabilty scores, and preserved 1-year survivor independency. Other studies, including a better comprehensive geriatric assessment, seem necessary to determine a predictive "phenotype" of survival with a "satisfactory" level of autonomy.
The Development of Extraversion and Ability: Analysis of Data from a Large-Scale Longitudinal Study of Children Tested at 10-11 and 14-15 Years.

ERIC Educational Resources Information Center

Anthony, W. S.

1983-01-01

Results of analysis of correlations collected by Cookson, following Eysenck and Cookson's study of personality and ability in young people, confirm the finding from previous Cattellian test data that the more intelligent children decline in relative extraversion scores and cast doubt on Eysenck's suggestion that introverts gradually show higher…
Validity of GRE General Test Scores and TOEFL Scores for Graduate Admission to a Technical University in Western Europe

ERIC Educational Resources Information Center

Zimmermann, Judith; von Davier, Alina A.; Buhmann, Joachim M.; Heinimann, Hans R.

2018-01-01

Graduate admission has become a critical process in tertiary education, whereby selecting valid admissions instruments is key. This study assessed the validity of Graduate Record Examination (GRE) General Test scores for admission to Master's programmes at a technical university in Europe. We investigated the indicative value of GRE scores for the…
The Formalization of Fairness: Issues in Testing for Measurement Invariance Using Subtest Scores

ERIC Educational Resources Information Center

Molenaar, Dylan; Borsboom, Denny

2013-01-01

Measurement invariance is an important prerequisite for the adequate comparison of group differences in test scores. In psychology, measurement invariance is typically investigated by means of linear factor analyses of subtest scores. These subtest scores typically result from summing the item scores. In this paper, we discuss 4 possible problems…
Effect of case-based learning on the development of graduate nurses' problem-solving ability.

PubMed

Yoo, Moon-Sook; Park, Jin-Hee

2014-01-01

Case-based learning (CBL) is a teaching strategy which promotes clinical problem-solving ability. This research was performed to investigate the effects of CBL on problem-solving ability of graduate nurses. This research was a quasi-experimental design using pre-test, intervention, and post-test with a non-synchronized, non-equivalent control group. The study population was composed of 190 new graduate nurses from university hospital A in Korea. Results of the research indicate that there was a statistically significant difference in objective problem-solving ability scores of CBL group demonstrating higher scores. Subjective problem-solving ability was also significantly higher in CBL group than in the lecture-based group. These results may suggest that CBL is a beneficial and effective instructional method of training graduate nurses to improve their clinical problem-solving ability. Copyright © 2013 Elsevier Ltd. All rights reserved.
Estimating Achievement Gaps from Test Scores Reported in Ordinal "Proficiency" Categories

ERIC Educational Resources Information Center

Ho, Andrew D.; Reardon, Sean F.

2012-01-01

Test scores are commonly reported in a small number of ordered categories. Examples of such reporting include state accountability testing, Advanced Placement tests, and English proficiency tests. This paper introduces and evaluates methods for estimating achievement gaps on a familiar standard-deviation-unit metric using data from these ordered…
A Seven-Year Follow-Up of Intelligence Test Scores of Foster Grandparents

ERIC Educational Resources Information Center

Troll, Lillian E.; And Others

1976-01-01

After seven years, a group (N=32) of originally nonemployed poverty-level older people (over 60) now employed as foster grandparents were retested with the WAIS. Three subtest scores showed stability and Digit Span showed a statistically significant drop. Neither age nor initial level of health or WAIS scores was related to test-score changes over…
An Examination of English Speaking Tests and Research on English Speaking Ability.

ERIC Educational Resources Information Center

Nakamura, Yuji

This paper examines both overseas and domestic tests of English speaking ability from the viewpoint of the crucial testing elements such as definition of speaking ability, validity, reliability, and practicality. The paper points out problems to be solved and proposes suggestions for constructing an oral proficiency test in order to determine the…
Investigating the mental abilities of rural Zulu primary school children in South Africa.

PubMed

Jinabhai, C C; Taylor, M; Rangongo, M F; Mkhize, N J; Anderson, S; Pillay, B J; Sullivan, K R

2004-02-01

Maximising the full potential of health and educational interventions in South African schools requires assessment of the current level of mental abilities of the school children as measured by cognitive and scholastic tests and the identification of any barriers to improved performance. This study reports on the application and interpretation of a selected battery of mental ability tests among Zulu school children and the methodological and analytical issues that need to be addressed. The test scores of 806 primary school children from a rural community are presented, based on four tests: Raven's Coloured Progressive Matrices (CPM), an Auditory Verbal Learning Test (AVLT), the Symbol Digit Modalities Test (SDMT) and Young's Group Mathematics Test (GMT). Significant gender differences were found in the test scores, and the mean scores of Zulu children in this study were lower than those reported in other studies. The results of this selected test battery provide data for the further development of appropriate test instruments for South African conditions. These results can contribute towards the development of a test battery for South African children that can be used to assess and improve their school performance.
Evaluation of the utility of the Estimation of Physiologic Ability and Surgical Stress score for predicting post-operative morbidity after orthopaedic surgery.

PubMed

Nagata, Takehiro; Hirose, Jun; Nakamura, Takayuki; Tokunaga, Takuya; Uehara, Yusuke; Mizuta, Hiroshi

2015-11-01

The purpose of this study was to investigate the utility of the Estimation of Physiologic Ability and Surgical Stress (E-PASS) scoring system for predicting post-operative morbidity. We included 1,883 patients (mean age, 52.1 years) who underwent orthopaedic surgery. The post-operative complications were classified as surgical site and non-surgical site complications, and the relationship between the E-PASS scores and post-operative morbidity was investigated. The incidence of post-operative complications (n = 274) significantly increased with an increase in E-PASS scores (p < 0.001). The areas under the curve for the comprehensive risk score of the E-PASS scoring system for overall and non-surgical site complications were 0.777 and 0.794, respectively. The E-PASS scoring system showed some utility in predicting post-operative morbidity after general orthopaedic surgery. However, creating a new risk score that is more suitable for orthopaedic surgery will be challenging.
Measuring creative imagery abilities

PubMed Central

Jankowska, Dorota M.; Karwowski, Maciej

2015-01-01

Over the decades, creativity and imagination research developed in parallel, but they surprisingly rarely intersected. This paper introduces a new theoretical model of creative visual imagination, which bridges creativity and imagination research, as well as presents a new psychometric instrument, called the Test of Creative Imagery Abilities (TCIA), developed to measure creative imagery abilities understood in accordance with this model. Creative imagination is understood as constituted by three interrelated components: vividness (the ability to create images characterized by a high level of complexity and detail), originality (the ability to produce unique imagery), and transformativeness (the ability to control imagery). TCIA enables valid and reliable measurement of these three groups of abilities, yielding the general score of imagery abilities and at the same time making profile analysis possible. We present the results of nine studies on a total sample of more than 1700 participants, showing the factor structure of TCIA using confirmatory factor analysis, as well as provide data confirming this instrument's validity and reliability. The availability of TCIA for interested researchers may result in new insights and possibilities of integrating the fields of creativity and imagination science. PMID:26539140
Simple exercise test score versus cardiac stress test for the prediction of coronary artery disease in patients with type 2 diabetes.

PubMed

Pikto-Pietkiewicz, Witold; Przewłocka, Monika; Chybowska, Barbara; Cyciwa, Alona; Pasierski, Tomasz

2014-01-01

Type 2 diabetes markedly increases the risk of coronary heart disease (CHD), and screening for CHD is suggested by the guidelines. The aim of the study was to compare the diagnostic usefulness of the simple exercise test score, incorporating the clinical data and cardiac stress test results, with the standard stress test in patients with type 2 diabetes. A total of 62 consecutive patients (aged 65.4 ±8.5 years; 32 men) with type 2 diabetes and clinical symptoms suggesting CHD underwent a stress test followed by coronary angiography. The simple score was calculated for all patients. Significant coronary stenosis was observed in 41 patients (66.1%). Stress test results were positive in 36 patients (58.1%). The mean simple score was high (65.5 ±14.3 points). A positive linear relationship was observed between the score and the prevalence of CHD (R2 = 0.19; P <0.001) as well as its severity (R² = 0.23; P <0.001). The area under the receiver-operating characteristic curve for the simple score was 0.74 (95% confidence interval [CI], 0.62-0.86). At the original cut-off value of 60 points, the score had a similar prognostic value to that of the standard stress test. However, in a multivariate analysis, only the simple score (odds ratio [OR], 1.46; 95% CI, 1.11-1.94; P <0.01 for an increase in the score by 1 point) and male sex (OR, 1.57; 95% CI, 1.24-1.98; P <0.001) remained independent predictors of CHD. In patients with type 2 diabetes, the simple score correlated with the prevalence and severity of CHD. However, the cut-off value of 60 points was inadequate in the population of diabetic patients with high risk of CHD. The simple score used instead of or together with the stress test was a better predictor of CHD than the stress test alone.
A Maturing Global Testing Regime Meets the World Economy: Test Scores and Economic Growth, 1960-2012

ERIC Educational Resources Information Center

Kamens, David H.

2015-01-01

This article considers the growth of the international testing regime. It discusses sources of growth and empirically examines two related sets of issues: (1) the stability of countries' achievement scores, and (2) the influence of those national scores on subsequent economic development over different time lags. The article suggests that…
Assessment Test Scores of Incoming Students, Fall 2001.

ERIC Educational Resources Information Center

Negron, Maggie; Breindel, Matthew

This assessment of placement test scores in reading, math, and sentence skills from incoming students at College of the Desert (California) shows that students are overwhelmingly underprepared for study at the college. Only 15% of students were prepared in sentence skills, 27% in reading skills, 7% in math skills; only 3% were prepared in all 3…
Test Score Stability and Construct Validity of the Adult Manifest Anxiety Scale-College Version Scores among College Students: A Brief Report

ERIC Educational Resources Information Center

Lowe, Patricia A.; Papanastasiou, Elena C.; DeRuyck, Kimberly A.; Reynolds, Cecil R.

2005-01-01

In this study, the authors investigated the temporal stability and construct validity of the Adult Manifest Anxiety Scale-College Version (AMAS-C; C. R. Reynolds, B. O. Richmond, & P. A. Lowe, 2003b) scores. Results indicated that the AMAS-C scores had adequate to excellent test score stability, and evidence supported the construct validity of the…
The Validity of IQ Scores Derived from Readiness Screening Tests

ERIC Educational Resources Information Center

Telegdy, Gabriel A.

1976-01-01

The Screening Test of Academic Readiness (STAR) and the Peabody Picture Vocabulary Test (PPVT) were administered to 52 kindergarten children to reveal the convergent validity of IQ scores derived from the STAR. The findings raise doubts about the validity of the deviation IQs derived from the STAR. (Author)
Just as smart but not as successful: obese students obtain lower school grades but equivalent test scores to nonobese students.

PubMed

MacCann, C; Roberts, R D

2013-01-01

The obesity epidemic in industrialized nations has important implications for education, as research demonstrates lower academic achievement among obese students. The current paper compares the test scores and school grades of obese, overweight and normal-weight students in secondary and further education, controlling for demographic variables, personality, ability and well-being confounds. This study included 383 eighth-grade students (49% female; study 1) and 1036 students from 24 community colleges and universities (64% female, study 2), both drawn from five regions across the United States. In study 1, body mass index (BMI) was calculated using self-reports and parent reports of weight and height. In study 2, BMI was calculated from self-reported weight and height only. Both samples completed age-appropriate assessments of mathematics, vocabulary and the personality trait conscientiousness. Eighth-grade students additionally completed a measure of life satisfaction, with both self-reports and parent reports of their grades from the previous semester also obtained. Higher education students additionally completed measures of positive and negative affect, and self-reported their grades and college entrance scores. Obese students receive significantly lower grades in middle school (d=0.83), community college (d=0.34) and university (d=0.36), but show no statistically significant differences in intelligence or achievement test scores. Even after controlling for demographic variables, intelligence, personality and well-being, obese students obtain significantly lower grades than normal-weight students in the eighth grade (d=0.39), community college (d=0.42) and university (d=0.31). Lower grades may reflect peer and teacher prejudice against overweight and obese students rather than lack of ability among these students.
Psychometric Properties of Raw and Scale Scores on Mixed-Format Tests

ERIC Educational Resources Information Center

Kolen, Michael J.; Lee, Won-Chan

2011-01-01

This paper illustrates that the psychometric properties of scores and scales that are used with mixed-format educational tests can impact the use and interpretation of the scores that are reported to examinees. Psychometric properties that include reliability and conditional standard errors of measurement are considered in this paper. The focus is…

The Comparison of Accuracy Scores on the Paper and Pencil Testing vs. Computer-Based Testing

ERIC Educational Resources Information Center

Retnawati, Heri

2015-01-01

This study aimed to compare the accuracy of the test scores as results of Test of English Proficiency (TOEP) based on paper and pencil test (PPT) versus computer-based test (CBT). Using the participants' responses to the PPT documented from 2008-2010 and data of CBT TOEP documented in 2013-2014 on the sets of 1A, 2A, and 3A for the Listening and…
Study Protocol on Intentional Distortion in Personality Assessment: Relationship with Test Format, Culture, and Cognitive Ability.

PubMed

Van Geert, Eline; Orhon, Altan; Cioca, Iulia A; Mamede, Rui; Golušin, Slobodan; Hubená, Barbora; Morillo, Daniel

2016-01-01

Self-report personality questionnaires, traditionally offered in a graded-scale format, are widely used in high-stakes contexts such as job selection. However, job applicants may intentionally distort their answers when filling in these questionnaires, undermining the validity of the test results. Forced-choice questionnaires are allegedly more resistant to intentional distortion compared to graded-scale questionnaires, but they generate ipsative data. Ipsativity violates the assumptions of classical test theory, distorting the reliability and construct validity of the scales, and producing interdependencies among the scores. This limitation is overcome in the current study by using the recently developed Thurstonian item response theory model. As online testing in job selection contexts is increasing, the focus will be on the impact of intentional distortion on personality questionnaire data collected online. The present study intends to examine the effect of three different variables on intentional distortion: (a) test format (graded-scale versus forced-choice); (b) culture, as data will be collected in three countries differing in their attitudes toward intentional distortion (the United Kingdom, Serbia, and Turkey); and (c) cognitive ability, as a possible predictor of the ability to choose the more desirable responses. Furthermore, we aim to integrate the findings using a comprehensive model of intentional distortion. In the Anticipated Results section, three main aspects are considered: (a) the limitations of the manipulation, theoretical approach, and analyses employed; (b) practical implications for job selection and for personality assessment in a broader sense; and (c) suggestions for further research.
Effects of age, gender, education and race on two tests of language ability in community-based older adults.

PubMed

Snitz, Beth E; Unverzagt, Frederick W; Chang, Chung-Chou H; Bilt, Joni Vander; Gao, Sujuan; Saxton, Judith; Hall, Kathleen S; Ganguli, Mary

2009-12-01

Neuropsychological tests, including tests of language ability, are frequently used to differentiate normal from pathological cognitive aging. However, language can be particularly difficult to assess in a standardized manner in cross-cultural studies and in patients from different educational and cultural backgrounds. This study examined the effects of age, gender, education and race on performance of two language tests: the animal fluency task (AFT) and the Indiana University Token Test (IUTT). We report population-based normative data on these tests from two combined ethnically divergent, cognitively normal, representative population samples of older adults. Participants aged > or =65 years from the Monongahela-Youghiogheny Healthy Aging Team (MYHAT) and from the Indianapolis Study of Health and Aging (ISHA) were selected based on (1) a Clinical Dementia Rating (CDR) score of 0; (2) non-missing baseline language test data; and (3) race self-reported as African-American or white. The combined sample (n = 1885) was 28.1% African-American. Multivariate ordinal logistic regression was used to model the effects of demographic characteristics on test scores. On both language tests, better performance was significantly associated with higher education, younger age, and white race. On the IUTT, better performance was also associated with female gender. We found no significant interactions between age and sex, and between race and education. Age and education are more potent variables than are race and gender influencing performance on these language tests. Demographically stratified normative tables for these measures can be used to guide test interpretation and aid clinical diagnosis of impaired cognition.
New Testing Methods to Assess Technical Problem-Solving Ability.

ERIC Educational Resources Information Center

Hambleton, Ronald K.; And Others

Tests to assess problem-solving ability being provided for the Air Force are described, and some details on the development and validation of these computer-administered diagnostic achievement tests are discussed. Three measurement approaches were employed: (1) sequential problem solving; (2) context-free assessment of fundamental skills and…
Frailty Versus Stopping Elderly Accidents, Deaths and Injuries Initiative Fall Risk Score: Ability to Predict Future Falls.

PubMed

Crow, Rebecca S; Lohman, Matthew C; Pidgeon, Dawna; Bruce, Martha L; Bartels, Stephen J; Batsis, John A

2018-03-01

To compare the ability of frailty status to predict fall risk with that of community fall risk screening tools. Analysis of cross-sectional and longitudinal data from NHATS. National Health and Aging Trend Study (NHATS) 2011-2015. Individuals aged 65 and older (N = 7,392). Fall risk was defined according to the Stopping Elderly Accidents, Deaths and Injuries (STEADI) initiative. Frailty was defined as exhaustion, weight loss, low activity, slow gait speed, and weak grip strength. Robust was defined as meeting 0 criteria, prefrailty as 1 or 2 criteria, and frailty as 3 or more criteria. Falls were self-reported and ascertained using NHATS subsequent rounds (2012-2015). We compared the ability of frailty to predict future falls with that of STEADI score, adjusting for age, race, sex, education, comorbidities, hearing and vision impairment, and disability. Of the 7,392 participants (58.5% female), there 3,545 (48.0%) were classified as being at low risk of falling, 2,966 (40.1%) as being at moderate risk, and 881 (11.9%) as being at high risk. The adjusted risk of falling over the 4 subsequent years was 2.5 times as great for the moderate-risk group (hazard ratio (HR) = 2.50, 95% confidence interval (CI) = 2.16-2.89) and almost 4 times as great (HR = 3.79, 95% CI = 2.76-5.21) for the high-risk group as for the low-risk group. Risk of falling was greater for those who were prefrail (HR = 1.34, 95% CI 1.16-1.55) and frail (HR = 1.20, 95% CI = 0.94-1.54) than for those who were robust. STEADI score is a strong predictor of future falls. Addition of frailty status does not improve the ability of the STEADI measure to predict future falls. © 2018, Copyright the Authors Journal compilation © 2018, The American Geriatrics Society.
Pain scores for intravenous cannulation and arterial blood gas test among emergency department patients.

PubMed

Ballesteros-Peña, Sendoa; Vallejo-De la Hoz, Gorka; Fernández-Aedo, Irrintzi

2017-12-23

To analyse vein catheterisation and blood gas test-related pain among adult patients in the emergency department and to explore pain score-related factors. An observational and multicentre research study was performed. Patients undergoing vein catheterisation or arterial puncture for gas test were included consecutively. After each procedure, patients scored the pain experienced using the NRS-11. 780 vein catheterisations and 101 blood gas tests were analysed. Venipuncture was scored with an average score of 2.8 (95% CI: 2.6-3), and arterial puncture with 3.6 (95%CI 3.1-4). Iatrogenic pain scores were associated with moderate - high difficulty procedures (P<.001); with the choice of the humeral rather than the radial artery (P=.02) in the gas test and correlated to baseline pain in venipunctures (P<.001). Pain scores related to other variables such as sex, place of origin or needle gauge did not present statistically significant differences. Vein catheterisation and blood gas test-related pain can be considered mild to moderately and moderately painful procedures, respectively. The pain score is associated with certain variables such as the difficulty of the procedure, the anatomic area of the puncture or baseline pain. A better understanding of painful effects related to emergency nursing procedures and the factors associated with pain self-perception could help to determine when and how to act to mitigate this undesired effect. Copyright © 2017 Elsevier España, S.L.U. All rights reserved.
Associations of physical activity with driving-related cognitive abilities in older drivers: an exploratory study.

PubMed

Marmeleira, José; Ferreira, Inês; Melo, Filipe; Godinho, Mário

2012-10-01

The purpose of this study was to examine the associations between hysical activity and driving-related cognitive abilities of older drivers. Thirty-eight female and male drivers ages 61 to 81 years (M = 70.2, SD = 5.0) responded to the International Physical Activity Questionnaire and were assessed on a battery of neuropsychological tests, which included measures of visual attention, executive functioning, mental status, visuospatial ability, and memory. A higher amount of reported physical activity was significantly correlated with better scores on tests of visual processing speed and divided visual attention. Higher amounts of physical activity was significantly associated with a better composite score for visual attention, but its correlation with the composite score for executive functioning was not significant. These findings support the hypothesis that pzhysical activity is associated with preservation of specific driving-related cognitive abilities of older adults.
Affective Variables and Japanese L2 Reading Ability

ERIC Educational Resources Information Center

Kondo-Brown, Kimi

2006-01-01

This study investigates how 17 affective factors are related to Japanese second language (L2) reading comprehension and "kanji" knowledge test scores of 43 university students in advanced Japanese courses. Major findings are that: a) reading comprehension ability and "kanji" knowledge have direct associations with…
Cognitive abilities of children on a gray seriation test.

PubMed

Dain, Stephen J; Ling, Barbara Y

2009-06-01

The importance of testing children's color vision, particularly to identify color vision deficiencies at an early age, has long been agreed on by teachers and color vision researchers and healthcare workers. The classic color vision tests were not necessarily developed for children's cognitive abilities, even though they are commonly used to assess children's color vision. Although, in the past, psychologists have studied color seriation abilities of children, they have not necessarily chosen isoluminous stimuli, which would minimize brightness cues. This investigation was designed to assess the ability of children to seriate a gray series. Tests were constructed in the form of the Farnsworth-Munsell style of arrangement test with constant intervals of metric lightness (CIE L*). Four intervals (DeltaL* = 15, 10, 5, and 3) were used. The child was instructed to arrange the colors from darker to lighter (or vice versa). Errors were not made on the DeltaL* = 15 series. Only isolated errors were made on the DeltaL* = 10 series. Errors were made on the DeltaL* = 5 series that diminished with age to nil in the older groups. Errors were made on the DeltaL* = 3 series at all ages studied, which also diminished with increasing age. Children aged 5 to 12 have sufficiently grasped the concept of seriation. They are able to complete a series with DeltaL* = 5, hence are capable of performing color arrangement tests with similar color differences such as the Lanthony New Color Test and the Farnsworth-Munsell D-15. Given the large number of errors made on DeltaL* = 3 series, it may be concluded that children's performance on the 100-hue test, at least to the age of 12 years, could be unduly influenced by non-color vision factors.
Testing Based on Understanding: Implications from Studies of Spatial Ability.

ERIC Educational Resources Information Center

Egan, Dennis E.

1979-01-01

The information-processing approach and results of research on spatial ability are analyzed. Performance consists of a sequence of distinct mental operations that seem general across subjects, and can be individually measured. New interpretations for some classical concepts in psychological testing and procedures for abilities are suggested.…
An alternative to the balance error scoring system: using a low-cost balance board to improve the validity/reliability of sports-related concussion balance testing.

PubMed

Chang, Jasper O; Levy, Susan S; Seay, Seth W; Goble, Daniel J

2014-05-01

Recent guidelines advocate sports medicine professionals to use balance tests to assess sensorimotor status in the management of concussions. The present study sought to determine whether a low-cost balance board could provide a valid, reliable, and objective means of performing this balance testing. Criterion validity testing relative to a gold standard and 7 day test-retest reliability. University biomechanics laboratory. Thirty healthy young adults. Balance ability was assessed on 2 days separated by 1 week using (1) a gold standard measure (ie, scientific grade force plate), (2) a low-cost Nintendo Wii Balance Board (WBB), and (3) the Balance Error Scoring System (BESS). Validity of the WBB center of pressure path length and BESS scores were determined relative to the force plate data. Test-retest reliability was established based on intraclass correlation coefficients. Composite scores for the WBB had excellent validity (r = 0.99) and test-retest reliability (R = 0.88). Both the validity (r = 0.10-0.52) and test-retest reliability (r = 0.61-0.78) were lower for the BESS. These findings demonstrate that a low-cost balance board can provide improved balance testing accuracy/reliability compared with the BESS. This approach provides a potentially more valid/reliable, yet affordable, means of assessing sports-related concussion compared with current methods.
Effects of Classroom Ventilation Rate and Temperature on Students' Test Scores.

PubMed

Haverinen-Shaughnessy, Ulla; Shaughnessy, Richard J

2015-01-01

Using a multilevel approach, we estimated the effects of classroom ventilation rate and temperature on academic achievement. The analysis is based on measurement data from a 70 elementary school district (140 fifth grade classrooms) from Southwestern United States, and student level data (N = 3109) on socioeconomic variables and standardized test scores. There was a statistically significant association between ventilation rates and mathematics scores, and it was stronger when the six classrooms with high ventilation rates that were indicated as outliers were filtered (> 7.1 l/s per person). The association remained significant when prior year test scores were included in the model, resulting in less unexplained variability. Students' mean mathematics scores (average 2286 points) were increased by up to eleven points (0.5%) per each liter per second per person increase in ventilation rate within the range of 0.9-7.1 l/s per person (estimated effect size 74 points). There was an additional increase of 12-13 points per each 1°C decrease in temperature within the observed range of 20-25°C (estimated effect size 67 points). Effects of similar magnitude but higher variability were observed for reading and science scores. In conclusion, maintaining adequate ventilation and thermal comfort in classrooms could significantly improve academic achievement of students.
Bi-Factor MIRT Observed-Score Equating for Mixed-Format Tests

ERIC Educational Resources Information Center

Lee, Guemin; Lee, Won-Chan

2016-01-01

The main purposes of this study were to develop bi-factor multidimensional item response theory (BF-MIRT) observed-score equating procedures for mixed-format tests and to investigate relative appropriateness of the proposed procedures. Using data from a large-scale testing program, three types of pseudo data sets were formulated: matched samples,…
Optimal Scoring Methods of Hand-Strength Tests in Patients with Stroke

ERIC Educational Resources Information Center

Huang, Sheau-Ling; Hsieh, Ching-Lin; Lin, Jau-Hong; Chen, Hui-Mei

2011-01-01

The purpose of this study was to determine the optimal scoring methods for measuring strength of the more-affected hand in patients with stroke by examining the effect of reducing measurement errors. Three hand-strength tests of grip, palmar pinch, and lateral pinch were administered at two sessions in 56 patients with stroke. Five scoring methods…
Score Reporting in Teacher Certification Testing: A Review, Design, and Interview/Focus Group Study

ERIC Educational Resources Information Center

Klesch, Heather S.

2010-01-01

The reporting of scores on educational tests is at times misunderstood, misinterpreted, and potentially confusing to examinees and other stakeholders who may need to interpret test scores. In reporting test results to examinees, there is a need for clarity in the message communicated. As pressure rises for students to demonstrate performance at a…
The challenge of cross-cultural assessment--The Test of Ability To Explain for Zulu-speaking Children.

PubMed

Solarsh, Barbara; Alant, Erna

2006-01-01

A culturally appropriate test, The Test of Ability To Explain for Zulu-speaking Children (TATE-ZC), was developed to measure verbal problem solving skills of rural, Zulu-speaking, primary school children. Principles of 'non-biased' assessment, as well as emic (culture specific) and etic (universal) aspects of intelligence formed the theoretical backdrop. In addition, specific principles relating to test translation; test content; culturally appropriate stimulus material; scoring procedures and test administration were applied. Five categories of abstract thinking skills formed the basis of the TATE-ZC. These were: (a) Explaining Inferences, (b) Determining Cause, (c) Negative Why Questions, (d) Determining Solutions and (e) Avoiding Problem. The process of test development underwent three pilot studies. Results indicate that the TATE-ZC is a reliable and valid test for the target population. A critical analysis of the efficacy of creating a test of verbal reasoning for children from the developing world concludes the article. As a result of this activity (1) the participant will have a clearer understanding of the principles that need to be followed when developing culturally appropriate test material; (2) the participant will understand the process of developing culturally appropriate test material for non-mainstream cultures; (3) the participant will be able to apply the process and principles to other cross-cultural testing situations.
Joint Confirmatory Factor Analysis of the Differential Ability Scales and the "Woodcock-Johnson Tests of Cognitive Abilities--Third Edition"

ERIC Educational Resources Information Center

Sanders, Sarah; McIntosh, David E.; Dunham, Mardis; Rothlisberg, Barbara A.; Finch, Holmes

2007-01-01

This study examined the underlying constructs measured by the "Differential Ability Scales" ("DAS"; C.D. Elliott, 1990a) as they relate to the "Cattell-Horn-Carroll (CHC) Theory" (K.S. McGrew, 1997) of cognitive abilities. The "DAS" and "Woodcock-Johnson Tests of Cognitive Abilities" ("WJ-III COG"; R.W.Woodcock, K.S. McGrew, & N. Mather, 2001)…
Measurement of Nonverbal IQ in Autism Spectrum Disorder: Scores in Young Adulthood compared to Early Childhood

PubMed Central

Bishop, Somer L.; Farmer, Cristan; Thurm, Audrey

2014-01-01

Nonverbal IQ (NVIQ) was examined in 84 individuals with ASD followed from age 2 to 19. Most adults who scored in the range of ID also received scores below 70 as children, and the majority of adults with scores in the average range had scored in this range by age 3. However, within the lower ranges of ability, actual scores declined from age 2 to 19, likely due in part to limitations of appropriate tests. Use of Vineland-II DLS scores in place of NVIQ did not statistically improve the correspondence between age 2 and age 19 scores. Clinicians and researchers should use caution when making comparisons based on exact scores or specific ability ranges within or across individuals with ASD of different ages. PMID:25239176
Development of a Mathematical Ability Test: A Validity and Reliability Study

ERIC Educational Resources Information Center

Dündar, Sefa; Temel, Hasan; Gündüz, Nazan

2016-01-01

The identification of talented students accurately at an early age and the adaptation of the education provided to the students depending on their abilities are of great importance for the future of the countries. In this regard, this study aims to develop a mathematical ability test for the identification of the mathematical abilities of students…
Michael's Inform Test of Student Ability (M.I.T.O.S.A.). Tester's Manual.

ERIC Educational Resources Information Center

Grafius, Thomas M.

Michael's Informal Test of Student Ability (MITOSA) is a diagnostic evaluative tool for adult students designed to test nine skills abilities in adult students functioning below a tenth grade level. The nine test sections are approximate reading level, understanding of basic math concepts and symbols, general thinking/reasoning ability, eye-hand…

Clinical experience of scoring criteria for Familial Hypercholesterolaemia (FH) genetic testing in Wales.

PubMed

Haralambos, K; Whatley, S D; Edwards, R; Gingell, R; Townsend, D; Ashfield-Watt, P; Lansberg, P; Datta, D B N; McDowell, I F W

2015-05-01

Familial Hypercholesterolaemia (FH) is caused by mutations in genes of the Low Density Lipoprotein (LDL) receptor pathway. A definitive diagnosis of FH can be made by the demonstration of a pathogenic mutation. The Wales FH service has developed scoring criteria to guide selection of patients for DNA testing, for those referred to clinics with hypercholesterolaemia. The criteria are based on a modification of the Dutch Lipid Clinic scoring criteria and utilise a combination of lipid values, physical signs, personal and family history of premature cardiovascular disease. They are intended to provide clinical guidance and enable resources to be targeted in a cost effective manner. 623 patients who presented to lipid clinics across Wales had DNA testing following application of these criteria. The proportion of patients with a pathogenic mutation ranged from 4% in those scoring 5 or less up to 85% in those scoring 15 or more. LDL-cholesterol was the strongest discriminatory factor. Scores gained from physical signs, family history, coronary heart disease, and triglycerides also showed a gradient in mutation pick-up rate according to the score. These criteria provide a useful tool to guide selection of patients for DNA testing when applied by health professionals who have clinical experience of FH. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Increasing Racial Isolation and Test Score Gaps in Mathematics: A 30-Year Perspective

ERIC Educational Resources Information Center

Berends, Mark; Penaloza, Roberto V.

2010-01-01

Background/Context: Although there has been progress in closing the test score gaps among student groups over past decades, that progress has stalled. Many researchers have speculated why the test score gaps closed between the early 1970s and the early 1990s, but only a few have been able to empirically study how changes in school factors and…
Peer Effects and the Indigenous/Non-Indigenous Early Test-Score Gap in Peru

ERIC Educational Resources Information Center

Sakellariou, Chris

2008-01-01

This paper assesses the magnitude of the non-indigenous/indigenous test-score gap for third-year and fourth-year primary school pupils in Peru, in relation to the main family, school and peer inputs contributing to the test-score gap using the estimation method of feasible generalized least squares. The article then decomposes the gap into its…
The Uses and Misuses of Test Scores: Technical Assistance Perspective.

ERIC Educational Resources Information Center

Echternacht, Gary

The uses and misuses of standardized test results used for program evaluation as seen by a staff member of an Elementary Secondary Education Act (ESEA) Title I Technical Assistance Center are described. In ESEA Title I, test scores are used to select students for the program. Although federal requirements do not require using standardized test…
Motivating High School Students to Score Proficient on State Tests

ERIC Educational Resources Information Center

Brown, Sarah Lee

2015-01-01

The researcher interviewed two groups of eleventh grade students, in a rural Appalachian setting, who tended to score low on the state mandated high stakes/low stakes test to discover their efforts on the test, specifically in reading, and to obtain their opinions concerning the effects of a specific incentive or consequence. Before the eleventh…
Age and gender differences in ability emotional intelligence in adults: A cross-sectional study.

PubMed

Cabello, Rosario; Sorrel, Miguel A; Fernández-Pinto, Irene; Extremera, Natalio; Fernández-Berrocal, Pablo

2016-09-01

The goal of the current investigation was to analyze ability emotional intelligence (EI) in a large cross-sectional sample of Spanish adults (N = 12,198; males, 56.56%) aged from 17 to 76 years (M = 37.71, SD = 12.66). Using the Mayer-Salovey-Caruso Emotional Intelligence Test (MSCEIT), which measures ability EI according to the 4 branches of the Mayer and Salovey EI model. The authors examined effects of gender on ability EI, as well as the linear and quadratic effects of age. Results suggest that gender affects the total ability EI score as well as scores on the 4 EI branches. Ability EI was greater in women than men. Ability EI varied with age according to an inverted-U curve: Younger and older adults scored lower on ability EI than middle-aged adults, except for the branch of understanding emotions. These findings strongly support the idea that both gender and age significantly influence ability EI during aging. (PsycINFO Database Record (c) 2016 APA, all rights reserved).
A Comparison of Standardized Achievement Test Scores on Right and Left Brain Dominant Fourth-Grade Students.

ERIC Educational Resources Information Center

Bell, Michael L.; Roubinek, Darrell L.

1989-01-01

Compares fourth-graders' subtest scores on the Stanford Achievement Test (SAT), the Iowa Test of Basic Skills (ITBS), and the Metropolitan Achievement Test (MAT). Finds right-brain dominant students scored better on four SAT subtests, and left-brain dominant students scored better on four ITBS subtests and two MAT subtests. (NH)
Increasing Confidence and Ability in Implementing Kangaroo Mother Care Method Among Young Mothers.

PubMed

Kenanga Purbasary, Eleni; Rustina, Yeni; Budiarti, Tri

Mothers giving birth to low birth weight babies (LBWBs) have low confidence in caring for their babies because they are often still young and may lack the knowledge, experience, and ability to care for the baby. This research aims to determine the effect of education about kangaroo mother care (KMC) on the confidence and ability of young mothers to implement KMC. The research methodology used was a controlled-random experimental approach with pre- and post-test equivalent groups of 13 mothers and their LBWBs in the intervention group and 13 mothers and their LBWBs in the control group. Data were collected via an instrument measuring young mothers' confidence, the validity and reliability of which have been tested with a resulting r value of .941, and an observation sheet on KMC implementation. After conducting the education, the confidence score of young mothers and their ability to perform KMC increased meaningfully. The score of confidence of young mothers before education was 37 (p = .1555: and the ability score for KMC Implementation before education was 9 (p = .1555). The median score of confidence of young mothers after education in the intervention group was 87 and in the control group was 50 (p = .001, 95% CI 60.36-75.56), and ability median score for KMC implementation after education in the intervention group was 16 and in the control group was 12 (p = .001, 95% CI 1.50-1.88). KMC education should be conducted gradually, and it is necessary to involve the family, in order for KMC implementation to continue at home. A family visit can be done for LBWBs to evaluate the ability of the young mothers to implement KMC.
Generation of GHS Scores from TEST and online sources ...

EPA Pesticide Factsheets

Alternatives assessment frameworks such as DfE (Design for the Environment) evaluate chemical alternatives in terms of human health effects, ecotoxicity, and fate. T.E.S.T. (Toxicity Estimation Software Tool) can be utilized to evaluate human health in terms of acute oral rat toxicity, developmental toxicity, endocrine activity, and mutagenicity. It can be used to evaluate ecotoxicity (in terms of acute fathead minnow toxicity) and fate (in terms of bioconcentration factor). It also be used to estimate a variety of key physicochemical properties such as melting point, boiling point, vapor pressure, water solubility, and bioconcentration factor. A web-based version of T.E.S.T. is currently being developed to allow predictions to be made from other web tools. Online data sources such as from NCCT’s Chemistry Dashboard, REACH dossiers, or from ChemHat.org can also be utilized to obtain GHS (Global Harmonization System) scores for comparing alternatives. The purpose of this talk is to show how GHS (Global Harmonization Score) data can be obtained from literature sources and from T.E.S.T. (Toxicity Estimation Software Tool). This data will be used to compare chemical alternatives in the alternatives assessment dashboard (a 2018 CSS product).
Is education associated with improvements in general cognitive ability, or in specific skills?

PubMed

Ritchie, Stuart J; Bates, Timothy C; Deary, Ian J

2015-05-01

Previous research has indicated that education influences cognitive development, but it is unclear what, precisely, is being improved. Here, we tested whether education is associated with cognitive test score improvements via domain-general effects on general cognitive ability (g), or via domain-specific effects on particular cognitive skills. We conducted structural equation modeling on data from a large (n = 1,091), longitudinal sample, with a measure of intelligence at age 11 years and 10 tests covering a diverse range of cognitive abilities taken at age 70. Results indicated that the association of education with improved cognitive test scores is not mediated by g, but consists of direct effects on specific cognitive skills. These results suggest a decoupling of educational gains from increases in general intellectual capacity. (c) 2015 APA, all rights reserved).
High Test Scores: The Wrong Road to National Economic Success

ERIC Educational Resources Information Center

Baker, Keith

2011-01-01

A widely held view is that good schools are essential to a nation's international economic success and that high test scores on international tests of academic skills and knowledge indicate how good a nation's schools are. The widespread belief that good schools are an important contributor to a nation's economic success in the world is supported…
Sex Differences in Latent Cognitive Abilities Ages 6 to 59: Evidence from the Woodcock-Johnson III Tests of Cognitive Abilities

ERIC Educational Resources Information Center

Keith, Timothy Z.; Reynolds, Matthew R.; Patel, Puja G.; Ridley, Kristen P.

2008-01-01

Sex differences in the latent general and broad cognitive abilities underlying the Woodcock-Johnson Tests of Cognitive Abilities were investigated for children, youth, and adults ages 6 through 59. A developmental, multiple indicator-multiple cause, structural equation model was used to investigate sex differences in latent cognitive abilities as…
Assessment of nonscholastic abilities and its associated factors among medical students: An exploratory study

PubMed Central

Kumar, S. Ganesh; Sarkar, Sonali

2017-01-01

Background: Nonscholastic abilities among medical students are an important area of concern for the health professionals. Very few studies had been conducted in the past with regard to it. Objective: This study was an exploratory study aimed to assess the nonscholastic abilities among medical students in a medical institution in coastal South India. Materials and Methods: This study assessed three broad domains of nonscholastic abilities namely personal qualities, interpersonal activities, and communication skills among 106 medical students using a structured questionnaire (27 questions with a total score of 27). The data were analyzed by independent t-test and linear regression model. Results: About 41.5% (44) of the subjects were males and 52.8% (56) of them were belonged to 18–19 years age group. Overall mean score of nonscholastic abilities was found to be 19.40 (standard deviation = 3.27). Percentile distribution of subjects is at score 17 (25th percentile), 20 (50th percentile), and 22 (75th percentile). Mean personal quality domain score was found to be proportionately lesser than other domains of nonscholastic abilities. Nonscholastic ability score was significantly associated with marks obtained in the previous examination (P = 0.006). However, linear regression analysis revealed that the presence of family problems (P = 0.005) and alcohol use (P = 0.026) were associated with low nonscholastic ability score among medical students. Conclusion: Nonscholastic abilities are still a required need in medical student's career. Further analytical studies will help in identifying the in-depth evaluation of factors associated with it. PMID:28546968
Commentary: Student Cognition, the Situated Learning Context, and Test Score Interpretation

ERIC Educational Resources Information Center

La Marca, Paul M.

2006-01-01

Although it is assumed that student cognition contributes to student performance on achievement tests, it may be that current testing models lack the degree of specification necessary to warrant such inferences. With test score interpretations as the referent, the authors in this special issue address the role of student cognition in learning and…
Relationships between spatial activities and scores on the mental rotation test as a function of sex.

PubMed

Ginn, Sheryl R; Pickens, Stefanie J

2005-06-01

Previous results suggested that female college students' scores on the Mental Rotations Test might be related to their prior experience with spatial tasks. For example, women who played video games scored better on the test than their non-game-playing peers, whereas playing video games was not related to men's scores. The present study examined whether participation in different types of spatial activities would be related to women's performance on the Mental Rotations Test. 31 men and 59 women enrolled at a small, private church-affiliated university and majoring in art or music as well as students who participated in intercollegiate athletics completed the Mental Rotations Test. Women's scores on the Mental Rotations Test benefitted from experience with spatial activities; the more types of experience the women had, the better their scores. Thus women who were athletes, musicians, or artists scored better than those women who had no experience with these activities. The opposite results were found for the men. Efforts are currently underway to assess how length of experience and which types of experience are related to scores.
The value of Bayes' theorem for interpreting abnormal test scores in cognitively healthy and clinical samples.

PubMed

Gavett, Brandon E

2015-03-01

The base rates of abnormal test scores in cognitively normal samples have been a focus of recent research. The goal of the current study is to illustrate how Bayes' theorem uses these base rates--along with the same base rates in cognitively impaired samples and prevalence rates of cognitive impairment--to yield probability values that are more useful for making judgments about the absence or presence of cognitive impairment. Correlation matrices, means, and standard deviations were obtained from the Wechsler Memory Scale--4th Edition (WMS-IV) Technical and Interpretive Manual and used in Monte Carlo simulations to estimate the base rates of abnormal test scores in the standardization and special groups (mixed clinical) samples. Bayes' theorem was applied to these estimates to identify probabilities of normal cognition based on the number of abnormal test scores observed. Abnormal scores were common in the standardization sample (65.4% scoring below a scaled score of 7 on at least one subtest) and more common in the mixed clinical sample (85.6% scoring below a scaled score of 7 on at least one subtest). Probabilities varied according to the number of abnormal test scores, base rates of normal cognition, and cutoff scores. The results suggest that interpretation of base rates obtained from cognitively healthy samples must also account for data from cognitively impaired samples. Bayes' theorem can help neuropsychologists answer questions about the probability that an individual examinee is cognitively healthy based on the number of abnormal test scores observed.
Posttraumatic Stress Disorder and Standardized Test-Taking Ability

ERIC Educational Resources Information Center

Rutkowski, Leslie; Vasterling, Jennifer J.; Proctor, Susan P.; Anderson, Carolyn J.

2010-01-01

Given the widespread use and high-stakes nature of educational standardized assessments, understanding factors that affect test-taking ability in young adults is vital. Although scholarly attention has often focused on demographic factors (e.g., gender and race), sufficiently prevalent acquired characteristics may also help explain widespread…
Superitem Test: An Alternative Assessment Tool to Assess Students' Algebraic Solving Ability

ERIC Educational Resources Information Center

Lian, Lim Hooi; Yew, Wun Thiam; Idris, Noraini

2010-01-01

Superitem test based on the SOLO model (Structure of the Observing Learning Outcome) has become a powerful alternative assessment tool for monitoring the growth of students' cognitive ability in solving mathematics problems. This article focused on developing a superitem test to assess students' algebraic solving ability through interview method.…
The Influence of Foreign Language Learning during Early Childhood on Standardized Test Scores

ERIC Educational Resources Information Center

Shaw, Tommetta

2010-01-01

Increasing standardized test scores in reading and math is of high importance to the California Department of Education to meet requirements mandated by the No Child Left Behind (NCLB) act of 2001. More research is needed to understand the best ways to improve tests scores to meet concerns of the NCLB act. The purpose of the study was to evaluate…
More than Just Test Scores

ERIC Educational Resources Information Center

Levin, Henry M.

2012-01-01

Around the world we hear considerable talk about creating world-class schools. Usually the term refers to schools whose students get very high scores on the international comparisons of student achievement such as PISA or TIMSS. The practice of restricting the meaning of exemplary schools to the narrow criterion of achievement scores is usually…

Predictive effects of teachers and schools on test scores, college attendance, and earnings

PubMed Central

Chamberlain, Gary E.

2013-01-01

I studied predictive effects of teachers and schools on test scores in fourth through eighth grade and outcomes later in life such as college attendance and earnings. For example, predict the fraction of a classroom attending college at age 20 given the test score for a different classroom in the same school with the same teacher and given the test score for a classroom in the same school with a different teacher. I would like to have predictive effects that condition on averages over many classrooms, with and without the same teacher. I set up a factor model that, under certain assumptions, makes this feasible. Administrative school district data in combination with tax data were used to calculate estimates and do inference. PMID:24101492
Predictive effects of teachers and schools on test scores, college attendance, and earnings.

PubMed

Chamberlain, Gary E

2013-10-22

I studied predictive effects of teachers and schools on test scores in fourth through eighth grade and outcomes later in life such as college attendance and earnings. For example, predict the fraction of a classroom attending college at age 20 given the test score for a different classroom in the same school with the same teacher and given the test score for a classroom in the same school with a different teacher. I would like to have predictive effects that condition on averages over many classrooms, with and without the same teacher. I set up a factor model that, under certain assumptions, makes this feasible. Administrative school district data in combination with tax data were used to calculate estimates and do inference.
Scoring severity in trauma: comparison of prehospital scoring systems in trauma ICU patients.

PubMed

Llompart-Pou, J A; Chico-Fernández, M; Sánchez-Casado, M; Salaberria-Udabe, R; Carbayo-Górriz, C; Guerrero-López, F; González-Robledo, J; Ballesteros-Sanz, M Á; Herrán-Monge, R; Servià-Goixart, L; León-López, R; Val-Jordán, E

2017-06-01

We evaluated the predictive ability of mechanism, Glasgow coma scale, age and arterial pressure (MGAP), Glasgow coma scale, age and systolic blood pressure (GAP), and triage-revised trauma Score (T-RTS) scores in patients from the Spanish trauma ICU registry using the trauma and injury severity score (TRISS) as a reference standard. Patients admitted for traumatic disease in the participating ICU were included. Quantitative data were reported as median [interquartile range (IQR), categorical data as number (percentage)]. Comparisons between groups with quantitative variables and categorical variables were performed using Student's T Test and Chi Square Test, respectively. We performed receiving operating curves (ROC) and evaluated the area under the curve (AUC) with its 95 % confidence interval (CI). Sensitivity, specificity, positive predictive and negative predictive values and accuracy were evaluated in all the scores. A value of p < 0.05 was considered significant. The final sample included 1361 trauma ICU patients. Median age was 45 (30-61) years. 1092 patients (80.3 %) were male. Median ISS was 18 (13-26) and median T-RTS was 11 (10-12). Median GAP was 20 (15-22) and median MGAP 24 (20-27). Observed mortality was 17.7 % whilst predicted mortality using TRISS was 16.9 %. The AUC in the scores evaluated was: TRISS 0.897 (95 % CI 0.876-0.918), MGAP 0.860 (95 % CI 0.835-0.886), GAP 0.849 (95 % CI 0.823-0.876) and T-RTS 0.796 (95 % CI 0.762-0.830). Both MGAP and GAP scores performed better than the T-RTS in the prediction of hospital mortality in Spanish trauma ICU patients. Since these are easy-to-perform scores, they should be incorporated in clinical practice as a triaging tool.
Controlling Item Exposure Conditional on Ability in Computerized Adaptive Testing.

ERIC Educational Resources Information Center

Stocking, Martha L.; Lewis, Charles

1998-01-01

Ensuring item and pool security in a continuous testing environment is explored through a new method of controlling exposure rate of items conditional on ability level in computerized testing. Properties of this conditional control on exposure rate, when used in conjunction with a particular adaptive testing algorithm, are explored using simulated…
Background Variables, Levels of Aggregation, and Standardized Test Scores

ERIC Educational Resources Information Center

Paulson, Sharon E.; Marchant, Gregory J.

2009-01-01

This article examines the role of student demographic characteristics in standardized achievement test scores at both the individual level and aggregated at the state, district, school levels. For several data sets, the majority of the variance among states, districts, and schools was related to demographic characteristics. Where these background…
Compensation or inhibitory failure? Testing hypotheses of age-related right frontal lobe involvement in verbal memory ability using structural and diffusion MRI

PubMed Central

Cox, Simon R.; Bastin, Mark E.; Ferguson, Karen J.; Allerhand, Mike; Royle, Natalie A.; Maniega, Susanna Muñoz; Starr, John M.; MacLullich, Alasdair M.J.; Wardlaw, Joanna M.; Deary, Ian J.; MacPherson, Sarah E.

2015-01-01

Functional neuroimaging studies report increased right prefrontal cortex (PFC) involvement during verbal memory tasks amongst low-scoring older individuals, compared to younger controls and their higher-scoring contemporaries. Some propose that this reflects inefficient use of neural resources through failure of the left PFC to inhibit non-task-related right PFC activity, via the anterior corpus callosum (CC). For others, it indicates partial compensation – that is, the right PFC cannot completely supplement the failing neural network, but contributes positively to performance. We propose that combining structural and diffusion brain MRI can be used to test predictions from these theories which have arisen from fMRI studies. We test these hypotheses in immediate and delayed verbal memory ability amongst 90 healthy older adults of mean age 73 years. Right hippocampus and left dorsolateral prefrontal cortex (DLPFC) volumes, and fractional anisotropy (FA) in the splenium made unique contributions to verbal memory ability in the whole group. There was no significant effect of anterior callosal white matter integrity on performance. Rather, segmented linear regression indicated that right DLPFC volume was a significantly stronger positive predictor of verbal memory for lower-scorers than higher-scorers, supporting a compensatory explanation for the differential involvement of the right frontal lobe in verbal memory tasks in older age. PMID:25241394
What's in a Teacher Test? Assessing the Relationship between Teacher Test Scores and Student Secondary STEM Achievement. CEDR Working Paper. WP #2016-4

ERIC Educational Resources Information Center

Goldhaber, Dan; Gratz, Trevor; Theobald, Roddy

2016-01-01

We investigate the predictive validity of teacher credential test scores for student performance in secondary STEM classrooms in Washington state. After replicating earlier findings that teacher basic skills licensure test scores are a modest and statistically significant predictor of student math test score gains in elementary grades, we focus on…
Effects of Classroom Ventilation Rate and Temperature on Students’ Test Scores

PubMed Central

2015-01-01

Using a multilevel approach, we estimated the effects of classroom ventilation rate and temperature on academic achievement. The analysis is based on measurement data from a 70 elementary school district (140 fifth grade classrooms) from Southwestern United States, and student level data (N = 3109) on socioeconomic variables and standardized test scores. There was a statistically significant association between ventilation rates and mathematics scores, and it was stronger when the six classrooms with high ventilation rates that were indicated as outliers were filtered (> 7.1 l/s per person). The association remained significant when prior year test scores were included in the model, resulting in less unexplained variability. Students’ mean mathematics scores (average 2286 points) were increased by up to eleven points (0.5%) per each liter per second per person increase in ventilation rate within the range of 0.9–7.1 l/s per person (estimated effect size 74 points). There was an additional increase of 12–13 points per each 1°C decrease in temperature within the observed range of 20–25°C (estimated effect size 67 points). Effects of similar magnitude but higher variability were observed for reading and science scores. In conclusion, maintaining adequate ventilation and thermal comfort in classrooms could significantly improve academic achievement of students. PMID:26317643
CognitiveGenesis (CG): Assessing Academic Achievement and Cognitive Ability in Adventist Schools

ERIC Educational Resources Information Center

Thayer, Jerome; Kido, Elissa

2012-01-01

CognitiveGenesis collected achievement and ability test data from 2006-2009 for all students in Seventh-day Adventist schools in North America. Students were above average in achievement compared to national norms and achieved above that predicted by their ability scores. The more years students attended Adventist schools, the higher they…
Can Percentiles Replace Raw Scores in the Statistical Analysis of Test Data?

ERIC Educational Resources Information Center

Zimmerman, Donald W.; Zumbo, Bruno D.

2005-01-01

Educational and psychological testing textbooks typically warn of the inappropriateness of performing arithmetic operations and statistical analysis on percentiles instead of raw scores. This seems inconsistent with the well-established finding that transforming scores to ranks and using nonparametric methods often improves the validity and power…
Effect of ice massage on lower extremity functional performance and weight discrimination ability in collegiate footballers.

PubMed

Sharma, Geeta; Noohu, Majumi Mohamad

2014-09-01

Cryotherapy, in the form of ice massge is used to reduce inflammation after acute musculoskeletal injury or trauma. The potential negative effects of ice massage on proprioception are unknown, despite equivocal evidence supporting its effectiveness. The purpose of the study was to test the influence of cooling on weight discrimination ability and hence the performance in footballers. The study was of same subject experimental design (pretest-posttest design). Thirty male collegiate football players, whose mean age was 21.07 years, participated in the study. The participants were assessed for two functional performance tests, single leg hop test and crossed over hop test and weight discrimination ability before and after ice massage for 5 minutes on hamstrings muscle tendon. Pre cooling scores of Single Leg Hop Test of the dominant leg in the subjects was 166.65 (± 10.16) cm and post cooling scores of the dominant leg was 167.25 (± 11.77) cm. Pre cooling scores of Crossed Over Hop Test of the dominant leg in the subjects was 174.14 (± 8.60) cm and post cooling scores of the dominant leg was 174.45 (± 9.28) cm. Pre cooling scores of Weight Discrimination Differential Threshold of the dominant leg in the subjects was 1.625 ± 1.179 kg compared with post cooling scores of the dominant leg 1.85 (± 1.91) kg. Pre cooling scores of single leg hop and crossed over hop test of the dominant leg in the subjects compared with post cooling scores of the dominant leg showed no significant differences and it was also noted that the weight discrimination ability (weight discrimination differential threshold) didn't show any significant difference. All the values are reported as mean ± SD. This study provides additional evidence that proprioceptive acuity in the hamstring muscles (biceps femoris) remains largely unaffected after ice application to the hamstrings tendon (biceps femoris).
Lifestyle index and work ability.

PubMed

Kaleta, Dorota; Makowiec-Dabrowska, Teresa; Jegier, Anna

2006-01-01

In many countries around the world, negative changes in lifestyles are observed. The aim of this study was to analyze the influence of selected lifestyle indicators on work ability among professionally active individuals. The study was performed in the randomly selected group of full-time employees (94 men and 93 women) living in the city of Lódź. Work ability was measured with the work ability index and lifestyle characteristic was assessed with the healthy lifestyle index. We analyzed four lifestyle indicators: non-smoking, healthy weight, fiber intake per day, and regular physical activity. Logistic regression was used to estimate odds ratios and 95% confidence intervals to control the effects of lifestyle and work ability. The analysis of lifestyle index indicated that 27.7, 30.9, 27.7 and 11.7% of men and 15.1, 21.5, 35.5 and 26.9% of women scored 0, 1, 2, 3 points, respectively. Only 2.1% of men and 1.1% of women met the criteria for the healthy lifestyle (score 4). Work ability was excellent, good and moderate in 38.3, 46.8 and 14.9% of men, and in 39.8, 14.9 and 19.3% of women, respectively. Poor work ability was found in 9.7% women. Work ability was strongly associated with lifestyle in both men and women. Among men with index score = 0, the risk of moderate work ability was nearly seven times higher than in men whose lifestyle index score exceeded 1 or more points (OR = 6.67; 95% CI: 1.94-22.90). Among women with lifestyle index score = 0, the risk of moderate or lower work ability was also highly elevated as compared to those with lifestyle index = 1 or higher (OR = 14.44; 95% CI: 3.53-59.04). Prophylactic schedules associated with the improvement of lifestyles should be addressed to all adults. Future programs aimed at increasing work ability should consider work- and lifestyle-related factors.
Effects of Targeted Test Preparation on Scores of Two Tests of Oral English as a Second Language

ERIC Educational Resources Information Center

Farnsworth, Tim

2013-01-01

This study investigated the effect of targeted test preparation, or coaching, on oral English as a second language test scores. The tests in question were the Basic English Skills Test Plus (BEST Plus), a scripted oral interview published by the Center for Applied Linguistics, and the Versant English Test (VET), a computer-administered and…
Emotion Recognition Ability: A Multimethod-Multitrait Study.

ERIC Educational Resources Information Center

Gaines, Margie; And Others

A common paradigm in measuring the ability to recognize facial expressions of emotion is to present photographs of facial expressions and to ask subjects to identify the emotion. The Affect Blend Test (ABT) uses this method of assessment and is scored for accuracy on specific affects as well as total accuracy. Another method of measuring affect…
Deviation from expected cognitive ability across psychotic disorders.

PubMed

Hochberger, W C; Combs, T; Reilly, J L; Bishop, J R; Keefe, R S E; Clementz, B A; Keshavan, M S; Pearlson, G D; Tamminga, C A; Hill, S K; Sweeney, J A

2018-02-01

Patients with schizophrenia show a deficit in cognitive ability compared to estimated premorbid and familial intellectual abilities. However, the degree to which this pattern holds across psychotic disorders and is familial is unclear. The present study examined deviation from expected cognitive level in schizophrenia, schizoaffective disorder, and psychotic bipolar disorder probands and their first-degree relatives. Using a norm-based regression approach, parental education and WRAT-IV Reading scores (both significant predictors of cognitive level in the healthy control group) were used to predict global neuropsychological function as measured by the composite score from the Brief Assessment of Cognition in Schizophrenia (BACS) test in probands and relatives. When compared to healthy control group, psychotic probands showed a significant gap between observed and predicted BACS composite scores and a greater likelihood of robust cognitive decline. This effect was not seen in unaffected relatives. While BACS and WRAT-IV Reading scores were themselves highly familial, the decline in cognitive function from expectation had lower estimates of familiality. Thus, illness-related factors such as epigenetic, treatment, or pathophysiological factors may be important causes of illness related decline in cognitive abilities across psychotic disorders. This is consistent with the markedly greater level of cognitive impairment seen in affected individuals compared to their unaffected family members. Copyright © 2017 Elsevier B.V. All rights reserved.
Loss of ability to work and ability to live independently in Parkinson's disease.

PubMed

Jasinska-Myga, Barbara; Heckman, Michael G; Wider, Christian; Putzke, John D; Wszolek, Zbigniew K; Uitti, Ryan J

2012-02-01

Ability to work and live independently is of particular concern for patients with Parkinson's disease (PD). We studied a series of PD patients able to work or live independently at baseline, and evaluated potential risk factors for two separate outcomes: loss of ability to work and loss of ability to live independently. The series comprised 495 PD patients followed prospectively. Ability to work and ability to live independently were based on clinical interview and examination. Cox regression models adjusted for age and disease duration were used to evaluate associations of baseline characteristics with loss of ability to work and loss of ability to live independently. Higher UPDRS dyskinesia score, UPDRS instability score, UPDRS total score, Hoehn and Yahr stage, and presence of intellectual impairment at baseline were all associated with increased risk of future loss of ability to work and loss of ability to live independently (P ≤ 0.0033). Five years after initial visit, for patients ≤70 years of age with a disease duration ≤4 years at initial visit, 88% were still able to work and 90% to live independently. These estimates worsened as age and disease duration at initial visit increased; for patients >70 years of age with a disease duration >4 years, estimates at 5 years were 43% able to work and 57% able to live independently. The information provided in this study can offer useful information for PD patients in preparing for future ability to perform activities of daily living. Copyright © 2011 Elsevier Ltd. All rights reserved.
A knowledge-based theory of rising scores on "culture-free" tests.

PubMed

Fox, Mark C; Mitchum, Ainsley L

2013-08-01

Secular gains in intelligence test scores have perplexed researchers since they were documented by Flynn (1984, 1987). Gains are most pronounced on abstract, so-called culture-free tests, prompting Flynn (2007) to attribute them to problem-solving skills availed by scientifically advanced cultures. We propose that recent-born individuals have adopted an approach to analogy that enables them to infer higher level relations requiring roles that are not intrinsic to the objects that constitute initial representations of items. This proposal is translated into item-specific predictions about differences between cohorts in pass rates and item-response patterns on the Raven's Matrices (Flynn, 1987), a seemingly culture-free test that registers the largest Flynn effect. Consistent with predictions, archival data reveal that individuals born around 1940 are less able to map objects at higher levels of relational abstraction than individuals born around 1990. Polytomous Rasch models verify predicted violations of measurement invariance, as raw scores are found to underestimate the number of analogical rules inferred by members of the earlier cohort relative to members of the later cohort who achieve the same overall score. The work provides a plausible cognitive account of the Flynn effect, furthers understanding of the cognition of matrix reasoning, and underscores the need to consider how test-takers select item responses. PsycINFO Database Record (c) 2013 APA, all rights reserved.
Toy-playing behavior, sex-role orientation, spatial ability, and science achievement

NASA Astrophysics Data System (ADS)

Tracy, Dyanne M.

The purpose of this correlational study was to examine the possible relationships among children's extracurricular toy-playing habits, sex-role orientations, spatial abilities, and science achievement. Data were gathered from 282 midwestern, suburban, fifth-grade students. It was found that boys had significantly higher spatial skills than girls. No significant differences in spatial ability were found among students with different sex-role orientations. No significant differences in science achievement were found between girls and boys, or among students with the four different sex-role orientations. Students who had high spatial ability also had significantly higher science achievement scores than students with low spatial ability. Femininely oriented boys who reported low playing in the two-dimensional, gross-body-movement, and proportional-arrangement toy categories scored significantly higher on the test of science achievement than girls with the same sex-role and toy-playing behavior.
A Latent Class Approach to Estimating Test-Score Reliability

ERIC Educational Resources Information Center

van der Ark, L. Andries; van der Palm, Daniel W.; Sijtsma, Klaas

2011-01-01

This study presents a general framework for single-administration reliability methods, such as Cronbach's alpha, Guttman's lambda-2, and method MS. This general framework was used to derive a new approach to estimating test-score reliability by means of the unrestricted latent class model. This new approach is the latent class reliability…
Experiential Awareness of the Effects of Test Score Reports.

ERIC Educational Resources Information Center

Bender, Robert C.

Because most counselors have experienced a significant amount of success, they often have difficulty understanding the impact of test scores on persons who do not perform well. Counselor educators must develop experiential awareness in an area normally outside the realm of their students. To provide such an experience, 25 counselor trainees took…

What We Lose in Winning the Test Score Race

ERIC Educational Resources Information Center

Jorgenson, Olaf

2012-01-01

To achieve perpetually better test results each year as mandated by the No Child Left Behind Act (NCLB), teachers in successful schools such as Leroy Anderson Elementary in San Jose, California, will "try anything" to raise scores, as the school's principal stated in an interview with "The San Jose Mercury News." In schools…
Benefits of Coaching on Test Scores Seen as Negligible.

ERIC Educational Resources Information Center

Report on Education Research, 1983

1983-01-01

THE FOLLOWING IS THE FULL TEXT OF THIS DOCUMENT: A new study by a pair of Harvard University researchers discounts earlier findings that coaching can substantially improve student performance on the Scholastic Aptitude Test (SAT). "There is simply insufficient evidence that large score increases are a result of a coaching program," write…
Structured didactic teaching sessions improve medical student neurology clerkship test scores: a pilot study.

PubMed

Menkes, Daniel L; Reed, Mary

2008-01-01

To determine the effectiveness of didactic case-based instruction methodology to improve medical student comprehension of common neurological illnesses and neurological emergencies. Neurology department, academic university. 415 third and fourth year medical students performing a required four week neurology clerkship. Raw test scores on a 1 hour, 50-item clinical vignette based examination and open-ended questions in a post-clerkship feedback session. There was a statistically significant improvement in overall test scores (p<0.001). Didactic teaching sessions have a significant positive impact on neurology student clerkship test score performance and perception of their educational experience. Confirmation of these results across multiple specialties in a multi-center trial is warranted.
Estimating Conditional Distributions of Scores on an Alternate Form of a Test. Research Report. ETS RR-15-18

ERIC Educational Resources Information Center

Livingston, Samuel A.; Chen, Haiwen H.

2015-01-01

Quantitative information about test score reliability can be presented in terms of the distribution of equated scores on an alternate form of the test for test takers with a given score on the form taken. In this paper, we describe a procedure for estimating that distribution, for any specified score on the test form taken, by estimating the joint…
Computerized scoring algorithms for the Autobiographical Memory Test.

PubMed

Takano, Keisuke; Gutenbrunner, Charlotte; Martens, Kris; Salmon, Karen; Raes, Filip

2018-02-01

Reduced specificity of autobiographical memories is a hallmark of depressive cognition. Autobiographical memory (AM) specificity is typically measured by the Autobiographical Memory Test (AMT), in which respondents are asked to describe personal memories in response to emotional cue words. Due to this free descriptive responding format, the AMT relies on experts' hand scoring for subsequent statistical analyses. This manual coding potentially impedes research activities in big data analytics such as large epidemiological studies. Here, we propose computerized algorithms to automatically score AM specificity for the Dutch (adult participants) and English (youth participants) versions of the AMT by using natural language processing and machine learning techniques. The algorithms showed reliable performances in discriminating specific and nonspecific (e.g., overgeneralized) autobiographical memories in independent testing data sets (area under the receiver operating characteristic curve > .90). Furthermore, outcome values of the algorithms (i.e., decision values of support vector machines) showed a gradient across similar (e.g., specific and extended memories) and different (e.g., specific memory and semantic associates) categories of AMT responses, suggesting that, for both adults and youth, the algorithms well capture the extent to which a memory has features of specific memories. (PsycINFO Database Record (c) 2018 APA, all rights reserved).
Implicit theories and ability emotional intelligence

PubMed Central

Cabello, Rosario; Fernández-Berrocal, Pablo

2015-01-01

Previous research has shown that people differ in their implicit theories about the essential characteristics of intelligence and emotions. Some people believe these characteristics to be predetermined and immutable (entity theorists), whereas others believe that these characteristics can be changed through learning and behavior training (incremental theorists). The present study provides evidence that in healthy adults (N = 688), implicit beliefs about emotions and emotional intelligence (EI) may influence performance on the ability-based Mayer-Salovey-Caruso Emotional Intelligence Test (MSCEIT). Adults in our sample with incremental theories about emotions and EI scored higher on the MSCEIT than entity theorists, with implicit theories about EI showing a stronger relationship to scores than theories about emotions. Although our participants perceived both emotion and EI as malleable, they viewed emotions as more malleable than EI. Women and young adults in general were more likely to be incremental theorists than men and older adults. Furthermore, we found that emotion and EI theories mediated the relationship of gender and age with ability EI. Our findings suggest that people’s implicit theories about EI may influence their emotional abilities, which may have important consequences for personal and professional EI training. PMID:26052309
Age-related invariance of abilities measured with the Wechsler Adult Intelligence Scale-IV.

PubMed

Sudarshan, Navaneetham J; Bowden, Stephen C; Saklofske, Donald H; Weiss, Lawrence G

2016-11-01

Assessment of measurement invariance across populations is essential for meaningful comparison of test scores, and is especially relevant where repeated measurements are required for educational assessment or clinical diagnosis. Establishing measurement invariance legitimizes the assumption that test scores reflect the same psychological trait in different populations or across different occasions. Examination of Wechsler Adult Intelligence Scale-Fourth Edition (WAIS-IV) U.S. standardization samples revealed that a first-order 5-factor measurement model was best fitting across 9 age groups from 16 years to 69 years. Strong metric invariance was found for 3 of 5 factors and partial intercept invariance for the remaining 2. Pairwise comparisons of adjacent age groups supported the inference that cognitive-trait group differences are manifested by group differences in the test scores. In educational and clinical settings these findings provide theoretical and empirical support to interpret changes in the index or subtest scores as reflecting changes in the corresponding cognitive abilities. Further, where clinically relevant, the subtest score composites can be used to compare changes in respective cognitive abilities. The model was supported in the Canadian standardization data with pooled age groups but the sample sizes were not adequate for detailed examination of separate age groups in the Canadian sample. (PsycINFO Database Record (c) 2016 APA, all rights reserved).
A seven-year follow-up of intelligence test scores of foster grandparents.

PubMed

Troll, L E; Saltz, R; Dunin-Markiewicz, A

1976-09-01

After 7 years, a group of originally nonemployed poverty-level older people (over 60) who had been employed as foster grandparents were retested with the WAIS. Four WAIS subtests - Vocabulary Similarities, Digit Span, and Block Design - were employed. Of the original group of 39, complete data were available for 28; 18 of these were still working on the project, and the other 10 had dropped out. Dropouts as a group tested lower originally and also showed more deterioration in functional health ratings over time. For the total group of 32 foster grandparents, three subtest scores showed stability over the 7 years. Only Digit Span showed a statistically significant drop. Neither age nor the initial level of health or WAIS scores was related to test-score changes over time.
Robust joint score tests in the application of DNA methylation data analysis.

PubMed

Li, Xuan; Fu, Yuejiao; Wang, Xiaogang; Qiu, Weiliang

2018-05-18

Recently differential variability has been showed to be valuable in evaluating the association of DNA methylation to the risks of complex human diseases. The statistical tests based on both differential methylation level and differential variability can be more powerful than those based only on differential methylation level. Anh and Wang (2013) proposed a joint score test (AW) to simultaneously detect for differential methylation and differential variability. However, AW's method seems to be quite conservative and has not been fully compared with existing joint tests. We proposed three improved joint score tests, namely iAW.Lev, iAW.BF, and iAW.TM, and have made extensive comparisons with the joint likelihood ratio test (jointLRT), the Kolmogorov-Smirnov (KS) test, and the AW test. Systematic simulation studies showed that: 1) the three improved tests performed better (i.e., having larger power, while keeping nominal Type I error rates) than the other three tests for data with outliers and having different variances between cases and controls; 2) for data from normal distributions, the three improved tests had slightly lower power than jointLRT and AW. The analyses of two Illumina HumanMethylation27 data sets GSE37020 and GSE20080 and one Illumina Infinium MethylationEPIC data set GSE107080 demonstrated that three improved tests had higher true validation rates than those from jointLRT, KS, and AW. The three proposed joint score tests are robust against the violation of normality assumption and presence of outlying observations in comparison with other three existing tests. Among the three proposed tests, iAW.BF seems to be the most robust and effective one for all simulated scenarios and also in real data analyses.
External validation of the simple clinical score and the HOTEL score, two scores for predicting short-term mortality after admission to an acute medical unit.

PubMed

Stræde, Mia; Brabrand, Mikkel

2014-01-01

Clinical scores can be of aid to predict early mortality after admission to a medical admission unit. A developed scoring system needs to be externally validated to minimise the risk of the discriminatory power and calibration to be falsely elevated. We performed the present study with the objective of validating the Simple Clinical Score (SCS) and the HOTEL score, two existing risk stratification systems that predict mortality for medical patients based solely on clinical information, but not only vital signs. Pre-planned prospective observational cohort study. Danish 460-bed regional teaching hospital. We included 3046 consecutive patients from 2 October 2008 until 19 February 2009. 26 (0.9%) died within one calendar day and 196 (6.4%) died within 30 days. We calculated SCS for 1080 patients. We found an AUROC of 0.960 (95% confidence interval [CI], 0.932 to 0.988) for 24-hours mortality and 0.826 (95% CI, 0.774-0.879) for 30-day mortality, and goodness-of-fit test, χ(2) = 2.68 (10 degrees of freedom), P = 0.998 and χ(2) = 4.00, P = 0.947, respectively. We included 1470 patients when calculating the HOTEL score. Discriminatory power (AUROC) was 0.931 (95% CI, 0.901-0.962) for 24-hours mortality and goodness-of-fit test, χ(2) = 5.56 (10 degrees of freedom), P = 0.234. We find that both the SCS and HOTEL scores showed an excellent to outstanding ability in identifying patients at high risk of dying with good or acceptable precision.
Side-to-side difference in dynamic unilateral balance ability and pitching performance in Japanese collegiate baseball pitchers.

PubMed

Yanagisawa, Osamu; Futatsubashi, Genki; Taniguchi, Hidenori

2018-01-01

[Purpose] To evaluate the side-to-side difference in dynamic unilateral balance ability and to determine the correlation of the balance ability with pitching performance in collegiate baseball pitchers. [Subjects and Methods] Twenty-five Japanese collegiate baseball pitchers participated in this study. Dynamic balance ability during a unilateral stance was bilaterally evaluated using the star excursion balance test (SEBT). The pitchers threw 20 fastballs at an official pitching distance; the maximal ball velocity and pitching accuracy (the number of strike/20 pitches × 100) were assessed. Side-to-side difference in scores of SEBT was assessed using a paired t-test. Correlations between SEBT scores and pitching performance were evaluated for both legs using a Pearson's correlation analysis. [Results] The pivot side showed significantly higher score of the SEBT in the anteromedial direction than the stride side. On the other hand, the SEBT scores in the pivot and stride legs did not have significant correlations with maximal ball velocity and pitching accuracy. [Conclusion] These findings suggest that marked side-to-side difference does not exist in the dynamic unilateral balance ability of collegiate baseball pitchers and that the dynamic unilateral balance ability of each leg is not directly related to maximal ball velocity and pitching accuracy.
Treatment for Schistosoma japonicum, Reduction of Intestinal Parasite Load, and Cognitive Test Score Improvements in School-Aged Children

PubMed Central

Ezeamama, Amara E.; McGarvey, Stephen T.; Hogan, Joseph; Lapane, Kate L.; Bellinger, David C.; Acosta, Luz P.; Leenstra, Tjalling; Olveda, Remigio M.; Kurtis, Jonathan D.; Friedman, Jennifer F.

2012-01-01

learning test independent of schistosome infection. Hookworm and Trichuris trichiura declines were independently associated with improvements in WRAML memory scores as was the joint decline in ≥2 STH species. Baseline coinfection by ≥2 STH species was associated with low PNIT scores (β = −1.9; P = 0.04). Conclusion/Significance Children cured/S. japonicum-free for >12 months post-treatment and those who experienced declines of ≥2 STH species scored higher in three of four cognitive tests. Our result suggests that sustained deworming and simultaneous control for schistosome and STH infections could improve children's ability to take advantage of educational opportunities in helminth-endemic regions. PMID:22563514
The reliability and validity of the Danish Draft Board Cognitive Ability Test: Børge Prien's Prøve.

PubMed

Teasdale, Thomas W; Hartmann, Peter V W; Pedersen, Christoffer H; Bertelsen, Mette

2011-04-01

The Danish Draft Board has used the same test for assessing general cognitive ability, the Børge Prien's Prøve (BPP), for over 50 years during which time all men on reaching the age of 18 become liable for conscription. Data from the test has, over the decades, been used in numerous and wide-ranging research studies. Nonetheless, owing to the special circumstances of its administration, some psychometric properties, which are generally assessed for psychological tests, have not previously been investigated for the BPP. First, since the test is only used at the assessment phase, retesting with the BPP occurs only rarely and under exceptional circumstances. Therefore, its Test-Retest reliability has hitherto not been documented. Second, questions have often been raised as to whether the validity of the BPP is undermined by either a lack of motivation and under-performing among some of the men taking the test, being, as they are, compelled to do so, and/or by gradual obsolescence of the test over the decades of its use. We here present findings from three new studies to show that (a) the BPP has a satisfactory Test-Retest reliability, r=0.77, (b) BPP test scores are not positively associated with expressed attitude to being called upon to serve conscription and (c) the correlation between the BPP and a measure of educational level has remained stable (at about 0.5) through the last two decades. Taken together these three findings further support the continuing value of the BPP in research relating to cognitive ability. © 2010 The Authors. Scandinavian Journal of Psychology © 2010 The Scandinavian Psychological Associations.
The ability of video image analysis to predict lean meat yield and EUROP score of lamb carcasses.

PubMed

Einarsson, E; Eythórsdóttir, E; Smith, C R; Jónmundsson, J V

2014-07-01

A total of 862 lamb carcasses that were evaluated by both the VIAscan® and the current EUROP classification system were deboned and the actual yield was measured. Models were derived for predicting lean meat yield of the legs (Leg%), loin (Loin%) and shoulder (Shldr%) using the best VIAscan® variables selected by stepwise regression analysis of a calibration data set (n=603). The equations were tested on validation data set (n=259). The results showed that the VIAscan® predicted lean meat yield in the leg, loin and shoulder with an R 2 of 0.60, 0.31 and 0.47, respectively, whereas the current EUROP system predicted lean yield with an R 2 of 0.57, 0.32 and 0.37, respectively, for the three carcass parts. The VIAscan® also predicted the EUROP score of the trial carcasses, using a model derived from an earlier trial. The EUROP classification from VIAscan® and the current system were compared for their ability to explain the variation in lean yield of the whole carcass (LMY%) and trimmed fat (FAT%). The predicted EUROP scores from the VIAscan® explained 36% of the variation in LMY% and 60% of the variation in FAT%, compared with the current EUROP system that explained 49% and 72%, respectively. The EUROP classification obtained by the VIAscan® was tested against a panel of three expert classifiers (n=696). The VIAscan® classification agreed with 82% of conformation and 73% of the fat classes assigned by a panel of expert classifiers. It was concluded that VIAscan® provides a technology that can directly predict LMY% of lamb carcasses with more accuracy than the current EUROP classification system. The VIAscan® is also capable of classifying lamb carcasses into EUROP classes with an accuracy that fulfils minimum demands for the Icelandic sheep industry. Although the VIAscan® prediction of the Loin% is low, it is comparable to the current EUROP system, and should not hinder the adoption of the technology to estimate the yield of Icelandic lambs as it delivered
The Effects of Repeated Testing on Verbal and Nonverbal Ability Assessment.

ERIC Educational Resources Information Center

Lewis, Ernest L.; Beggs, Donald L.

The purpose of this study was to attempt to determine if score gains obtained upon repeated testing with an intelligence test result from a practice effect, from students remembering specific items, or from a combination of both. The verbal and nonverbal batteries of an I.Q. test were administered to 860 sixth graders on three occasions with…
The impact of menopausal symptoms on work ability.

PubMed

Geukes, Marije; van Aalst, Mariëlle P; Nauta, Mary C E; Oosterhof, Henk

2012-03-01

Menopause is an important life event that may have a negative influence on quality of life. Work ability, a concept widely used in occupational health, can predict both future impairment and duration of sickness absence. The aim of this study was to examine the impact of menopausal symptoms on work ability. This was a cross-sectional study that used a sample of healthy working Dutch women aged 44 to 60 years. Work ability was measured using the Work Ability Index, and menopausal symptoms were measured using the Greene Climacteric Scale. Stepwise multiple linear regression models were used to examine the relationship between menopausal symptoms and work ability. A total of 208 women were included in this study. There was a significant negative correlation between total Greene Climacteric Scale score and Work Ability Index score. Total Greene Climacteric Scale score predicted 33.8% of the total variance in the Work Ability Index score. Only the psychological and somatic subscales of the Greene Climacteric Scale were significant predictors in multiple linear regression analysis. Together, they accounted for 36.5% of total variance in Work Ability Index score. Menopausal symptoms are negatively associated with work ability and may increase the risk of sickness absence.
Growth trajectories and intellectual abilities in young adulthood: The Helsinki Birth Cohort study.

PubMed

Räikkönen, Katri; Forsén, Tom; Henriksson, Markus; Kajantie, Eero; Heinonen, Kati; Pesonen, Anu-Katriina; Leskinen, Jukka T; Laaksonen, Ilmo; Osmond, Clive; Barker, David J P; Eriksson, Johan G

2009-08-15

Slow childhood growth is associated with poorer intellectual ability. The critical periods of growth remain uncertain. Among 2,786 Finnish male military conscripts (1952-1972) born in 1934-1944, the authors tested how specific growth periods from birth to age 20 years predicted verbal, visuospatial, and arithmetic abilities at age 20. Small head circumference at birth predicted poorer verbal, visuospatial, and arithmetic abilities. The latter 2 measures were also associated with lower weight and body mass index (weight (kg)/height (m)(2)) at birth (for a 1-standard-deviation (SD) decrease in test score per SD decrease in body size > or = 0.05, P's < 0.04). Slow linear growth and weight gain between birth and age 6 months, between ages 6 months and 2 years, or both predicted poorer performance on all 3 tests (for a 1-SD decrease in test score per SD decrease in growth > or = 0.05, P's < 0.03). Reduced linear growth between ages 2 and 7 years predicted worse verbal ability, and between age 11 years and conscription it predicted worse performance on all 3 tests. Prenatal brain growth and linear growth up to 2 years after birth form a first critical period for intellectual development. There is a second critical period, specific for verbal development, between ages 2 and 7 years and a third critical period for all 3 tested outcomes during adolescence.
Development of inquiry-based learning activities integrated with the local learning resource to promote learning achievement and analytical thinking ability of Mathayomsuksa 3 student

NASA Astrophysics Data System (ADS)

Sukji, Paweena; Wichaidit, Pacharee Rompayom; Wichaidit, Sittichai

2018-01-01

The objectives of this study were to: 1) compare learning achievement and analytical thinking ability of Mathayomsuksa 3 students before and after learning through inquiry-based learning activities integrated with the local learning resource, and 2) compare average post-test score of learning achievement and analytical thinking ability to its cutting score. The target of this study was 23 Mathayomsuksa 3 students who were studying in the second semester of 2016 academic year from Banchatfang School, Chainat Province. Research instruments composed of: 1) 6 lesson plans of Environment and Natural Resources, 2) the learning achievement test, and 3) analytical thinking ability test. The results showed that 1) student' learning achievement and analytical thinking ability after learning were higher than that of before at the level of .05 statistical significance, and 2) average posttest score of student' learning achievement and analytical thinking ability were higher than its cutting score at the level of .05 statistical significance. The implication of this research is for science teachers and curriculum developers to design inquiry activities that relate to student's context.
Stability of person ability measures in people with acquired brain injury in the use of everyday technology: the test-retest reliability of the Management of Everyday Technology Assessment (META).

PubMed

Malinowsky, Camilla; Kassberg, Ann-Charlotte; Larsson-Lund, Maria; Kottorp, Anders

2016-01-01

To evaluate the test-retest reliability of the Management of Everyday Technology Assessment (META) in a sample of people with acquired brain injury (ABI). The META was administered twice within a two-week period to 25 people with ABI. A Rasch measurement model was used to convert the META ordinal raw scores into equal-interval linear measures of each participant's ability to manage everyday technology (ET). Test-retest reliability of the stability of the person ability measures in the META was examined by a standardized difference Z-test and an intra-class correlations analysis (ICC 1). The results showed that the paired person ability measures generated from the META were stable over the test-retest period for 22 of the 25 subjects. The ICC 1 correlation was 0.63, which indicates good overall reliability. The META demonstrated acceptable test-retest reliability in a sample of people with ABI. The results illustrate the importance of using sufficiently challenging ETs (relative to a person's abilities) to generate stable META measurements over time. Implications for Rehabilitation The findings add evidence regarding the test-retest reliability of the person ability measures generated from the observation assessment META in a sample of people with ABI. The META might support professionals in the evaluation of interventions that are designed to improve clients' performance of activities including the ability to manage ET.
Developing a Measure of General Academic Ability: An Application of Maximal Reliability and Optimal Linear Combination to High School Students' Scores

ERIC Educational Resources Information Center

Dimitrov, Dimiter M.; Raykov, Tenko; AL-Qataee, Abdullah Ali

2015-01-01

This article is concerned with developing a measure of general academic ability (GAA) for high school graduates who apply to colleges, as well as with the identification of optimal weights of the GAA indicators in a linear combination that yields a composite score with maximal reliability and maximal predictive validity, employing the framework of…

A Diet Score Assessing Norwegian Adolescents’ Adherence to Dietary Recommendations—Development and Test-Retest Reproducibility of the Score

PubMed Central

Handeland, Katina; Kjellevold, Marian; Wik Markhus, Maria; Eide Graff, Ingvild; Frøyland, Livar; Lie, Øyvind; Skotheim, Siv; Stormark, Kjell Morten; Dahl, Lisbeth; Øyen, Jannike

2016-01-01

Assessment of adolescents’ dietary habits is challenging. Reliable instruments to monitor dietary trends are required to promote healthier behaviours in this group. The purpose of this cross-sectional study was to assess adolescents’ adherence to Norwegian dietary recommendations with a diet score and to report results from, and test-retest reliability of, the score. The diet score involved seven food groups and one physical activity indicator, and was applied to answers from a semi-quantitative food frequency questionnaire (FFQ) administered twice. Reproducibility of the score was assessed with Cohen’s Kappa (κ statistics) at an interval of three months. The setting was eight lower-secondary schools in Hordaland County, Norway, and subjects were adolescents (n = 472) aged 14–15 years and their caregivers. Results showed that the proportion of adolescents consistently classified by the diet score was 87.6% (κ = 0.465). For food groups, proportions ranged from 74.0% to 91.6% (κ = 0.249 to κ = 0.573). Less than 40% of the participants were found to adhere to recommendations for frequencies of eating fruits, vegetables, added sugar, and fish. Highest compliance to recommendations was seen for choosing water as beverage and limit the intake of red meat. The score was associated with parental socioeconomic status. The diet score was found to be reproducible at an acceptable level. Health promoting work targeting adolescents should emphasize to increase the intake of recommended foods to approach nutritional guidelines. PMID:27483312
A Diet Score Assessing Norwegian Adolescents' Adherence to Dietary Recommendations-Development and Test-Retest Reproducibility of the Score.

PubMed

Handeland, Katina; Kjellevold, Marian; Wik Markhus, Maria; Eide Graff, Ingvild; Frøyland, Livar; Lie, Øyvind; Skotheim, Siv; Stormark, Kjell Morten; Dahl, Lisbeth; Øyen, Jannike

2016-07-29

Assessment of adolescents' dietary habits is challenging. Reliable instruments to monitor dietary trends are required to promote healthier behaviours in this group. The purpose of this cross-sectional study was to assess adolescents' adherence to Norwegian dietary recommendations with a diet score and to report results from, and test-retest reliability of, the score. The diet score involved seven food groups and one physical activity indicator, and was applied to answers from a semi-quantitative food frequency questionnaire (FFQ) administered twice. Reproducibility of the score was assessed with Cohen's Kappa (κ statistics) at an interval of three months. The setting was eight lower-secondary schools in Hordaland County, Norway, and subjects were adolescents (n = 472) aged 14-15 years and their caregivers. Results showed that the proportion of adolescents consistently classified by the diet score was 87.6% (κ = 0.465). For food groups, proportions ranged from 74.0% to 91.6% (κ = 0.249 to κ = 0.573). Less than 40% of the participants were found to adhere to recommendations for frequencies of eating fruits, vegetables, added sugar, and fish. Highest compliance to recommendations was seen for choosing water as beverage and limit the intake of red meat. The score was associated with parental socioeconomic status. The diet score was found to be reproducible at an acceptable level. Health promoting work targeting adolescents should emphasize to increase the intake of recommended foods to approach nutritional guidelines.
A Bad Idea: National Standards Based on Test Scores

ERIC Educational Resources Information Center

Baker, Keith

2010-01-01

The justification for national standards is that test scores predict a nation's future economic success. There is no evidence that supports this assumption. There is evidence that it is wrong. For more than half a century, reformers have been trying to fix our schools with little success. The obvious conclusion is that something that can't be…
America's Mediocre Test Scores: Education Crisis or Poverty Crisis?

ERIC Educational Resources Information Center

Petrilli, Michael J.; Wright, Brandon L.

2016-01-01

At a time when the national conversation is focused on lagging upward mobility, it is no surprise that many educators point to poverty as the explanation for mediocre test scores among U.S. students compared to those of students in other countries. If American teachers in struggling U.S. schools taught in Finland, says Finnish educator Pasi…
Some Effects of Social Class and Race on Children's Language and Intellectual Abilities.

ERIC Educational Resources Information Center

Whiteman, Martin; And Others

A cross-sectional study of 292 first and fifth grade Negro and white children examined the relationship between environmental factors and performance test scores of verbal and cognitive ability. The socioeconomic status (SES) of each subject was determined and included in a deprivation index formed by obtaining a composite score for each subject…
Using Heteroskedastic Ordered Probit Models to Recover Moments of Continuous Test Score Distributions from Coarsened Data

ERIC Educational Resources Information Center

Reardon, Sean F.; Shear, Benjamin R.; Castellano, Katherine E.; Ho, Andrew D.

2017-01-01

Test score distributions of schools or demographic groups are often summarized by frequencies of students scoring in a small number of ordered proficiency categories. We show that heteroskedastic ordered probit (HETOP) models can be used to estimate means and standard deviations of multiple groups' test score distributions from such data. Because…
Higher-Order Asymptotics and Its Application to Testing the Equality of the Examinee Ability Over Two Sets of Items.

PubMed

Sinharay, Sandip; Jensen, Jens Ledet

2018-06-27

In educational and psychological measurement, researchers and/or practitioners are often interested in examining whether the ability of an examinee is the same over two sets of items. Such problems can arise in measurement of change, detection of cheating on unproctored tests, erasure analysis, detection of item preknowledge, etc. Traditional frequentist approaches that are used in such problems include the Wald test, the likelihood ratio test, and the score test (e.g., Fischer, Appl Psychol Meas 27:3-26, 2003; Finkelman, Weiss, & Kim-Kang, Appl Psychol Meas 34:238-254, 2010; Glas & Dagohoy, Psychometrika 72:159-180, 2007; Guo & Drasgow, Int J Sel Assess 18:351-364, 2010; Klauer & Rettig, Br J Math Stat Psychol 43:193-206, 1990; Sinharay, J Educ Behav Stat 42:46-68, 2017). This paper shows that approaches based on higher-order asymptotics (e.g., Barndorff-Nielsen & Cox, Inference and asymptotics. Springer, London, 1994; Ghosh, Higher order asymptotics. Institute of Mathematical Statistics, Hayward, 1994) can also be used to test for the equality of the examinee ability over two sets of items. The modified signed likelihood ratio test (e.g., Barndorff-Nielsen, Biometrika 73:307-322, 1986) and the Lugannani-Rice approximation (Lugannani & Rice, Adv Appl Prob 12:475-490, 1980), both of which are based on higher-order asymptotics, are shown to provide some improvement over the traditional frequentist approaches in three simulations. Two real data examples are also provided.
Neurocognitive abilities in young adults with very low birth weight.

PubMed

Pyhälä, R; Lahti, J; Heinonen, K; Pesonen, A-K; Strang-Karlsson, S; Hovi, P; Järvenpää, A-L; Eriksson, J G; Andersson, S; Kajantie, E; Räikkönen, K

2011-12-06

Although severely preterm birth has been associated with impaired neurocognitive abilities in children, follow-up studies in adulthood are scarce. We set out to study whether adults born with very low birth weight (VLBW) (<1,500 g), either small for gestational age (SGA) (birth weight ≤-2 SD) or appropriate for gestational age (AGA), differ in a range of neurocognitive abilities and academic performance from adults born at term and not SGA. As part of the Helsinki Study of Very Low Birth Weight Adults, 103 VLBW (37 SGA) and 105 term-born control adults (mean age 25.0, range 21.4-29.7 years) without major neurosensory impairments participated in the follow-up study in 2007-2008. The test battery included measures of general cognitive ability as well as executive functioning and related abilities. Academic performance was self-reported. With adjustment for sex and age, the VLBW group scored lower or performed slower than the control group in some indices of all tests (these mean differences ranged from 0.3 to 0.5 SD units, p ≤ 0.03) and they had received remedial education at school more frequently; however, no differences existed in self-reported academic performance. The differences were evident in both VLBW-SGA and VLBW-AGA groups. Further covariate adjustments for parental education, current head circumference, and head circumference at birth and, in tests of executive functioning and related abilities, adjustment for IQ estimate had minor effects on the results. In comparison with control adults, VLBW adults scored lower on several neurocognitive tests. Poorer neurocognitive performance is associated with VLBW irrespective of the intrauterine growth pattern.
The Design and Development of the Phillips-Patterson Test of Inference Ability in Reading Comprehension.

ERIC Educational Resources Information Center

Phillips, Linda M.

The design and development of a test of inference ability in reading comprehension for grades 6, 7, and 8 (the Phillips-Patterson Test of Inference Ability in Reading Comprehension) are described. After development of a contemporary theoretical framework for the test of inference ability in reading comprehension, the design, item development, and…
Power and sample size evaluation for the Cochran-Mantel-Haenszel mean score (Wilcoxon rank sum) test and the Cochran-Armitage test for trend.

PubMed

Lachin, John M

2011-11-10

The power of a chi-square test, and thus the required sample size, are a function of the noncentrality parameter that can be obtained as the limiting expectation of the test statistic under an alternative hypothesis specification. Herein, we apply this principle to derive simple expressions for two tests that are commonly applied to discrete ordinal data. The Wilcoxon rank sum test for the equality of distributions in two groups is algebraically equivalent to the Mann-Whitney test. The Kruskal-Wallis test applies to multiple groups. These tests are equivalent to a Cochran-Mantel-Haenszel mean score test using rank scores for a set of C-discrete categories. Although various authors have assessed the power function of the Wilcoxon and Mann-Whitney tests, herein it is shown that the power of these tests with discrete observations, that is, with tied ranks, is readily provided by the power function of the corresponding Cochran-Mantel-Haenszel mean scores test for two and R > 2 groups. These expressions yield results virtually identical to those derived previously for rank scores and also apply to other score functions. The Cochran-Armitage test for trend assesses whether there is an monotonically increasing or decreasing trend in the proportions with a positive outcome or response over the C-ordered categories of an ordinal independent variable, for example, dose. Herein, it is shown that the power of the test is a function of the slope of the response probabilities over the ordinal scores assigned to the groups that yields simple expressions for the power of the test. Copyright © 2011 John Wiley & Sons, Ltd.
The Implications of Family Size and Birth Order for Test Scores and Behavioral Development

ERIC Educational Resources Information Center

Silles, Mary A.

2010-01-01

This article, using longitudinal data from the National Child Development Study, presents new evidence on the effects of family size and birth order on test scores and behavioral development at age 7, 11 and 16. Sibling size is shown to have an adverse causal effect on test scores and behavioral development. For any given family size, first-borns…
The Weighted Airman Promotion System: Standardizing Test Scores

DTIC Science & Technology

2008-01-01

This document and trademark( s ) contained herein are protected by law as indicated in a notice appearing later in this work. This electronic...SUBTITLE The Weighted Airman Promotion System. Standardizing Test Scores 5a. CONTRACT NUMBER 5b. GRANT NUMBER 5c. PROGRAM ELEMENT NUMBER 6. AUTHOR( S ) 5d...PROJECT NUMBER 5e. TASK NUMBER 5f. WORK UNIT NUMBER 7. PERFORMING ORGANIZATION NAME( S ) AND ADDRESS(ES) Rand Corporation,PO Box 2138,Santa Monica
Intellectual Ability in Young Adulthood as an Antecedent of Physical Functioning in Older Age

PubMed Central

Poranen-Clark, Taina; von Bonsdorff, Mikaela B.; Törmäkangas, Timo; Lahti, Jari; Wasenius, Niko; Räikkönen, Katri; Osmond, Clive; Salonen, Minna K.; Rantanen, Taina; Kajantie, Eero; Eriksson, Johan G.

2016-01-01

Objectives Low cognitive ability is associated with subsequent functional disability. Whether this association extends across adult life has been little studied. The aim of this study was to examine the association between intellectual ability in young adulthood and physical functioning during a 10-year follow-up in older age. Methods 360 persons of the Helsinki Birth Cohort Study (HBCS) male members, born between 1934- 1944 and residing in Finland in 1971, took part in The Finnish Defence Forces Basic Intellectual Ability Test during the first two weeks of their military service training between 1952- 72. Their physical functioning was assessed twice using the Short Form 36 (SF-36) questionnaire at average ages of 61 and 71 years. A longitudinal path model linking Intellectual Ability Test score to the physical functioning assessments was used to explore the effect of intellectual ability in young adulthood on physical functioning in older age. Results After adjustments for age at measurement, childhood socioeconomic status and adult BMI (kg/m2), better intellectual ability total and arithmetic and verbal reasoning subtest scores in young adulthood predicted better physical functioning at age 61 years (P-values < 0.021). Intellectual ability total and arithmetic and verbal reasoning subtest scores in young adulthood had indirect effects on physical functioning at age 71 years (P-values < 0.022) through better physical functioning at age 61 years. Adjustment for main chronic diseases did not change the results materially. Conclusion Better early life intellectual ability helps in maintaining better physical functioning in older age. PMID:27189726
Emotional Intelligence and cognitive abilities - associations and sex differences.

PubMed

Pardeller, Silvia; Frajo-Apor, Beatrice; Kemmler, Georg; Hofer, Alex

2017-09-01

In order to expand on previous research, this cross-sectional study investigated the relationship between Emotional Intelligence (EI) and cognitive abilities in healthy adults with a special focus on potential sex differences. EI was assessed by means of the Mayer-Salovey-Caruso-Emotional-Intelligence Test (MSCEIT), whereas cognitive abilities were investigated using the Brief Assessment of Cognition in Schizophrenia (BACS), which measures key aspects of cognitive functioning, i.e. verbal memory, working memory, motor speed, verbal fluency, attention and processing speed, and reasoning and problem solving. 137 subjects (65% female) with a mean age of 38.7 ± 11.8 years were included into the study. While males and females were comparable with regard to EI, men achieved significantly higher BACS composite scores and outperformed women in the BACS subscales motor speed, attention and processing speed, and reasoning and problem solving. Verbal fluency significantly predicted EI, whereas the MSCEIT subscale understanding emotions significantly predicted the BACS composite score. Our findings support previous research and emphasize the relevance of considering cognitive abilities when assessing ability EI in healthy individuals.
Correlations between the Hand Test Pathology score and Personality Assessment Inventory scales for pain clinic patients.

PubMed

George, J M; Wagner, E E

1995-06-01

Pearson correlations between the Hand Test Pathology (PATH) score and Personality Assessment Inventory scales produced a cluster of relationships characteristic of an antisocial orientation. Likewise, PATH significantly differentiated between a "P" (Pathology) group flagged by a high Negative Impression score on the inventory, and an "N" (Normal) group of 100 pain patients. It was suggested that the interpretive simplicity of Hand Test scores renders the scores amenable to further correlational studies involving the inventory.
Risk factors for Apgar score using artificial neural networks.

PubMed

Ibrahim, Doaa; Frize, Monique; Walker, Robin C

2006-01-01

Artificial Neural Networks (ANNs) have been used in identifying the risk factors for many medical outcomes. In this paper, the risk factors for low Apgar score are introduced. This is the first time, to our knowledge, that the ANNs are used for Apgar score prediction. The medical domain of interest used is the perinatal database provided by the Perinatal Partnership Program of Eastern and Southeastern Ontario (PPPESO). The ability of the feed forward back propagation ANNs to generate strong predictive model with the most influential variables is tested. Finally, minimal sets of variables (risk factors) that are important in predicting Apgar score outcome without degrading the ANN performance are identified.
The Relationship Between Academic Performance and Reading Ability of Pensacola Junior College Freshmen.

ERIC Educational Resources Information Center

Einbecker, Polly Godwin

The purpose of this investigation was to determine the relationship between reading ability and academic performance of junior college freshman and to what degree a measure of reading ability could predict academic performance. The 313 Pensacola Junior College freshman for whom 1970 Reading Index Scores on the Florida Twelfth Grade Test were…
Improving Test Score Reporting: Perspectives from the ETS Score Reporting Conference. Research Report. ETS RR-11-45

ERIC Educational Resources Information Center

Zapata-Rivera, Diego, Ed.; Zwick, Rebecca, Ed.

2011-01-01

This volume includes 3 papers based on presentations at a workshop on communicating assessment information to particular audiences, held at Educational Testing Service (ETS) on November 4th, 2010, to explore some issues that influence score reports and new advances that contribute to the effectiveness of these reports. Jessica Hullman, Rebecca…
Standardized Testing of Special Education Students: A Comparison of Service Type and Test Scores

ERIC Educational Resources Information Center

Hogan-Young, Christine

2013-01-01

The purpose of this study was to determine if there was a difference in Tennessee Comprehensive Assessment Program Modified Academic Achievement Standards (TCAP MAAS) achievement test scores for special education students who receive their instruction in the resource classroom or in an inclusion classroom. The study involved third, fourth, and…
Relationship Between Cognitive Perceptual Abilities and Accident and Penalty Histories Among Elderly Korean Drivers

PubMed Central

2016-01-01

Objective To investigate the relationship between cognitive perceptual abilities of elderly drivers based on the Cognitive Perceptual Assessment for Driving (CPAD) test and their accident and penalty histories. Methods A total of 168 elderly drivers (aged ≥65 years) participated in the study. Participant data included CPAD scores and incidents of traffic accidents and penalties, attained from the Korea Road Traffic Authority and Korea National Police Agency, respectively. Results Drivers' mean age was 70.25±4.1 years and the mean CPAD score was 52.75±4.72. Elderly drivers' age was negatively related to the CPAD score (p<0.001). The accident history group had marginally lower CPAD scores, as compared to the non-accident group (p=0.051). However, incidence rates for traffic fines did not differ significantly between the two groups. Additionally, the group that passed the CPAD test had experienced fewer traffic accidents (3.6%), as compared to the group that failed (10.6%). The older age group (12.0%) had also experienced more traffic accidents, as compared to the younger group (2.4%). Conclusion Overall, elderly drivers who experienced driving accidents had lower CPAD scores than those who did not, without statistical significance. Thus, driving-related cognitive abilities of elderly drivers with insufficient cognitive ability need to be further evaluated to prevent traffic accidents. PMID:28119840

Genome-Wide Polygenic Scores Predict Reading Performance Throughout the School Years.

PubMed

Selzam, Saskia; Dale, Philip S; Wagner, Richard K; DeFries, John C; Cederlöf, Martin; O'Reilly, Paul F; Krapohl, Eva; Plomin, Robert

2017-07-04

It is now possible to create individual-specific genetic scores, called genome-wide polygenic scores (GPS). We used a GPS for years of education ( EduYears ) to predict reading performance assessed at UK National Curriculum Key Stages 1 (age 7), 2 (age 12) and 3 (age 14) and on reading tests administered at ages 7 and 12 in a UK sample of 5,825 unrelated individuals. EduYears GPS accounts for up to 5% of the variance in reading performance at age 14. GPS predictions remained significant after accounting for general cognitive ability and family socioeconomic status. Reading performance of children in the lowest and highest 12.5% of the EduYears GPS distribution differed by a mean growth in reading ability of approximately two school years. It seems certain that polygenic scores will be used to predict strengths and weaknesses in education.
Evaluating the Effects of Differences in Group Abilities on the Tucker and the Levine Observed-Score Methods for Common-Item Nonequivalent Groups Equating. ACT Research Report Series 2010-1

ERIC Educational Resources Information Center

Chen, Hanwei; Cui, Zhongmin; Zhu, Rongchun; Gao, Xiaohong

2010-01-01

The most critical feature of a common-item nonequivalent groups equating design is that the average score difference between the new and old groups can be accurately decomposed into a group ability difference and a form difficulty difference. Two widely used observed-score linear equating methods, the Tucker and the Levine observed-score methods,…
Validity of Alternative Cut-Off Scores for the Back-Saver Sit and Reach Test

ERIC Educational Resources Information Center

Looney, Marilyn A.; Gilbert, Jennie

2012-01-01

The purpose of the study was to determine if currently used FITNESSGRAM[R] cut-off scores for the Back Saver Sit and Reach Test had the best criterion-referenced validity evidence for 6-12 year old children. Secondary analyses of an existing data set focused on the passive straight leg raise and Back Saver Sit and Reach Test flexibility scores of…
Testing the "Work Ability House" Model in hospital workers.

PubMed

Martinez, Maria Carmen; Latorre, Maria do Rosário Dias de Oliveira; Fischer, Frida Marina

2016-01-01

To test the Work Ability House model, verifying the hierarchy of proposed dimensions, among a group of hospital workers. A cohort study (2009-2011) was conducted with a sample of 599 workers from a hospital in the city of São Paulo. A questionnaire including sociodemographics, lifestyle and working conditions was used. The Brazilian versions of Job Stress Scale, Effort-Reward Imbalance, Work-Related Activities That May Contribute To Job-Related Pain and/or Injury, and the Work Ability Index (WAI) were also used. A hierarchical logistic regression analysis was performed: the independent variables were allocated into levels according to the dimensions of the theoretical model in order to evaluate the factors associated with work ability. Variables associated with impairment of work ability in each dimension were as follows: (a) sociodemographics: age < 30 years (p = 0.20), (b) health: without report of occurrence of work injuries (p = 0.029), (c) professional competence: low educational level (p = 0.008), (d) values : intensified in overcommitment (p < 0.001), and (e) work: intensification of effort-reward imbalance (p = 0.009) and high demands (p = 0.040). The results confirmed the dimensions proposed for the Work Ability House model, indicating that it is valid as a representation of a multidimensional construct of multifactorial determination and can be used in the management of work ability.
Speech perception and communication ability over the telephone by Mandarin-speaking children with cochlear implants.

PubMed

Wu, Che-Ming; Liu, Tien-Chen; Wang, Nan-Mai; Chao, Wei-Chieh

2013-08-01

(1) To understand speech perception and communication ability through real telephone calls by Mandarin-speaking children with cochlear implants and compare them to live-voice perception, (2) to report the general condition of telephone use of this population, and (3) to investigate the factors that correlate with telephone speech perception performance. Fifty-six children with over 4 years of implant use (aged 6.8-13.6 years, mean duration 8.0 years) took three speech perception tests administered using telephone and live voice to examine sentence, monosyllabic-word and Mandarin tone perception. The children also filled out a questionnaire survey investigating everyday telephone use. Wilcoxon signed-rank test was used to compare the scores between live-voice and telephone tests, and Pearson's test to examine the correlation between them. The mean scores were 86.4%, 69.8% and 70.5% respectively for sentence, word and tone recognition over the telephone. The corresponding live-voice mean scores were 94.3%, 84.0% and 70.8%. Wilcoxon signed-rank test showed the sentence and word scores were significantly different between telephone and live voice test, while the tone recognition scores were not, indicating tone perception was less worsened by telephone transmission than words and sentences. Spearman's test showed that chronological age and duration of implant use were weakly correlated with the perception test scores. The questionnaire survey showed 78% of the children could initiate phone calls and 59% could use the telephone 2 years after implantation. Implanted children are potentially capable of using the telephone 2 years after implantation, and communication ability over the telephone becomes satisfactory 4 years after implantation. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
Interpretation and Utilization of Scores on the Air Force Officer Qualifying Test.

ERIC Educational Resources Information Center

Miller, Robert E.

The report summarizes a large body of data relevant to the proper interpretation and use of aptitude scores on the Air Force Officer Qualifying Test (AFOQT). Included are descriptions of the AFOQT testing program and the test itself. Technical data include an extensive sampling of validation studies covering predictors of success in pilot…
Stochastic Processes as True-Score Models for Highly Speeded Mental Tests.

ERIC Educational Resources Information Center

Moore, William E.

The previous theoretical development of the Poisson process as a strong model for the true-score theory of mental tests is discussed, and additional theoretical properties of the model from the standpoint of individual examinees are developed. The paper introduces the Erlang process as a family of test theory models and shows in the context of…
Did Teachers' Verbal Ability and Race Matter in the 1960s? "Coleman" Revisited. RAND Reprints.

ERIC Educational Resources Information Center

Ehrenberg, Ronald G.; Brewer, Dominic J.

This paper reanalyzed data from the classic 1966 study "Equality of Educational Opportunity," or "Coleman Report." It addressed the issue of whether teacher characteristics, including verbal ability and race, influenced "synthetic gain scores" of students (mean test scores of upper grade students in a school minus…
Consideration of "g" as a Common Antecedent for Cognitive Ability Test Performance, Test Motivation, and Perceived Fairness

ERIC Educational Resources Information Center

Reeve, Charlie L.; Lam, Holly

2007-01-01

Several different analyses were used to test the hypothesis that test-taking motivation, perceived test fairness, and actual test performance are correlated only because they share a common antecedent. First, hierarchical regressions reveal that initial test performance has a unique influence on non-ability factors even after controlling for…
Univariate and Bivariate Loglinear Models for Discrete Test Score Distributions.

ERIC Educational Resources Information Center

Holland, Paul W.; Thayer, Dorothy T.

2000-01-01

Applied the theory of exponential families of distributions to the problem of fitting the univariate histograms and discrete bivariate frequency distributions that often arise in the analysis of test scores. Considers efficient computation of the maximum likelihood estimates of the parameters using Newton's Method and computationally efficient…
Contextual approach using VBA learning media to improve students’ mathematical displacement and disposition ability

NASA Astrophysics Data System (ADS)

Chotimah, Siti; Bernard, M.; Wulandari, S. M.

2018-01-01

The main problems of the research were the lack of reasoning ability and mathematical disposition of students to the learning of mathematics in high school students in Cimahi - West Java. The lack of mathematical reasoning ability in students was caused by the process of learning. The teachers did not train the students to do the problems of reasoning ability. The students still depended on each other. Sometimes, one of patience teacher was still guiding his students. In addition, the basic ability aspects of students also affected the ability the mathematics skill. Furthermore, the learning process with contextual approach aided by VBA Learning Media (Visual Basic Application for Excel) gave the positive influence to the students’ mathematical disposition. The students are directly involved in learning process. The population of the study was all of the high school students in Cimahi. The samples were the students of SMA Negeri 4 Cimahi class XIA and XIB. There were both of tested and non-tested instruments. The test instrument was a description test of mathematical reasoning ability. The non-test instruments were questionnaire-scale attitudes about students’ mathematical dispositions. This instrument was used to obtain data about students’ mathematical reasoning and disposition of mathematics learning with contextual approach supported by VBA (Visual Basic Application for Excel) and by conventional learning. The data processed in this study was from the post-test score. These scores appeared from both of the experimental class group and the control class group. Then, performing data was processed by using SPSS 22 and Microsoft Excel. The data was analyzed using t-test statistic. The final result of this study concluded the achievement and improvement of reasoning ability and mathematical disposition of students whose learning with contextual approach supported by learning media of VBA (Visual Basic Application for Excel) was better than students who got
External Validation of the Simple Clinical Score and the HOTEL Score, Two Scores for Predicting Short-Term Mortality after Admission to an Acute Medical Unit

PubMed Central

Stræde, Mia; Brabrand, Mikkel

2014-01-01

Background Clinical scores can be of aid to predict early mortality after admission to a medical admission unit. A developed scoring system needs to be externally validated to minimise the risk of the discriminatory power and calibration to be falsely elevated. We performed the present study with the objective of validating the Simple Clinical Score (SCS) and the HOTEL score, two existing risk stratification systems that predict mortality for medical patients based solely on clinical information, but not only vital signs. Methods Pre-planned prospective observational cohort study. Setting Danish 460-bed regional teaching hospital. Findings We included 3046 consecutive patients from 2 October 2008 until 19 February 2009. 26 (0.9%) died within one calendar day and 196 (6.4%) died within 30 days. We calculated SCS for 1080 patients. We found an AUROC of 0.960 (95% confidence interval [CI], 0.932 to 0.988) for 24-hours mortality and 0.826 (95% CI, 0.774–0.879) for 30-day mortality, and goodness-of-fit test, χ2 = 2.68 (10 degrees of freedom), P = 0.998 and χ2 = 4.00, P = 0.947, respectively. We included 1470 patients when calculating the HOTEL score. Discriminatory power (AUROC) was 0.931 (95% CI, 0.901–0.962) for 24-hours mortality and goodness-of-fit test, χ2 = 5.56 (10 degrees of freedom), P = 0.234. Conclusion We find that both the SCS and HOTEL scores showed an excellent to outstanding ability in identifying patients at high risk of dying with good or acceptable precision. PMID:25144186
Validity and Relative Ability of 4 Balance Tests to Identify Fall Status of Older Adults With Type 2 Diabetes.

PubMed

Marques, Alda; Silva, Alexandre; Oliveira, Ana; Cruz, Joana; Machado, Ana; Jácome, Cristina

The Berg Balance Scale (BBS), the Balance Evaluation Systems Test (BESTest), the Mini-BESTest, and the Brief-BESTest are useful tests to assess balance; however, their clinimetric properties have not been studied well in older adults with type 2 diabetes (T2D). This study compared the validity and relative ability of the BBS, BESTest, Mini-BESTest, and Brief-BESTest to identify fall status in older adults with T2D. This study involved a cross-sectional design. Sixty-six older adults with T2D (75 ± 7.6 years) were included and asked to report the number of falls during the previous 12 months and to complete the Activities-specific Balance Confidence scale. The BBS and the BESTest were administered, and the Mini-BESTest and Brief-BESTest scores were computed based on the BESTest performance. Receiver operating characteristics were used to assess the ability of each balance test to differentiate between participants with and without a history of falls. The 4 balance tests were able to identify fall status (areas under the curve = 0.74-0.76), with similar sensitivity (60%-67%) and specificity (71%-76%). The 4 balance tests were able to differentiate between older adults with T2D with and without a history of falls. As the BBS and the BESTest require longer application time, the Brief-BESTest may be an appropriate choice to use in clinical practice to detect fall risk.
Ability Testing for Job Selection: Are the Economic Claims Justified?

ERIC Educational Resources Information Center

Levin, Henry M.

The use of ability testing for job selection has become widespread in the Federal Government and in the U.S. Employment Service, which assists private sector employers. The justification for the practice is based largely on research findings claiming a high level of validity for such tests in predicting job performance. More recently, such claims…
Factors contributing to speech perception scores in long-term pediatric cochlear implant users.

PubMed

Davidson, Lisa S; Geers, Ann E; Blamey, Peter J; Tobey, Emily A; Brenner, Christine A

2011-02-01

The objectives of this report are to (1) describe the speech perception abilities of long-term pediatric cochlear implant (CI) recipients by comparing scores obtained at elementary school (CI-E, 8 to 9 yrs) with scores obtained at high school (CI-HS, 15 to 18 yrs); (2) evaluate speech perception abilities in demanding listening conditions (i.e., noise and lower intensity levels) at adolescence; and (3) examine the relation of speech perception scores to speech and language development over this longitudinal timeframe. All 112 teenagers were part of a previous nationwide study of 8- and 9-yr-olds (N = 181) who received a CI between 2 and 5 yrs of age. The test battery included (1) the Lexical Neighborhood Test (LNT; hard and easy word lists); (2) the Bamford Kowal Bench sentence test; (3) the Children's Auditory-Visual Enhancement Test; (4) the Test of Auditory Comprehension of Language at CI-E; (5) the Peabody Picture Vocabulary Test at CI-HS; and (6) the McGarr sentences (consonants correct) at CI-E and CI-HS. CI-HS speech perception was measured in both optimal and demanding listening conditions (i.e., background noise and low-intensity level). Speech perception scores were compared based on age at test, lexical difficulty of stimuli, listening environment (optimal and demanding), input mode (visual and auditory-visual), and language age. All group mean scores significantly increased with age across the two test sessions. Scores of adolescents significantly decreased in demanding listening conditions. The effect of lexical difficulty on the LNT scores, as evidenced by the difference in performance between easy versus hard lists, increased with age and decreased for adolescents in challenging listening conditions. Calculated curves for percent correct speech perception scores (LNT and Bamford Kowal Bench) and consonants correct on the McGarr sentences plotted against age-equivalent language scores on the Test of Auditory Comprehension of Language and Peabody
Allele-sharing models: LOD scores and accurate linkage tests.

PubMed

Kong, A; Cox, N J

1997-11-01

Starting with a test statistic for linkage analysis based on allele sharing, we propose an associated one-parameter model. Under general missing-data patterns, this model allows exact calculation of likelihood ratios and LOD scores and has been implemented by a simple modification of existing software. Most important, accurate linkage tests can be performed. Using an example, we show that some previously suggested approaches to handling less than perfectly informative data can be unacceptably conservative. Situations in which this model may not perform well are discussed, and an alternative model that requires additional computations is suggested.
Allele-sharing models: LOD scores and accurate linkage tests.

PubMed Central

Kong, A; Cox, N J

1997-01-01

Starting with a test statistic for linkage analysis based on allele sharing, we propose an associated one-parameter model. Under general missing-data patterns, this model allows exact calculation of likelihood ratios and LOD scores and has been implemented by a simple modification of existing software. Most important, accurate linkage tests can be performed. Using an example, we show that some previously suggested approaches to handling less than perfectly informative data can be unacceptably conservative. Situations in which this model may not perform well are discussed, and an alternative model that requires additional computations is suggested. PMID:9345087
Beyond Correlations: Usefulness of High School GPA and Test Scores in Making College Admissions Decisions

ERIC Educational Resources Information Center

Sawyer, Richard

2013-01-01

Correlational evidence suggests that high school GPA is better than admission test scores in predicting first-year college GPA, although test scores have incremental predictive validity. The usefulness of a selection variable in making admission decisions depends in part on its predictive validity, but also on institutions' selectivity and…
The effect of human immunodeficiency virus type 1 antibody status on military applicant aptitude test scores.

PubMed

Arday, D R; Brundage, J F; Gardner, L I; Goldenbaum, M; Wann, F; Wright, S

1991-06-15

The authors conducted a population-based study to attempt to estimate the effect of human immunodeficiency virus type 1 (HIV-1) seropositivity on Armed Services Vocational Aptitude Battery test scores in otherwise healthy individuals with early HIV-1 infection. The Armed Services Vocational Aptitude Battery is a 10-test written multiple aptitude battery administered to all civilian applicants for military enlistment prior to serologic screening for HIV-1 antibodies. A total of 975,489 induction testing records containing both Armed Services Vocational Aptitude Battery and HIV-1 results from October 1985 through March 1987 were examined. An analysis data set (n = 7,698) was constructed by choosing five controls for each of the 1,283 HIV-1-positive cases, matched on five-digit ZIP code, and a multiple linear regression analysis was performed to control for demographic and other factors that might influence test scores. Years of education was the strongest predictor of test scores, raising an applicant's score on a composite test nearly 0.16 standard deviation per year. The HIV-1-positive effect on the composite score was -0.09 standard deviation (99% confidence interval -0.17 to -0.02). Separate regressions on each component test within the battery showed HIV-1 effects between -0.39 and +0.06 standard deviation. The two Armed Services Vocational Aptitude Battery component tests felt a priori to be the most sensitive to HIV-1-positive status showed the least decrease with seropositivity. Much of the variability in test scores was not predicted by either HIV-1 serostatus or the demographic and other factors included in the model. There appeared to be little evidence of a strong HIV-1 effect.
Low aerobic fitness and obesity are associated with lower standardized test scores in children.

PubMed

Roberts, Christian K; Freed, Benjamin; McCarthy, William J

2010-05-01

To investigate whether aerobic fitness and obesity in school children are associated with standardized test performance. Ethnically diverse (n = 1989) 5th, 7th, and 9th graders attending California schools comprised the sample. Aerobic fitness was determined by a 1-mile run/walk test; body mass index (BMI) was obtained from state-mandated measurements. California standardized test scores were obtained from the school district. Students whose mile run/walk times exceeded California Fitnessgram standards or whose BMI exceeded Centers for Disease Control sex- and age-specific body weight standards scored lower on California standardized math, reading, and language tests than students with desirable BMI status or fitness level, even after controlling for parent education among other covariates. Ethnic differences in standardized test scores were consistent with ethnic differences in obesity status and aerobic fitness. BMI-for-age was no longer a significant multivariate predictor when covariates included fitness level. Low aerobic fitness is common among youth and varies among ethnic groups, and aerobic fitness level predicts performance on standardized tests across ethnic groups. More research is needed to uncover the physiological mechanisms by which aerobic fitness may contribute to performance on standardized academic tests.

Convergent Validity of the Reynolds Intellectual Assessment Scales (RIAS) Using the Woodcock-Johnson Tests of Cognitive Ability, Third Edition (WJ-III) with University Students

ERIC Educational Resources Information Center

Krach, S. Kathleen; Loe, Scott A.; Jones, W. Paul; Farrally, Autumn

2009-01-01

Validity studies with the Reynolds Intellectual Ability scales (RIAS) indicated that RIAS composite intelligence index (CIX) and verbal intelligence index (VIX) scores have moderate-to-high correlation with comparable scores on other instruments. The authors of the RIAS described the VIX scale as a measure of crystallized ability and the nonverbal…
The Emphasis of Student Test Scores in Teacher Appraisal Systems

ERIC Educational Resources Information Center

Smith, William C.; Kubacka, Katarzyna

2017-01-01

Over the past 30 years teachers have been held increasingly accountable for the quality of education in their classroom. During this transition, the line between teacher appraisals, traditionally an instrument for continuous formative teacher feedback, and summative teacher evaluations has blurred. Student test scores, as an "objective"…
Rising Stars: High School's Change Process Produces Higher Test Scores.

ERIC Educational Resources Information Center

McCown, Claire; Runnebaum, Robert

2001-01-01

Presents Bishop Ward High School (Kansas) as a case study that has seen great improvements in standardized testing results by changing its approach. States that realignment of curriculum, adjusting instructional strategies, and accommodating students with special needs are important aspects of raising assessment scores in high schools. (CJW)
Comparing the Effects of Elementary Music and Visual Arts Lessons on Standardized Mathematics Test Scores

ERIC Educational Resources Information Center

King, Molly Elizabeth

2016-01-01

The purpose of this quantitative, causal-comparative study was to compare the effect elementary music and visual arts lessons had on third through sixth grade standardized mathematics test scores. Inferential statistics were used to compare the differences between test scores of students who took in-school, elementary, music instruction during the…
Many Children Left Behind? Textbooks and Test Scores in Kenya. NBER Working Paper No. 13300

ERIC Educational Resources Information Center

Glewwe, Paul; Kremer, Michael; Moulin, Sylvie

2007-01-01

A randomized evaluation suggests that a program which provided official textbooks to randomly selected rural Kenyan primary schools did not increase test scores for the average student. In contrast, the previous literature suggests that textbook provision has a large impact on test scores. Disaggregating the results by students' initial academic…
Relationship of Elementary and Secondary School Achievement Test Scores to Later Academic Success.

ERIC Educational Resources Information Center

Loyd, Brenda H.; And Others

1980-01-01

This study investigated the relationship between achievement test scores on the Iowa Tests of Basic Skills (ITBS) and Iowa Tests of Educational Development (ITED), and high school and college grade point average. Support for the predictive validity of the ITBS and ITED achievement test batteries is provided. (Author/GK)
The Effect of Black Peers on Black Test Scores

ERIC Educational Resources Information Center

Armor, David J.; Duck, Stephanie

2007-01-01

Recent studies have used increasingly complex methodologies to estimate the effect of peer characteristics--race, poverty, and ability--on student achievement. A paper by Hanushek, Kain, and Rivkin using Texas state testing data has received particularly wide attention because it found a large negative effect of school percent black on black math…
Dichotomous scoring of Trails B in patients referred for a dementia evaluation.

PubMed

Schmitt, Andrew L; Livingston, Ronald B; Smernoff, Eric N; Waits, Bethany L; Harris, James B; Davis, Kent M

2010-04-01

The Trail Making Test is a popular neuropsychological test and its interpretation has traditionally used time-based scores. This study examined an alternative approach to scoring that is simply based on the examinees' ability to complete the test. If an examinee is able to complete Trails B successfully, they are coded as "completers"; if not, they are coded as "noncompleters." To assess this approach to scoring Trails B, the performance of 97 diagnostically heterogeneous individuals referred for a dementia evaluation was examined. In this sample, 55 individuals successfully completed Trails B and 42 individuals were unable to complete it. Point-biserial correlations indicated a moderate-to-strong association (r(pb)=.73) between the Trails B completion variable and the Total Scale score of the Repeatable Battery for the Assessment of Neurological Status (RBANS), which was larger than the correlation between the Trails B time-based score and the RBANS Total Scale score (r(pb)=.60). As a screen for dementia status, Trails B completion showed a sensitivity of 69% and a specificity of 100% in this sample. These results suggest that dichotomous scoring of Trails B might provide a brief and clinically useful measure of dementia status.
Is Education Associated with Improvements in General Cognitive Ability, or in Specific Skills?

ERIC Educational Resources Information Center

Ritchie, Stuart J.; Bates, Timothy C.; Deary, Ian J.

2015-01-01

Previous research has indicated that education influences cognitive development, but it is unclear what, precisely, is being improved. Here, we tested whether education is associated with cognitive test score improvements via domain-general effects on general cognitive ability ("g"), or via domain-specific effects on particular cognitive…
The Effect of Poverty on the Verbal Scores of Gifted Students

ERIC Educational Resources Information Center

Kaya, Fatih; Stough, Laura M.; Juntune, Joyce

2016-01-01

A nonexperimental design was used to determine whether the verbal scores of low-income gifted fifth graders (n = 38) differed from those of their higher income peers (n = 83). The Otis-Lennon School Ability Test, Eighth Edition and the Stanford Achievement Test-Tenth Edition were used to collect student data. Results of a MANOVA showed a…
Observed Score and True Score Equating Procedures for Multidimensional Item Response Theory

ERIC Educational Resources Information Center

Brossman, Bradley Grant

2010-01-01

The purpose of this research was to develop observed score and true score equating procedures to be used in conjunction with the Multidimensional Item Response Theory (MIRT) framework. Currently, MIRT scale linking procedures exist to place item parameter estimates and ability estimates on the same scale after separate calibrations are conducted.…
Contribution of Psychological, Social, and Mechanical Work Exposures to Low Work Ability

PubMed Central

Knardahl, Stein

2015-01-01

Objective: To determine the contribution of specific psychological, social, and mechanical work exposures to the self-reported low level of work ability. Methods: Employees from 48 organizations were surveyed over a 2-year period (n = 3779). Changes in 16 work exposures and 3 work ability measures—the work ability index score, perceived current, and future work ability—were tested with Spearman rank correlations. Binary logistic regressions were run to determine contribution of work exposures to low work ability. Results: Role conflict, human resource primacy, and positive challenge were the most consistent predictors of low work ability across test designs. Role clarity and fair leadership were less consistent but prominent predictors. Mechanical exposures were not predictive. Conclusions: To protect employee work ability, work place interventions would benefit from focusing on reducing role conflicts and on promoting positive challenges and human resource primacy. PMID:25470453
Separating Contributions of Hearing, Lexical Knowledge, and Speech Production to Speech-Perception Scores in Children with Hearing Impairments.

ERIC Educational Resources Information Center

Paatsch, Louise E.; Blamey, Peter J.; Sarant, Julia Z.; Martin, Lois F.A.; Bow, Catherine P.

2004-01-01

Open-set word and sentence speech-perception test scores are commonly used as a measure of hearing abilities in children and adults using cochlear implants and/or hearing aids. These tests ore usually presented auditorily with a verbal response. In the case of children, scores are typically lower and more variable than for adults with hearing…
Predicting space telerobotic operator training performance from human spatial ability assessment

NASA Astrophysics Data System (ADS)

Liu, Andrew M.; Oman, Charles M.; Galvan, Raquel; Natapoff, Alan

2013-11-01

Our goal was to determine whether existing tests of spatial ability can predict an astronaut's qualification test performance after robotic training. Because training astronauts to be qualified robotics operators is so long and expensive, NASA is interested in tools that can predict robotics performance before training begins. Currently, the Astronaut Office does not have a validated tool to predict robotics ability as part of its astronaut selection or training process. Commonly used tests of human spatial ability may provide such a tool to predict robotics ability. We tested the spatial ability of 50 active astronauts who had completed at least one robotics training course, then used logistic regression models to analyze the correlation between spatial ability test scores and the astronauts' performance in their evaluation test at the end of the training course. The fit of the logistic function to our data is statistically significant for several spatial tests. However, the prediction performance of the logistic model depends on the criterion threshold assumed. To clarify the critical selection issues, we show how the probability of correct classification vs. misclassification varies as a function of the mental rotation test criterion level. Since the costs of misclassification are low, the logistic models of spatial ability and robotic performance are reliable enough only to be used to customize regular and remedial training. We suggest several changes in tracking performance throughout robotics training that could improve the range and reliability of predictive models.
The Relationship between Cognitive Reserve and Math Abilities

PubMed Central

Arcara, Giorgio; Mondini, Sara; Bisso, Alice; Palmer, Katie; Meneghello, Francesca; Semenza, Carlo

2017-01-01

Cognitive Reserve is the capital of knowledge and experiences that an individual acquires over their life-span. Cognitive Reserve is strictly related to Brain Reserve, which is the ability of the brain to cope with damage. These two concepts could explain many phenomena such as the modality of onset in dementia or the different degree of impairment in cognitive abilities in aging. The aim of this study is to verify the effect of Cognitive Reserve, as measured by a questionnaire, on a variety of numerical abilities (number comprehension, reading and writing numbers, rules and principles, mental calculations and written calculations), in a group of healthy older people (aged 65–98 years). Sixty older individuals were interviewed with the Cognitive Reserve Index questionnaire (CRIq), and assessed with the Numerical Activities of Daily Living battery (NADL), which included formal tasks on math abilities, an informal test on math, one interview with the participant, and one interview with a relative on the perceived math abilities. We also took into account the years of education, as another proxy for Cognitive Reserve. In the multiple regression analyses on all formal tests, CRIq scores did not significantly predict math performance. Other variables, i.e., years of education and Mini-Mental State Examination score, accounted better for math performance on NADL. Only a subsection of CRIq, CRIq-Working-activity, was found to predict performance on a NADL subtest assessing informal use of math in daily life. These results show that education might better explain abstract math functions in late life than other aspects related to Cognitive Reserve, such as lifestyle or occupational attainment. PMID:29311910
The Relationship between Cognitive Reserve and Math Abilities.

PubMed

Arcara, Giorgio; Mondini, Sara; Bisso, Alice; Palmer, Katie; Meneghello, Francesca; Semenza, Carlo

2017-01-01

Cognitive Reserve is the capital of knowledge and experiences that an individual acquires over their life-span. Cognitive Reserve is strictly related to Brain Reserve, which is the ability of the brain to cope with damage. These two concepts could explain many phenomena such as the modality of onset in dementia or the different degree of impairment in cognitive abilities in aging. The aim of this study is to verify the effect of Cognitive Reserve, as measured by a questionnaire, on a variety of numerical abilities (number comprehension, reading and writing numbers, rules and principles, mental calculations and written calculations), in a group of healthy older people (aged 65-98 years). Sixty older individuals were interviewed with the Cognitive Reserve Index questionnaire (CRIq), and assessed with the Numerical Activities of Daily Living battery (NADL), which included formal tasks on math abilities, an informal test on math, one interview with the participant, and one interview with a relative on the perceived math abilities. We also took into account the years of education, as another proxy for Cognitive Reserve. In the multiple regression analyses on all formal tests, CRIq scores did not significantly predict math performance. Other variables, i.e., years of education and Mini-Mental State Examination score, accounted better for math performance on NADL. Only a subsection of CRIq, CRIq-Working-activity, was found to predict performance on a NADL subtest assessing informal use of math in daily life. These results show that education might better explain abstract math functions in late life than other aspects related to Cognitive Reserve, such as lifestyle or occupational attainment.
The Impact of Inclusion and Resource Instruction on Standardized Test Scores of Special Education Students

ERIC Educational Resources Information Center

Derico, Vontrice L.

2017-01-01

The purpose of the proposed quasi-experimental quantitative study was to determine if students who were taught in the inclusive setting yielded higher standardized test scores compared to students who were taught in the resource setting. The researcher analyzed the standardized test scores, in the areas of Language Arts, Reading, and Mathematics…
STABILITY OF ACADEMIC APTITUDE AND READING TEST SCORES OF MOBILE AND NON-MOBILE DISADVANTAGED CHILDREN.

ERIC Educational Resources Information Center

JUSTMAN, JOSEPH

CHANGES IN ACADEMIC APTITUDE AND ACHIEVEMENT TEST SCORES OF PUPILS ATTENDING PUBLIC SCHOOLS IN DISADVANTAGED AREAS IN NEW YORK CITY WERE INVESTIGATED. AN ATTEMPT WAS MADE TO DETERMINE WHETHER VARYING DEGREES OF MOBILITY WERE ASSOCIATED WITH VARIATION IN CHANGES IN TEST SCORES. THE CUMULATIVE RECORD CARDS OF SIXTH-GRADE PUPILS WERE EXAMINED TO…
Kindergarten Black-White Test Score Gaps: Replicating and Updating Previous Findings with New National Data

ERIC Educational Resources Information Center

Quinn, David

2014-01-01

A substantial body of evidence has shown large academic test score gaps between black and white students in early childhood. These gaps remain, and probably grow, as students progress through school. Many researchers have sought to explain these persistent test score gaps, and particularly, to understand the role of students' socio-economic status…
The Influence of an NCLB Accountability Plan on the Distribution of Student Test Score Gains

ERIC Educational Resources Information Center

Springer, Matthew G.

2008-01-01

Previous research on the effect of accountability programs on the distribution of student test score gains is decidedly mixed. This study examines the issue by estimating an educational production function in which test score gains are a function of the incentives schools have to focus instruction on below-proficient students. NCLB's threat of…

A preliminary investigation into the moral reasoning abilities of UK veterinarians.

PubMed

Batchelor, C E M; Creed, A; McKeegan, D E F

2015-08-01

Veterinary medicine is an ethically challenging profession, but the ethical reasoning abilities of practising veterinarians in the UK have never been formally assessed. This study investigated moral reasoning ability in 65 qualified veterinarians (38 practising and 27 academic) and 33 members of the public in the UK using the Defining Issues Test. Academic veterinarians had higher scores than members of the public but practising veterinarians did not. There was large variation in moral reasoning abilities among qualified veterinarians. Moral reasoning score in veterinarians did not improve with years of experience. These results show that despite having a professional degree moral reasoning skills of practising veterinarians may be insufficient to deal with the demands of their profession. This could have implications for animal welfare, client services and veterinarian wellbeing. The results highlight the need for more training in this area. British Veterinary Association.
Genome-Wide Polygenic Scores Predict Reading Performance Throughout the School Years

PubMed Central

Selzam, Saskia; Dale, Philip S.; Wagner, Richard K.; DeFries, John C.; Cederlöf, Martin; O’Reilly, Paul F.; Krapohl, Eva; Plomin, Robert

2017-01-01

ABSTRACT It is now possible to create individual-specific genetic scores, called genome-wide polygenic scores (GPS). We used a GPS for years of education (EduYears) to predict reading performance assessed at UK National Curriculum Key Stages 1 (age 7), 2 (age 12) and 3 (age 14) and on reading tests administered at ages 7 and 12 in a UK sample of 5,825 unrelated individuals. EduYears GPS accounts for up to 5% of the variance in reading performance at age 14. GPS predictions remained significant after accounting for general cognitive ability and family socioeconomic status. Reading performance of children in the lowest and highest 12.5% of the EduYears GPS distribution differed by a mean growth in reading ability of approximately two school years. It seems certain that polygenic scores will be used to predict strengths and weaknesses in education. PMID:28706435
Test and Score Data Summary for TOEFL[R] Internet-Based and Paper-Based Tests. January 2008-December 2008 Test Data

ERIC Educational Resources Information Center

Educational Testing Service, 2008

2008-01-01

The Test of English as a Foreign Language[TM], better known as TOEFL[R], is designed to measure the English-language proficiency of people whose native language is not English. TOEFL scores are accepted by more than 6,000 colleges, universities, and licensing agencies in 130 countries. The test is also used by governments, and scholarship and…
Use of Standardized Test Scores to Predict Success in a Computer Applications Course

ERIC Educational Resources Information Center

Harris, Robert V.; King, Stephanie B.

2016-01-01

The purpose of this study was to see if a relationship existed between American College Testing (ACT) scores (i.e., English, reading, mathematics, science reasoning, and composite) and student success in a computer applications course at a Mississippi community college. The study showed that while the ACT scores were excellent predictors of…
A Comparison of the Approaches of Generalizability Theory and Item Response Theory in Estimating the Reliability of Test Scores for Testlet-Composed Tests

ERIC Educational Resources Information Center

Lee, Guemin; Park, In-Yong

2012-01-01

Previous assessments of the reliability of test scores for testlet-composed tests have indicated that item-based estimation methods overestimate reliability. This study was designed to address issues related to the extent to which item-based estimation methods overestimate the reliability of test scores composed of testlets and to compare several…
How Changes in Families and Schools Are Related to Trends in Black-White Test Scores

ERIC Educational Resources Information Center

Berends, Mark; Lucas, Samuel R.; Penaloza, Roberto V.

2008-01-01

Through several decades of research, a great deal has been written about trends in black-white test scores and the factors that may explain the gaps in different subject areas. Only a few studies have examined the changing relationships between gaps in students' test scores and family and school measures in nationally representative data over…
Clock Drawing Test and the diagnosis of amnestic mild cognitive impairment: can more detailed scoring systems do the work?

PubMed

Rubínová, Eva; Nikolai, Tomáš; Marková, Hana; Siffelová, Kamila; Laczó, Jan; Hort, Jakub; Vyhnálek, Martin

2014-01-01

The Clock Drawing Test is a frequently used cognitive screening test with several scoring systems in elderly populations. We compare simple and complex scoring systems and evaluate the usefulness of the combination of the Clock Drawing Test with the Mini-Mental State Examination to detect patients with mild cognitive impairment. Patients with amnestic mild cognitive impairment (n = 48) and age- and education-matched controls (n = 48) underwent neuropsychological examinations, including the Clock Drawing Test and the Mini-Mental State Examination. Clock drawings were scored by three blinded raters using one simple (6-point scale) and two complex (17- and 18-point scales) systems. The sensitivity and specificity of these scoring systems used alone and in combination with the Mini-Mental State Examination were determined. Complex scoring systems, but not the simple scoring system, were significant predictors of the amnestic mild cognitive impairment diagnosis in logistic regression analysis. At equal levels of sensitivity (87.5%), the Mini-Mental State Examination showed higher specificity (31.3%, compared with 12.5% for the 17-point Clock Drawing Test scoring scale). The combination of Clock Drawing Test and Mini-Mental State Examination scores increased the area under the curve (0.72; p < .001) and increased specificity (43.8%), but did not increase sensitivity, which remained high (85.4%). A simple 6-point scoring system for the Clock Drawing Test did not differentiate between healthy elderly and patients with amnestic mild cognitive impairment in our sample. Complex scoring systems were slightly more efficient, yet still were characterized by high rates of false-positive results. We found psychometric improvement using combined scores from the Mini-Mental State Examination and the Clock Drawing Test when complex scoring systems were used. The results of this study support the benefit of using combined scores from simple methods.
The Relationship between Academic Averages of Primary School Science and Technology Class and Test Sub-Test Scores of Placement Test of Science

ERIC Educational Resources Information Center

Guzeller, Cem Oktay

2012-01-01

In this research, the relationship between written exam scores of science and technology class of 6th, 7th, and 8th grades, project, participation in class activities and performance work, year-end academic success point averages and sub-test raw scores of LDT science of 6th, 7th and 8th grades. Academic success point averages were used as…
Racial Differences in Mathematics Test Scores for Advanced Mathematics Students

ERIC Educational Resources Information Center

Minor, Elizabeth Covay

2016-01-01

Research on achievement gaps has found that achievement gaps are larger for students who take advanced mathematics courses compared to students who do not. Focusing on the advanced mathematics student achievement gap, this study found that African American advanced mathematics students have significantly lower test scores and are less likely to be…
Commentary on "Validating the Interpretations and Uses of Test Scores"

ERIC Educational Resources Information Center

Brennan, Robert L.

2013-01-01

Kane's paper "Validating the Interpretations and Uses of Test Scores" is the most complete and clearest discussion yet available of the argument-based approach to validation. At its most basic level, validation as formulated by Kane is fundamentally a simply-stated two-step enterprise: (1) specify the claims inherent in a particular interpretation…
Using Test Scores from Students with Disabilities in Teacher Evaluation

ERIC Educational Resources Information Center

Buzick, Heather M.; Jones, Nathan D.

2015-01-01

Much of the recent focus of educational policymakers has been on improving the measurement of teacher effectiveness. Linking student growth to teacher effects has been a large part of reform efforts. To date, neither researchers nor practitioners have arrived at a consensus on how to treat test scores from students with disabilities in…
Piloting a Polychotomous Partial-Credit Scoring Procedure in a Multiple-Choice Test

ERIC Educational Resources Information Center

Tsopanoglou, Antonios; Ypsilandis, George S.; Mouti, Anna

2014-01-01

Multiple-choice (MC) tests are frequently used to measure language competence because they are quick, economical and straightforward to score. While degrees of correctness have been investigated for partially correct responses in combined-response MC tests, degrees of incorrectness in distractors and the role they play in determining the…
Scoring life insurance applicants' laboratory results, blood pressure and build to predict all-cause mortality risk.

PubMed

Fulks, Michael; Stout, Robert L; Dolan, Vera F

2012-01-01

Evaluate the degree of medium to longer term mortality prediction possible from a scoring system covering all laboratory testing used for life insurance applicants, as well as blood pressure and build measurements. Using the results of testing for life insurance applicants who reported a Social Security number in conjunction with the Social Security Death Master File, the mortality associated with each test result was defined by age and sex. The individual mortality scores for each test were combined for each individual and a composite mortality risk score was developed. This score was then tested against the insurance applicant dataset to evaluate its ability to discriminate risk across age and sex. The composite risk score was highly predictive of all-cause mortality risk in a linear manner from the best to worst quintile of scores in a nearly identical fashion for each sex and decade of age. Laboratory studies, blood pressure and build from life insurance applicants can be used to create scoring that predicts all-cause mortality across age and sex. Such an approach may hold promise for preventative health screening as well.
What's in a Teacher Test? Assessing the Relationship between Teacher Licensure Test Scores and Student STEM Achievement and Course-Taking. Working Paper 158

ERIC Educational Resources Information Center

Goldhaber, Dan; Gratz, Trevor; Theobald, Roddy

2016-01-01

We investigate the relationship between teacher licensure test scores and student test achievement and high school course-taking. We focus on three subject/grade combinations--middle school math, ninth-grade algebra and geometry, and ninth-grade biology--and find evidence that a teacher's basic skills test scores are modestly predictive of student…
The Bender Gestalt Test with the Human Figure Drawing Test for Young School Children. A Manual for Use with the Koppitz Scoring System.

ERIC Educational Resources Information Center

Koppitz, Elizabeth Munsterberg

Presented is a manual for scoring the Bender Gestalt Test and the Human Figure Drawing Test for screening and diagnostic uses with emotionally disturbed, brain damaged, or perceptually handicapped 5- to 11-year-old children. Given are suggestions for administering and scoring the Bender test which examines distortion of shape, rotation,…
A pretest prognostic score to assess patients undergoing exercise or pharmacological stress testing.

PubMed

Morise, Anthony; Evans, Matthew; Jalisi, Farrukh; Shetty, Rajendra; Stauffer, Marc

2007-02-01

A previously developed pretest score was validated to stratify patients presenting for exercise testing with suspected coronary disease according to the presence of angiographic coronary disease. Our goal was to determine how well this pretest score risk stratified patients undergoing pharmacological and exercise stress tests concerning prognostic endpoints. Retrospective cohort analysis. University hospital stress laboratory. 7452 unselected ambulatory patients with symptoms of suspected coronary disease undergoing stress testing between 1995 and 2004. All-cause death, cardiac death and non-fatal myocardial infarction. The rate of all-cause death was 5.5% (CI 5.0 to 6.1) with 4.3 (SD 2.4) years of follow-up (Exercise 2.8% (CI 2.3 to 3.2) v Pharmacological group 11.9% (CI 10.5 to 13.3); p<0.001). The rate of cardiac death/myocardial infarction was 2.6% (CI 2.2 to 3.0) (Exercise 1.4% (CI 1.1 to 1.8) v Pharmacological group 5.3% (CI 4.3 to 6.2); p<0.001). In both groups, stratification by pretest score was significant for all-cause death and the combined endpoint. However, stratification was more effective in the pharmacological group using the combined endpoint rather than all-cause death. Pharmacological stress patients in intermediate and high risk groups were at higher risk than their respective exercise test cohorts. Referral for pharmacological stress testing was found to be an independent predictor of time to death (2.7 (CI 2.0 to 3.6); p<0.001). A pretest score previously validated to stratify according to angiographic outcomes, effectively risk stratified pharmacological and exercise stress patients according to the combined endpoint of cardiac death/myocardial infarction.
TOEFL iBT Speaking Test Scores as Indicators of Oral Communicative Language Proficiency

ERIC Educational Resources Information Center

Bridgeman, Brent; Powers, Donald; Stone, Elizabeth; Mollaun, Pamela

2012-01-01

Scores assigned by trained raters and by an automated scoring system (SpeechRater[TM]) on the speaking section of the TOEFL iBT[TM] were validated against a communicative competence criterion. Specifically, a sample of 555 undergraduate students listened to speech samples from 184 examinees who took the Test of English as a Foreign Language…
Acute Kidney Injury Enhances Outcome Prediction Ability of Sequential Organ Failure Assessment Score in Critically Ill Patients

PubMed Central

Chang, Chih-Hsiang; Fan, Pei-Chun; Chang, Ming-Yang; Tian, Ya-Chung; Hung, Cheng-Chieh; Fang, Ji-Tseng; Yang, Chih-Wei; Chen, Yung-Chang

2014-01-01

Introduction Acute kidney injury (AKI) is a common and serious complication in intensive care unit (ICU) patients and also often part of a multiple organ failure syndrome. The sequential organ failure assessment (SOFA) score is an excellent tool for assessing the extent of organ dysfunction in critically ill patients. This study aimed to evaluate the outcome prediction ability of SOFA and Acute Physiology and Chronic Health Evaluation (APACHE) III score in ICU patients with AKI. Methods A total of 543 critically ill patients were admitted to the medical ICU of a tertiary-care hospital from July 2007 to June 2008. Demographic, clinical and laboratory variables were prospectively recorded for post hoc analysis as predictors of survival on the first day of ICU admission. Results One hundred and eighty-seven (34.4%) patients presented with AKI on the first day of ICU admission based on the risk of renal failure, injury to kidney, failure of kidney function, loss of kidney function, and end-stage renal failure (RIFLE) classification. Major causes of the ICU admissions involved respiratory failure (58%). Overall in-ICU mortality was 37.9% and the hospital mortality was 44.7%. The predictive accuracy for ICU mortality of SOFA (areas under the receiver operating characteristic curves: 0.815±0.032) was as good as APACHE III in the AKI group. However, cumulative survival rates at 6-month follow-up following hospital discharge differed significantly (p<0.001) for SOFA score ≤10 vs. ≥11 in these ICU patients with AKI. Conclusions For patients coexisting with AKI admitted to ICU, this work recommends application of SOFA by physicians to assess ICU mortality because of its practicality and low cost. A SOFA score of ≥ “11” on ICU day 1 should be considered an indicator of negative short-term outcome. PMID:25279844
Mixed handedness and achievement test scores of middle school boys.

PubMed

Sarma, P S B

2008-10-01

The purpose of the study was to replicate findings of an earlier study of fourth grade boys manifesting mixed handedness with a sample. Among 32 mixed-handed boys in Grades 6 to 8, the right-handed writer, left-handed thrower group obtained low spelling scores (Normal Curve Equivalent Scores) on the California Achievement Test significantly more frequently than the left-handed writer, right-handed thrower group. These findings are consistent with data for Grade 4 boys in the earlier study. Findings strengthen the hypotheses that mixed handedness is not a unitary neuropsychological entity and that boys who write with the right hand and throw with the left hand might be at risk for certain academic deficits.
Validity and reliability of Abbreviated Mental Test Score (AMTS) among older Iranian.

PubMed

Foroughan, Mahshid; Wahlund, Lars-Olof; Jafari, Zahra; Rahgozar, Mehdi; Farahani, Ida G; Rashedi, Vahid

2017-11-01

Cognitive impairment is common among older people and is associated with increased morbidity and mortality. The main aim of this study was to evaluate the validity of the Persian version of the Abbreviated Mental Test Score (AMTS) as a screening tool for dementia. Data were obtained from a cross-sectional study. One hundred and one older adults who were members of Iranian Alzheimer Association and 101 of their siblings were entered into this study by convenient sampling. The Diagnostic and Statistical Manual of Mental Disorders, 4th edition, criteria for diagnosing dementia and the Mini-Mental State Examination were used as the study tools. The gathered data were analyzed by the Mann-Whitney U-test, the Kruskal-Wallis test, Spearman's rank correlation coefficient, and the receiver-operating characteristic. The AMTS could successfully differentiate the dementia group from the non-dementia group. Scores were significantly correlated with Diagnostic and Statistical Manual of Mental Disorders diagnosis for dementia and Mini-Mental State Examination scores (P < 0.001). Educational level (P < 0.001) and male sex (P = 0.015) were positively associated with AMTS, whereas (P < 0.001) was negatively associated with AMTS. Total Cronbach's α coefficient was 0.90. The scores 6 and 7 showed the optimum balance between sensitivity (99% and 94%, respectively) and specificity (85% and 86%, respectively). The Persian version of the AMTS is a valid cognitive assessment tool for older Iranian adults and can be used for dementia screening in Iran. © 2017 Japanese Psychogeriatric Society.

A general equation to obtain multiple cut-off scores on a test from multinomial logistic regression.

PubMed

Bersabé, Rosa; Rivas, Teresa

2010-05-01

The authors derive a general equation to compute multiple cut-offs on a total test score in order to classify individuals into more than two ordinal categories. The equation is derived from the multinomial logistic regression (MLR) model, which is an extension of the binary logistic regression (BLR) model to accommodate polytomous outcome variables. From this analytical procedure, cut-off scores are established at the test score (the predictor variable) at which an individual is as likely to be in category j as in category j+1 of an ordinal outcome variable. The application of the complete procedure is illustrated by an example with data from an actual study on eating disorders. In this example, two cut-off scores on the Eating Attitudes Test (EAT-26) scores are obtained in order to classify individuals into three ordinal categories: asymptomatic, symptomatic and eating disorder. Diagnoses were made from the responses to a self-report (Q-EDD) that operationalises DSM-IV criteria for eating disorders. Alternatives to the MLR model to set multiple cut-off scores are discussed.
Report: States See Test-Score Gains

ERIC Educational Resources Information Center

Viadero, Debra

2004-01-01

This article discusses a report from Education Trust, a Washington-based research and advocacy group. The report says almost half the states have seen rising math scores on their state exams for elementary school pupils since the federal No Child Left Behind law was enacted. It also states that reading scores have improved among 4th and 5th…
School Choice in Suburbia: Test Scores, Race, and Housing Markets

ERIC Educational Resources Information Center

Dougherty, Jack; Harelson, Jeffrey; Maloney, Laura; Murphy, Drew; Smith, Russell; Snow, Michael; Zannoni, Diane

2009-01-01

Home buyers exercise school choice when shopping for a private residence due to its location in a public school district or attendance area. In this quantitative study of one Connecticut suburban district, we measure the effect of elementary school test scores and racial composition on home buyers' willingness to purchase single-family homes over…
Correlations Between Chiropractic National Board (Part I) Scores and Basic Science Course Grades and Related Data.

ERIC Educational Resources Information Center

Wolfenberger, Virginia

1999-01-01

A study at one institution found significant correlations between students' scores on the National Board of Chiropractic Examiners test and academic achievement data. Results indicate that it is not always course subject matter that influences the relationship between course grade and board scores, but may instead be the ability to assimilate…
The Effect of Mobility on Texas Assessment of Knowledge and Skills Test Scores

ERIC Educational Resources Information Center

Alvarez, Ray

2006-01-01

This research studies the effects of mobility on the high-stakes test scores of a Title I South Central Texas school district. The study involved 10, 5th-grade elementary feeder school populations graduating to the 6th grade in 3 middle schools. The researcher compared the 1st administration scores of the Texas Assessment of Knowledge and Skills…
Automated essay scoring and the future of educational assessment in medical education.

PubMed

Gierl, Mark J; Latifi, Syed; Lai, Hollis; Boulais, André-Philippe; De Champlain, André

2014-10-01

Constructed-response tasks, which range from short-answer tests to essay questions, are included in assessments of medical knowledge because they allow educators to measure students' ability to think, reason, solve complex problems, communicate and collaborate through their use of writing. However, constructed-response tasks are also costly to administer and challenging to score because they rely on human raters. One alternative to the manual scoring process is to integrate computer technology with writing assessment. The process of scoring written responses using computer programs is known as 'automated essay scoring' (AES). An AES system uses a computer program that builds a scoring model by extracting linguistic features from a constructed-response prompt that has been pre-scored by human raters and then, using machine learning algorithms, maps the linguistic features to the human scores so that the computer can be used to classify (i.e. score or grade) the responses of a new group of students. The accuracy of the score classification can be evaluated using different measures of agreement. Automated essay scoring provides a method for scoring constructed-response tests that complements the current use of selected-response testing in medical education. The method can serve medical educators by providing the summative scores required for high-stakes testing. It can also serve medical students by providing them with detailed feedback as part of a formative assessment process. Automated essay scoring systems yield scores that consistently agree with those of human raters at a level as high, if not higher, as the level of agreement among human raters themselves. The system offers medical educators many benefits for scoring constructed-response tasks, such as improving the consistency of scoring, reducing the time required for scoring and reporting, minimising the costs of scoring, and providing students with immediate feedback on constructed-response tasks. © 2014
Critical-Inquiry-Based-Learning: Model of Learning to Promote Critical Thinking Ability of Pre-service Teachers

NASA Astrophysics Data System (ADS)

Prayogi, S.; Yuanita, L.; Wasis

2018-01-01

This study aimed to develop Critical-Inquiry-Based-Learning (CIBL) learning model to promote critical thinking (CT) ability of preservice teachers. The CIBL learning model was developed by meeting the criteria of validity, practicality, and effectiveness. Validation of the model involves 4 expert validators through the mechanism of the focus group discussion (FGD). CIBL learning model declared valid to promote CT ability, with the validity level (Va) of 4.20 and reliability (r) of 90,1% (very reliable). The practicality of the model was evaluated when it was implemented that involving 17 of preservice teachers. The CIBL learning model had been declared practice, its measuring from learning feasibility (LF) with very good criteria (LF-score = 4.75). The effectiveness of the model was evaluated from the improvement CT ability after the implementation of the model. CT ability were evaluated using the scoring technique adapted from Ennis-Weir Critical Thinking Essay Test. The average score of CT ability on pretest is - 1.53 (uncritical criteria), whereas on posttest is 8.76 (critical criteria), with N-gain score of 0.76 (high criteria). Based on the results of this study, it can be concluded that developed CIBL learning model is feasible to promote CT ability of preservice teachers.
Estimates of premorbid ability in a neurodegenerative disease clinic population: comparing the Test of Premorbid Functioning and the Wide Range Achievement Test, 4th Edition.

PubMed

Berg, Jody-Lynn; Durant, January; Banks, Sarah J; Miller, Justin B

2016-05-01

Two frequently used measures to assess premorbid intellectual ability include the Wide Range Achievement Test, 4th Edition Reading Subtest (WRAT-4 READ) and the Test of Premorbid Functioning (TOPF). The present study compared estimates obtained from these measures in a neurodegenerative disease population. Records from 85 referrals seen for neuropsychological evaluation in a neurodegenerative disorders clinic were reviewed. Evaluations included TOPF, WRAT-4 READ, and measures of memory, reasoning, language, and executive functioning. Pairwise correlations and concordance correlation coefficients (CCC) were calculated between raw scores and predicted intelligence estimates. Discrepancy scores were calculated between estimates and data were divided into three groups based on size of standardized discrepancy score: Equal, WRAT-4 READ > TOPF, and TOPF > WRAT-4 READ. analysis of variances compared groups on demographic characteristics and cognitive performance. Despite strong Pearson correlation, CCC between predicted IQ estimates showed poor agreement between measures, with evidence of both fixed and proportional bias. Discrepancies ranged from -24.0 to 22.0 (M = 1.78, SD = 6.65), with TOPF generating higher estimates on average. Individuals performing better on WRAT-4 READ were significantly older (M age = 76.26, SD = 7.53) than those performing similarly on both measures and those performing better on TOPF (F (2, 82) = 7.31, p < .001). All other comparisons between groups on demographic variables and cognitive measures were non-significant. Estimates of premorbid intelligence obtained from the TOPF and WRAT-4 READ have a strong linear relationship, but systematically generate inconsistent estimates in a neurodegenerative disease clinical sample and should not be used interchangeably.
Effects of correcting for prematurity on cognitive test scores in childhood.

PubMed

Wilson-Ching, Michelle; Pascoe, Leona; Doyle, Lex W; Anderson, Peter J

2014-03-01

The American Academy of Pediatrics recommends that test scores should be corrected for prematurity up to 3 years of age, but this practice varies greatly in both clinical and research settings. The aim of this study was to contrast the effects of using chronological age and those of using corrected age on measures of cognitive outcome across childhood. A theoretical model was constructed using norms from the Bayley Scales of Infant and Toddler Development, Third Edition; the Wechsler Preschool and Primary Scale of Intelligence, Third Edition Australian; and the Wechsler Intelligence Scales for Children, Fourth Edition Australian. Baseline scores representing different levels of functioning (70, below average; 85, borderline; and 100, average) were recalculated using the normative data for ages 6 months to 16 years to account for 1, 2, 3 and 4 months of prematurity. The model created depicted the difference in standardised scores between chronological and corrected age. Compared with scores corrected for prematurity, the absolute reduction in scores using chronological age was greater for increasing degree of prematurity, younger ages at assessment and higher baseline scores and was substantial even beyond 3 years of age. However, the pattern was erratic, with considerable fluctuation evident across different ages and baseline scores. Chronological age results in a lowering of scores at all ages for preterm-born subjects that is greater in the first few years and in those born at earlier gestational ages. Whether or not to correct for prematurity depends upon the context of the assessment. © 2014 The Authors. Journal of Paediatrics and Child Health © 2014 Paediatrics and Child Health Division (Royal Australasian College of Physicians).
How Parents Can Help Kids Improve Test Scores: Taking the Stakes out of Literacy Testing

ERIC Educational Resources Information Center

Schneider, Steven

2006-01-01

In order to meet the goals of No Child Left Behind, standardized testing is preeminent as the sole indicator determining whether states all across America demonstrate adequate yearly progress regarding the improvement of student achievement in literacy education. This book will help teachers and parents raise children's scores on standardized…
The Effects of Group Members' Personalities on a Test Taker's L2 Group Oral Discussion Test Scores

ERIC Educational Resources Information Center

Ockey, Gary J.

2009-01-01

The second language group oral is a test of second language speaking proficiency, in which a group of three or more English language learners discuss an assigned topic without interaction with interlocutors. Concerns expressed about the extent to which test takers' personal characteristics affect the scores of others in the group have limited its…
Noncognitive Skills and the Gender Disparities in Test Scores and Teacher Assessments: Evidence from Primary School

ERIC Educational Resources Information Center

Cornwell, Christopher; Mustard, David B.; Van Parys, Jessica

2013-01-01

Using data from the 1998-99 ECLS-K cohort, we show that the grades awarded by teachers are not aligned with test scores. Girls in every racial category outperform boys on reading tests, while boys score at least as well on math and science tests as girls. However, boys in all racial categories across all subject areas are not represented in…
Using imputed genotype data in the joint score tests for genetic association and gene-environment interactions in case-control studies.

PubMed

Song, Minsun; Wheeler, William; Caporaso, Neil E; Landi, Maria Teresa; Chatterjee, Nilanjan

2018-03-01

Genome-wide association studies (GWAS) are now routinely imputed for untyped single nucleotide polymorphisms (SNPs) based on various powerful statistical algorithms for imputation trained on reference datasets. The use of predicted allele counts for imputed SNPs as the dosage variable is known to produce valid score test for genetic association. In this paper, we investigate how to best handle imputed SNPs in various modern complex tests for genetic associations incorporating gene-environment interactions. We focus on case-control association studies where inference for an underlying logistic regression model can be performed using alternative methods that rely on varying degree on an assumption of gene-environment independence in the underlying population. As increasingly large-scale GWAS are being performed through consortia effort where it is preferable to share only summary-level information across studies, we also describe simple mechanisms for implementing score tests based on standard meta-analysis of "one-step" maximum-likelihood estimates across studies. Applications of the methods in simulation studies and a dataset from GWAS of lung cancer illustrate ability of the proposed methods to maintain type-I error rates for the underlying testing procedures. For analysis of imputed SNPs, similar to typed SNPs, the retrospective methods can lead to considerable efficiency gain for modeling of gene-environment interactions under the assumption of gene-environment independence. Methods are made available for public use through CGEN R software package. © 2017 WILEY PERIODICALS, INC.
Web-based training and interrater reliability testing for scoring the Hamilton Depression Rating Scale.

PubMed

Rosen, Jules; Mulsant, Benoit H; Marino, Patricia; Groening, Christopher; Young, Robert C; Fox, Debra

2008-10-30

Despite the importance of establishing shared scoring conventions and assessing interrater reliability in clinical trials in psychiatry, these elements are often overlooked. Obstacles to rater training and reliability testing include logistic difficulties in providing live training sessions, or mailing videotapes of patients to multiple sites and collecting the data for analysis. To address some of these obstacles, a web-based interactive video system was developed. It uses actors of diverse ages, gender and race to train raters how to score the Hamilton Depression Rating Scale and to assess interrater reliability. This system was tested with a group of experienced and novice raters within a single site. It was subsequently used to train raters of a federally funded multi-center clinical trial on scoring conventions and to test their interrater reliability. The advantages and limitations of using interactive video technology to improve the quality of clinical trials are discussed.
Enhancement of Spatial Ability in Girls in a Single-Sex Environment through Spatial Experience and the Impact on Information Seeking

ERIC Educational Resources Information Center

Swarlis, Linda L.

2008-01-01

The test scores of spatial ability for women lag behind those of men in many spatial tests. On the Mental Rotations Test (MRT), a significant gender gap has existed for over 20 years and continues to exist. High spatial ability has been linked to efficiencies in typical computing tasks including Web and database searching, text editing, and…
Opportunity to learn: Investigating possible predictors for pre-course Test Of Astronomy STandards TOAST scores

NASA Astrophysics Data System (ADS)

Berryhill, Katie J.

As astronomy education researchers become more interested in experimentally testing innovative teaching strategies to enhance learning in introductory astronomy survey courses ("ASTRO 101"), scholars are placing increased attention toward better understanding factors impacting student gain scores on the widely used Test Of Astronomy STandards (TOAST). Usually used in a pre-test and post-test research design, one might naturally assume that the pre-course differences observed between high- and low-scoring college students might be due in large part to their pre-existing motivation, interest, experience in science, and attitudes about astronomy. To explore this notion, 11 non-science majoring undergraduates taking ASTRO 101 at west coast community colleges were interviewed in the first few weeks of the course to better understand students' pre-existing affect toward learning astronomy with an eye toward predicting student success. In answering this question, we hope to contribute to our understanding of the incoming knowledge of students taking undergraduate introductory astronomy classes, but also gain insight into how faculty can best meet those students' needs and assist them in achieving success. Perhaps surprisingly, there was only weak correlation between students' motivation toward learning astronomy and their pre-test scores. Instead, the most fruitful predictor of TOAST pre-test scores was the quantity of pre-existing, informal, self-directed astronomy learning experiences.
Individual Differences in Digit Span, Susceptibility to Proactive Interference, and Aptitude/Achievement Test Scores.

ERIC Educational Resources Information Center

Dempster, Frank N.; Cooney, John B.

1982-01-01

Individual differences in digit span, susceptibility to proactive interference, and various aptitude/achievement test scores were investigated in two experiments with college students. Results indicated that digit span was strongly correlated with aptitude/achievement scores, but did not indicate that susceptibility to proactive interference…
Association between Ability Emotional Intelligence and Left Insula during Social Judgment of Facial Emotions.

PubMed

Quarto, Tiziana; Blasi, Giuseppe; Maddalena, Chiara; Viscanti, Giovanna; Lanciano, Tiziana; Soleti, Emanuela; Mangiulli, Ivan; Taurisano, Paolo; Fazio, Leonardo; Bertolino, Alessandro; Curci, Antonietta

2016-01-01

The human ability of identifying, processing and regulating emotions from social stimuli is generally referred as Emotional Intelligence (EI). Within EI, Ability EI identifies a performance measure assessing individual skills at perceiving, using, understanding and managing emotions. Previous models suggest that a brain "somatic marker circuitry" (SMC) sustains emotional sub-processes included in EI. Three primary brain regions are included: the amygdala, the insula and the ventromedial prefrontal cortex (vmPFC). Here, our aim was to investigate the relationship between Ability EI scores and SMC activity during social judgment of emotional faces. Sixty-three healthy subjects completed a test measuring Ability EI and underwent fMRI during a social decision task (i.e. approach or avoid) about emotional faces with different facial expressions. Imaging data revealed that EI scores are associated with left insula activity during social judgment of emotional faces as a function of facial expression. Specifically, higher EI scores are associated with greater left insula activity during social judgment of fearful faces but also with lower activity of this region during social judgment of angry faces. These findings indicate that the association between Ability EI and the SMC activity during social behavior is region- and emotion-specific.
A pretest prognostic score to assess patients undergoing exercise or pharmacological stress testing

PubMed Central

Morise, Anthony; Evans, Matthew; Jalisi, Farrukh; Shetty, Rajendra; Stauffer, Marc

2007-01-01

Objective A previously developed pretest score was validated to stratify patients presenting for exercise testing with suspected coronary disease according to the presence of angiographic coronary disease. Our goal was to determine how well this pretest score risk stratified patients undergoing pharmacological and exercise stress tests concerning prognostic endpoints. Design Retrospective cohort analysis. Setting University hospital stress laboratory. Patients 7452 unselected ambulatory patients with symptoms of suspected coronary disease undergoing stress testing between 1995 and 2004. Main outcomes measures All‐cause death, cardiac death and non‐fatal myocardial infarction. Results The rate of all‐cause death was 5.5% (CI 5.0 to 6.1) with 4.3 (SD 2.4) years of follow‐up (Exercise 2.8% (CI 2.3 to 3.2) v Pharmacological group 11.9% (CI 10.5 to 13.3); p<0.001). The rate of cardiac death/myocardial infarction was 2.6% (CI 2.2 to 3.0) (Exercise 1.4% (CI 1.1 to 1.8) v Pharmacological group 5.3% (CI 4.3 to 6.2); p<0.001). In both groups, stratification by pretest score was significant for all‐cause death and the combined endpoint. However, stratification was more effective in the pharmacological group using the combined endpoint rather than all‐cause death. Pharmacological stress patients in intermediate and high risk groups were at higher risk than their respective exercise test cohorts. Referral for pharmacological stress testing was found to be an independent predictor of time to death (2.7 (CI 2.0 to 3.6); p<0.001). Conclusion A pretest score previously validated to stratify according to angiographic outcomes, effectively risk stratified pharmacological and exercise stress patients according to the combined endpoint of cardiac death/myocardial infarction. PMID:17228070
Construction of an Exome-Wide Risk Score for Schizophrenia Based on a Weighted Burden Test.

PubMed

Curtis, David

2018-01-01

Polygenic risk scores obtained as a weighted sum of associated variants can be used to explore association in additional data sets and to assign risk scores to individuals. The methods used to derive polygenic risk scores from common SNPs are not suitable for variants detected in whole exome sequencing studies. Rare variants, which may have major effects, are seen too infrequently to judge whether they are associated and may not be shared between training and test subjects. A method is proposed whereby variants are weighted according to their frequency, their annotations and the genes they affect. A weighted sum across all variants provides an individual risk score. Scores constructed in this way are used in a weighted burden test and are shown to be significantly different between schizophrenia cases and controls using a five-way cross-validation procedure. This approach represents a first attempt to summarise exome sequence variation into a summary risk score, which could be combined with risk scores from common variants and from environmental factors. It is hoped that the method could be developed further. © 2017 John Wiley & Sons Ltd/University College London.

Obesity and motor coordination ability in Taiwanese children with and without developmental coordination disorder.

PubMed

Zhu, Yi-Ching; Wu, Sheng K; Cairney, John

2011-01-01

The purpose of this study was to investigate the associations between obesity and motor coordination ability in Taiwanese children with and without developmental coordination disorder (DCD). 2029 children (1078 boys, 951 girls) aged nine to ten years were chosen randomly from 14 elementary schools across Taiwan. We used bioelectrical impedance analysis to measure percentage of body fat (PBF) and the Movement Assessment Battery for Children test (MABC test) to evaluate the motor coordination ability. Using cut-off points based on PBF from past studies, boys and girls were divided into obese, overweight and normal-weight groups, respectively. In boys, total impairment scores and scores on balance subtest in the MABC were significantly higher in the obese and overweight groups when compared against the normal-weight group. Girls in the obese and the overweight groups had higher balance impairment scores than those of the normal-weight group. Among boys, the prevalence of obesity was highest in the DCD group, when compared to the borderline DCD and TD boys. A higher percentage of DCD girls were overweight and obese than TD girls. Obesity may be associated with poor motor coordination ability among boys and girls, and particularly in relation to balance ability. Children with DCD may have a higher risk to be overweight or obese in Taiwan. Copyright © 2010 Elsevier Ltd. All rights reserved.
A Study of Mental Ability Testing and Its Implications for the Oklahoma City Public Schools.

ERIC Educational Resources Information Center

Hall, Janie L.

This study follows a 1980 moratorium on group mental ability testing called by the district's superintendent when questions relating to the informational value and cost-effectiveness of the Otis Lennon Mental Ability Test (OLMA) were raised by the Oklahoma City Public School District. Criticisms of intelligence tests and relevant issues are…
Can Machine Scoring Deal with Broad and Open Writing Tests as Well as Human Readers?

ERIC Educational Resources Information Center

McCurry, Doug

2010-01-01

This article considers the claim that machine scoring of writing test responses agrees with human readers as much as humans agree with other humans. These claims about the reliability of machine scoring of writing are usually based on specific and constrained writing tasks, and there is reason for asking whether machine scoring of writing requires…
Ability and Motivation: Assessing Individual Factors that Contribute to University Retention

ERIC Educational Resources Information Center

Alarcon, Gene M.; Edwards, Jean M.

2013-01-01

The current study explored individual differences in ability and motivation factors of retention in first-year college students. We used discrete-time survival mixture analysis to model university retention. Parents' education, gender, American College Test (ACT) scores, conscientiousness, and trait affectivity were explored as predictors of…
Pediatric residents' learning styles and temperaments and their relationships to standardized test scores.

PubMed

Tuli, Sanjeev Y; Thompson, Lindsay A; Saliba, Heidi; Black, Erik W; Ryan, Kathleen A; Kelly, Maria N; Novak, Maureen; Mellott, Jane; Tuli, Sonal S

2011-12-01

Board certification is an important professional qualification and a prerequisite for credentialing, and the Accreditation Council for Graduate Medical Education (ACGME) assesses board certification rates as a component of residency program effectiveness. To date, research has shown that preresidency measures, including National Board of Medical Examiners scores, Alpha Omega Alpha Honor Medical Society membership, or medical school grades poorly predict postresidency board examination scores. However, learning styles and temperament have been identified as factors that 5 affect test-taking performance. The purpose of this study is to characterize the learning styles and temperaments of pediatric residents and to evaluate their relationships to yearly in-service and postresidency board examination scores. This cross-sectional study analyzed the learning styles and temperaments of current and past pediatric residents by administration of 3 validated tools: the Kolb Learning Style Inventory, the Keirsey Temperament Sorter, and the Felder-Silverman Learning Style test. These results were compared with known, normative, general and medical population data and evaluated for correlation to in-service examination and postresidency board examination scores. The predominant learning style for pediatric residents was converging 44% (33 of 75 residents) and the predominant temperament was guardian 61% (34 of 56 residents). The learning style and temperament distribution of the residents was significantly different from published population data (P = .002 and .04, respectively). Learning styles, with one exception, were found to be unrelated to standardized test scores. The predominant learning style and temperament of pediatric residents is significantly different than that of the populations of general and medical trainees. However, learning styles and temperament do not predict outcomes on standardized in-service and board examinations in pediatric residents.
Spinal appearance questionnaire: factor analysis, scoring, reliability, and validity testing.

PubMed

Carreon, Leah Y; Sanders, James O; Polly, David W; Sucato, Daniel J; Parent, Stefan; Roy-Beaudry, Marjolaine; Hopkins, Jeffrey; McClung, Anna; Bratcher, Kelly R; Diamond, Beverly E

2011-08-15

Cross sectional. This study presents the factor analysis of the Spinal Appearance Questionnaire (SAQ) and its psychometric properties. Although the SAQ has been administered to a large sample of patients with adolescent idiopathic scoliosis (AIS) treated surgically, its psychometric properties have not been fully evaluated. This study presents the factor analysis and scoring of the SAQ and evaluates its psychometric properties. The SAQ and the Scoliosis Research Society-22 (SRS-22) were administered to AIS patients who were being observed, braced or scheduled for surgery. Standard demographic data and radiographic measures including Lenke type and curve magnitude were also collected. Of the 1802 patients, 83% were female; with a mean age of 14.8 years and mean initial Cobb angle of 55.8° (range, 0°-123°). From the 32 items of the SAQ, 15 loaded on two factors with consistent and significant correlations across all Lenke types. There is an Appearance (items 1-10) and an Expectations factor (items 12-15). Responses are summed giving a range of 5 to 50 for the Appearance domain and 5 to 20 for the Expectations domain. The Cronbach's α was 0.88 for both domains and Total score with a test-retest reliability of 0.81 for Appearance and 0.91 for Expectations. Correlations with major curve magnitude were higher for the SAQ Appearance and SAQ Total scores compared to correlations between the SRS Appearance and SRS Total scores. The SAQ and SRS-22 Scores were statistically significantly different in patients who were scheduled for surgery compared to those who were observed or braced. The SAQ is a valid measure of self-image in patients with AIS with greater correlation to curve magnitude than SRS Appearance and Total score. It also discriminates between patients who require surgery from those who do not.
Benton Judgment of Line Orientation (JoLO) Test: A Brief and Useful Measure for Assessing Visuospatial Abilities in Manifest, but not Premanifest, Huntington's Disease.

PubMed

Corey-Bloom, Jody; Gluhm, Shea; Herndon, Andrew; Haque, Ameera S; Park, Sungmee; Gilbert, Paul E

2016-01-01

Visuospatial deficits have been described in Huntington's disease (HD); however, the extent of these deficits remains unclear. The Benton Judgment of Line Orientation (JoLO) Test, commonly used to assess visuospatial ability, requires minimal motor involvement. It has demonstrated sensitivity to visuospatial deficits in Parkinson's disease; however, few studies have examined performance on this test in HD. The objective of the current study was to assess visuospatial ability in premanifest and manifest HD using the JoLO. A global cognitive measure, the Mattis Dementia Rating Scale (DRS), was used to stratify manifest HD patients as mild (DRS ≥129) vs. moderate-severe (DRS ≤128). Fifty mild, 42 moderate-severe, and 30 premanifest HD subjects, as well as 35 matched controls, were administered the JoLO. HD Burden of Pathology (BOP) scores were used as a measure of disease severity. Results revealed that the total manifest HD sample (p < 0.001), in addition to the mild (p = 0.028), and moderate-severe (p < 0.001), but not premanifest, HD subjects scored significantly lower on the JoLO compared to normal controls. Our results suggest that the JoLO is useful for detecting visuospatial deficits across various stages of manifest HD. However, any visuospatial impairment that might be present during the premanifest stage of HD was not detected using the JoLO in the present sample.
Effect of Item Arrangement, Knowledge of Arrangement, and Test Anxiety on Two Scoring Methods.

ERIC Educational Resources Information Center

Plake, Barbara S.; And Others

1981-01-01

Number right and elimination scores were analyzed on a college level mathematics exam assembled from pretest data. Anxiety measures were administered along with the experimental forms to undergraduates. Results suggest that neither test scores nor attitudes are influenced by item order knowledge thereof, or anxiety level. (Author/GK)
ACER Mathematics Profile Series: Number Test. (Test Booklet, Answer and Record Sheet, Score Key, and Teachers Handbook).

ERIC Educational Resources Information Center

Cornish, Greg; Wines, Robin

The Number Test of the ACER Mathematics Profile Series, contains 30 items, for each of three suggested grade levels: 7-8, 8-9, and 9-10. Raw scores on all tests in the ACER Mathematics Profile Series (Number, Operations, Space and Measurement) are converted to a common scale called MAPS, a major feature of the Series. Based on the Rasch Model,…
A non-verbal technique for the assessment of general intellectual ability in selection of aviation personnel.

DOT National Transportation Integrated Search

1971-06-01

A study was conducted in which performance on a non-verbal problem- solving task was correlated with the Otis Quick Scoring Mental Ability Test and the Raven Progressive Matrices Test. The problem-solving task, called 'code- lock' required the subjec...
Reflective ability and moral reasoning in final year medical students: a semi-qualitative cohort study.

PubMed

Chalmers, Patricia; Dunngalvin, Audrey; Shorten, George

2011-01-01

Moral reasoning and reflective ability are important concepts in medical education. To date, the association between reflective ability and moral reasoning in medical students has not been measured. This study tested the hypotheses that, amongst final year medical students, (1) moral reasoning and reflective ability improve over time and (2) positive change in reflective ability favourably influences moral reasoning. With Institutional Ethical approval, 56 medical students (of a class of 110) participated fully both at the beginning and end of the final academic year. Reflective ability and moral reasoning were assessed at each time using Sobral's reflection-in-learning scale (RLS), Boenink's overall reflection score and by employing Kohlberg's schema for moral reasoning. The most important findings were that (1) Students' level of reflective ability scores related to medicine decreased significantly over the course of the year, (2) students demonstrated a predominantly conventional level of moral reasoning at both the beginning and end of the year, (3) moral reasoning scores tended to decrease over the course of the year and (4) RLS is a strong predictor of change in moral reasoning over time. This study confirms the usefulness of Sobral's RLS and BOR score for evaluating moral development in the context of medical education. This study further documents regression and levelling in the moral reasoning of final year medical students and a decrease in reflective ability applied in the medical context. Further studies are required to determine factors that would favourably influence reflective ability and moral reasoning among final year medical students.
Linking Scores from Tests of Similar Content Given in Different Languages: An Illustration Involving Methodological Alternatives

ERIC Educational Resources Information Center

Cascallar, Alicia S.; Dorans, Neil J.

2005-01-01

This study compares two methods commonly used (concordance and prediction) to establish linkages between scores from tests of similar content given in different languages. Score linkages between the Verbal and Math sections of the SAT I and the corresponding sections of the Spanish-language admissions test, the Prueba de Aptitud Academica (PAA),…
Investigation of basic cognitive predictors of reading and spelling abilities in Tunisian third-grade primary school children.

PubMed

Batnini, Soulef; Uno, Akira

2015-06-01

This study investigated first the main cognitive abilities; phonological processing, visual cognition, automatization and receptive vocabulary in predicting reading and spelling abilities in Arabic. Second, we compared good/poor readers and spellers to detect the characteristics of cognitive predictors which contribute to identifying reading and spelling difficulties in Arabic speaking children. A sample of 116 Tunisian third-grade children was tested on their abilities to read and spell, phonological processing, visual cognition, automatization and receptive vocabulary. For reading, phonological processing and automatization uniquely predicted Arabic word reading and paragraph reading abilities. Automatization uniquely predicted Arabic non-word reading ability. For spelling, phonological processing was a unique predictor for Arabic word spelling ability. Furthermore, poor readers had significantly lower scores on the phonological processing test and slower reading times on the automatization test as compared with good readers. Additionally, poor spellers showed lower scores on the phonological processing test as compared with good spellers. Visual cognitive processing and receptive vocabulary were not significant cognitive predictors of Arabic reading and spelling abilities for Tunisian third grade children in this study. Our results are consistent with previous studies in alphabetic orthographies and demonstrate that phonological processing and automatization are the best cognitive predictors in detecting early literacy problems. We suggest including phonological processing and automatization tasks in screening tests and in intervention programs may help Tunisian children with poor literacy skills overcome reading and spelling difficulties in Arabic. Copyright © 2014 The Japanese Society of Child Neurology. Published by Elsevier B.V. All rights reserved.
Developmental prosopagnosia and the Benton Facial Recognition Test.

PubMed

Duchaine, Bradley C; Nakayama, Ken

2004-04-13

The Benton Facial Recognition Test is used for clinical and research purposes, but evidence suggests that it is possible to pass the test with impaired face discrimination abilities. The authors tested 11 patients with developmental prosopagnosia using this test, and a majority scored in the normal range. Consequently, scores in the normal range should be interpreted cautiously, and testing should always be supplemented by other face tests.
Lexical-Access Ability and Cognitive Predictors of Speech Recognition in Noise in Adult Cochlear Implant Users

PubMed Central

Smits, Cas; Merkus, Paul; Festen, Joost M.; Goverts, S. Theo

2017-01-01

Not all of the variance in speech-recognition performance of cochlear implant (CI) users can be explained by biographic and auditory factors. In normal-hearing listeners, linguistic and cognitive factors determine most of speech-in-noise performance. The current study explored specifically the influence of visually measured lexical-access ability compared with other cognitive factors on speech recognition of 24 postlingually deafened CI users. Speech-recognition performance was measured with monosyllables in quiet (consonant-vowel-consonant [CVC]), sentences-in-noise (SIN), and digit-triplets in noise (DIN). In addition to a composite variable of lexical-access ability (LA), measured with a lexical-decision test (LDT) and word-naming task, vocabulary size, working-memory capacity (Reading Span test [RSpan]), and a visual analogue of the SIN test (text reception threshold test) were measured. The DIN test was used to correct for auditory factors in SIN thresholds by taking the difference between SIN and DIN: SRTdiff. Correlation analyses revealed that duration of hearing loss (dHL) was related to SIN thresholds. Better working-memory capacity was related to SIN and SRTdiff scores. LDT reaction time was positively correlated with SRTdiff scores. No significant relationships were found for CVC or DIN scores with the predictor variables. Regression analyses showed that together with dHL, RSpan explained 55% of the variance in SIN thresholds. When controlling for auditory performance, LA, LDT, and RSpan separately explained, together with dHL, respectively 37%, 36%, and 46% of the variance in SRTdiff outcome. The results suggest that poor verbal working-memory capacity and to a lesser extent poor lexical-access ability limit speech-recognition ability in listeners with a CI. PMID:29205095
Evaluating the ability of dental technician students and graduate dentists to match tooth color.

PubMed

Sinmazisik, Gulden; Trakyali, Goksu; Tarcin, Bilge

2014-12-01

The ability of dental technician students to match tooth shade with the Vita 3D-Master shade guide and Toothguide Training Box has not been investigated. The purpose of this study was to evaluate and compare the shade-matching ability of dental technician students and graduate dentists using the Vita 3D-Master shade guide. Twenty-nine dental technician students (DTS group) and 30 graduate dentists (GD group) participated in this study. The Toothguide Training Box (TTB) was used to train the participants and test their shade-matching abilities. Shade-matching ability was evaluated with 3 exercises and a final test, all of which are components of the TTB. The number of mistakes for each participant for value (L), chroma (c), and hue (h) were recorded during the exercises and the final test, and the mistake ratios were calculated. Color difference (ΔE) values for each shade were calculated from the L*, a*, and b* values of the Vita 3D-Master shade guide for each participant in both groups. The Mann-Whitney U test was used to determine statistically significant differences between the L, c, and h mistake ratios of the 2 groups, and the Student t test was used to determine statistically significant differences between the final test scores and the ΔE values of the groups (α=.05). The mistake ratio for L in the GD group was significantly higher than that of the DTS group (P<.05), whereas the mistake ratio for h in the DTS group was higher (P<.001). No significant differences were observed between the groups regarding the mistake ratios for c (P>.05). With regard to the final test scores and the ΔE values, no significant differences were found between the groups (P<.001), and the DTS group received higher scores than the GD group (912 and 851). The mean ΔE values for the DTS and GD groups were 1.72 and 2.92. DTSs made more mistakes in the h parameter than GDs, and GDs made more mistakes in the L parameter than DTSs. With regard to the final test scores and the �
Effect of vowel context on test-retest nasalance score variability in children with and without cleft palate.

PubMed

Ha, Seunghee; Jung, Seungeun; Koh, Kyung S

2018-06-01

The purpose of this study was to determine whether test-retest nasalance score variability differs between Korean children with and without cleft palate (CP) and vowel context influences variability in nasalance score. Thirty-four 3-to-5-year-old children with and without CP participated in the study. Three 8-syllable speech stimuli devoid of nasal consonants were used for data collection. Each stimulus was loaded with high, low, or mixed vowels, respectively. All participants were asked to repeat the speech stimuli twice after the examiner, and an immediate test-retest nasalance score was assessed with no headgear change. Children with CP exhibited significantly greater absolute difference in nasalance scores than children without CP. Variability in nasalance scores was significantly different for the vowel context, and the high vowel sentence showed a significantly larger difference in nasalance scores than the low vowel sentence. The cumulative frequencies indicated that, for children with CP in the high vowel sentence, only 8 of 17 (47%) repeated nasalance scores were within 5 points. Test-retest nasalance score variability was greater for children with CP than children without CP, and there was greater variability for the high vowel sentence(s) for both groups. Copyright © 2018 Elsevier B.V. All rights reserved.
Scoring Method of a Situational Judgment Test: Influence on Internal Consistency Reliability, Adverse Impact and Correlation with Personality?

ERIC Educational Resources Information Center

De Leng, W. E.; Stegers-Jager, K. M.; Husbands, A.; Dowell, J. S.; Born, M. Ph.; Themmen, A. P.

2017-01-01

Situational Judgment Tests (SJTs) are increasingly used for medical school selection. Scoring an SJT is more complicated than scoring a knowledge test, because there are no objectively correct answers. The scoring method of an SJT may influence the construct and concurrent validity and the adverse impact with respect to non-traditional students.…
Decision making under internal uncertainty: the case of multiple-choice tests with different scoring rules.

PubMed

Bereby-Meyer, Yoella; Meyer, Joachim; Budescu, David V

2003-02-01

This paper assesses framing effects on decision making with internal uncertainty, i.e., partial knowledge, by focusing on examinees' behavior in multiple-choice (MC) tests with different scoring rules. In two experiments participants answered a general-knowledge MC test that consisted of 34 solvable and 6 unsolvable items. Experiment 1 studied two scoring rules involving Positive (only gains) and Negative (only losses) scores. Although answering all items was the dominating strategy for both rules, the results revealed a greater tendency to answer under the Negative scoring rule. These results are in line with the predictions derived from Prospect Theory (PT) [Econometrica 47 (1979) 263]. The second experiment studied two scoring rules, which allowed respondents to exhibit partial knowledge. Under the Inclusion-scoring rule the respondents mark all answers that could be correct, and under the Exclusion-scoring rule they exclude all answers that might be incorrect. As predicted by PT, respondents took more risks under the Inclusion rule than under the Exclusion rule. The results illustrate that the basic process that underlies choice behavior under internal uncertainty and especially the effect of framing is similar to the process of choice under external uncertainty and can be described quite accurately by PT. Copyright 2002 Elsevier Science B.V.
Effects of Scoring by Section and Independent Scorers' Patterns on Scorer Reliability in Biology Essay Tests

ERIC Educational Resources Information Center

Ebuoh, Casmir N.; Ezeudu, S. A.

2015-01-01

The study investigated the effects of scoring by section, use of independent scorers and conventional patterns on scorer reliability in Biology essay tests. It was revealed from literature review that conventional pattern of scoring all items at a time in essay tests had been criticized for not being reliable. The study was true experimental study…

Some links on this page may take you to non-federal websites. Their policies may differ from this site.