Science.gov

Sample records for act test scores

  1. The Hand Test Acting Out Score as a Predictor of Acting Out in Correctional Settings

    ERIC Educational Resources Information Center

    Porecki, Daniel; Vandergoot, David

    1978-01-01

    The Hand Test was administered to 107 maximum-security prison inmates. The Acting Out Score (AOS) was computed, and one year later, actual acting out for the same inmates was recorded. Determined that the Hand Test AOS is useful in identifying inmates who have potential for acting out. (Author)

  2. The Relationship of Scores on Elizur's Hostility System on the Rorschach to the Acting-Out Score on the Hand Test.

    ERIC Educational Resources Information Center

    Martin, John D.; And Others

    1978-01-01

    The relationship between Elizur's Hostility Scoring on the Rorschach Test and the Acting-Out Score on the Hand Test was examined. Correlations between the two measures (using several scoring procedures) ranged from .40 to .64. (JKS)

  3. Relating Scores on the Enhanced ACT Assessment and the SAT Test Batteries.

    ERIC Educational Resources Information Center

    Houston, Walter; Sawyer, Richard

    1991-01-01

    A study examined the appropriateness of two methods for relating the enhanced American College Testing Program (ACT) Assessment Composite and Scholastic Aptitude Test (SAT) total scores: equipercentile concordance and regression. Study results, potential inaccuracies, and the usefulness of institutional concordances based on their own data are…

  4. The Effects of Using Selected Metacognitive Strategies on ACT Mathematics Sub-Test Scores

    ERIC Educational Resources Information Center

    LeMay, Jeffrey W.

    2016-01-01

    This quasi-experimental post-test only control group designed quantitative study examined whether or not members of an experimental group of participants who utilized two metacognitive strategy training regimens experienced a significant increase in their ACT mathematics sub-test scores compared to a group of students who did not utilize either of…

  5. The Effects of Using Selected Metacognitive Strategies on ACT Mathematics Sub-Test Scores

    ERIC Educational Resources Information Center

    LeMay, Jeffrey W.

    2016-01-01

    This quasi-experimental post-test only control group designed quantitative study examined whether or not members of an experimental group of participants who utilized two metacognitive strategy training regimens experienced a significant increase in their ACT mathematics sub-test scores compared to a group of students who did not utilize either of…

  6. Predicting Scores on the College-Level Examination Program (CLEP) General Examinations from Scores Earned on the American College Test (ACT) Assessment.

    ERIC Educational Resources Information Center

    Nimmer, Donald N.; Shakiba-Nejad, Hadi

    The study was conducted to provide formulae by which College-Level Examination Program (CLEP) General Examination scores may be predicted from scores earned on the American College Test (ACT) Assessment. Five basic areas of liberal arts achievement are measured by the CLEP General Examinations: English Composition, Humanities, Mathematics, Natural…

  7. Estimating Reading Skill from ACT Assessment Scores.

    ERIC Educational Resources Information Center

    Noble, Julie

    1986-01-01

    A study correlating Nelson-Denny Reading Test scores with American College Testing Program Assessments (ACT) indicates that reading skill can be predicted accurately from the ACT social studies reading and English usage subtests. (MSE)

  8. Test Scoring [book review].

    ERIC Educational Resources Information Center

    Meijer, Rob R.

    2003-01-01

    This book discusses how to obtain test scores and, in particular, how to obtain test scores from tests that consist of a combination of multiple choice and open-ended questions. The strength of the book is that scoring solutions are presented for a diversity of real world scoring problems. (SLD)

  9. Test Scoring [book review].

    ERIC Educational Resources Information Center

    Meijer, Rob R.

    2003-01-01

    This book discusses how to obtain test scores and, in particular, how to obtain test scores from tests that consist of a combination of multiple choice and open-ended questions. The strength of the book is that scoring solutions are presented for a diversity of real world scoring problems. (SLD)

  10. Test facilities for SCORE-D

    NASA Astrophysics Data System (ADS)

    Greuel, Dirk; Deeken, Jan; Suslov, Dmitry; Schäfer, Klaus; Schlechtriem, Stefan

    2013-06-01

    The LOX/LH2 Staged Combustion Rocket Engine Demonstrator (SCORE-D) is part of ESA's Future Launcher Preparatory Program (FLPP). SCORE-D serves as a technology demonstrator in perspective of the development of the High Thrust Engine (HTE), which is designated as a candidate for the main stage engine of the Next Generation Launcher (NGL). To develop and test the SCORE-D engine, ESA investigates configurations of the test benches P3.2 and P5 at DLR test site in Lampoldshausen. For the SCORE-D Hot Combustion Devices (HCD) development, i.e. Pre-burner (PB) and thrust chamber assembly (TCA), the P3.2 test facility has to be modified for further usage. Recently, the first steps in this endeavor have been made with the evaluation of the necessary modifications to the facility. To accommodate the SCORE-D engine, it is foreseen to modify the P5 test facility in the coming years. In the last year, DLR has started the design phase for these modifications. In preparatory test programs at the P8 test facility, Astrium has conducted sub-scale hot combustion devices tests. While Astrium designed and manufactured the sub-scale assembly of the pre-burner and the main combustion chamber (MCC) for SCORE-D, DLR operated the P8 test facility.

  11. Score test variable screening

    PubMed Central

    Zhao, Sihai Dave; Li, Yi

    2015-01-01

    Variable screening has emerged as a crucial first step in the analysis of high-throughput data, but existing procedures can be computationally cumbersome, difficult to justify theoretically, or inapplicable to certain types of analyses. Motivated by a high-dimensional censored quantile regression problem in multiple myeloma genomics, this paper makes three contributions. First, we establish a score test-based screening framework, which is widely applicable, extremely computationally efficient, and relatively simple to justify. Secondly, we propose a resampling-based procedure for selecting the number of variables to retain after screening according to the principle of reproducibility. Finally, we propose a new iterative score test screening method which is closely related to sparse regression. In simulations we apply our methods to four different regression models and show that they can outperform existing procedures. We also apply score test screening to an analysis of gene expression data from multiple myeloma patients using a censored quantile regression model to identify high-risk genes. PMID:25124197

  12. Counseling College-Bound Students: Can ACT Scores Be Predicted?

    ERIC Educational Resources Information Center

    Carvajal, Howard; And Others

    1989-01-01

    Examined relationships between American College Testing Program (ACT) academic test and Peabody Picture Vocabulary Test-Revised (PPVT-R), Wechsler Adult Intelligence Scale-Revised (WAIS-R), and 1986 Stanford-Binet Intelligence Scale: Fourth Edition (Binet IV). Thirty college students for whom ACT scores were available took other tests. Concluded…

  13. Counseling College-Bound Students: Can ACT Scores Be Predicted?

    ERIC Educational Resources Information Center

    Carvajal, Howard; And Others

    1989-01-01

    Examined relationships between American College Testing Program (ACT) academic test and Peabody Picture Vocabulary Test-Revised (PPVT-R), Wechsler Adult Intelligence Scale-Revised (WAIS-R), and 1986 Stanford-Binet Intelligence Scale: Fourth Edition (Binet IV). Thirty college students for whom ACT scores were available took other tests. Concluded…

  14. Do Test Scores Buy Happiness?

    ERIC Educational Resources Information Center

    McCluskey, Neal

    2017-01-01

    Since at least the enactment of No Child Left Behind in 2002, standardized test scores have served as the primary measures of public school effectiveness. Yet, such scores fail to measure the ultimate goal of education: maximizing happiness. This exploratory analysis assesses nation level associations between test scores and happiness, controlling…

  15. Do later wake times and increased sleep duration of 12th graders result in more studying, higher grades, and improved SAT/ACT test scores?

    PubMed

    Cole, James S

    2016-09-01

    The aim of this study was to investigate the relationship between sleep duration, wake time, and hours studying on high school grades and performance on the Scholastic Aptitude Test (SAT)/ American College Testing (ACT) college entrance exams. Data were collected from 13,071 recently graduated high school seniors who were entering college in the fall of 2014. A column proportions z test with a Bonferroni adjustment was used to analyze proportional differences. Analysis of covariance (ANCOVA) was used to examine mean group differences. Students who woke up prior to 6 a.m. and got less than 8 h of sleep (27 %) were significantly more likely to report studying 11 or more hours per week (30 %), almost double the rate compared to students who got more than 8 h of sleep and woke up the latest (16 %). Post hoc results revealed students who woke up at 7 a.m. or later reported significantly higher high school grades than all other groups (p < 0.001), with the exception of those students who woke up between 6:01 a.m. and 7:00 a.m. and got eight or more hours of sleep. The highest reported SAT/ACT scores were from the group that woke up after 7 a.m. but got less than 8 h sleep (M = 1099.5). Their scores were significantly higher than all other groups. This study provides additional evidence that increased sleep and later wake time are associated with increased high school grades. However, this study also found that students who sleep the longest also reported less studying and lower SAT/ACT scores.

  16. Hedonism or Higher Test Scores?

    ERIC Educational Resources Information Center

    Wold, Donald C.

    2004-01-01

    In the 20 years since the federal report on education "A Nation at Risk" appeared, much has been written on test scores of students in the United States versus their counterparts elsewhere. One of the issues is whether their scores are in fact inferior, or merely a statistical difference due to their universal schooling philosophy. Since…

  17. Development of Predicted Paths for ACT Aspire Score Reports. WP-2014-06

    ERIC Educational Resources Information Center

    Allen, Jeff

    2014-01-01

    This report focuses on the initial development of the predicted score paths for ACT Aspire reporting. The paths provide predicted score ranges for the next two years--as well as predicted ACT score ranges for tests administered at grades 9 and 10. Longitudinal ACT Aspire data for students tested in spring 2013 and spring 2014, as well as…

  18. EDUCATION AND PSYCHOLOGICAL TEST SCORES

    PubMed Central

    Pershad, Dwarka; Verma, S. K.

    1980-01-01

    Education, a long neglected variable affecting psychological test score, is in search of reemphasis. Some evidence for this has accumulated on the psychological tests constructed and standardized here at the department of Psychiatry, P.G.I., Chandigarh. Tentative norms prepared education wise on WAIS-Verbal section, PGI-Memory Scale, Proverb and Similarity Tests, Psychoticism Questionnaire, and PGI MQN 2, for adults, in the age range of 16-50, are reported. The results showed marked difference in the mean scores of different educational categories and thus stressed the need for reporting norms separately for different educational levels. PMID:22064617

  19. Estimating Reading Skill from ACT Assessment Scores. Research Report No. 88.

    ERIC Educational Resources Information Center

    Noble, Julie

    A study examined whether ACT subtest scores can be used to predict reading skills, as measured by the Nelson-Denny Reading Test, with a degree of accuracy that would support their use as a screening device for college placement. ACT test scores of 2,431 students were used to predict Form C Nelson-Denny raw scores. ACT test scores from 3,016…

  20. Television and the Test Scores.

    ERIC Educational Resources Information Center

    Schramm, Wilbur

    There is no conclusive evidence as yet that television has provided sufficient cause for the decline in scores on the Scholastic Aptitude Test and other measures of academic achievement or aptitude, although it may be one of several elements in a complex causal system. The major studies show that television viewing, after the early school years,…

  1. Developing and scoring essay tests.

    PubMed

    Oermann, M

    1999-01-01

    The need to prepare nursing students for the licensing examination has resulted in a predominance of multiple-choice testing in nursing courses. But what about evaluating students' ability to present ideas in their own words and develop creative responses to questions posed by the teacher? Essay items provide an effective means of assessing higher levels of learning and ability to organize and present ideas in writing. The author describes how to develop essay items and score responses.

  2. Reducing Test Length with Polychotomous Scoring

    DTIC Science & Technology

    1993-10-01

    using a dichotomous IRT model . A Brief Description or Polyweighting The examinee scores obtained when using polyweights to score a test are called...scoring and with scoring under the 1-parameter ( Rasch ) and 2-parameter logistic IRT models . The latter scoring methods assign scores to examinees in a...polyweighting and dichotomous Item Response Theory [IRTn scoring) traditional number-correct (NC) scoring. The study used data collected from applicants for

  3. Aptitude Test Score Trends: 1959-1984.

    ERIC Educational Resources Information Center

    Brown, Dianne C.

    The decline in standardized test scores during the 1960s and 1970s is well documented and is seen in both aptitude and achievement test scores. This paper describes and analyzes the test score trends over the 1960s, 1970s and early 1980s for five aptitude tests: (1) the Scholastic Aptitude Test; (2) the American College Test; (3) the Preliminary…

  4. The effect of lab based instruction on ACT science scores

    NASA Astrophysics Data System (ADS)

    Hamilton, Michelle

    Standardized tests, although unpopular, are required for a multitude of reasons. One of these tests is the ACT. The ACT is a college readiness test that many high school juniors take to gain college admittance. Students throughout the United States are unprepared for this assessment. The average high school junior is three points behind twenty-four, the ACT recommended score, for the science section. The science section focuses on reading text and, interpreting graphs, charts, tables and diagrams with an emphasis on experimental design and relationships among variables. For students to become better at interpreting and understanding scientific graphics they must have vast experience developing their own graphics. The purpose of this study was to provide students the opportunity to generate their own graphics to master interpretation of them on the ACT. According to a t-test the results show that students who are continually exposed to creating graphs are able to understand and locate information from graphs at a significantly faster rate.

  5. More than Just Test Scores

    ERIC Educational Resources Information Center

    Levin, Henry M.

    2012-01-01

    Around the world we hear considerable talk about creating world-class schools. Usually the term refers to schools whose students get very high scores on the international comparisons of student achievement such as PISA or TIMSS. The practice of restricting the meaning of exemplary schools to the narrow criterion of achievement scores is usually…

  6. Moving the Cut Score on Rasch Scored Tests.

    ERIC Educational Resources Information Center

    Miller, G. Edward; Beretvas, S. Natasha

    2002-01-01

    Presents empirically based item selection guidelines for moving the cut score on equated tests consisting of "n" dichotomous items calibrated assuming the Rasch model. Derivations of lemmas that underlie the guidelines are provided as well as a simulated example. (SLD)

  7. Predictive Validity of the ACT Tests at Selective Colleges.

    ERIC Educational Resources Information Center

    Lenning, Oscar T.

    Included are three studies, each dealing with an aspect of comparative validity of the American College Test (ACT) and the Scholastic Aptitude Test (SAT) at selective colleges. The first study considered the predictive efficiency of the ACT test scores and ACT test scores plus high school grades at 120 colleges, separated into three groups…

  8. ACT Scores of Incoming Freshmen to New Mexico State University by High School Size.

    ERIC Educational Resources Information Center

    Edington, Everett D.

    The American College Testing (ACT) scores of New Mexico public high school graduates who applied for admission to New Mexico State University in 1981 were analyzed to determine the relationship between ACT scores and school size. The schools were divided into 6 categories according to enrollment: less than 100 students (15 applicants), 100-199…

  9. Moving the cut score on Rasch scored tests.

    PubMed

    Miller, G Edward; Beretvas, S Natasha

    2002-01-01

    Empirically based item selection guidelines are presented for moving the cut score on equated tests consisting of n dichotomous items calibrated assuming the Rasch model. The cut score on a test form B, c(B), may be made higher than test form A's cut score, c(A), in the following ways: (1) select items for test form B such that the variance of test form B's item difficulties, sigma(2)(B), will be equal to test form A's sigma(2)(A), but test form B's mean item difficulty, mu(B), will be less that of test form A, mu(A); (2) given c(A) > n/2, select items for test form B such that mu(B) test form B such that mu(B) s(2)(A). To make c(B) lower than c(A), the direction of the changes listed above for the two tests item difficulties sigma(2) and mu should be reversed. Derivations of lemmas that underlie the guidelines are provided as well as a simulated example.

  10. Accountancy, teaching methods, sex, and American College Test scores.

    PubMed

    Heritage, J; Harper, B S; Harper, J P

    1990-10-01

    This study examines the significance of sex, methodology, academic preparation, and age as related to development of judgmental and problem-solving skills. Sex, American College Test (ACT) Mathematics scores, Composite ACT scores, grades in course work, grade point average (GPA), and age were used in studying the effects of teaching method on 96 students' ability to analyze data in financial statements. Results reflect positively on accounting students compared to the general college population and the women students in particular.

  11. The relationship between beginning college chemistry achievement and prior knowledge, number of college mathematics courses completed, GPA of college mathematics courses completed, levels of Piagetian intellectual development, mathematics ACT score, science ACT score, and composite ACT score

    NASA Astrophysics Data System (ADS)

    Barthel, Margaret Gorjanc

    This study was designed to investigate the relationship between achievement in Chem 33 (a third semester general chemistry course) and the following predictor variables: prior knowledge (Chem 32 grade), level of Piagetian intellectual development, number of college mathematics courses previously completed, grades received in prior college mathematics courses, science ACT scores, mathematics ACT scores, and composite ACT scores. During the winter semester of the 1994--1995 school year, the students who were enrolled in two sections of Chem 33, taught by the same professor, were given the TOLT test. In addition, their student records were examined in order to determine the grade they received in Chem 32, the number of mathematics courses Completed in college, the GPA for the mathematics courses completed in college, the science ACT score, mathematics ACT score, and the composite ACT score. A Pearson Correlation was performed on these variables to determine to what extent they were able to predict the student's achievement, measured by the grade in Chem 33. What was found in respect to achievement in Chem 33 was: prior knowledge was r = 0.67, GPA of mathematics courses completed in college was r = 0.58, mathematics ACT was r = 0.50, composite ACT was r = 0.40, and TOLT was r = 0.33. A stepwise multiple regression was also performed to see what interactions could be found between the variables themselves. There was no single predictor variable that adequately serves as a dominant predictor variable for success in Chem 33. It can be concluded that the Chem 32 grade (prior knowledge) coupled with the GPA of the college mathematics courses completed in college as well as the mathematics ACT score are the best predictors for achievement in Chem 33.

  12. Conditional Reliability Coefficients for Test Scores.

    PubMed

    Nicewander, W Alan

    2017-04-06

    The most widely used, general index of measurement precision for psychological and educational test scores is the reliability coefficient-a ratio of true variance for a test score to the true-plus-error variance of the score. In item response theory (IRT) models for test scores, the information function is the central, conditional index of measurement precision. In this inquiry, conditional reliability coefficients for a variety of score types are derived as simple transformations of information functions. It is shown, for example, that the conditional reliability coefficient for an ordinary, number-correct score, X, is equal to, ρ(X,X'|θ)=I(X,θ)/[I(X,θ)+1] Where: θ is a latent variable measured by an observed test score, X; p(X, X'|θ) is the conditional reliability of X at a fixed value of θ; and I(X, θ) is the score information function. This is a surprisingly simple relationship between the 2, basic indices of measurement precision from IRT and classical test theory (CTT). This relationship holds for item scores as well as test scores based on sums of item scores-and it holds for dichotomous as well as polytomous items, or a mix of both item types. Also, conditional reliabilities are derived for computerized adaptive test scores, and for θ-estimates used as alternatives to number correct scores. These conditional reliabilities are all related to information in a manner similar-or-identical to the 1 given above for the number-correct (NC) score. (PsycINFO Database Record (c) 2017 APA, all rights reserved).

  13. The Uses and Misuses of Test Scores: Technical Assistance Perspective.

    ERIC Educational Resources Information Center

    Echternacht, Gary

    The uses and misuses of standardized test results used for program evaluation as seen by a staff member of an Elementary Secondary Education Act (ESEA) Title I Technical Assistance Center are described. In ESEA Title I, test scores are used to select students for the program. Although federal requirements do not require using standardized test…

  14. The Uses and Misuses of Test Scores: Technical Assistance Perspective.

    ERIC Educational Resources Information Center

    Echternacht, Gary

    The uses and misuses of standardized test results used for program evaluation as seen by a staff member of an Elementary Secondary Education Act (ESEA) Title I Technical Assistance Center are described. In ESEA Title I, test scores are used to select students for the program. Although federal requirements do not require using standardized test…

  15. Confidence Test Scoring and Incentive Conditions.

    ERIC Educational Resources Information Center

    Rippey, Robert M.

    The effects of incentive conditions on the results of a confidence test were investigated. Two hundred thirty high school subjects were administered a very difficult confidence scored test under two conditions: 1) that the test would count heavily on their grades (incentive condition) and 2) that the test was for research purposes and would not be…

  16. Polychotomous Responses and the Test Score.

    ERIC Educational Resources Information Center

    Samejima, Fumiko

    Traditionally, the test score represented by the number of items answered correctly was taken as an indicator of the examinee's ability level. Researchers still tend to think that the number-correct score is a way of ordering individuals with respect to the latent trait. The objective of this study is to depict the benefits of using ability…

  17. Deriving Enrollment Management Scores from ACT Data. IR Applications, Volume 16, November 15, 2008

    ERIC Educational Resources Information Center

    Saupe, Joe L.; Curs, Bradley R.

    2008-01-01

    This study is an investigation of the derivation of scores that predict whether or not prospective first-time freshmen will apply or will enroll and whether or not first-time freshman enrollees will graduate using data from the ACT (American College Testing) assessment. Using a regression methodology, four basic scores are derived to be…

  18. On the Scoring of Cloze Tests.

    ERIC Educational Resources Information Center

    Clausing, Gerhard; Senko, Donna

    1978-01-01

    Cloze testing and language performance is discussed as are two techniques for awarding partial credit: the quick performance measurement and feedback technique and the three-stage scoring hierarchy for partial credit. A figure and tables are included. (EJS)

  19. What We Lose in Winning the Test Score Race

    ERIC Educational Resources Information Center

    Jorgenson, Olaf

    2012-01-01

    To achieve perpetually better test results each year as mandated by the No Child Left Behind Act (NCLB), teachers in successful schools such as Leroy Anderson Elementary in San Jose, California, will "try anything" to raise scores, as the school's principal stated in an interview with "The San Jose Mercury News." In schools…

  20. What We Lose in Winning the Test Score Race

    ERIC Educational Resources Information Center

    Jorgenson, Olaf

    2012-01-01

    To achieve perpetually better test results each year as mandated by the No Child Left Behind Act (NCLB), teachers in successful schools such as Leroy Anderson Elementary in San Jose, California, will "try anything" to raise scores, as the school's principal stated in an interview with "The San Jose Mercury News." In schools…

  1. Teacher Greetings Increase College Students' Test Scores

    ERIC Educational Resources Information Center

    Weinstein, Lawrence; Laverghetta, Antonio; Alexander, Ralph; Stewart, Megan

    2009-01-01

    The current study is an extension of a previous investigation dealing with teacher greetings to students. The present investigation used teacher greetings with college students and academic performance (test scores). We report data using university students and in-class test performance. Students in introductory psychology who received teachers'…

  2. Teacher Greetings Increase College Students' Test Scores

    ERIC Educational Resources Information Center

    Weinstein, Lawrence; Laverghetta, Antonio; Alexander, Ralph; Stewart, Megan

    2009-01-01

    The current study is an extension of a previous investigation dealing with teacher greetings to students. The present investigation used teacher greetings with college students and academic performance (test scores). We report data using university students and in-class test performance. Students in introductory psychology who received teachers'…

  3. Pre-Professional Skills Test Scores as College of Education Admission Criteria.

    ERIC Educational Resources Information Center

    Mikitovics, Audrey; Crehan, Kevin D.

    2002-01-01

    Examined the validity of Pre-Professional Skills Test (PPST) scores as college admission criteria. Examination of teacher education program graduates' American College Test (ACT) scores, PPST subtest scores, PPST scores, undergraduate grade point averages (GPA), and student teaching ratings highlighted a concurrent relationship between…

  4. Concordance between ACT and ITED Scores from Different Populations

    ERIC Educational Resources Information Center

    Yin, Ping; Brennan, Robert L.; Kolen, Michael J.

    2004-01-01

    The use of educational tests in practical situations often requires that tests that are built to different content and/or statistical specifications be statistically linked. When equating methods are applied to tests that differ in content, difficulty, and/or reliability, the resulting scores cannot be used interchangeably. This study examined…

  5. Cautions on the Use of State Aggregate ACT Scores.

    ERIC Educational Resources Information Center

    American Coll. Testing Program, Iowa City, IA.

    The American College Testing (ACT) Assessment consists of four curriculum-based achievement tests designed to assess critical reasoning and higher-order thinking skills in English, mathematics, reading, and science. These tests reflect students' skills and achievement levels as products of their high school experience and serve as measures of…

  6. What Do Test Score Really Mean? A Latent Class Analysis of Danish Test Score Performance

    ERIC Educational Resources Information Center

    McIntosh, James; Munk, Martin D.

    2014-01-01

    Latent class Poisson count models are used to analyse a sample of Danish test score results from a cohort of individuals born in 1954-1955, tested in 1968, and followed until 2011. The procedure takes account of unobservable effects as well as excessive zeros in the data. We show that the test scores measure manifest or measured ability as it has…

  7. What Do Test Score Really Mean? A Latent Class Analysis of Danish Test Score Performance

    ERIC Educational Resources Information Center

    McIntosh, James; Munk, Martin D.

    2014-01-01

    Latent class Poisson count models are used to analyse a sample of Danish test score results from a cohort of individuals born in 1954-1955, tested in 1968, and followed until 2011. The procedure takes account of unobservable effects as well as excessive zeros in the data. We show that the test scores measure manifest or measured ability as it has…

  8. Does parental physical violence reduce children's standardized test score performance?

    PubMed

    Peek-Asa, Corinne; Maxwell, Leah; Stromquist, Ann; Whitten, Paul; Limbos, Mary Ann; Merchant, James

    2007-11-01

    Many negative cognitive and behavioral outcomes have been identified among children living in households with parental violence, but few studies have examined academic performance. In a rural population-based cohort, we examine the role of parental violence on standardized test score performance. The cohort included 306 children ages 6 through 17. Parents responded to a health interview that included questions about physical violence. Children's standardized test scores were collected prospectively for 5 years after the parent interview. Hierarchical multivariate models clustering on school, household, and repeated individual test scores and controlling for children's and parent's characteristics were run to predict test score performance. One in five children lived in a household in which parents reported at least one act of physical violence. Children whose parents reported intimate partner violence (IPV) performed an average of 12.2 percentile points lower than children whose parents reported no IPV (95% CI, -19.2--5.2; p < 0.001). Parent-reported IPV led to larger test score reductions for girls than for boys and for children less than 12 years old than for older children. Parental physical violence was common, and children in homes with violence had significantly poorer performance on standardized test scores.

  9. Critical Thinking: More than Test Scores

    ERIC Educational Resources Information Center

    Smith, Vernon G.; Szymanski, Antonia

    2013-01-01

    This article is for practicing or aspiring school administrators. The demand for excellence in public education has lead to an emphasis on standardized test scores. This article explores the development of a professional enhancement program designed to prepare teachers to teach higher order thinking skills. Higher order thinking is the primary…

  10. Indicators of Usefulness of Test Scores

    ERIC Educational Resources Information Center

    Sawyer, Richard

    2007-01-01

    Current thinking on validity suggests that educational institutions and individuals should evaluate their uses of test scores in the context of their fundamental goals. Regression coefficients and other traditional criterion-related validity statistics provide relevant information, but often do not, by themselves, address the fundamental reasons…

  11. Teacher Use of Achievement Test Score Data

    ERIC Educational Resources Information Center

    Miller, Steven C.

    2012-01-01

    The Wyoming Department of Education (WDE) has invested time and money developing standardized achievement test score reports designed to give teachers data about each of their students' levels of mastery of particular concepts in order to differentiate their instruction. The purpose of this study was to determine the extent to which eighth-grade…

  12. The Black-White Test Score Gap.

    ERIC Educational Resources Information Center

    Jencks, Christopher, Ed.; Phillips, Meredith, Ed.

    The 15 chapters of this book address issues related to the continuing test score gap between black and white students. The editors argue against traditional explanations which emphasize differences in economic resources and demographic factors, and they urge that more emphasis be put on psychological and cultural factors. The book suggests studies…

  13. Teacher Use of Achievement Test Score Data

    ERIC Educational Resources Information Center

    Miller, Steven C.

    2012-01-01

    The Wyoming Department of Education (WDE) has invested time and money developing standardized achievement test score reports designed to give teachers data about each of their students' levels of mastery of particular concepts in order to differentiate their instruction. The purpose of this study was to determine the extent to which eighth-grade…

  14. Misidentifying Factors Underlying Singapore's High Test Scores

    ERIC Educational Resources Information Center

    Usiskin, Zalman

    2012-01-01

    Singapore students have scored exceedingly well on international tests in mathematics. In response, there has been a desire in the United States--both at the policy level and at the school level--to emulate Singapore. Because what can be identified most easily about Singapore's school mathematics can be gleaned from curriculum documents from the…

  15. The Influence of Foreign Language Learning during Early Childhood on Standardized Test Scores

    ERIC Educational Resources Information Center

    Shaw, Tommetta

    2010-01-01

    Increasing standardized test scores in reading and math is of high importance to the California Department of Education to meet requirements mandated by the No Child Left Behind (NCLB) act of 2001. More research is needed to understand the best ways to improve tests scores to meet concerns of the NCLB act. The purpose of the study was to evaluate…

  16. The Influence of Foreign Language Learning during Early Childhood on Standardized Test Scores

    ERIC Educational Resources Information Center

    Shaw, Tommetta

    2010-01-01

    Increasing standardized test scores in reading and math is of high importance to the California Department of Education to meet requirements mandated by the No Child Left Behind (NCLB) act of 2001. More research is needed to understand the best ways to improve tests scores to meet concerns of the NCLB act. The purpose of the study was to evaluate…

  17. Validating Test Score Meaning and Defending Test Score Use: Different Aims, Different Methods

    ERIC Educational Resources Information Center

    Cizek, Gregory J.

    2016-01-01

    Advances in validity theory and alacrity in validation practice have suffered because the term "validity" has been used to refer to two incompatible concerns: (1) the degree of support for specified interpretations of test scores (i.e. intended score meaning) and (2) the degree of support for specified applications (i.e. intended test…

  18. Validating Test Score Meaning and Defending Test Score Use: Different Aims, Different Methods

    ERIC Educational Resources Information Center

    Cizek, Gregory J.

    2016-01-01

    Advances in validity theory and alacrity in validation practice have suffered because the term "validity" has been used to refer to two incompatible concerns: (1) the degree of support for specified interpretations of test scores (i.e. intended score meaning) and (2) the degree of support for specified applications (i.e. intended test…

  19. Grades--Scores--Predictions: A Study of the Efficiency of High School Grades and American College Test Scores in Predicting Academic Achievement at Montgomery College.

    ERIC Educational Resources Information Center

    Gell, Robert L.; Bleil, David F.

    This report analyzes the relationship between high school grades, American College Test (ACT) scores, and first-semester college grades. Based on the Standard Research Service of the ACT program, 1,379 students in the fall 1969 freshman class of Montgomery College (Maryland) were studied. Measures of academic background used ACT scores in English,…

  20. ITC Guidelines on Quality Control in Scoring, Test Analysis, and Reporting of Test Scores

    ERIC Educational Resources Information Center

    Allalouf, Avi

    2014-01-01

    The Quality Control (QC) Guidelines are intended to increase the efficiency, precision, and accuracy of the scoring, analysis, and reporting process of testing. The QC Guidelines focus on large-scale testing operations where multiple forms of tests are created for use on set dates. However, they may also be used for a wide variety of other testing…

  1. ITC Guidelines on Quality Control in Scoring, Test Analysis, and Reporting of Test Scores

    ERIC Educational Resources Information Center

    Allalouf, Avi

    2014-01-01

    The Quality Control (QC) Guidelines are intended to increase the efficiency, precision, and accuracy of the scoring, analysis, and reporting process of testing. The QC Guidelines focus on large-scale testing operations where multiple forms of tests are created for use on set dates. However, they may also be used for a wide variety of other testing…

  2. Use of Standardized Test Scores to Predict Success in a Computer Applications Course

    ERIC Educational Resources Information Center

    Harris, Robert V.; King, Stephanie B.

    2016-01-01

    The purpose of this study was to see if a relationship existed between American College Testing (ACT) scores (i.e., English, reading, mathematics, science reasoning, and composite) and student success in a computer applications course at a Mississippi community college. The study showed that while the ACT scores were excellent predictors of…

  3. Use of Standardized Test Scores to Predict Success in a Computer Applications Course

    ERIC Educational Resources Information Center

    Harris, Robert V.; King, Stephanie B.

    2016-01-01

    The purpose of this study was to see if a relationship existed between American College Testing (ACT) scores (i.e., English, reading, mathematics, science reasoning, and composite) and student success in a computer applications course at a Mississippi community college. The study showed that while the ACT scores were excellent predictors of…

  4. ACT Test Preparation Course and Its Impact on Students' College- and Career-Readiness

    ERIC Educational Resources Information Center

    Parrott, Timothy Nolan

    2012-01-01

    This study examined the effectiveness of an ACT intervention course developed for high school juniors at Anderson County High School during the 2011-2012 school year. This study compared the ACT composite test scores of the treatment group to the ACT composite test scores of the control group by using their PLAN scores as a baseline, to determine…

  5. A Study of the Predictability of Praxis I Examination Scores from ACT Scores and Teacher Education Program Prerequisite Courses

    ERIC Educational Resources Information Center

    Henderson, Allen R.

    2013-01-01

    This study investigated the relationship between student enrollment in certain college courses and Praxis I scores. Specifically, the study examined the predictive nature of the relationships between students' grades in college algebra, their freshman English course of choice, their ACT scores, and their Praxis I scores. The subjects consisted of…

  6. Validating the Interpretations and Uses of Test Scores

    ERIC Educational Resources Information Center

    Kane, Michael T.

    2013-01-01

    To validate an interpretation or use of test scores is to evaluate the plausibility of the claims based on the scores. An argument-based approach to validation suggests that the claims based on the test scores be outlined as an argument that specifies the inferences and supporting assumptions needed to get from test responses to score-based…

  7. On the Confidentiality of Student Test Scores. Report No. 32.

    ERIC Educational Resources Information Center

    Read, Peter B.

    A discussion of the limited meaning of test scores, testing as an invasion of privacy, the abuse of test scores as confidential information and privileged communication, recording and storing of test results, access to test scores, and the demand for accountability forms the basis for recommendations for the release of individual and group test…

  8. Facilitating the Interpretation of English Language Proficiency Scores: Combining Scale Anchoring and Test Score Mapping Methodologies

    ERIC Educational Resources Information Center

    Powers, Donald; Schedl, Mary; Papageorgiou, Spiros

    2017-01-01

    The aim of this study was to develop, for the benefit of both test takers and test score users, enhanced "TOEFL ITP"® test score reports that go beyond the simple numerical scores that are currently reported. To do so, we applied traditional scale anchoring (proficiency scaling) to item difficulty data in order to develop performance…

  9. From High School to the Future: ACT Preparation--Too Much, Too Late. Why ACT Scores Are Low in Chicago and What It Means for Schools

    ERIC Educational Resources Information Center

    Allensworth, Elaine; Correa, Macarena; Ponisciak, Steve

    2008-01-01

    The majority of Chicago Public Schools (CPS) students are not attaining the ACT scores they are aiming for, which they need to qualify for scholarships and college acceptance. This report looks at the reasons behind students' low performance and what matters for doing well on this test. CPS students are highly motivated to do well on the ACT, and…

  10. A Procedure for Linear Polychotomous Scoring of Test Items

    DTIC Science & Technology

    1993-10-01

    associated with the response categories of test items . When tests are scored using these scoring weights, test reliability increases. The new procedure is...program POLY. The example demonstrates how polyweighting can be used to calibrate and score test items drawn from an item bank that is too large to

  11. Access, Attrition, Test Scores, and Grades of College Entrants and Persisters: 1965-1973

    ERIC Educational Resources Information Center

    Rever, Philip R.; Kojaku, Lawrence K.

    1976-01-01

    The distributions of indices of academic abilities and achievements of entering freshmen classes and of classes at the end of the first year of college over an eight-year period are compared. Data show a decline in ACT test scores, increases in high school grades of college entrants, and the stability of test scores and increases in college grades…

  12. Genotype-Based Score Test for Association Testing in Families.

    PubMed

    Uh, Hae-Won; Beekman, Marian; Meulenbelt, Ingrid; Houwing-Duistermaat, Jeanine J

    The multiplex-case and control design in which multiple cases are sampled from the same family is considered. In such studies phenotype information of the un-genotyped relatives might be available. We intend to use additional family information when performing genetic association tests. A score test is revisited to provide a flexible framework to accommodate various genetic models and to improve power of the association test by adding available family information. The proposed test accounts for correlations induced by multiple cases from the same pedigree, directly deals with X-linked SNPs in mixed-sex-related samples, and incorporates additional phenotypic information such as the number of (un-genotyped) siblings and parents with similar symptoms by assigning the weights to (genotyped) multiplex cases. In addition, the score test directly incorporates posterior probabilities of imputed genotypes, which leads to an efficiency measure that reflects imputation uncertainty on the test conducted. The proposed test is applied to real applications for illustration. Its efficiency is demonstrated via simulations.

  13. The Design and Field Test of the ACT Portfolio System.

    ERIC Educational Resources Information Center

    Reckase, Mark D.

    The American College Testing Program (ACT) is field testing a portfolio assessment model. The field test is designed to determine whether it is possible to implement a portfolio assessment model on a national level that will result in scores that are of sufficient reliability and validity that they can be used for decisions at the student level.…

  14. Estimating Total-Test Scores from Partial Scores in a Matrix Sampling Design.

    ERIC Educational Resources Information Center

    Sachar, Jane; Suppes, Patrick

    1980-01-01

    The present study compared six methods, two of which utilize the content structure of items, to estimate total-test scores using 450 students and 60 items of the 110-item Stanford Mental Arithmetic Test. Three methods yielded fairly good estimates of the total-test score. (Author/RL)

  15. Testing Intelligently Includes Double-Checking Wechsler IQ Scores

    ERIC Educational Resources Information Center

    Kuentzel, Jeffrey G.; Hetterscheidt, Lesley A.; Barnett, Douglas

    2011-01-01

    The rigors of standardized testing make for numerous opportunities for examiner error, including simple computational mistakes in scoring. Although experts recommend that test scoring be double-checked, the extent to which independent double-checking would reduce scoring errors is not known. A double-checking procedure was established at a…

  16. Testing Intelligently Includes Double-Checking Wechsler IQ Scores

    ERIC Educational Resources Information Center

    Kuentzel, Jeffrey G.; Hetterscheidt, Lesley A.; Barnett, Douglas

    2011-01-01

    The rigors of standardized testing make for numerous opportunities for examiner error, including simple computational mistakes in scoring. Although experts recommend that test scoring be double-checked, the extent to which independent double-checking would reduce scoring errors is not known. A double-checking procedure was established at a…

  17. Identifying Speech Acts in E-Mails: Toward Automated Scoring of the "TOEIC"® E-Mail Task. Research Report. ETS RR-12-16

    ERIC Educational Resources Information Center

    De Felice, Rachele; Deane, Paul

    2012-01-01

    This study proposes an approach to automatically score the "TOEIC"® Writing e-mail task. We focus on one component of the scoring rubric, which notes whether the test-takers have used particular speech acts such as requests, orders, or commitments. We developed a computational model for automated speech act identification and tested it…

  18. See It, Be It, Write It: Using Performing Arts to Improve Writing Skills and Test Scores

    ERIC Educational Resources Information Center

    Blecher-Sass, Hope Sara; Moffitt, Maryellen

    2010-01-01

    Improve students' writing skills and boost their assessment scores while adding arts education, creativity, and fun to your writing curriculum. With this vibrant resource, improving writing skills goes hand-in-hand with improving test scores. Students learn how to use acting and visualization as prewriting activities to help them connect writing…

  19. See It, Be It, Write It: Using Performing Arts to Improve Writing Skills and Test Scores

    ERIC Educational Resources Information Center

    Blecher-Sass, Hope Sara; Moffitt, Maryellen

    2010-01-01

    Improve students' writing skills and boost their assessment scores while adding arts education, creativity, and fun to your writing curriculum. With this vibrant resource, improving writing skills goes hand-in-hand with improving test scores. Students learn how to use acting and visualization as prewriting activities to help them connect writing…

  20. Does Test Preparation Work? Implications for Score Validity

    ERIC Educational Resources Information Center

    Xie, Qin

    2013-01-01

    This article reports an empirical study that examined the pattern of test preparation for College English Test Band 4 (CET4) and the differential effects of test preparation practices on its scores, thereby drawing implications for CET4 score validity. Data collection involved 1,003 test takers of CET4. A pretest was administered at the beginning…

  1. Does Test Preparation Work? Implications for Score Validity

    ERIC Educational Resources Information Center

    Xie, Qin

    2013-01-01

    This article reports an empirical study that examined the pattern of test preparation for College English Test Band 4 (CET4) and the differential effects of test preparation practices on its scores, thereby drawing implications for CET4 score validity. Data collection involved 1,003 test takers of CET4. A pretest was administered at the beginning…

  2. School accountability and the black-white test score gap.

    PubMed

    Gaddis, S Michael; Lauen, Douglas Lee

    2014-03-01

    Since at least the 1960s, researchers have closely examined the respective roles of families, neighborhoods, and schools in producing the black-white achievement gap. Although many researchers minimize the ability of schools to eliminate achievement gaps, the No Child Left Behind Act (NCLB) increased pressure on schools to do so by 2014. In this study, we examine the effects of NCLB's subgroup-specific accountability pressure on changes in black-white math and reading test score gaps using a school-level panel dataset on all North Carolina public elementary and middle schools between 2001 and 2009. Using difference-in-difference models with school fixed effects, we find that accountability pressure reduces black-white achievement gaps by raising mean black achievement without harming mean white achievement. We find no differential effects of accountability pressure based on the racial composition of schools, but schools with more affluent populations are the most successful at reducing the black-white math achievement gap. Thus, our findings suggest that school-based interventions have the potential to close test score gaps, but differences in school composition and resources play a significant role in the ability of schools to reduce racial inequality.

  3. Good Instructional Leadership: Principals' Actions to Increase Composite ACT School Scores

    ERIC Educational Resources Information Center

    Xu, Bo; Liu, Dongfang

    2016-01-01

    Due to increased college admission requirements and a 20-year flat-lined trend in ACT scores, it is imperative for education leaders across the nation to implement effective strategies to increase ACT composite scores. High school principals, as instructional leaders and decision makers, are the major stakeholders who are vested in the outcomes of…

  4. Interpreting Nelson-Denny Reading Test Scores.

    ERIC Educational Resources Information Center

    Blai, Boris, Jr.

    Reading test results and their interpretation are stressed because of their importance in student achievement. The Nelson-Denny Reading Test used at Harcum Junior College is a useful measuring instrument for predicting academic achievement, screening students, and diagnosing reading and learning problems. General hints for interpretation of the…

  5. Black Students Are Also Losing Ground on the ACT Standardized Admission Test.

    ERIC Educational Resources Information Center

    Journal of Blacks in Higher Education, 2002

    2002-01-01

    Nearly as many black students now take the American College Testing Program (ACT) college admission test as sign up for the competing Scholastic Assessment Test (SAT). In common with the SAT, the scoring gap between blacks and whites is widening. Almost no Blacks score at the very highest level of ACT performance, which is generally necessary to…

  6. The Generalizability of Motivation Filtering in Improving Test Score Validity

    ERIC Educational Resources Information Center

    Wise, Vicki L.; Wise, Steven L.; Bhola, Dennison S.

    2006-01-01

    Accountability for educational quality is a priority at all levels of education. Low-stakes testing is one way to measure the quality of education that students receive and make inferences about what students know and can do. Aggregate test scores from low-stakes testing programs are suspect, however, to the degree that these scores are influenced…

  7. A Review of Scoring Algorithms for Ability and Aptitude Tests.

    ERIC Educational Resources Information Center

    Chevalier, Shirley A.

    In conventional practice, most educators and educational researchers score cognitive tests using a dichotomous right-wrong scoring system. Although simple and straightforward, this method does not take into consideration other factors, such as partial knowledge or guessing tendencies and abilities. This paper discusses alternative scoring models:…

  8. The Test Score Decline: A Review and Annotated Bibliography

    DTIC Science & Technology

    1981-08-01

    40. Champagne, D., & Roberts, E., An Exercise in Freedom: A Place Where Test Scores Appear to Be Rising. = 3. Acland , H., If Reading Scores Are...of the nation’s young teachers. Scientific, Engineering, Tech- nical Manpower Comments, November 1979. 3. Acland , Henry, If reading scores are

  9. Prediction of Metropolitan Readiness Test Scores

    ERIC Educational Resources Information Center

    Blowers, E. A.

    1977-01-01

    The efficiency of several visual and auditory predictors of the Metropolitan Readiness Test was examined utilizing 106 grade 1 subjects considered by their teachers to show learning difficulties. (Author/JC)

  10. Pictures Speak Louder than Test Scores.

    ERIC Educational Resources Information Center

    McCabe, Deborah; Hilmo, Joellen

    1985-01-01

    The Goodenough-Harris Draw-a-Person Test, if given at regular intervals during periods of remediation, may show clear evidence of improvement in behavior and attitude of learning disabled students. (CL)

  11. Understanding and Interpreting Pharmacy College Admission Test Scores

    PubMed Central

    2017-01-01

    To fairly and accurately interpret candidates’ Pharmacy College Admission Test (PCAT) scores as listed on their official transcripts, it is important to understand how these scores reflect candidates’ performances on cognitive tasks involving the identification, interpretation, analysis, and evaluation of information assumed to have been covered in pre-pharmacy science, math, and general education coursework. This paper attempts to facilitate this understanding by explaining how candidates’ responses to PCAT test items relate to their scaled scores and percentile ranks and how their writing scores reflect their performance. This paper also suggests how differences between candidates’ PCAT subtest scores may reflect different personal experiences, educational backgrounds, and cognitive abilities. PMID:28289307

  12. Estimating Total-test Scores from Partial Scores in a Matrix Sampling Design.

    ERIC Educational Resources Information Center

    Sachar, Jane; Suppes, Patrick

    It is sometimes desirable to obtain an estimated total-test score for an individual who was administered only a subset of the items in a total test. The present study compared six methods, two of which utilize the content structure of items, to estimate total-test scores using 450 students in grades 3-5 and 60 items of the ll0-item Stanford Mental…

  13. Fuzzy Math: A Meditation on Test Scoring

    ERIC Educational Resources Information Center

    Jacks, Meredith

    2011-01-01

    As a public school English teacher, the author observes standardized testing season each year with a sort of grim fascination. "So this is it," she thinks as she paces around her silent classroom, peering over kids' shoulders at articles about parasailing. Line graphs tracking the rainfall in Tulsa. Parts of speech. Functions of "x." "These are…

  14. Improving Scores on the IELTS Speaking Test

    ERIC Educational Resources Information Center

    Issitt, Steve

    2008-01-01

    This article presents three strategies for teaching students who are taking the IELTS speaking test. The first strategy is aimed at improving confidence and uses a variety of self-help materials from the field of popular psychology. The second encourages students to think critically and invokes a range of academic perspectives. The third strategy…

  15. Accountability Is More than a Test Score

    ERIC Educational Resources Information Center

    Turnipseed, Stephan; Darling-Hammond, Linda

    2015-01-01

    The number one quality business leaders look for in employees is creativity and yet the U.S. education system undermines the development of the higher-order skills that promote creativity by its dogged focus on multiple-choice tests. Stephan Turnipseed and Linda DarlingHammond discuss the kind of rich accountability system that will help students…

  16. Fuzzy Math: A Meditation on Test Scoring

    ERIC Educational Resources Information Center

    Jacks, Meredith

    2011-01-01

    As a public school English teacher, the author observes standardized testing season each year with a sort of grim fascination. "So this is it," she thinks as she paces around her silent classroom, peering over kids' shoulders at articles about parasailing. Line graphs tracking the rainfall in Tulsa. Parts of speech. Functions of "x." "These are…

  17. Improving Scores on the IELTS Speaking Test

    ERIC Educational Resources Information Center

    Issitt, Steve

    2008-01-01

    This article presents three strategies for teaching students who are taking the IELTS speaking test. The first strategy is aimed at improving confidence and uses a variety of self-help materials from the field of popular psychology. The second encourages students to think critically and invokes a range of academic perspectives. The third strategy…

  18. Student Achievement, Behavior, Perceptions, and Other Factors Affecting ACT Scores. ACT Research Report Series, 2006-1

    ERIC Educational Resources Information Center

    Noble, Julie P.; Roberts, William L.; Sawyer, Richard L.

    2006-01-01

    In this study, we investigated factors related to students' performance on the ACT. We examined how students' background characteristics, academic achievement in high school, education-related accomplishments and activities, and perceptions of self and others affect their ACT scores. Of particular interest were the direct and indirect effects of…

  19. State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. Washington

    ERIC Educational Resources Information Center

    Center on Education Policy, 2010

    2010-01-01

    This paper profiles Washington's test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the basic level on NAEP (National Assessment of Educational Progress) decreased in grade 4 reading. In grade 4 math, the percentage scoring proficient on the state test decreased…

  20. State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. Arkansas

    ERIC Educational Resources Information Center

    Center on Education Policy, 2010

    2010-01-01

    This paper profiles Arkansas's test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the basic level on NAEP (National Assessment of Educational Progress) went up in math at grades 4 and 8. In reading, the percentages scoring proficient on the state test went up at…

  1. Estimating the Reliability of a Test Battery Composite or a Test Score Based on Weighted Item Scoring

    ERIC Educational Resources Information Center

    Feldt, Leonard S.

    2004-01-01

    In some settings, the validity of a battery composite or a test score is enhanced by weighting some parts or items more heavily than others in the total score. This article describes methods of estimating the total score reliability coefficient when differential weights are used with items or parts.

  2. Test Score Reporting Referenced to Doubly-Moderated Cut Scores Using Splines

    ERIC Educational Resources Information Center

    Schafer, William D.; Hou, Xiaodong

    2011-01-01

    This study discusses and presents an example of a use of spline functions to establish and report test scores using a moderated system of any number of cut scores. Our main goals include studying the need for and establishing moderated standards and creating a reporting scale that is referenced to all the standards. Our secondary goals are to make…

  3. Talent Search Qualifying: Comparisons between Talent Search Students Qualifying via Scores on Standardized Tests and via Parent Nomination

    ERIC Educational Resources Information Center

    Lee, Seon-Young; Olszewski-Kubilius, Paula

    2006-01-01

    This study examined differences between students who qualified for talent search testing via scores on standardized tests and via parent nomination in their performances on the SAT or ACT and some demographic characteristics. Overall, the standardized testing group earned higher scores on the off-level tests than the parent nominated group. Asian…

  4. Educational Improvement Act: Diagnostic Testing.

    ERIC Educational Resources Information Center

    Kentucky State Dept. of Education, Frankfort. Office of Research and Planning.

    The Kentucky Department of Education has a responsibility to provide technical assistance and consultative services to local school districts. Descriptions of the state selected diagnostic reading test, the Prescriptive Reading Inventory (PRI) and the diagnostic math test, the Diagnostic Math Inventory (DMI), are explained. Each school district in…

  5. Test Scores and What They Mean. Sixth Edition.

    ERIC Educational Resources Information Center

    Lyman, Howard B.

    The first edition of this book was written to give information about testing to people whose work gave them access to test results, but whose training included little or nothing about the use and interpretation of tests. Later editions have been intended for a broader audience as the need for understanding what test scores really mean has…

  6. Development and testing of a portfolio evaluation scoring tool.

    PubMed

    Karlowicz, Karen A

    2010-02-01

    This study focused on development of a portfolio evaluation tool to guide the assignment of valid and reliable scores. Tool development was facilitated by a literature review, guidance of a faculty committee, and validation by content experts. Testing involved a faculty team that evaluated 60 portfolios. Calculation of interrater reliability and a paired-samples t test were used to judge effectiveness. Interrater reliability was 0.78 for overall scores, 0.81 for the seven program outcomes criteria scores, and more than 0.65 for scores assigned by 11 of 13 pairs of raters. There were no significant differences between raters' scores in 10 of 13 pairs. The portfolio evaluation tool demonstrated high reliability and should be tested by other schools using portfolio evaluation. Copyright 2010, SLACK Incorporated.

  7. Partial-Credit Scoring Methods for Multiple-Choice Tests.

    ERIC Educational Resources Information Center

    Frary, Robert B.

    1989-01-01

    Multiple-choice response and scoring methods that attempt to determine an examinee's degree of knowledge about each item in order to produce a total test score are reviewed. There is apparently little advantage to such schemes; however, they may have secondary benefits such as providing feedback to enhance learning. (SLD)

  8. A Review of Scoring Algorithms for Multiple-Choice Tests.

    ERIC Educational Resources Information Center

    Kurz, Terri Barber

    Multiple-choice tests are generally scored using a conventional number right scoring method. While this method is easy to use, it has several weaknesses. These weaknesses include decreased validity due to guessing and failure to credit partial knowledge. In an attempt to address these weaknesses, psychometricians have developed various scoring…

  9. Evaluating the Predictive Validity of Graduate Management Admission Test Scores

    ERIC Educational Resources Information Center

    Sireci, Stephen G.; Talento-Miller, Eileen

    2006-01-01

    Admissions data and first-year grade point average (GPA) data from 11 graduate management schools were analyzed to evaluate the predictive validity of Graduate Management Admission Test[R] (GMAT[R]) scores and the extent to which predictive validity held across sex and race/ethnicity. The results indicated GMAT verbal and quantitative scores had…

  10. Evaluating the Predictive Validity of Graduate Management Admission Test Scores

    ERIC Educational Resources Information Center

    Sireci, Stephen G.; Talento-Miller, Eileen

    2006-01-01

    Admissions data and first-year grade point average (GPA) data from 11 graduate management schools were analyzed to evaluate the predictive validity of Graduate Management Admission Test[R] (GMAT[R]) scores and the extent to which predictive validity held across sex and race/ethnicity. The results indicated GMAT verbal and quantitative scores had…

  11. Missing the Mark: What Test Scores Really Tell Us

    ERIC Educational Resources Information Center

    Tanner, John R.

    2011-01-01

    State test scores administered for accountability purposes are regularly used to adjust instruction in nuanced ways. This is no accident--No Child Left Behind demanded that students' scores be returned quickly to teachers in order that this might be the case, and the idea of data-driven decision making continues as one way the promise of education…

  12. A Comparison of an Introductory Course to SAT/ACT Scores in Predicting Student Performance

    ERIC Educational Resources Information Center

    Marsh, Crystale M.; Vandehey, Michael A.; Diekhoff, George M.

    2008-01-01

    We assessed students in General Psychology classes and examined their SAT/ACT scores, GPAs, and attempted and earned hours. Exams in General Psychology were superior to the SAT/ACT in predicting GPA, supporting the use of an introductory course as a "gateway" for identifying at-risk students and engaging them in academic services.…

  13. Test Scores: Can Year-Round School Raise Them?

    ERIC Educational Resources Information Center

    Alcorn, Richard D.

    1992-01-01

    Findings from a study of San Diego (California) elementary schools indicate that students in year-round schools outperformed those in traditional schools in the degree of achievement in reading, language, and math test scores. (MLF)

  14. Counselor Simulation by Film in Test Score Reporting Interviews

    ERIC Educational Resources Information Center

    Collins, Tom

    1972-01-01

    The responsible and innovative utilization of media, not only in test score reporting but also in other guidance functions, may assist the counselor in permitting him more time to function with clients in counseling relationships. (Author)

  15. Usefulness of High School Average and ACT Scores in Making College Admission Decisions. ACT Research Report Series 2010-2

    ERIC Educational Resources Information Center

    Sawyer, Richard

    2010-01-01

    Ample correlational evidence indicates that high school GPA is usually better than admission test scores in predicting first-year college GPA, although test scores have incremental predictive validity. Many people conclude that this correlational evidence translates directly to usefulness in making admission decisions. The issue of usefulness is…

  16. Making Sense of Test Scores. Assessment Brief. Number 10

    ERIC Educational Resources Information Center

    Bergman, Lincoln

    2004-01-01

    It is challenging for parents and the general public to make sense of the reports on test scores that appear in the mass media. This article offers some things for readers to consider as they bring a critical eye to what is read in the papers. Usually reports on test scores in the media are quite short and focus on one or two aspects of test…

  17. Wage and Test Score Dispersion: Some International Evidence.

    ERIC Educational Resources Information Center

    Bedard, Kelly; Ferrall, Christopher

    2003-01-01

    Compares the distribution of test scores at age 13 in 1964 and 1982 and wages later in life across 11 countries. Finds that wage dispersion later in life is never greater than test-score dispersion. For three countries (U.S., UK, and Japan), finds evidence of skill-biased changes in wage dispersion between the early 1970s and the late 1980s.…

  18. Making Sense of Test Scores. Assessment Brief. Number 10

    ERIC Educational Resources Information Center

    Bergman, Lincoln

    2004-01-01

    It is challenging for parents and the general public to make sense of the reports on test scores that appear in the mass media. This article offers some things for readers to consider as they bring a critical eye to what is read in the papers. Usually reports on test scores in the media are quite short and focus on one or two aspects of test…

  19. Wage and Test Score Dispersion: Some International Evidence.

    ERIC Educational Resources Information Center

    Bedard, Kelly; Ferrall, Christopher

    2003-01-01

    Compares the distribution of test scores at age 13 in 1964 and 1982 and wages later in life across 11 countries. Finds that wage dispersion later in life is never greater than test-score dispersion. For three countries (U.S., UK, and Japan), finds evidence of skill-biased changes in wage dispersion between the early 1970s and the late 1980s.…

  20. Bewitched, Bothered, and Bewildering: The Use and Misuse of State SAT and ACT Scores.

    ERIC Educational Resources Information Center

    Powell, Brian; Steelman, Lala Carr

    1996-01-01

    Updates an earlier study by reanalyzing interstate variations in Scholastic Aptitude Test and American College Test scores. Reaffirms the conclusion that state rankings based on such scores change dramatically when adjusted for number of test takers or class rank of test-taking population. Finds that public expenditures are positively related to…

  1. Interpreting Test Scores: More Complicated than You Think

    ERIC Educational Resources Information Center

    Tully, Susannah

    2008-01-01

    As more colleges move to "test optional" admissions policies, the debate over the utility and interpretation of standardized-test scores continues. In this article, the author interviews Daniel Koretz, a professor of education at Harvard University and author of "Measuring Up: What Educational Testing Really Tells Us". Koretz…

  2. Observed-Score Equating as a Test Assembly Problem.

    ERIC Educational Resources Information Center

    van der Linden, Wim J.; Luecht, Richard M.

    1998-01-01

    Derives a set of linear conditions of item-response functions that guarantees identical observed-score distributions on two test forms. The conditions can be added as constraints to a linear programming model for test assembly. An example illustrates the use of the model for an item pool from the Law School Admissions Test (LSAT). (SLD)

  3. Machine-Scored Testing, Part II: Creativity and Item Analysis.

    ERIC Educational Resources Information Center

    Leuba, Richard J.

    1986-01-01

    Explains how multiple choice test items can be devised to measure higher-order learning, including engineering problem solving. Discusses the value and information provided in item analysis procedures with machine-scored tests. Suggests elements to consider in test design. (ML)

  4. Interpreting Test Scores: More Complicated than You Think

    ERIC Educational Resources Information Center

    Tully, Susannah

    2008-01-01

    As more colleges move to "test optional" admissions policies, the debate over the utility and interpretation of standardized-test scores continues. In this article, the author interviews Daniel Koretz, a professor of education at Harvard University and author of "Measuring Up: What Educational Testing Really Tells Us". Koretz…

  5. Test Takers and the Validity of Score Interpretations

    ERIC Educational Resources Information Center

    Kopriva, Rebecca J.; Thurlow, Martha L.; Perie, Marianne; Lazarus, Sheryl S.; Clark, Amy

    2016-01-01

    This article argues that test takers are as integral to determining validity of test scores as defining target content and conditioning inferences on test use. A principled sustained attention to how students interact with assessment opportunities is essential, as is a principled sustained evaluation of evidence confirming the validity or calling…

  6. A Monte Carlo Comparison of Three Optimal Test Scoring Procedures,

    DTIC Science & Technology

    theory, with conventional scoring. The effect of the COWS procedures is to differentially weight test items as a function of examine ability and the...item characteristics. Low ability examinees are given very low weights on difficult test items , lowering the effects of guessing, and decreasing test error.

  7. Dearborn 1981-82 Achievement Test Scores (Fifth Annual Report).

    ERIC Educational Resources Information Center

    Dearborn Public Schools, MI.

    The purpose of the fifth annual Dearborn Achievement Test Score report is to summarize and to help interpret the test results so that Dearborn citizens and educators will have a better understanding of the educational achievements of Dearborn students. The District-wide Testing Program assesses reading readiness, scholastic aptitude, academic…

  8. Distinctive Feature Scoring of the California Consonant Test.

    ERIC Educational Resources Information Center

    Feeney, M. Patrick

    1990-01-01

    The study evaluated a distinctive feature scoring technique for List 1 of the California Consonant Test for the purpose of improving test reliability in this test used to identify errors in speech recognition made by adult listeners (N=50) with high frequency sensorineural hearing loss. (DB)

  9. Grades and Test Scores: Accounting for Observed Differences.

    ERIC Educational Resources Information Center

    Willingham, Warren W.; Pollack, Judith M.; Lewis, Charles

    2002-01-01

    Proposed a framework of possible differences between grades and test scores and tested the framework with data on 8,454 high school seniors from the National Education Longitudinal Study. Identified differences and correlations among achievement factors. Differences between grades and tests give these measures complementary strengths in…

  10. Prediction of true test scores from observed item scores and ancillary data.

    PubMed

    Haberman, Shelby J; Yao, Lili; Sinharay, Sandip

    2015-05-01

    In many educational tests which involve constructed responses, a traditional test score is obtained by adding together item scores obtained through holistic scoring by trained human raters. For example, this practice was used until 2008 in the case of GRE(®) General Analytical Writing and until 2009 in the case of TOEFL(®) iBT Writing. With use of natural language processing, it is possible to obtain additional information concerning item responses from computer programs such as e-rater(®). In addition, available information relevant to examinee performance may include scores on related tests. We suggest application of standard results from classical test theory to the available data to obtain best linear predictors of true traditional test scores. In performing such analysis, we require estimation of variances and covariances of measurement errors, a task which can be quite difficult in the case of tests with limited numbers of items and with multiple measurements per item. As a consequence, a new estimation method is suggested based on samples of examinees who have taken an assessment more than once. Such samples are typically not random samples of the general population of examinees, so that we apply statistical adjustment methods to obtain the needed estimated variances and covariances of measurement errors. To examine practical implications of the suggested methods of analysis, applications are made to GRE General Analytical Writing and TOEFL iBT Writing. Results obtained indicate that substantial improvements are possible both in terms of reliability of scoring and in terms of assessment reliability.

  11. Relationships between EPAS Scores and College Preparatory Course Work in High School. ACT Research Report Series.

    ERIC Educational Resources Information Center

    Woodruff, David J.

    2003-01-01

    The Educational Planning and Assessment System (EPAS) of the American College Testing Program consists of three testing programs: EXPLORE, PLAN, and the ACT Assessment Program (ACT). EXPLORE is administered in the 8th grade, PLAN in the 10th grade, and the ACT in the 11th or 12th grade. This study investigated relationships among the composite…

  12. Do elderly people score better on cognitive tests at home?

    PubMed Central

    Shievitz, A. L.; Tudiver, F.; Araujo, A.; Sanghe, P.; Boyle, E.

    1998-01-01

    OBJECTIVE: To determine whether Mini-Mental State Examination (MMSE) scores of elderly family medicine patients are different when the test is administered at home rather than at the clinic. DESIGN: Cross-sectional comparison study. SETTING: University family practice unit in an urban area. PARTICIPANTS: A convenience sample of family practice clinic patients 70 years or older were referred to the study in the sequence seen at the clinic. Of 171 patients approached in person or by telephone, 77 agreed to participate. METHOD: The MMSE was administered at home and at the clinic on the same day for all subjects. Testing site order was randomized across patients. MAIN FINDINGS: Of the 77 patients who agreed to be subjects, only 13 (16.9%) had low MMSE scores (< or = 24). Five (41.7%) of these had normal scores (> 24) at home, but low scores in the clinic. Subjects had significantly higher scores on MMSEs administered at home (P < .01) on the same day. CONCLUSIONS: Previous research has shown patients achieve higher MMSE scores at home; this study demonstrated it in a representative family medicine population. Primary care physicians should be cautious about classifying elderly patients as possibly cognitively impaired based on clinic testing alone. Testing at home could avoid many unnecessary referrals to specialist services for further assessment and diagnostic tests that use up precious health care resources. PMID:9721421

  13. "No Child" Effect on English-Learners Mulled: Teachers Welcome Attention, Fault Focus on Test Scores

    ERIC Educational Resources Information Center

    Zehr, Mary Ann

    2006-01-01

    Educators who specialize in teaching English-language learners agree that the 4-year-old No Child Left Behind Act has brought unprecedented attention to those students by requiring schools to isolate test-score data for them. They disagree, though, on whether changes in instruction spurred by the law have been positive or negative overall. Such…

  14. How to Set Cutoff Scores for Knowledge Tests Used in Promotion, Training, Certification, and Licensing.

    ERIC Educational Resources Information Center

    Biddle, Richard E.

    1993-01-01

    Suggests a process for setting a cutoff score on knowledge tests for promotion, certification, and licensing: a modified Angoff method, in which a competency estimate is determined by subject matter experts, is combined with analysis of potential impact on any groups protected by the Civil Rights Act. (SK)

  15. Which Advanced Mathematics Courses Influence ACT Score? A State Level Analysis of the Iowa Class of 2012

    ERIC Educational Resources Information Center

    Grinstead, Mary L.

    2013-01-01

    This study explores the relationship between specific advanced mathematics courses and college readiness (as determined by ACT score). The ACT organization has found a consistent relationship between taking a minimum core number of mathematics courses and higher ACT scores (mathematics and composite) (ACT, Inc., 2012c). However, the extent to…

  16. Which Advanced Mathematics Courses Influence ACT Score? A State Level Analysis of the Iowa Class of 2012

    ERIC Educational Resources Information Center

    Grinstead, Mary L.

    2013-01-01

    This study explores the relationship between specific advanced mathematics courses and college readiness (as determined by ACT score). The ACT organization has found a consistent relationship between taking a minimum core number of mathematics courses and higher ACT scores (mathematics and composite) (ACT, Inc., 2012c). However, the extent to…

  17. State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. Louisiana

    ERIC Educational Resources Information Center

    Center on Education Policy, 2010

    2010-01-01

    This paper profiles Louisiana's test score trends through 2008-09. Between 2005 and 2009, trends on state tests and NAEP (National Assessment of Educational Progress) sometimes differed. On the state test, the percentages of students reaching the proficient level increased at grades 4 and 8 in both reading and math. On NAEP, the percentage of…

  18. State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. Tennessee

    ERIC Educational Resources Information Center

    Center on Education Policy, 2010

    2010-01-01

    This paper profiles Tennessee's test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the basic level on NAEP (National Assessment of Educational Progress) increased in grade 8 reading and math. At grade 4, trends on the state test and NAEP differed somewhat. In…

  19. Effect of self-assessment on test scores: student perceptions.

    PubMed

    Ramirez, Beatriz U

    2010-09-01

    After a sudden increase in most of the individual grades in a multiple-choice test, students were asked to rank the three most relevant factors responsible for this outcome. Among eight others, the availability of a test for self-assessment before the final test was by far the most frequently mentioned (82.4% of the students). Questions applied during different course activities did not have the same effect on student scores as the "online" self-assessment test.

  20. High Test Scores: The Wrong Road to National Economic Success

    ERIC Educational Resources Information Center

    Baker, Keith

    2011-01-01

    A widely held view is that good schools are essential to a nation's international economic success and that high test scores on international tests of academic skills and knowledge indicate how good a nation's schools are. The widespread belief that good schools are an important contributor to a nation's economic success in the world is supported…

  1. High Test Scores: The Wrong Road to National Economic Success

    ERIC Educational Resources Information Center

    Baker, Keith

    2011-01-01

    A widely held view is that good schools are essential to a nation's international economic success and that high test scores on international tests of academic skills and knowledge indicate how good a nation's schools are. The widespread belief that good schools are an important contributor to a nation's economic success in the world is supported…

  2. Effort Analysis: Individual Score Validation of Achievement Test Data

    ERIC Educational Resources Information Center

    Wise, Steven L.

    2015-01-01

    Whenever the purpose of measurement is to inform an inference about a student's achievement level, it is important that we be able to trust that the student's test score accurately reflects what that student knows and can do. Such trust requires the assumption that a student's test event is not unduly influenced by construct-irrelevant factors…

  3. Effects of Coaching on SAT I: Reasoning Test Scores.

    ERIC Educational Resources Information Center

    Powers, Donald E.; Rock, Donald A.

    1999-01-01

    Used several alternative analytic methods to estimate the effects of coaching on Scholastic Assessment Test I scores in a sample of more than 4,000 examinees, approximately 500 of whom had attended formal coaching programs. All estimates suggest that the effects of coaching are far less than claimed by major commercial test-preparation companies.…

  4. The Impact of School Resources on Student Achievement Test Scores.

    ERIC Educational Resources Information Center

    Nyhan, Ronald C.; Alkadry, Mohamad G.

    1999-01-01

    A statistical analysis tested the relationship of class size, expenditures per student, and socioeconomic status on student achievement test scores in three south Florida counties. Poverty is a primary determinant of student achievement. There is modest support for targeted expenditures for smaller classes above 20 students. (Contains 62…

  5. Motivating High School Students to Score Proficient on State Tests

    ERIC Educational Resources Information Center

    Brown, Sarah Lee

    2015-01-01

    The researcher interviewed two groups of eleventh grade students, in a rural Appalachian setting, who tended to score low on the state mandated high stakes/low stakes test to discover their efforts on the test, specifically in reading, and to obtain their opinions concerning the effects of a specific incentive or consequence. Before the eleventh…

  6. Equating Test Scores Using the Linear Method: A Primer.

    ERIC Educational Resources Information Center

    Tanguma, Jesus

    This paper describes four commonly used designs in equating test scores. These designs are: (1) single-group; (2) random-group; (3) equivalent-group; and (4) anchor-test. Each design requires that its data be collected according to specific guidelines. Three of the four methods are illustrated through hypothetical examples. All four methods try to…

  7. Effort Analysis: Individual Score Validation of Achievement Test Data

    ERIC Educational Resources Information Center

    Wise, Steven L.

    2015-01-01

    Whenever the purpose of measurement is to inform an inference about a student's achievement level, it is important that we be able to trust that the student's test score accurately reflects what that student knows and can do. Such trust requires the assumption that a student's test event is not unduly influenced by construct-irrelevant factors…

  8. The Probability of Obtaining Two Statistically Different Test Scores as a Test Index

    ERIC Educational Resources Information Center

    Muller, Jorg M.

    2006-01-01

    A new test index is defined as the probability of obtaining two randomly selected test scores (PDTS) as statistically different. After giving a concept definition of the test index, two simulation studies are presented. The first analyzes the influence of the distribution of test scores, test reliability, and sample size on PDTS within classical…

  9. A prognostic scoring system for arm exercise stress testing

    PubMed Central

    Xie, Yan; Xian, Hong; Chandiramani, Pooja; Bainter, Emily; Wan, Leping; Martin, Wade H

    2016-01-01

    Objective Arm exercise stress testing may be an equivalent or better predictor of mortality outcome than pharmacological stress imaging for the ≥50% for patients unable to perform leg exercise. Thus, our objective was to develop an arm exercise ECG stress test scoring system, analogous to the Duke Treadmill Score, for predicting outcome in these individuals. Methods In this retrospective observational cohort study, arm exercise ECG stress tests were performed in 443 consecutive veterans aged 64.1 (11.1) years. (mean (SD)) between 1997 and 2002. From multivariate Cox models, arm exercise scores were developed for prediction of 5-year and 12-year all-cause and cardiovascular mortality and 5-year cardiovascular mortality or myocardial infarction (MI). Results Arm exercise capacity in resting metabolic equivalents (METs), 1 min heart rate recovery (HRR) and ST segment depression ≥1 mm were the stress test variables independently associated with all-cause and cardiovascular mortality by step-wise Cox analysis (all p<0.01). A score based on the relation HRR (bpm)+7.3×METs−10.5×ST depression (0=no; 1=yes) prognosticated 5-year cardiovascular mortality with a C-statistic of 0.81 before and 0.88 after adjustment for significant demographic and clinical covariates. Arm exercise scores for the other outcome end points yielded C-statistic values of 0.77–0.79 before and 0.82–0.86 after adjustment for significant covariates versus 0.64–0.72 for best fit pharmacological myocardial perfusion imaging models in a cohort of 1730 veterans who were evaluated over the same time period. Conclusions Arm exercise scores, analogous to the Duke Treadmill Score, have good power for prediction of mortality or MI in patients who cannot perform leg exercise. PMID:26835142

  10. State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. Massachusetts

    ERIC Educational Resources Information Center

    Center on Education Policy, 2010

    2010-01-01

    This paper profiles Massachusetts' test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the basic level on NAEP (National Assessment of Educational Progress) increased in grade 4 reading and math and grade 8 math. Average annual gains were larger on the state test…

  11. State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. Nebraska

    ERIC Educational Resources Information Center

    Center on Education Policy, 2010

    2010-01-01

    This paper profiles Nebraska's test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the percentages reaching the basic level on NAEP (National Assessment of Educational Progress) increased at grade 4 in both reading and math. At grade 8, however, the percentages…

  12. State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. Wisconsin

    ERIC Educational Resources Information Center

    Center on Education Policy, 2010

    2010-01-01

    This paper profiles Wisconsin's test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the basic level on NAEP (National Assessment of Educational Progress) increased in math at grades 4 and 8 and in reading at grade 8. In grade 4 reading, the percentage scoring…

  13. State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. New Mexico

    ERIC Educational Resources Information Center

    Center on Education Policy, 2010

    2010-01-01

    This paper profiles New Mexico's test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the basic level on NAEP (National Assessment of Educational Progress) increased in grade 4 math and grade 8 reading and math. In grade 4 reading, the percentage basic on NAEP …

  14. State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. Alaska

    ERIC Educational Resources Information Center

    Center on Education Policy, 2010

    2010-01-01

    This paper profiles Alaska's test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the basic level on NAEP (National Assessment of Educational Progress) increased in grades 4 and 8 in math and grade 8 in reading. In grade 4 reading, the percentage reaching the…

  15. Does weight affect children's test scores and teacher assessments differently?

    PubMed

    Zavodny, Madeline

    2013-06-01

    The prevalence of childhood overweight and obesity increased dramatically in the United States during the past three decades. This increase has adverse public health implications, but its implication for children's academic outcomes is less clear. This paper uses data from five waves of the Early Childhood Longitudinal Study-Kindergarten to examine how children's weight is related to their scores on standardized tests and to their teachers' assessments of their academic ability. The results indicate that children's weight is more negatively related to teacher assessments of their academic performance than to test scores.

  16. Between-District Test Score Variation, 2009-2012

    ERIC Educational Resources Information Center

    Fahle, Erin; Reardon, Sean

    2016-01-01

    Describing the variation in test scores between and within school districts is critical for: (1) for policy-related and descriptive work that investigates the sorting of students among districts and the differential effectiveness of those districts; and (2) for methodological work planning future experiments or interventions. Intraclass…

  17. Commentary on "Validating the Interpretations and Uses of Test Scores"

    ERIC Educational Resources Information Center

    Brennan, Robert L.

    2013-01-01

    Kane's paper "Validating the Interpretations and Uses of Test Scores" is the most complete and clearest discussion yet available of the argument-based approach to validation. At its most basic level, validation as formulated by Kane is fundamentally a simply-stated two-step enterprise: (1) specify the claims inherent in a particular interpretation…

  18. Assessment Test Scores of Incoming Students, Fall 2001.

    ERIC Educational Resources Information Center

    Negron, Maggie; Breindel, Matthew

    This assessment of placement test scores in reading, math, and sentence skills from incoming students at College of the Desert (California) shows that students are overwhelmingly underprepared for study at the college. Only 15% of students were prepared in sentence skills, 27% in reading skills, 7% in math skills; only 3% were prepared in all 3…

  19. Study Finds Link between Quality Music Programs, Test Scores

    ERIC Educational Resources Information Center

    Teaching Music, 2007

    2007-01-01

    A recent study found that students in high-quality school music education programs score higher on standardized tests compared to students in schools with deficient music education programs. The study, which was published in the Winter 2006 issue of MENC's Journal for Research in Music Education, is the first to examine the quality of school music…

  20. Using Student Test Scores to Measure Principal Performance

    ERIC Educational Resources Information Center

    Grissom, Jason A.; Kalogrides, Demetra; Loeb, Susanna

    2015-01-01

    Expansion of the use of student test score data to measure teacher performance has fueled recent policy interest in using those data to measure the effects of school administrators as well. However, little research has considered the capacity of student performance data to uncover principal effects. Filling this gap, this article identifies…

  1. Using Test Scores from Students with Disabilities in Teacher Evaluation

    ERIC Educational Resources Information Center

    Buzick, Heather M.; Jones, Nathan D.

    2015-01-01

    Much of the recent focus of educational policymakers has been on improving the measurement of teacher effectiveness. Linking student growth to teacher effects has been a large part of reform efforts. To date, neither researchers nor practitioners have arrived at a consensus on how to treat test scores from students with disabilities in…

  2. America's Mediocre Test Scores: Education Crisis or Poverty Crisis?

    ERIC Educational Resources Information Center

    Petrilli, Michael J.; Wright, Brandon L.

    2016-01-01

    At a time when the national conversation is focused on lagging upward mobility, it is no surprise that many educators point to poverty as the explanation for mediocre test scores among U.S. students compared to those of students in other countries. If American teachers in struggling U.S. schools taught in Finland, says Finnish educator Pasi…

  3. Racial Differences in Mathematics Test Scores for Advanced Mathematics Students

    ERIC Educational Resources Information Center

    Minor, Elizabeth Covay

    2016-01-01

    Research on achievement gaps has found that achievement gaps are larger for students who take advanced mathematics courses compared to students who do not. Focusing on the advanced mathematics student achievement gap, this study found that African American advanced mathematics students have significantly lower test scores and are less likely to be…

  4. School Choice in Suburbia: Test Scores, Race, and Housing Markets

    ERIC Educational Resources Information Center

    Dougherty, Jack; Harelson, Jeffrey; Maloney, Laura; Murphy, Drew; Smith, Russell; Snow, Michael; Zannoni, Diane

    2009-01-01

    Home buyers exercise school choice when shopping for a private residence due to its location in a public school district or attendance area. In this quantitative study of one Connecticut suburban district, we measure the effect of elementary school test scores and racial composition on home buyers' willingness to purchase single-family homes over…

  5. Student Laptop Use and Scores on Standardized Tests

    ERIC Educational Resources Information Center

    Kposowa, Augustine J.; Valdez, Amanda D.

    2013-01-01

    Objectives: The primary objective of the study was to investigate the relationship between ubiquitous laptop use and academic achievement. It was hypothesized that students with ubiquitous laptops would score on average higher on standardized tests than those without such computers. Methods: Data were obtained from two sources. First, demographic…

  6. The Correlational Relationship between Homeschooling Demographics and High Test Scores.

    ERIC Educational Resources Information Center

    Burns, Johnna

    Homeschooling, one of the fastest growing educational alternatives, is enjoying increasing respect from educators and parents alike. This is partly because homeschooling children score as well and often better on standardized tests than their publicly schooled counterparts. However, the vast majority of homeschooled students come from the…

  7. America's Mediocre Test Scores: Education Crisis or Poverty Crisis?

    ERIC Educational Resources Information Center

    Petrilli, Michael J.; Wright, Brandon L.

    2016-01-01

    At a time when the national conversation is focused on lagging upward mobility, it is no surprise that many educators point to poverty as the explanation for mediocre test scores among U.S. students compared to those of students in other countries. If American teachers in struggling U.S. schools taught in Finland, says Finnish educator Pasi…

  8. A Latent Class Approach to Estimating Test-Score Reliability

    ERIC Educational Resources Information Center

    van der Ark, L. Andries; van der Palm, Daniel W.; Sijtsma, Klaas

    2011-01-01

    This study presents a general framework for single-administration reliability methods, such as Cronbach's alpha, Guttman's lambda-2, and method MS. This general framework was used to derive a new approach to estimating test-score reliability by means of the unrestricted latent class model. This new approach is the latent class reliability…

  9. Student Laptop Use and Scores on Standardized Tests

    ERIC Educational Resources Information Center

    Kposowa, Augustine J.; Valdez, Amanda D.

    2013-01-01

    Objectives: The primary objective of the study was to investigate the relationship between ubiquitous laptop use and academic achievement. It was hypothesized that students with ubiquitous laptops would score on average higher on standardized tests than those without such computers. Methods: Data were obtained from two sources. First, demographic…

  10. Commentary on "Validating the Interpretations and Uses of Test Scores"

    ERIC Educational Resources Information Center

    Brennan, Robert L.

    2013-01-01

    Kane's paper "Validating the Interpretations and Uses of Test Scores" is the most complete and clearest discussion yet available of the argument-based approach to validation. At its most basic level, validation as formulated by Kane is fundamentally a simply-stated two-step enterprise: (1) specify the claims inherent in a particular interpretation…

  11. School Choice in Suburbia: Test Scores, Race, and Housing Markets

    ERIC Educational Resources Information Center

    Dougherty, Jack; Harelson, Jeffrey; Maloney, Laura; Murphy, Drew; Smith, Russell; Snow, Michael; Zannoni, Diane

    2009-01-01

    Home buyers exercise school choice when shopping for a private residence due to its location in a public school district or attendance area. In this quantitative study of one Connecticut suburban district, we measure the effect of elementary school test scores and racial composition on home buyers' willingness to purchase single-family homes over…

  12. Study Finds Link between Quality Music Programs, Test Scores

    ERIC Educational Resources Information Center

    Teaching Music, 2007

    2007-01-01

    A recent study found that students in high-quality school music education programs score higher on standardized tests compared to students in schools with deficient music education programs. The study, which was published in the Winter 2006 issue of MENC's Journal for Research in Music Education, is the first to examine the quality of school music…

  13. Assessment Test Scores of Incoming Students, Fall 2001.

    ERIC Educational Resources Information Center

    Negron, Maggie; Breindel, Matthew

    This assessment of placement test scores in reading, math, and sentence skills from incoming students at College of the Desert (California) shows that students are overwhelmingly underprepared for study at the college. Only 15% of students were prepared in sentence skills, 27% in reading skills, 7% in math skills; only 3% were prepared in all 3…

  14. Experiential Awareness of the Effects of Test Score Reports.

    ERIC Educational Resources Information Center

    Bender, Robert C.

    Because most counselors have experienced a significant amount of success, they often have difficulty understanding the impact of test scores on persons who do not perform well. Counselor educators must develop experiential awareness in an area normally outside the realm of their students. To provide such an experience, 25 counselor trainees took…

  15. Racial Differences in Mathematics Test Scores for Advanced Mathematics Students

    ERIC Educational Resources Information Center

    Minor, Elizabeth Covay

    2016-01-01

    Research on achievement gaps has found that achievement gaps are larger for students who take advanced mathematics courses compared to students who do not. Focusing on the advanced mathematics student achievement gap, this study found that African American advanced mathematics students have significantly lower test scores and are less likely to be…

  16. High Profile Football Players' Reading at a Research University: ACT Scores, Interview Responses, and Personal Preferences

    ERIC Educational Resources Information Center

    Townsend, Martha

    2013-01-01

    This qualitative case study examines the reading acumen of a cohort of twenty-six senior football players at a Midwestern public research university. Data related to three indices--ACT scores, interview responses, and personal preferences--were collected as part of a larger IRB-approved study aimed at determining the factors that led to the entire…

  17. Secondary School Scheduling Models: How Do Types of Models Compare to the ACT Scores?

    ERIC Educational Resources Information Center

    Hackmann, Donald G.; Hecht, Janet E.; Harmston, Matt T.; Pliska, Ann-Maureen; Ziomek, Robert L.

    This study examined the relationship between school scheduling format and average composite scores on the ACT Assessment after controlling for lifestyle factors, gender, school enrollment levels, number of examinees, and years under the scheduling model. The participants were 38,089 high schools seniors in 568 public high schools in Iowa and…

  18. The Relative Predictive Validity of ACT Scores and High School Grades in Making College Admission Decisions. Issues in College Success

    ERIC Educational Resources Information Center

    ACT, Inc., 2008

    2008-01-01

    Postsecondary institutions often consider students' high school grades and ACT scores when making admission decisions. This issue brief summarizes ACT research on the relative weights of ACT scores and high school grades for predicting college persistence as well as selected indicators of academic success in college. (Contains 1 table and 3…

  19. Flow and diffusion of high-stakes test scores

    NASA Astrophysics Data System (ADS)

    Marder, M.; Bansal, D.

    2009-10-01

    We apply visualization and modeling methods for convective and diffusive flows to public school mathematics test scores from Texas. We obtain plots that show the most likely future and past scores of students, the effects of random processes such as guessing, and the rate at which students appear in and disappear from schools. We show that student outcomes depend strongly upon economic class, and identify the grade levels where flows of different groups diverge most strongly. Changing the effectiveness of instruction in one grade naturally leads to strongly nonlinear effects on student outcomes in subsequent grades.

  20. The prognostic value of a QRS score during exercise testing.

    PubMed

    Michaelides, Andreas P; Aigyprladou, Maria-Niki K; Andrikopoulos, George K; Richter, Dimitris J; Kartalis, Athanasios; Tapanlis, Evstratios; Fourlas, Christros A; Stefanadis, Christodoulos I

    2005-08-01

    Values of a QRS score have been positively related to the number of narrowed coronary arteries and to the extent of myocardial ischemia in radionuclide imaging techniques. This study was conducted to evaluate the potential prognostic information of abnormal values of this QRS score during treadmill exercise testing in patients with established coronary artery disease (CAD). In all, 309 patients (258 men, 51 women, mean age 56.1 +/- 10.0 years) with documented CAD, underwent a treadmill exercise test and coronary arteriography at baseline. Subsequently, they were prospectively followed to a maximum of 36 months (mean follow-up 23 +/- 13 months, median 25 months). During the follow-up period, 20 patients (6.5%) died from acute myocardial infarction. Abnormal QRS score values were found to be significantly and independently associated with cardiac mortality (QRS < or = -4: relative risk 11.7; 95% confidence interval = 2.5-55.4; p = 0.002). Taking into consideration the importance of exercise testing in the management of ischemic heart disease, the use of this QRS score could be of clinical value in predicting the outcome of patients with documented CAD.

  1. ACT/SAT Test Preparation and Coaching Programs. What Works Clearinghouse Intervention Report

    ERIC Educational Resources Information Center

    What Works Clearinghouse, 2016

    2016-01-01

    Most colleges and universities in the United States require students to take the SAT or ACT as part of the college application process. These tests are high stakes in at least three ways. First, most universities factor scores on these tests into admissions decisions. Second, higher scores can increase a student's chances of being admitted to…

  2. Spinal appearance questionnaire: factor analysis, scoring, reliability, and validity testing.

    PubMed

    Carreon, Leah Y; Sanders, James O; Polly, David W; Sucato, Daniel J; Parent, Stefan; Roy-Beaudry, Marjolaine; Hopkins, Jeffrey; McClung, Anna; Bratcher, Kelly R; Diamond, Beverly E

    2011-08-15

    Cross sectional. This study presents the factor analysis of the Spinal Appearance Questionnaire (SAQ) and its psychometric properties. Although the SAQ has been administered to a large sample of patients with adolescent idiopathic scoliosis (AIS) treated surgically, its psychometric properties have not been fully evaluated. This study presents the factor analysis and scoring of the SAQ and evaluates its psychometric properties. The SAQ and the Scoliosis Research Society-22 (SRS-22) were administered to AIS patients who were being observed, braced or scheduled for surgery. Standard demographic data and radiographic measures including Lenke type and curve magnitude were also collected. Of the 1802 patients, 83% were female; with a mean age of 14.8 years and mean initial Cobb angle of 55.8° (range, 0°-123°). From the 32 items of the SAQ, 15 loaded on two factors with consistent and significant correlations across all Lenke types. There is an Appearance (items 1-10) and an Expectations factor (items 12-15). Responses are summed giving a range of 5 to 50 for the Appearance domain and 5 to 20 for the Expectations domain. The Cronbach's α was 0.88 for both domains and Total score with a test-retest reliability of 0.81 for Appearance and 0.91 for Expectations. Correlations with major curve magnitude were higher for the SAQ Appearance and SAQ Total scores compared to correlations between the SRS Appearance and SRS Total scores. The SAQ and SRS-22 Scores were statistically significantly different in patients who were scheduled for surgery compared to those who were observed or braced. The SAQ is a valid measure of self-image in patients with AIS with greater correlation to curve magnitude than SRS Appearance and Total score. It also discriminates between patients who require surgery from those who do not.

  3. The validity of self-reported physical fitness test scores.

    PubMed

    Jones, Sarah B; Knapik, Joseph J; Sharp, Marilyn A; Darakjy, Salima; Jones, Bruce H

    2007-02-01

    Epidemiological studies often have to rely on a participant's self-reporting of information. The validity of the self-report instrument is an important consideration in any study. The purpose of this investigation was to determine the validity of self-reported Army Physical Fitness Test (APFT) scores. The APFT is administered to all soldiers in the U.S. Army twice a year and consists of the maximum number of push-ups completed in 2 minutes, the maximum number of sit-ups completed in 2 minutes, and a 2-mile run for time. Army mechanics responded to a questionnaire in March and June 2004 asking them to report the exact scores of each event on their most recent APFT. Actual APFT scores were obtained from the soldier's military unit. The mean +/- standard deviation (SD) of actual and self-reported numbers of push-ups was 61 +/- 14 and 65 +/- 13, respectively. The mean +/- SD of actual and self-reported numbers of sit-ups were 66 +/- 10 and 68 +/- 10, respectively. The mean +/- SD of actual and self-reported run times (minutes) were 14.8 +/- 1.4 and 14.6 +/- 1.4, respectively. Correlations between actual and self-reported push-ups, sit-ups, and run were 0.83, 0.71, and 0.85, respectively. On average, soldiers tended to slightly over-report performance on all APFT events and individual self-reported scores could vary widely from actual scores based on Bland-Altman plots. Despite this, the close correlations between the actual and self-reported scores suggest that self-reported values are adequate for most epidemiological military studies involving larger sample sizes.

  4. Do Teacher Unions Hinder Educational Performance? Lessons Learned from State SAT and ACT Scores.

    ERIC Educational Resources Information Center

    Steelman, Lala Carr; Powell, Brian; Carini, Robert M.

    2000-01-01

    Comparison of standardized test scores and degree of teacher unionization in states found a statistically significant and positive relationship between the presence of teacher unions and stronger state performance on tests. Taking into account the percentage of students taking the tests, states with greater percentages of teachers in unions…

  5. Score-based tests of measurement invariance: use in practice

    PubMed Central

    Wang, Ting; Merkle, Edgar C.; Zeileis, Achim

    2014-01-01

    In this paper, we consider a family of recently-proposed measurement invariance tests that are based on the scores of a fitted model. This family can be used to test for measurement invariance w.r.t. a continuous auxiliary variable, without pre-specification of subgroups. Moreover, the family can be used when one wishes to test for measurement invariance w.r.t. an ordinal auxiliary variable, yielding test statistics that are sensitive to violations that are monotonically related to the ordinal variable (and less sensitive to non-monotonic violations). The paper is specifically aimed at potential users of the tests who may wish to know (1) how the tests can be employed for their data, and (2) whether the tests can accurately identify specific models parameters that violate measurement invariance (possibly in the presence of model misspecification). After providing an overview of the tests, we illustrate their general use via the R packages lavaan and strucchange. We then describe two novel simulations that provide evidence of the tests' practical abilities. As a whole, the paper provides researchers with the tools and knowledge needed to apply these tests to general measurement invariance scenarios. PMID:24936190

  6. An Approach to Scoring and Equating Tests with Binary Items: Piloting With Large-Scale Assessments

    ERIC Educational Resources Information Center

    Dimitrov, Dimiter M.

    2016-01-01

    This article describes an approach to test scoring, referred to as "delta scoring" (D-scoring), for tests with dichotomously scored items. The D-scoring uses information from item response theory (IRT) calibration to facilitate computations and interpretations in the context of large-scale assessments. The D-score is computed from the…

  7. An Approach to Scoring and Equating Tests with Binary Items: Piloting With Large-Scale Assessments

    ERIC Educational Resources Information Center

    Dimitrov, Dimiter M.

    2016-01-01

    This article describes an approach to test scoring, referred to as "delta scoring" (D-scoring), for tests with dichotomously scored items. The D-scoring uses information from item response theory (IRT) calibration to facilitate computations and interpretations in the context of large-scale assessments. The D-score is computed from the…

  8. Haplotype-based score test for linkage in nuclear families

    PubMed Central

    Nam, Chung Mo; Kang, Dae Ryong; Kim, Jinheum

    2007-01-01

    Background To look for genetic linkage between angiotensin-I converting enzyme(ACE) gene and hypertension in a Korean adolescent cohort, we developed a powerful test using the covariances between marginal differences and their variances in a transmission/non-transmission table. Results We estimated haplotype frequencies using the parental and affected offspring's genotypes and then constructed a transmission/non-transmission table for the parental haplotypes transmitted to the offspring. We then proposed a test for checking the marginal homogeneity in the table. Because the cells in the table were dependent due to the uncertainty of the parental haplotypes, we adopted a randomization procedure to estimate the significance of the observed test statistic. Simulations show that our test performs well on a nominal level and has a monotone power, which increases as the relative risk increases. With our test, there was no evidence of genetic linkage between the ACE gene and hypertension in the Korean adolescent cohort. Conclusion We developed a score test for linkage and used simulations to demonstrate that our test performs well at a nominal level. Under some situations where the diversity of haplotypes is low, the proposed test gained a little power over the method based on only variances between marginal differences in a transmission/non-transmission table. PMID:17672913

  9. ACT Test Performance by Advanced Placement Students in Memphis City Schools

    ERIC Educational Resources Information Center

    Mo, Lun; Yang, Fang; Hu, Xiangen; Calaway, Florance; Nickey, John

    2011-01-01

    The authors investigated the extent to which taking specific types of Advanced Placement (AP) courses and the number of courses taken predicts the likelihood of passing subject benchmarks and earning a score of 19 on the composite score on the ACT test, and examined the role gender plays in the projection. They found evidence that taking an AP…

  10. Recent Trends in Mean Scores and Characteristics of Test-Takers on "Praxis II" Licensure Tests

    ERIC Educational Resources Information Center

    Gitomer, Drew H.; Qi, Yi

    2010-01-01

    This study concerns the "highly qualified teacher" provisions of the "Elementary and Secondary Education Act" ("ESEA," 2002), as reauthorized, and other policies at the federal, state and local levels, which have aimed to elevate the content knowledge of teachers. This examination of "Praxis II" score trends…

  11. Generation of GHS Scores from TEST and online sources ...

    EPA Pesticide Factsheets

    Alternatives assessment frameworks such as DfE (Design for the Environment) evaluate chemical alternatives in terms of human health effects, ecotoxicity, and fate. T.E.S.T. (Toxicity Estimation Software Tool) can be utilized to evaluate human health in terms of acute oral rat toxicity, developmental toxicity, endocrine activity, and mutagenicity. It can be used to evaluate ecotoxicity (in terms of acute fathead minnow toxicity) and fate (in terms of bioconcentration factor). It also be used to estimate a variety of key physicochemical properties such as melting point, boiling point, vapor pressure, water solubility, and bioconcentration factor. A web-based version of T.E.S.T. is currently being developed to allow predictions to be made from other web tools. Online data sources such as from NCCT’s Chemistry Dashboard, REACH dossiers, or from ChemHat.org can also be utilized to obtain GHS (Global Harmonization System) scores for comparing alternatives. The purpose of this talk is to show how GHS (Global Harmonization Score) data can be obtained from literature sources and from T.E.S.T. (Toxicity Estimation Software Tool). This data will be used to compare chemical alternatives in the alternatives assessment dashboard (a 2018 CSS product).

  12. Validity Issues for Performance-Based Tests Scored with Computer-Automated Scoring Systems.

    ERIC Educational Resources Information Center

    Clauser, Brian E.; Kane, Michael T.; Swanson, David B.

    2002-01-01

    Attempts to place the issues associated with computer-automated scoring within the context of current validity theory and presents a taxonomy of automated scoring procedures as a framework for discussing threats to validity that may take on increased importance for specific approaches to automated scoring. (SLD)

  13. Correlation of the Scores on Barron's Ego Strength Scale with the Scores on the Bender-Gestalt Test.

    ERIC Educational Resources Information Center

    Martin, John D.; And Others

    1979-01-01

    The degree of relationship between scores on the Barron Ego Strength Scale and the scores on the Bender-Gestalt Test was investigated on a sample of college students. Correlations were moderate to low. Racial differences were observed on the Bender-Gestalt Test. (Author/JKS)

  14. Correlation of the Scores on Barron's Ego Strength Scale with the Scores on the Bender-Gestalt Test.

    ERIC Educational Resources Information Center

    Martin, John D.; And Others

    1979-01-01

    The degree of relationship between scores on the Barron Ego Strength Scale and the scores on the Bender-Gestalt Test was investigated on a sample of college students. Correlations were moderate to low. Racial differences were observed on the Bender-Gestalt Test. (Author/JKS)

  15. ANOVA Analysis of Student Daily Test Scores in Multi-Day Test Periods

    ERIC Educational Resources Information Center

    Mouritsen, Matthew L.; Davis, Jefferson T.; Jones, Steven C.

    2016-01-01

    Instructors are often concerned when giving multiple-day tests because students taking the test later in the exam period may have an advantage over students taking the test early in the exam period due to information leakage. However, exam scores seemed to decline as students took the same test later in a multi-day exam period (Mouritsen and…

  16. Computerized Scoring Algorithms for the Autobiographical Memory Test.

    PubMed

    Takano, Keisuke; Gutenbrunner, Charlotte; Martens, Kris; Salmon, Karen; Raes, Filip

    2017-04-03

    Reduced specificity of autobiographical memories is a hallmark of depressive cognition. Autobiographical memory (AM) specificity is typically measured by the Autobiographical Memory Test (AMT), in which respondents are asked to describe personal memories in response to emotional cue words. Due to this free descriptive responding format, the AMT relies on experts' hand scoring for subsequent statistical analyses. This manual coding potentially impedes research activities in big data analytics such as large epidemiological studies. Here, we propose computerized algorithms to automatically score AM specificity for the Dutch (adult participants) and English (youth participants) versions of the AMT by using natural language processing and machine learning techniques. The algorithms showed reliable performances in discriminating specific and nonspecific (e.g., overgeneralized) autobiographical memories in independent testing data sets (area under the receiver operating characteristic curve > .90). Furthermore, outcome values of the algorithms (i.e., decision values of support vector machines) showed a gradient across similar (e.g., specific and extended memories) and different (e.g., specific memory and semantic associates) categories of AMT responses, suggesting that, for both adults and youth, the algorithms well capture the extent to which a memory has features of specific memories. (PsycINFO Database Record

  17. The Weighted Airman Promotion System: Standardizing Test Scores

    DTIC Science & Technology

    2008-01-01

    u th o ri ze d Top 3/E6 ratio, inventory 1401206040 100 70 130 5R 2F 2G 3N 2M 2A 4J 4C 4P 4T 4B 1W 2T 3P 1T 4A 2S 5J 1A 1S1C 6F 4N 7S 4R 4E 1N 3A 3V...mitigated by standardizing WAPS test scores. There is also a concern about AFSC mix in the pool of future enlisted leaders. The Air Force fills about 400...impact of longevity. Because AFSCs comprise members with different mixes of gen- eral and specialty knowledge, because tests can vary in difficulty

  18. Digit symbol substitution test score and hyperhomocysteinemia in older adults.

    PubMed

    Hsu, Wen-Chuin; Chu, Yi-Chuan; Fung, Hon-Chung; Wai, Yau-Yau; Wang, Jiun-Jie; Lee, Jiann-Der; Chen, Yi-Chun

    2016-08-01

    Mounting evidence shows that hyperhomocysteinemia is a risk factor for cognitive decline. This study enrolled subjects with normal serum levels of B12 and folate and performed thorough neuropsychological assessments to illuminate the independent role of homocysteine on cognitive functions.Participants between ages 50 and 85 were enrolled with Modified Hachinski ischemic score of <4, adequate visual and auditory acuity to allow neuropsychological testing, and good general health. Subjects with cognitive impairment resulting from secondary causes were excluded. Each of the participants completed evaluations of general intellectual function, including the Mini-Mental State Examination, Cognitive Abilities Screening Instrument, Clinical Dementia Rating, and a battery of neuropsychological assessments.This study enrolled 225 subjects (90 subjects younger than 65 years and 135 subjects aged 65 years or older). The sex proportion was similar between the 2 age groups. Years of education were significantly fewer in the elderly (7.49 ± 5.40 years) than in the young (9.76 ± 4.39 years, P = 0.001). There was no significant difference in body mass index or levels of vitamin B12 and folate between the 2 age groups. Homocysteine levels were significantly higher in the elderly group compared to the younger group (10.8 ± 2.7 vs. 9.5 ± 2.5 μmol/L, respectively, P = 0.0006). After adjusting for age, sex, and education, only the Digit Symbol Substitution (DSS) score was significantly lower in subjects with hyperhomocysteinemia (homocysteine >12 μmol/L) than those with homocysteine ≤12 μmol/L in the elderly group (DSS score: 7.1 ± 2.7 and 9.0 ± 3.0, respectively, beta = -1.6, 95% confidence interval [CI] = -2.8∼-0.5, P = 0.001) and borderline significance was noted in the combined age group (beta = -1.1, 95% CI = -2.1∼-0.1, P = 0.04). We did not find an association between hyperhomocysteinemia and other

  19. Digit symbol substitution test score and hyperhomocysteinemia in older adults

    PubMed Central

    Hsu, Wen-Chuin; Chu, Yi-Chuan; Fung, Hon-Chung; Wai, Yau-Yau; Wang, Jiun-Jie; Lee, Jiann-Der; Chen, Yi-Chun

    2016-01-01

    Abstract Mounting evidence shows that hyperhomocysteinemia is a risk factor for cognitive decline. This study enrolled subjects with normal serum levels of B12 and folate and performed thorough neuropsychological assessments to illuminate the independent role of homocysteine on cognitive functions. Participants between ages 50 and 85 were enrolled with Modified Hachinski ischemic score of <4, adequate visual and auditory acuity to allow neuropsychological testing, and good general health. Subjects with cognitive impairment resulting from secondary causes were excluded. Each of the participants completed evaluations of general intellectual function, including the Mini-Mental State Examination, Cognitive Abilities Screening Instrument, Clinical Dementia Rating, and a battery of neuropsychological assessments. This study enrolled 225 subjects (90 subjects younger than 65 years and 135 subjects aged 65 years or older). The sex proportion was similar between the 2 age groups. Years of education were significantly fewer in the elderly (7.49 ± 5.40 years) than in the young (9.76 ± 4.39 years, P = 0.001). There was no significant difference in body mass index or levels of vitamin B12 and folate between the 2 age groups. Homocysteine levels were significantly higher in the elderly group compared to the younger group (10.8 ± 2.7 vs. 9.5 ± 2.5 μmol/L, respectively, P = 0.0006). After adjusting for age, sex, and education, only the Digit Symbol Substitution (DSS) score was significantly lower in subjects with hyperhomocysteinemia (homocysteine >12 μmol/L) than those with homocysteine ≤12 μmol/L in the elderly group (DSS score: 7.1 ± 2.7 and 9.0 ± 3.0, respectively, beta = −1.6, 95% confidence interval [CI] = −2.8∼−0.5, P = 0.001) and borderline significance was noted in the combined age group (beta = −1.1, 95% CI = −2.1∼−0.1, P = 0.04). We did not find an association between

  20. Hypothesis Testing as an Act of Rationality

    NASA Astrophysics Data System (ADS)

    Nearing, Grey

    2017-04-01

    Statistical hypothesis testing is ad hoc in two ways. First, setting probabilistic rejection criteria is, as Neyman (1957) put it, an act of will rather than an act of rationality. Second, physical theories like conservation laws do not inherently admit probabilistic predictions, and so we must use what are called epistemic bridge principles to connect model predictions with the actual methods of hypothesis testing. In practice, these bridge principles are likelihood functions, error functions, or performance metrics. I propose that the reason we are faced with these problems is because we have historically failed to account for a fundamental component of basic logic - namely the portion of logic that explains how epistemic states evolve in the presence of empirical data. This component of Cox' (1946) calculitic logic is called information theory (Knuth, 2005), and adding information theory our hypothetico-deductive account of science yields straightforward solutions to both of the above problems. This also yields a straightforward method for dealing with Popper's (1963) problem of verisimilitude by facilitating a quantitative approach to measuring process isomorphism. In practice, this involves data assimilation. Finally, information theory allows us to reliably bound measures of epistemic uncertainty, thereby avoiding the problem of Bayesian incoherency under misspecified priors (Grünwald, 2006). I therefore propose solutions to four of the fundamental problems inherent in both hypothetico-deductive and/or Bayesian hypothesis testing. - Neyman (1957) Inductive Behavior as a Basic Concept of Philosophy of Science. - Cox (1946) Probability, Frequency and Reasonable Expectation. - Knuth (2005) Lattice Duality: The Origin of Probability and Entropy. - Grünwald (2006). Bayesian Inconsistency under Misspecification. - Popper (1963) Conjectures and Refutations: The Growth of Scientific Knowledge.

  1. Normal test scores in the Farnsworth-Munsell 100 hue test.

    PubMed

    Mantyjarvi, M

    2001-01-01

    One hundred and sixty persons aged from 10 to 69 years (106 women, 54 men) with healthy eyes were studied with the Farnsworth-Munsell 100 hue (FM100) test. The mean of the results in the total scores and in the individual box scores in the right and left eye were calculated. The total score was also separately calculated in women and men. The test was administered under the illumination of Macbeth Easel lamp, 1000 lux, and the right eye was tested first. The results were calculated in six different age groups, 10-19 years, 20-29 years, etc. The mean of the total scores in the right eye varied from 7.44+/-2.46 (SD) to 10.07+/-2.03 in different age groups and in the left eye from 7.56+/-2.36 to 10.16+/-2.68. The scores changed significantly with the age: the correlation between the age and the test scores by linear regression gave significant results, in the right eye (R = 0.308, P = 0.0001), and in the left eye (R = 0.246, P = 0.0021). The present study with the normal error scores in the FM100 test and its individual boxes in persons aged 10-69 years gives clinicians working with colour vision defects a possibility to estimate the normality or abnormality of the results in their patients.

  2. The Visual Aural Digit Span Test and Bender Gestalt Test as Predictors of Wide Range Achievement Test-Revised Scores.

    ERIC Educational Resources Information Center

    Smith, Teresa C.; Smith, Billy L.

    1988-01-01

    Examined Visual Aural Digit Span Test (VADS) and Bender-Gestalt (BG) scores as predictors of Wide Range Achievement Test-Revised (WRAT-R) scores among 115 elementary school students referred for low academic achievement. Divided children into three age groups. Results suggest BG and VADS Test can be effective screening devices for young children…

  3. The Visual Aural Digit Span Test and Bender Gestalt Test as Predictors of Wide Range Achievement Test-Revised Scores.

    ERIC Educational Resources Information Center

    Smith, Teresa C.; Smith, Billy L.

    1988-01-01

    Examined Visual Aural Digit Span Test (VADS) and Bender-Gestalt (BG) scores as predictors of Wide Range Achievement Test-Revised (WRAT-R) scores among 115 elementary school students referred for low academic achievement. Divided children into three age groups. Results suggest BG and VADS Test can be effective screening devices for young children…

  4. The Graduate Management Admission Test: Technical Report on Test Development and Score Interpretation for GMAT Users.

    ERIC Educational Resources Information Center

    Schrader, William B.

    This report provides information on test development, test administration, and score interpretation for the Graduate Management Admission Test (GMAT). The GMAT, first administered in 1954, provides objective measures of an applicant's abilities for use in admissions decisions by graduate management schools. It is currently composed of five…

  5. Use of Practice Tests in the Prediction of GED Test Scores.

    ERIC Educational Resources Information Center

    Herring, Warren

    1999-01-01

    Analyzes the relationship between success on the two new practice-test forms (EE and FF) developed by Steck-Vaughn and success on the General Educational Development (GED) test. Success with practice-form EE correlated with GED test scores; form FF failed to correlate. (JOW)

  6. The Effect of Sex on General Aptitude Test Battery Validity and Test Scores.

    ERIC Educational Resources Information Center

    Synk, David J.

    This study used meta-analysis research techniques to determine if there are differences in General Aptitude Test Battery (GATB) validities and test scores between males and females. The sample consisted of 26,111 subjects from 122 Specific Aptitude Test Battery (SATB) validation or revalidation studies analyzed since 1972. Four approaches were…

  7. Your move: The effect of chess on mathematics test scores

    PubMed Central

    Rosholm, Michael; Mikkelsen, Mai Bjørnskov; Gumede, Kamilla

    2017-01-01

    We analyse the effect of substituting a weekly mathematics lesson in primary school grades 1–3 with a lesson in mathematics based on chess instruction. We use data from the City of Aarhus in Denmark, combining test score data with a comprehensive data set obtained from administrative registers. We use two different methodological approaches to identify and estimate treatment effects and we tend to find positive effects, indicating that knowledge acquired through chess play can be transferred to the domain of mathematics. We also find larger impacts for unhappy children and children who are bored in school, perhaps because chess instruction facilitates learning by providing an alternative approach to mathematics for these children. The results are encouraging and suggest that chess may be an important and effective tool for improving mathematical capacity in young students. PMID:28494023

  8. Your move: The effect of chess on mathematics test scores.

    PubMed

    Rosholm, Michael; Mikkelsen, Mai Bjørnskov; Gumede, Kamilla

    2017-01-01

    We analyse the effect of substituting a weekly mathematics lesson in primary school grades 1-3 with a lesson in mathematics based on chess instruction. We use data from the City of Aarhus in Denmark, combining test score data with a comprehensive data set obtained from administrative registers. We use two different methodological approaches to identify and estimate treatment effects and we tend to find positive effects, indicating that knowledge acquired through chess play can be transferred to the domain of mathematics. We also find larger impacts for unhappy children and children who are bored in school, perhaps because chess instruction facilitates learning by providing an alternative approach to mathematics for these children. The results are encouraging and suggest that chess may be an important and effective tool for improving mathematical capacity in young students.

  9. Discrepancies between modified Medical Research Council dyspnea score and COPD assessment test score in patients with COPD

    PubMed Central

    Rhee, Chin Kook; Kim, Jin Woo; Hwang, Yong Il; Lee, Jin Hwa; Jung, Ki-Suck; Lee, Myung Goo; Yoo, Kwang Ha; Lee, Sang Haak; Shin, Kyeong-Cheol; Yoon, Hyoung Kyu

    2015-01-01

    Background and objective According to the Global Initiative for Chronic Obstructive Lung Disease (GOLD) guidelines, either a modified Medical Research Council (mMRC) dyspnea score of ≥2 or a chronic obstructive pulmonary disease (COPD) assessment test (CAT) score of ≥10 is considered to represent COPD patients who are more symptomatic. We aimed to identify the ideal CAT score that exhibits minimal discrepancy with the mMRC score. Methods A receiver operating characteristic curve of the CAT score was generated for an mMRC scores of 1 and 2. A concordance analysis was applied to quantify the association between the frequencies of patients categorized into GOLD groups A–D using symptom cutoff points. A κ-coefficient was calculated. Results For an mMRC score of 2, a CAT score of 15 showed the maximum value of Youden’s index with a sensitivity and specificity of 0.70 and 0.66, respectively (area under the receiver operating characteristic curve [AUC] 0.74; 95% confidence interval [CI], 0.70–0.77). For an mMRC score of 1, a CAT score of 10 showed the maximum value of Youden’s index with a sensitivity and specificity of 0.77 and 0.65, respectively (AUC 0.77; 95% CI, 0.72–0.83). The κ value for concordance was highest between an mMRC score of 1 and a CAT score of 10 (0.66), followed by an mMRC score of 2 and a CAT score of 15 (0.56), an mMRC score of 2 and a CAT score of 10 (0.47), and an mMRC score of 1 and a CAT score of 15 (0.43). Conclusion A CAT score of 10 was most concordant with an mMRC score of 1 when classifying patients with COPD into GOLD groups A–D. However, a discrepancy remains between the CAT and mMRC scoring systems. PMID:26316736

  10. Characteristics of College and Non-College Attendants Who Submitted ACT Scores to a Black College.

    ERIC Educational Resources Information Center

    Berry, Charles A.

    The purpose of the study was to analyze selected characteristics of 3,385 high school graduates submitting American College Tests (ACT) profile reports to a black college. Additional data was obtained via questionnaire from 1,059 graduates that were non-attendants. These samples were subdivided into the following groups: attendants at black and…

  11. Score Gains on "g"-Loaded Tests: No "g"

    ERIC Educational Resources Information Center

    te Nijenhuis, Jan; van Vianen, Annelies E. M.; van der Flier, Henk

    2007-01-01

    IQ scores provide the best general predictor of success in education, job training, and work. However, there are many ways in which IQ scores can be increased, for instance by means of retesting or participation in learning potential training programs. What is the nature of these score gains? Jensen [Jensen, A. R. (1998a). "The g factor: The…

  12. Money Improves Test Scores--Even State-Level SATs.

    ERIC Educational Resources Information Center

    Bracey, Gerald W.

    1996-01-01

    Three former secretaries of education--William Bennett, Lauro Cavazos, and Terrel Bell--have touted state-level SAT scores as proof that educational financing does not matter. Recently, Brian Powell and Lala Carr Steelman adjusted scores for participation rate and detected a very strong relationship between expenditures and SAT scores. Bigger…

  13. Score Gains on "g"-Loaded Tests: No "g"

    ERIC Educational Resources Information Center

    te Nijenhuis, Jan; van Vianen, Annelies E. M.; van der Flier, Henk

    2007-01-01

    IQ scores provide the best general predictor of success in education, job training, and work. However, there are many ways in which IQ scores can be increased, for instance by means of retesting or participation in learning potential training programs. What is the nature of these score gains? Jensen [Jensen, A. R. (1998a). "The g factor: The…

  14. An Investigation into the Relationships Between Cloze Test Scores and Informal Reading Inventory Scores of Fifth Grade Pupils.

    ERIC Educational Resources Information Center

    Walter, Richard Barry

    This study investigated the relationship between instructional level scores as determined by a cloze test and instructional level scores as determined by an informal reading inventory (IRI). Fifty male and 50 female subjects were randomly selected from the total fifth grade population of five schools chosen from a total of 22 midwestern elementary…

  15. A method for increasing the scoring efficiency of the Farnsworth-Munsell 100-Hue test.

    PubMed

    Craven, B J

    1997-03-01

    This paper describes a method for scoring the Farnsworth-Munsell 100-Hue test, based on maximum-likelihood estimation, which in theory reduces test-to-test variability in scores and which is therefore better able to discriminate between different levels of overall colour discrimination than is the original Farnsworth scoring system. Error scores produced by the method are directly comparable to error scores produced by the traditional scoring system. It is hoped that this work will provoke further consideration of the efficiency of the scoring system as far as test-to-test variability is concerned, including the efficient detection of polarity in the subject's hue discrimination function.

  16. Empirical Bayes Estimates of Domain Scores under Binomial and Hypergeometric Distributions for Test Scores.

    ERIC Educational Resources Information Center

    Lin, Miao-Hsiang; Hsiung, Chao A.

    1994-01-01

    Two simple empirical approximate Bayes estimators are introduced for estimating domain scores under binomial and hypergeometric distributions respectively. Criteria are established regarding use of these functions over maximum likelihood estimation counterparts. (SLD)

  17. The Comparison of Accuracy Scores on the Paper and Pencil Testing vs. Computer-Based Testing

    ERIC Educational Resources Information Center

    Retnawati, Heri

    2015-01-01

    This study aimed to compare the accuracy of the test scores as results of Test of English Proficiency (TOEP) based on paper and pencil test (PPT) versus computer-based test (CBT). Using the participants' responses to the PPT documented from 2008-2010 and data of CBT TOEP documented in 2013-2014 on the sets of 1A, 2A, and 3A for the Listening and…

  18. Volatility in School Test Scores: Implications for Test-Based Accountability Systems

    ERIC Educational Resources Information Center

    Kane, Thomas J.; Staiger, Douglas O.

    2002-01-01

    By the spring of 2000, forty states had begun using student test scores to rate school performance. Twenty states have gone a step further and are attaching explicit monetary rewards or sanctions to a school's test performance. In this paper, the authors focus on accountability programs in which states measure the effectiveness of individual…

  19. The Relationship between Deductive Reasoning Ability, Test Anxiety, and Standardized Test Scores in a Latino Sample

    ERIC Educational Resources Information Center

    Rich, John D., Jr.; Fullard, William; Overton, Willis

    2011-01-01

    One Hundred and Twelve Latino students from Philadelphia participated in this study, which examined the development of deductive reasoning across adolescence, and the relation of reasoning to test anxiety and standardized test scores. As predicted, 11th and ninth graders demonstrated significantly more advanced reasoning than seventh graders.…

  20. Volatility in School Test Scores: Implications for Test-Based Accountability Systems

    ERIC Educational Resources Information Center

    Kane, Thomas J.; Staiger, Douglas O.

    2002-01-01

    By the spring of 2000, forty states had begun using student test scores to rate school performance. Twenty states have gone a step further and are attaching explicit monetary rewards or sanctions to a school's test performance. In this paper, the authors focus on accountability programs in which states measure the effectiveness of individual…

  1. State Test Score Trends through 2007-08, Part 3: Are Achievement Gaps Closing and Is Achievement Rising for All?

    ERIC Educational Resources Information Center

    Chudowsky, Naomi; Chudowsky, Victor; Kober, Nancy

    2009-01-01

    A main goal of the No Child Left Behind Act (NCLB) is to close gaps in test scores between different groups of students while raising achievement for all groups. Of particular concern are the persistent achievement gaps between African American, Latino, and Native American students and their white and Asian counterparts, and between students from…

  2. Test/score/report: Simulation techniques for automating the test process

    NASA Technical Reports Server (NTRS)

    Hageman, Barbara H.; Sigman, Clayton B.; Koslosky, John T.

    1994-01-01

    A Test/Score/Report capability is currently being developed for the Transportable Payload Operations Control Center (TPOCC) Advanced Spacecraft Simulator (TASS) system which will automate testing of the Goddard Space Flight Center (GSFC) Payload Operations Control Center (POCC) and Mission Operations Center (MOC) software in three areas: telemetry decommutation, spacecraft command processing, and spacecraft memory load and dump processing. Automated computer control of the acceptance test process is one of the primary goals of a test team. With the proper simulation tools and user interface, the task of acceptance testing, regression testing, and repeatability of specific test procedures of a ground data system can be a simpler task. Ideally, the goal for complete automation would be to plug the operational deliverable into the simulator, press the start button, execute the test procedure, accumulate and analyze the data, score the results, and report the results to the test team along with a go/no recommendation to the test team. In practice, this may not be possible because of inadequate test tools, pressures of schedules, limited resources, etc. Most tests are accomplished using a certain degree of automation and test procedures that are labor intensive. This paper discusses some simulation techniques that can improve the automation of the test process. The TASS system tests the POCC/MOC software and provides a score based on the test results. The TASS system displays statistics on the success of the POCC/MOC system processing in each of the three areas as well as event messages pertaining to the Test/Score/Report processing. The TASS system also provides formatted reports documenting each step performed during the tests and the results of each step. A prototype of the Test/Score/Report capability is available and currently being used to test some POCC/MOC software deliveries. When this capability is fully operational it should greatly reduce the time necessary

  3. Using Patterns of Summed Scores in Paper-and-Pencil Tests and Computer-Adaptive Tests to Detect Misfitting Item Score Patterns

    ERIC Educational Resources Information Center

    Meijer, Rob R.

    2004-01-01

    Two new methods have been proposed to determine unexpected sum scores on sub-tests (testlets) both for paper-and-pencil tests and computer adaptive tests. A method based on a conservative bound using the hypergeometric distribution, denoted p, was compared with a method where the probability for each score combination was calculated using a…

  4. Reporting Diagnostic Scores in Educational Testing: Temptations, Pitfalls, and Some Solutions

    ERIC Educational Resources Information Center

    Sinharay, Sandip; Puhan, Gautam; Haberman, Shelby J.

    2010-01-01

    Diagnostic scores are of increasing interest in educational testing due to their potential remedial and instructional benefit. Naturally, the number of educational tests that report diagnostic scores is on the rise, as are the number of research publications on such scores. This article provides a critical evaluation of diagnostic score reporting…

  5. Directions for Scoring Typing Tests Taken Either on a Typewriter or a Computer.

    ERIC Educational Resources Information Center

    Kump, Ann

    Directions are given for scoring typing tests taken on a typewriter or on a computer using special software. The speed score (gross words per minute) is obtained by determining the total number of strokes typed, and dividing by 25. The accuracy score is obtained by comparing the examinee's test paper to the appropriate scoring key and counting the…

  6. A new scoring system for the Spraings Multiple Choice Bender Gestalt Test.

    PubMed

    Friedman, A F; Wakefield, J A; Sasek, J; Schroeder, D

    1977-01-01

    A new scoring procedure to be used with Spraings' technique for administering the Bender-Gestalt test in a multiple choice format is presented. Scoring weights are used instead of simply scoring each item right or wrong. The evidence presented suggests that this method of scoring would increase the value of Spraings' test in the diagnosis of perceptual deficits.

  7. Reporting Diagnostic Scores in Educational Testing: Temptations, Pitfalls, and Some Solutions

    ERIC Educational Resources Information Center

    Sinharay, Sandip; Puhan, Gautam; Haberman, Shelby J.

    2010-01-01

    Diagnostic scores are of increasing interest in educational testing due to their potential remedial and instructional benefit. Naturally, the number of educational tests that report diagnostic scores is on the rise, as are the number of research publications on such scores. This article provides a critical evaluation of diagnostic score reporting…

  8. Estimating the Consistency and Accuracy of Classifications Based on Test Scores.

    ERIC Educational Resources Information Center

    Livingston, Samuel A.; Lewis, Charles

    This paper presents a method for estimating the accuracy and consistency of classifications based on test scores. The scores can be produced by any scoring method, including the formation of a weighted composite. The estimates use data from a single form. The reliability of the score is used to estimate its effective test length in terms of…

  9. A Procedure for Linear Polychotomous Scoring of Test Items (Computer Diskette).

    DTIC Science & Technology

    weights that are then associated with the response categories of test items . When tests are scored using these scoring weights, test reliability...program poly. The example demonstrates how polyweighting can be used to calibrate and score test items drawn from an item bank that is too large to

  10. Using Subpopulation Invariance to Assess Test Score Equity

    ERIC Educational Resources Information Center

    Dorans, Neil J.

    2004-01-01

    Score equity assessment (SEA) is introduced, and placed within a fair assessment context that includes differential prediction or fair selection and differential item functioning. The notion of subpopulation invariance of linking functions is central to the assessment of score equity, just as it has been for differential item functioning and…

  11. Variability of Test Scores and the Split-Half Reliability Coefficient

    ERIC Educational Resources Information Center

    Zimmerman, Donald W.

    1970-01-01

    Results of this study indicate that the correlation between half-test scores over repeated splits, over persons, and over repeated testings resulting in different sets of observed scores, is given by Kuder-Richardson Formula 21. (RF)

  12. The Effect of Schooling and Ability on Achievement Test Scores. NBER Working Paper Series.

    ERIC Educational Resources Information Center

    Hansen, Karsten; Heckman, James J.; Mullen, Kathleen J.

    This study developed two methods for estimating the effect of schooling on achievement test scores that control for the endogeneity of schooling by postulating that both schooling and test scores are generated by a common unobserved latent ability. The methods were applied to data on schooling and test scores. Estimates from the two methods are in…

  13. Effects of white noise on Callsign Acquisition Test and Modified Rhyme Test scores.

    PubMed

    Blue-Terry, Misty; Letowski, Tomasz

    2011-02-01

    The Callsign Acquisition Test (CAT) is a speech intelligibility test developed by the US Army Research Laboratory. The test has been used to evaluate speech transmission through various communication systems but has not been yet sufficiently standardised and validated. The aim of this study was to compare CAT and Modified Rhyme Test (MRT) performance in the presence of white noise across a range of signal-to-noise ratios (SNRs). A group of 16 normal-hearing listeners participated in the study. The speech items were presented at 65 dB(A) in the background of white noise at SNRs of -18, -15, -12, -9 and -6 dB. The results showed a strong positive association (75.14%) between the two tests, but significant differences between the CAT and MRT absolute scores in the range of investigated SNRs. Based on the data, a function to predict CAT scores based on existing MRT scores and vice versa was formulated. STATEMENT OF RELEVANCE: This work compares performance data of a common speech intelligibility test (MRT) with a new test (CAT) in the presence of white noise. The results here can be used as a part of the standardisation procedures and provide insights to the predictive capabilities of the CAT to quantify speech intelligibility communication in high-noise military environments.

  14. How Parents Can Help Kids Improve Test Scores: Taking the Stakes out of Literacy Testing

    ERIC Educational Resources Information Center

    Schneider, Steven

    2006-01-01

    In order to meet the goals of No Child Left Behind, standardized testing is preeminent as the sole indicator determining whether states all across America demonstrate adequate yearly progress regarding the improvement of student achievement in literacy education. This book will help teachers and parents raise children's scores on standardized…

  15. Evaluating the Impact of Test Accommodations on Test Scores of LEP Students & Non-LEP Students.

    ERIC Educational Resources Information Center

    Hafner, Anne L.

    Using a quasi-experimental analysis of variance (ANOVA) design, this project examined the effects of the use of accommodations with students of limited English proficiency (LEP) and non-LEP students and whether the use of accommodations affected the validity of test score interpretations. Major accommodations examined were extra time, and extra…

  16. Predicting Teacher Certification Success: The Effect of Cumulative Grade Point Average and Preprofessional Academic Skills Test Scores on Testing Performance

    ERIC Educational Resources Information Center

    Hernandez, Barbara L. Michiels; Ward, Susan; Strickland, George

    2006-01-01

    Legislative mandates and reforms hold universities accountable for student certification test performance. The purpose of this investigation was to determine if cumulative grade point average scores and the preprofessional academic skills test scores predict performance on elementary certification test (professional development) scores of…

  17. Evidence-Based Decision about Test Scoring Rules in Clinical Anatomy Multiple-Choice Examinations

    ERIC Educational Resources Information Center

    Severo, Milton; Gaio, A. Rita; Povo, Ana; Silva-Pereira, Fernanda; Ferreira, Maria Amélia

    2015-01-01

    In theory the formula scoring methods increase the reliability of multiple-choice tests in comparison with number-right scoring. This study aimed to evaluate the impact of the formula scoring method in clinical anatomy multiple-choice examinations, and to compare it with that from the number-right scoring method, hoping to achieve an…

  18. A Diet Score Assessing Norwegian Adolescents’ Adherence to Dietary Recommendations—Development and Test-Retest Reproducibility of the Score

    PubMed Central

    Handeland, Katina; Kjellevold, Marian; Wik Markhus, Maria; Eide Graff, Ingvild; Frøyland, Livar; Lie, Øyvind; Skotheim, Siv; Stormark, Kjell Morten; Dahl, Lisbeth; Øyen, Jannike

    2016-01-01

    Assessment of adolescents’ dietary habits is challenging. Reliable instruments to monitor dietary trends are required to promote healthier behaviours in this group. The purpose of this cross-sectional study was to assess adolescents’ adherence to Norwegian dietary recommendations with a diet score and to report results from, and test-retest reliability of, the score. The diet score involved seven food groups and one physical activity indicator, and was applied to answers from a semi-quantitative food frequency questionnaire (FFQ) administered twice. Reproducibility of the score was assessed with Cohen’s Kappa (κ statistics) at an interval of three months. The setting was eight lower-secondary schools in Hordaland County, Norway, and subjects were adolescents (n = 472) aged 14–15 years and their caregivers. Results showed that the proportion of adolescents consistently classified by the diet score was 87.6% (κ = 0.465). For food groups, proportions ranged from 74.0% to 91.6% (κ = 0.249 to κ = 0.573). Less than 40% of the participants were found to adhere to recommendations for frequencies of eating fruits, vegetables, added sugar, and fish. Highest compliance to recommendations was seen for choosing water as beverage and limit the intake of red meat. The score was associated with parental socioeconomic status. The diet score was found to be reproducible at an acceptable level. Health promoting work targeting adolescents should emphasize to increase the intake of recommended foods to approach nutritional guidelines. PMID:27483312

  19. Allometric scaling of Wingate anaerobic power test scores in men.

    PubMed

    Stickley, Christopher D; Hetzler, Ronald K; Wages, Jennifer J; Freemyer, Bret G; Kimura, Iris F

    2013-09-01

    This study examined the appropriate magnitude of allometric scaling of the Wingate anaerobic test (WAnT) power data for body mass (BM) and established normative data for the WAnT for adult men. Eighty-three men completed a standard WAnT using 0.1 kg·kg(-1) BM resistance. Allometric exponents and percentile ranks for 1-second peak power (PP), 5-second PP, and mean power (MP) were established. The Predicted Residual Sum of Squares (PRESS) procedure was used to assess external validity while avoiding data splitting. The mean 1-second PP, 5-second PP, and MP were 1,049.1 ± 168.8 W, 1,013.4 ± 158.6 W, and 777.9 ± 105.0 W, respectively. Allometric exponents for 1-second PP, 5-second PP, and MP scaled for BM were b = 0.89, 0.88, and 0.86, respectively. Correlations between allometrically scaled 1-second PP, 5-second PP, and MP, and BM were r = -0.03, -0.03, and -0.02, respectively, suggesting that the allometric exponents derived were effective in partialling out the effect of BM on WAnT values. The PRESS procedure values resulted in small decreases in R² (0.03, 0.04, and 0.02 for 1-second PP, 5-second PP, and MP, respectively) suggesting acceptable levels of external validity when applied to independent samples. The allometric exponents and normative values provide a useful tool for comparing WAnT scores in college-aged females without the confounding effect of BM. It is suggested that exponents of b = 0.89 (1-second PP), b = 0.88 (5-second PP), and b = 0.86 (MP) be used for allometrically scaling WAnT power values in healthy adult men and that the confidence limits for these allometric exponents be considered as 0.66-1.0 for PP and 0.69-1.0 for MP. The use of these exponents in allometric scaling of male WAnT power values provide coaches and practitioners with valid means for comparing power production between individuals without the confounding influence of BM.

  20. A Comparison of Three Methods for Computing Scale Score Conditional Standard Errors of Measurement. ACT Research Report Series, 2013 (7)

    ERIC Educational Resources Information Center

    Woodruff, David; Traynor, Anne; Cui, Zhongmin; Fang, Yu

    2013-01-01

    Professional standards for educational testing recommend that both the overall standard error of measurement and the conditional standard error of measurement (CSEM) be computed on the score scale used to report scores to examinees. Several methods have been developed to compute scale score CSEMs. This paper compares three methods, based on…

  1. Determining the Relationship of Nursing Test Scores and Test Anxiety Levels before and after a Test-Taking Strategy Seminar.

    ERIC Educational Resources Information Center

    Carraway, Cassandra T.

    A study was conducted to determine whether participation in a test-taking strategy seminar significantly decreased test anxiety in first-year nursing students. The study also sought to compare nursing test scores of first-year nursing students who participated in the seminar with those who did not. The sample consisted of 30 first-year nursing…

  2. The Effect of Geographic Area on General Aptitude Test Battery Validity and Test Scores. USES Test Research Report No. 48.

    ERIC Educational Resources Information Center

    Swarthout, David; And Others

    Meta-analysis techniques were used to determine whether or not there are differences in General Aptitude Test Battery (GATB) validity and test scores between samples from the Northern, Southern, and Western United States census areas. The sample consisted of 26,111 subjects from 122 validation studies of the Specific Aptitude Test Battery (SATB).…

  3. Why African American College Students Miss the Perfect Test Score

    ERIC Educational Resources Information Center

    Gentry, Ruben; Stokes, Dorothy

    2016-01-01

    Many African Americans were imbued with the cliché that they must work twice as hard as others to be a success in life. Entering college, students with this belief put extensive effort into earning top grades to ensure quality preparation for their chosen career; yet, some fail to earn top scores. Why? This is the million dollar question, but the…

  4. High-Stakes Testing and Student Achievement: Problems for the No Child Left Behind Act

    ERIC Educational Resources Information Center

    Nichols, Sharon L.; Glass, Gene V.; Berliner, David C.

    2005-01-01

    Under the federal No Child Left Behind Act of 2001 (NCLB), standardized test scores are the indicator used to hold schools and school districts accountable for student achievement. Each state is responsible for constructing an accountability system, attaching consequences--or stakes--for student performance. The theory of action implied by this…

  5. Using Heteroskedastic Ordered Probit Models to Recover Moments of Continuous Test Score Distributions from Coarsened Data

    ERIC Educational Resources Information Center

    Reardon, Sean F.; Shear, Benjamin R.; Castellano, Katherine E.; Ho, Andrew D.

    2017-01-01

    Test score distributions of schools or demographic groups are often summarized by frequencies of students scoring in a small number of ordered proficiency categories. We show that heteroskedastic ordered probit (HETOP) models can be used to estimate means and standard deviations of multiple groups' test score distributions from such data. Because…

  6. Effects of Order for the Administration of Subtests on Standardized Reading Test Scores.

    ERIC Educational Resources Information Center

    Paradis, Edward; Peterson, Joe

    This study, involving 131 students in grades ten, eleven, and twelve, investigated the effects of order of administration of subtests on scores from the Nelson-Denny Reading Test. Results indicated that order of administration had no significant effect on scores from the vocabulary subtest or on the total test score, but subjects taking the…

  7. Situational Effects May Account for Gain Scores in Cognitive Ability Testing: A Longitudinal SEM Approach

    ERIC Educational Resources Information Center

    Matton, Nadine; Vautier, Stephane; Raufaste, Eric

    2009-01-01

    Mean gain scores for cognitive ability tests between two sessions in a selection setting are now a robust finding, yet not fully understood. Many authors do not attribute such gain scores to an increase in the target abilities. Our approach consists of testing a longitudinal SEM model suitable to this view. We propose to model the scores' changes…

  8. Situational Effects May Account for Gain Scores in Cognitive Ability Testing: A Longitudinal SEM Approach

    ERIC Educational Resources Information Center

    Matton, Nadine; Vautier, Stephane; Raufaste, Eric

    2009-01-01

    Mean gain scores for cognitive ability tests between two sessions in a selection setting are now a robust finding, yet not fully understood. Many authors do not attribute such gain scores to an increase in the target abilities. Our approach consists of testing a longitudinal SEM model suitable to this view. We propose to model the scores' changes…

  9. Difficulty and Discriminating Indices of Three-Multiple Choice Tests Using the Confidence Scoring Procedure

    ERIC Educational Resources Information Center

    Omirin, M. S.

    2007-01-01

    The study investigated the comparison of the difficulty and discrimination incides of three multiple choice tests using the confidence scoring procedure (CSP). The study was also set to determine whether or not the difficulty and discrimination indices would be improved, if the tests were scored by the confidence scoring procedure. Two null…

  10. Using Heteroskedastic Ordered Probit Models to Recover Moments of Continuous Test Score Distributions from Coarsened Data

    ERIC Educational Resources Information Center

    Reardon, Sean F.; Shear, Benjamin R.; Castellano, Katherine E.; Ho, Andrew D.

    2017-01-01

    Test score distributions of schools or demographic groups are often summarized by frequencies of students scoring in a small number of ordered proficiency categories. We show that heteroskedastic ordered probit (HETOP) models can be used to estimate means and standard deviations of multiple groups' test score distributions from such data. Because…

  11. Modified Balance Error Scoring System (M-BESS) test scores in athletes wearing protective equipment and cleats

    PubMed Central

    Azad, Aftab Mohammad; Al Juma, Saad; Bhatti, Junaid Ahmad; Delaney, J Scott

    2016-01-01

    Background Balance testing is an important part of the initial concussion assessment. There is no research on the differences in Modified Balance Error Scoring System (M-BESS) scores when tested in real world as compared to control conditions. Objective To assess the difference in M-BESS scores in athletes wearing their protective equipment and cleats on different surfaces as compared to control conditions. Methods This cross-sectional study examined university North American football and soccer athletes. Three observers independently rated athletes performing the M-BESS test in three different conditions: (1) wearing shorts and T-shirt in bare feet on firm surface (control); (2) wearing athletic equipment with cleats on FieldTurf; and (3) wearing athletic equipment with cleats on firm surface. Mean M-BESS scores were compared between conditions. Results 60 participants were recruited: 39 from football (all males) and 21 from soccer (11 males and 10 females). Average age was 21.1 years (SD=1.8). Mean M-BESS scores were significantly lower (p<0.001) for cleats on FieldTurf (mean=26.3; SD=2.0) and for cleats on firm surface (mean=26.6; SD=2.1) as compared to the control condition (mean=28.4; SD=1.5). Females had lower scores than males for cleats on FieldTurf condition (24.9 (SD=1.9) vs 27.3 (SD=1.6), p=0.005). Players who had taping or bracing on their ankles/feet had lower scores when tested with cleats on firm surface condition (24.6 (SD=1.7) vs 26.9 (SD=2.0), p=0.002). Conclusions Total M-BESS scores for athletes wearing protective equipment and cleats standing on FieldTurf or a firm surface are around two points lower than M-BESS scores performed on the same athletes under control conditions. PMID:27900181

  12. Improving Test Score Reporting: Perspectives from the ETS Score Reporting Conference. Research Report. ETS RR-11-45

    ERIC Educational Resources Information Center

    Zapata-Rivera, Diego, Ed.; Zwick, Rebecca, Ed.

    2011-01-01

    This volume includes 3 papers based on presentations at a workshop on communicating assessment information to particular audiences, held at Educational Testing Service (ETS) on November 4th, 2010, to explore some issues that influence score reports and new advances that contribute to the effectiveness of these reports. Jessica Hullman, Rebecca…

  13. 21 CFR 866.6050 - Ovarian adnexal mass assessment score test system.

    Code of Federal Regulations, 2014 CFR

    2014-04-01

    ... 21 Food and Drugs 8 2014-04-01 2014-04-01 false Ovarian adnexal mass assessment score test system... immunological Test Systems § 866.6050 Ovarian adnexal mass assessment score test system. (a) Identification. An ovarian/adnexal mass assessment test system is a device that measures one or more proteins in serum or...

  14. 21 CFR 866.6050 - Ovarian adnexal mass assessment score test system.

    Code of Federal Regulations, 2013 CFR

    2013-04-01

    ... 21 Food and Drugs 8 2013-04-01 2013-04-01 false Ovarian adnexal mass assessment score test system... immunological Test Systems § 866.6050 Ovarian adnexal mass assessment score test system. (a) Identification. An ovarian/adnexal mass assessment test system is a device that measures one or more proteins in serum or...

  15. 21 CFR 866.6050 - Ovarian adnexal mass assessment score test system.

    Code of Federal Regulations, 2012 CFR

    2012-04-01

    ... 21 Food and Drugs 8 2012-04-01 2012-04-01 false Ovarian adnexal mass assessment score test system... immunological Test Systems § 866.6050 Ovarian adnexal mass assessment score test system. (a) Identification. An ovarian/adnexal mass assessment test system is a device that measures one or more proteins in serum or...

  16. The Role of Test Scores in Explaining Race and Gender Differences in Wages

    ERIC Educational Resources Information Center

    Blackburn, McKinley L.

    2004-01-01

    Previous research has suggested that skills reflected in test-score performance on tests such as the Armed Forces Qualification Test (AFQT) can account for some of the racial differences in average wages. I use a more complete set of test scores available with the National Longitudinal Survey of Youth 1979 Cohort to reconsider this evidence, and…

  17. Interpreting Changes over Time in High School Average ACT® College Readiness Assessment Composite Scores and ACT College Readiness Benchmark Attainment Rates. ACT Research Report Series, 2013 (9)

    ERIC Educational Resources Information Center

    Sawyer, Richard

    2013-01-01

    Education officials and journalists frequently track changes over time in the average ACT® College Readiness Assessment Composite scores and ACT College Readiness Benchmark attainment rates of individual high schools. Using standard statistical methods, I examined how often changes in these statistics are unambiguously positive or negative, rather…

  18. A "Rearrangement Procedure" for Scoring Adaptive Tests with Review Options

    ERIC Educational Resources Information Center

    Papanastasiou, Elena C.; Reckase, Mark D.

    2007-01-01

    Because of the increased popularity of computerized adaptive testing (CAT), many admissions tests, as well as certification and licensure examinations, have been transformed from their paper-and-pencil versions to computerized adaptive versions. A major difference between paper-and-pencil tests and CAT from an examinee's point of view is that in…

  19. An Error Score Model for Time-Limit Tests

    ERIC Educational Resources Information Center

    Ven, A. H. G. S. van der

    1976-01-01

    A more generalized error model for time-limit tests is developed. Model estimates are derived for right-attempted and wrong-attempted correlations both within the same test and between different tests. A comparison is made between observed correlations and their model counterparts and a fair agreement is found between observed and expected…

  20. End-box scoring artefact evaluation of the Farnsworth-Munsell 100-Hue colour vision test.

    PubMed

    Viliūnas, V; Lukauskiene, R; Svegzda, A; Zukauskas, A

    2006-11-01

    The scoring artefact in the Farnsworth-Munsell 100-Hue test, arising from the grouping of the caps into four boxes, was investigated. The traditional method of scoring performed with the numbers of the anchor caps disregarded and the alternative scoring performed with the numbers of the anchor caps employed, were compared. For the traditional method of scoring, we revealed an increase of the error score of the outside (end-box) caps when the total error score was above 240. On the contrary for scoring performed with the numbers of the anchor caps employed, the difference between the error score of the outside caps and the average error per cap is not significant. To mitigate the end-box artefact and to improve the reliability of the Farnsworth-Munsell 100-Hue test, corrections to the traditional method of scoring are proposed.

  1. Neuropsychological test scores, academic performance, and developmental disorders in Spanish-speaking children.

    PubMed

    Rosselli, M; Ardila, A; Bateman, J R; Guzmán, M

    2001-01-01

    Limited information is currently available about performance of Spanish-speaking children on different neuropsychological tests. This study was designed to (a) analyze the effects of age and sex on different neuropsychological test scores of a randomly selected sample of Spanish-speaking children, (b) analyze the value of neuropsychological test scores for predicting school performance, and (c) describe the neuropsychological profile of Spanish-speaking children with learning disabilities (LD). Two hundred ninety (141 boys, 149 girls) 6- to 11-year-old children were selected from a school in Bogotá, Colombia. Three age groups were distinguished: 6- to 7-, 8- to 9-, and 10- to 11-year-olds. Performance was measured utilizing the following neuropsychological tests: Seashore Rhythm Test, Finger Tapping Test (FTT), Grooved Pegboard Test, Children's Category Test (CCT), California Verbal Learning Test-Children's Version (CVLT-C), Benton Visual Retention Test (BVRT), and Bateria Woodcock Psicoeducativa en Español (Woodcock, 1982). Normative scores were calculated. Age effect was significant for most of the test scores. A significant sex effect was observed for 3 test scores. Intercorrelations were performed between neuropsychological test scores and academic areas (science, mathematics, Spanish, social studies, and music). In a post hoc analysis, children presenting very low scores on the reading, writing, and arithmetic achievement scales of the Woodcock battery were identified in the sample, and their neuropsychological test scores were compared with a matched normal group. Finally, a comparison was made between Colombian and American norms.

  2. Validity Evidence for ACT Compass® Placement Tests. ACT Research Report Series 2014 (2)

    ERIC Educational Resources Information Center

    Westrick, Paul A.; Allen, Jeff

    2014-01-01

    We examined the validity of using Compass® test scores and high school grade point average (GPA) for placing students in first-year college courses and for identifying students at risk of not succeeding. Consistent with other research, the combination of high school GPA and Compass scores performed better than either measure used alone. Results…

  3. [The five-word test in three age-groups of mild Alzheimer's disease (60, 70 and 80 year-old patients): Utility of the Total Score, Total Weighted Score, Learning Score and Memory Score].

    PubMed

    Croisile, B; Astier, J-L; Beaumont, C; Mollion, H

    2010-01-01

    The five-word test (5WT) is a serial verbal memory test with semantic cuing. It is proposed to rapidly evaluate memory of aging people and has previously shown its sensitivity and its specificity in identifying patients with AD. It measures the efficacy of free and cued recalls during a procedure of immediate and delayed recalls. The 5WT was compared in a group of 202 normal subjects and a group of 302 mild AD patients (MMS of 20 or more) aged from 60 to 92 years, in three age classes (60 years, 70 years, 80 years). Nine scores were measured (Total Score, Total Weighted Score, Free Immediate Recall, Learning Score=total of Immediate Recalls, Free Delayed Recall, Memory Score=total of Delayed Recalls, Forgetting Rate, Percentage of Immediate Cuing, Percentage of Delayed Cuing) as well as the presence of intrusions. For each age class, Receiver Operating Characteristic curves determined the most significant cut-off scores. For each score of the 5WT, AD patients differed significantly from controls. The cut-off scores were not the same according to age. For the Total Score, the cut-off scores were 10 (60 years), 9 (70 years) and 8 (80 years), whereas the cut-off scores of the Total Weighted Score were 17 (60 years), 16 (70 years) and 14 (80 years). As suggested by Cowppli-Bony et al. (2005), the Total Weighted Score (which gives a higher coefficient to free recalls) was better than the Total Score for discriminating mild AD. The 5WT is useful to discriminate normal controls and mild AD patients. Normal aged subjects displayed good encoding, efficient stocking and consolidation (few forgetting, efficient cued recall), intrusions were rare. Mild AD patients were characterized by weak encoding of words and severe deficit for stocking and consolidation (important forgetting, impaired cued recall), they made numerous intrusions. This psychometric profile is characteristic of the amnestic hippocampal syndrome found in AD. The 5WT is a simple and reliable test for

  4. Construct Validity and Test Re-Test Reliability of the Forgotten Joint Score.

    PubMed

    Thompson, Simon M; Salmon, Lucy J; Webb, Justin M; Pinczewski, Leo A; Roe, Justin P

    2015-11-01

    Consecutive patients undergoing knee arthroplasty completed questionnaires: FJS, Knee Injury and Osteoarthritis Outcome Score (KOOS) and WOMAC Score (mean 39 months after surgery), and were mailed a repeat questionnaire after 4 to 6 weeks. The test-retest reliability was almost perfect for the FJS (ICC = 0.97), and the FJS subdomains (ICC > 0.8). Convergent construct validity of the FJS was correlated with the KOOS Subscores of Quality of Life (0.63, P = 0.001), Symptom (0.33, P = 0.001), Pain (0.68, P = 0.001) and ADL (0.66, P = 0.001) and the Total WOMAC (0.70, P = 0.001). The FJS demonstrates high test-retest reliability and construct validity compared to the Normalised WOMAC and KOOS Subscales. The FJS does not demonstrate the ceiling effect of the WOMAC or KOOS pain scores so may have greater discriminatory ability following TKR. Copyright © 2015 Elsevier Inc. All rights reserved.

  5. TCAP Scores and per Pupil Expenditures: Statewide Changes before and after Tennessee's First to the Top Act

    ERIC Educational Resources Information Center

    Cantrell, Martha Ely

    2013-01-01

    The purpose of this study was to investigate the relationships between the changes in Tennessee Comprehensive Assessment Program (TCAP) scores and the changes in Per Pupil Expenditures (PPE) after the enactment of "First to the Top Act of 2010" and the receipt of $501,000,000 in federal Race to the Top (RTTT) grant monies. Half of that…

  6. A Preliminary Attempt at Teaching Abstract Mathematics to Freshmen with an A.C.T. Score of Less Than 15.

    ERIC Educational Resources Information Center

    Schremmer, A. G.

    This experiment attempted to teach abstract mathematics fo college freshmen with A.C.T. scores less than 15 in a three semester terminal course sequence. The course content included a formal mathematical language, set theory, Boolean Algebra, relations and functions, operations, cardinals and ordinals, the rational numbers, and college algebra.…

  7. Alignment, High Stakes, and the Inflation of Test Scores

    ERIC Educational Resources Information Center

    Koretz, Daniel

    2005-01-01

    No one can dispute that tests should measure important content, and for many (but not all) purposes, tests should be aligned with curricular goals. Thus in many cases, alignment is clearly better than the alternative, and nothing that follows here argues otherwise. Unfortunately, however, this does not imply that alignment is sufficient protection…

  8. The Scoring of Matching Questions Tests: A Closer Look

    ERIC Educational Resources Information Center

    Jancarík, Antonín; Kostelecká, Yvona

    2015-01-01

    Electronic testing has become a regular part of online courses. Most learning management systems offer a wide range of tools that can be used in electronic tests. With respect to time demands, the most efficient tools are those that allow automatic assessment. The presented paper focuses on one of these tools: matching questions in which one…

  9. Printing Performance School Readiness Test: Administration and Scoring Manual.

    ERIC Educational Resources Information Center

    Simner, Marvin L.

    The Printing Performance School Readiness Test is an empirically derived instrument designed to aid in the early identification of preschool children who are at risk for school failure. The test is based on the outcome of a research program dealing with various aspects of children's printing that involved over 400 normal, non-repeating, native…

  10. Mantoux tests. Performing, interpreting, and acting upon them.

    PubMed Central

    Pennie, R. A.

    1995-01-01

    Interpreting and acting upon results of the tuberculin skin test (Mantoux test) can be confusing because the test is inherently ambiguous and published recommendations for investigation and prevention of tuberculosis are complex. This article presents a step-by-step guide to enable family physicians to make effective use of the Mantoux test. PMID:7780314

  11. Noncognitive Skills and the Gender Disparities in Test Scores and Teacher Assessments: Evidence from Primary School

    ERIC Educational Resources Information Center

    Cornwell, Christopher; Mustard, David B.; Van Parys, Jessica

    2013-01-01

    Using data from the 1998-99 ECLS-K cohort, we show that the grades awarded by teachers are not aligned with test scores. Girls in every racial category outperform boys on reading tests, while boys score at least as well on math and science tests as girls. However, boys in all racial categories across all subject areas are not represented in…

  12. Score Reporting in Teacher Certification Testing: A Review, Design, and Interview/Focus Group Study

    ERIC Educational Resources Information Center

    Klesch, Heather S.

    2010-01-01

    The reporting of scores on educational tests is at times misunderstood, misinterpreted, and potentially confusing to examinees and other stakeholders who may need to interpret test scores. In reporting test results to examinees, there is a need for clarity in the message communicated. As pressure rises for students to demonstrate performance at a…

  13. Discrepancies between Score Trends from NAEP and State Tests: A Scale-Invariant Perspective

    ERIC Educational Resources Information Center

    Ho, Andrew D.

    2007-01-01

    State test score trends are widely interpreted as indicators of educational improvement. To validate these interpretations, state test score trends are often compared to trends on other tests such as the National Assessment of Educational Progress (NAEP). These comparisons raise serious technical and substantive concerns. Technically, the most…

  14. School Inputs, Household Substitution, and Test Scores. NBER Working Paper No. 16830

    ERIC Educational Resources Information Center

    Das, Jishnu; Dercon, Stefan; Habyarimana, James; Krishnan, Pramila; Muralidharan, Karthik; Sundararaman, Venkatesh

    2011-01-01

    Empirical studies of the relationship between school inputs and test scores typically do not account for the fact that households will respond to changes in school inputs. We present a dynamic household optimization model relating test scores to school and household inputs, and test its predictions in two very different low-income country…

  15. Score Reporting in Teacher Certification Testing: A Review, Design, and Interview/Focus Group Study

    ERIC Educational Resources Information Center

    Klesch, Heather S.

    2010-01-01

    The reporting of scores on educational tests is at times misunderstood, misinterpreted, and potentially confusing to examinees and other stakeholders who may need to interpret test scores. In reporting test results to examinees, there is a need for clarity in the message communicated. As pressure rises for students to demonstrate performance at a…

  16. Further Validation of the Qualitative Scoring System for the Modified Bender-Gestalt Test.

    ERIC Educational Resources Information Center

    Brannigan, Gary G.; And Others

    1995-01-01

    Compares the Qualitative Scoring System and the Developmental Scoring Systems, both Bender-Gestalt tests, in predicting achievement on the Metropolitan Achievement Test (MAT). In this study, first through fourth graders (n=409) from regular elementary schools were subjected to both tests; both systems correlated significantly with school…

  17. School Inputs, Household Substitution, and Test Scores. NBER Working Paper No. 16830

    ERIC Educational Resources Information Center

    Das, Jishnu; Dercon, Stefan; Habyarimana, James; Krishnan, Pramila; Muralidharan, Karthik; Sundararaman, Venkatesh

    2011-01-01

    Empirical studies of the relationship between school inputs and test scores typically do not account for the fact that households will respond to changes in school inputs. We present a dynamic household optimization model relating test scores to school and household inputs, and test its predictions in two very different low-income country…

  18. Further Validation of the Qualitative Scoring System for the Modified Bender-Gestalt Test.

    ERIC Educational Resources Information Center

    Brannigan, Gary G.; And Others

    1995-01-01

    Compares the Qualitative Scoring System and the Developmental Scoring Systems, both Bender-Gestalt tests, in predicting achievement on the Metropolitan Achievement Test (MAT). In this study, first through fourth graders (n=409) from regular elementary schools were subjected to both tests; both systems correlated significantly with school…

  19. Using Expected Growth Size Estimates To Summarize Test Score Changes. ERIC/AE Digest.

    ERIC Educational Resources Information Center

    Russell, Michael

    An earlier Digest described the shortcomings of three methods commonly used to summarize changes in test scores. This Digest describes two less commonly used approaches for examining changes in test scores, those of Standardized Growth Estimates and Effect Sizes. Aspects of these two approaches are combined and applied to the Iowa Test of Basic…

  20. Stability of scores for the Slosson Full-Range Intelligence Test.

    PubMed

    Williams, Thomas O; Eaves, Ronald C; Woods-Groves, Suzanne; Mariano, Gina

    2007-08-01

    The test-retest stability of the Slosson Full-Range Intelligence Test by Algozzine, Eaves, Mann, and Vance was investigated with test scores from a sample of 103 students. With a mean interval of 13.7 mo. and different examiners for each of the two test administrations, the test-retest reliability coefficients for the Full-Range IQ, Verbal Reasoning, Abstract Reasoning, Quantitative Reasoning, and Memory were .93, .85, .80, .80, and .83, respectively. Mean differences from the test-retest scores were not statistically significantly different for any of the scales. Results suggest that Slosson scores are stable over time even when different examiners administer the test.

  1. D.C. Student Test Scores Show Uneven Progress. Data Snapshot

    ERIC Educational Resources Information Center

    DuPre, Mary

    2011-01-01

    Over the past five years, both DC Public Schools (DCPS) and public charter schools (PCS) have seen significant growth in secondary reading and math scores on the state test known as the District of Columbia Comprehensive Assessment System (DC CAS). However, scores have not improved as much at the elementary level. Reading and math scores for DCPS…

  2. Comparing Graphical and Verbal Representations of Measurement Error in Test Score Reports

    ERIC Educational Resources Information Center

    Zwick, Rebecca; Zapata-Rivera, Diego; Hegarty, Mary

    2014-01-01

    Research has shown that many educators do not understand the terminology or displays used in test score reports and that measurement error is a particularly challenging concept. We investigated graphical and verbal methods of representing measurement error associated with individual student scores. We created four alternative score reports, each…

  3. A Seven-Year Follow-Up of Intelligence Test Scores of Foster Grandparents

    ERIC Educational Resources Information Center

    Troll, Lillian E.; And Others

    1976-01-01

    After seven years, a group (N=32) of originally nonemployed poverty-level older people (over 60) now employed as foster grandparents were retested with the WAIS. Three subtest scores showed stability and Digit Span showed a statistically significant drop. Neither age nor initial level of health or WAIS scores was related to test-score changes over…

  4. Comparing Graphical and Verbal Representations of Measurement Error in Test Score Reports

    ERIC Educational Resources Information Center

    Zwick, Rebecca; Zapata-Rivera, Diego; Hegarty, Mary

    2014-01-01

    Research has shown that many educators do not understand the terminology or displays used in test score reports and that measurement error is a particularly challenging concept. We investigated graphical and verbal methods of representing measurement error associated with individual student scores. We created four alternative score reports, each…

  5. Characteristics of Patients with High Lie Scores in a Personality Test.

    PubMed

    Kasai, Yuichi; Sakakibara, Toshihiko; Mizuno, Tetsutaro

    2017-01-01

    It is reported that persons with high Lie score (L score) of a personality test are aggressively self-confident and are also related to depression or schizophrenia In this study, we examined the characteristics of patients with high L scores on the Maudsley Personality Inventory (MPI) and examined the significance of the L score. We collected the data of 10789 subjects and examined the relationship between L score or the number of characteristic biased persons and the parameters of age, sex, education level, occupation, and degree of pain. Furthermore, we examined the changes in extraversion-introversion (E score), neuroticism (N score), and L scores at approximately 1 year after surgery in 1711 patients who underwent surgery at our university hospital or affiliated hospitals. L score was significantly higher among persons with a high degree of pain, and ratio of the characteristic biased persons in L score was significantly high among persons in their 40s to 60s, healthcare professionals and those with a high degree of pain. Moreover, L score scarcely changed between before and after surgery when compared with E score and N score. L score is not greatly influenced by an individual's state of mind or situation at different times, and may indicate the personality traits proper to the person. It is shown that L score may indicate the personality trait characteristics of persons who want to make themselves look good in the eyes of other.

  6. The value of Bayes' theorem for interpreting abnormal test scores in cognitively healthy and clinical samples.

    PubMed

    Gavett, Brandon E

    2015-03-01

    The base rates of abnormal test scores in cognitively normal samples have been a focus of recent research. The goal of the current study is to illustrate how Bayes' theorem uses these base rates--along with the same base rates in cognitively impaired samples and prevalence rates of cognitive impairment--to yield probability values that are more useful for making judgments about the absence or presence of cognitive impairment. Correlation matrices, means, and standard deviations were obtained from the Wechsler Memory Scale--4th Edition (WMS-IV) Technical and Interpretive Manual and used in Monte Carlo simulations to estimate the base rates of abnormal test scores in the standardization and special groups (mixed clinical) samples. Bayes' theorem was applied to these estimates to identify probabilities of normal cognition based on the number of abnormal test scores observed. Abnormal scores were common in the standardization sample (65.4% scoring below a scaled score of 7 on at least one subtest) and more common in the mixed clinical sample (85.6% scoring below a scaled score of 7 on at least one subtest). Probabilities varied according to the number of abnormal test scores, base rates of normal cognition, and cutoff scores. The results suggest that interpretation of base rates obtained from cognitively healthy samples must also account for data from cognitively impaired samples. Bayes' theorem can help neuropsychologists answer questions about the probability that an individual examinee is cognitively healthy based on the number of abnormal test scores observed.

  7. Effects of Test Media on Different EFL Test-Takers in Writing Scores and in the Cognitive Writing Process

    ERIC Educational Resources Information Center

    Zou, Xiao-Ling; Chen, Yan-Min

    2016-01-01

    The effects of computer and paper test media on EFL test-takers with different computer familiarity in writing scores and in the cognitive writing process have been comprehensively explored from the learners' aspect as well as on the basis of related theories and practice. The results indicate significant differences in test scores among the…

  8. Effects of Test Media on Different EFL Test-Takers in Writing Scores and in the Cognitive Writing Process

    ERIC Educational Resources Information Center

    Zou, Xiao-Ling; Chen, Yan-Min

    2016-01-01

    The effects of computer and paper test media on EFL test-takers with different computer familiarity in writing scores and in the cognitive writing process have been comprehensively explored from the learners' aspect as well as on the basis of related theories and practice. The results indicate significant differences in test scores among the…

  9. Effects of a Coaching Class on the ACT Scores of Students at a Large Midwest High School

    ERIC Educational Resources Information Center

    Moss, Gary L.; Chippendale, Ene Kaja; Mershon, Clark W.; Carney, Trisha

    2012-01-01

    The role of standardized tests in general and college admission tests in particular is a hot topic for educators, administrators and others evaluating the outcomes of educational institutions. The importance of the test scores for both students and institutions is acknowledged, and the test preparation industry has spawned into a multi-billion…

  10. Clinical Importance of the Heel Drop Test and a New Clinical Score for Adult Appendicitis

    PubMed Central

    Ahn, Shin; Lee, Hyeji; Choi, Wookjin; Ahn, Ryeok; Hong, Jung-Suk; Sohn, Chang Hwan; Seo, Dong Woo; Lee, Yoon-Seon; Lim, Kyung Soo; Kim, Won Young

    2016-01-01

    Objective We tried to evaluate the accuracy of the heel drop test in patients with suspected appendicitis and tried to develop a new clinical score, which incorporates the heel drop test and other parameters, for the diagnosis of this condition. Methods We performed a prospective observational study on adult patients with suspected appendicitis at two academic urban emergency departments between January and August 2015. The predictive characteristics of each parameter, along with heel drop test results were calculated. A composite score was generated by logistic regression analysis. The performance of the generated score was compared to that of the Alvarado score. Results Of the 292 enrolled patients, 165 (56.5%) had acute appendicitis. The heel drop test had a higher predictive value than rebound tenderness. Variables and their points included in the new (MESH) score were pain migration (2), elevated white blood cell (WBC) >10,000/μL (3), shift to left (2), and positive heel drop test (3). The MESH score had a higher AUC than the Alvarado score (0.805 vs. 0.701). Scores of 5 and 11 were chosen as cut-off values; a MESH score ≥5 compared to an Alvarado score ≥5, and a MESH score ≥8 compared to an Alvarado score ≥7 showed better performance in diagnosing appendicitis. Conclusion MESH (migration, elevated WBC, shift to left, and heel drop test) is a simple clinical scoring system for assessing patients with suspected appendicitis and is more accurate than the Alvarado score. Further validation studies are needed. PMID:27723842

  11. Maintaining Equivalent Cut Scores for Small Sample Test Forms

    ERIC Educational Resources Information Center

    Dwyer, Andrew C.

    2016-01-01

    This study examines the effectiveness of three approaches for maintaining equivalent performance standards across test forms with small samples: (1) common-item equating, (2) resetting the standard, and (3) rescaling the standard. Rescaling the standard (i.e., applying common-item equating methodology to standard setting ratings to account for…

  12. Are Increasing Test Scores in Texas Really a Myth?

    ERIC Educational Resources Information Center

    Toenjes, Lawrence A.; Dworkin, A. Gary

    2002-01-01

    Used the same methodology and data used by W. Haney in his study of achievement gains on the Texas high school exit examination to demonstrate that his conclusion about the increased pass rate was not correct. None of the 20% improvement in the exit test pass rate was explained by combined increases in dropout rates or special education…

  13. Examining the Effect of Proctoring on Online Test Scores

    ERIC Educational Resources Information Center

    Alessio, Helaine M.; Malay, Nancy; Maurer, Karsten; Bailer, A. John; Rubin, Beth

    2017-01-01

    Online education continues to grow, bringing opportunities and challenges for students and instructors. One challenge is the perception that academic integrity associated with online tests is compromised due to undetected cheating that yields artificially higher grades. To address these concerns, proctoring software has been developed to address…

  14. Allometric Scaling of Wingate Anaerobic Power Test Scores in Women

    ERIC Educational Resources Information Center

    Hetzler, Ronald K.; Stickley, Christopher D.; Kimura, Iris F.

    2011-01-01

    In this study, we developed allometric exponents for scaling Wingate anaerobic test (WAnT) power data that are reflective in controlling for body mass (BM) and lean body mass (LBM) and established a normative WAnT data set for college-age women. One hundred women completed a standard WAnT. Allometric exponents and percentile ranks for peak (PP)…

  15. Generation of GHS Scores from TEST and online sources

    EPA Science Inventory

    Alternatives assessment frameworks such as DfE (Design for the Environment) evaluate chemical alternatives in terms of human health effects, ecotoxicity, and fate. T.E.S.T. (Toxicity Estimation Software Tool) can be utilized to evaluate human health in terms of acute oral rat tox...

  16. Prediction of WAIS Scores from Group Ability Tests

    ERIC Educational Resources Information Center

    Watson, Charles G.; Klett, William G.

    1973-01-01

    In a search for an adequate but efficient substitute, the authors have instituted three evaluations of the relationships between potential WAIS-substitutes and the WAIS itself. The present report describes the first of these researches-- a study of the relationships between the four group ability tests and the WAIS in a mental hospital setting.…

  17. The Effect of Black Peers on Black Test Scores

    ERIC Educational Resources Information Center

    Armor, David J.; Duck, Stephanie

    2007-01-01

    Recent studies have used increasingly complex methodologies to estimate the effect of peer characteristics--race, poverty, and ability--on student achievement. A paper by Hanushek, Kain, and Rivkin using Texas state testing data has received particularly wide attention because it found a large negative effect of school percent black on black math…

  18. Maintaining Equivalent Cut Scores for Small Sample Test Forms

    ERIC Educational Resources Information Center

    Dwyer, Andrew C.

    2016-01-01

    This study examines the effectiveness of three approaches for maintaining equivalent performance standards across test forms with small samples: (1) common-item equating, (2) resetting the standard, and (3) rescaling the standard. Rescaling the standard (i.e., applying common-item equating methodology to standard setting ratings to account for…

  19. Allometric Scaling of Wingate Anaerobic Power Test Scores in Women

    ERIC Educational Resources Information Center

    Hetzler, Ronald K.; Stickley, Christopher D.; Kimura, Iris F.

    2011-01-01

    In this study, we developed allometric exponents for scaling Wingate anaerobic test (WAnT) power data that are reflective in controlling for body mass (BM) and lean body mass (LBM) and established a normative WAnT data set for college-age women. One hundred women completed a standard WAnT. Allometric exponents and percentile ranks for peak (PP)…

  20. MSFC/Ball Space-Act Test Results of SBMD

    NASA Technical Reports Server (NTRS)

    Hadaway, James; Brown, Bob; Eng, Ron; Stahl, Phil

    2002-01-01

    The results of two cryo tests of the SBMD that were funded by Ball Aerospace through a Space-Act Agreement with MSFC will be discussed. These tests followed the formal completion of the SBMD program. The PhaseCam interferometer, rather than the Wavescope Shack-Hartmann sensor, was used during these tests.

  1. Assessing College Readiness: Should We Be Satisfied with Act or Other Threshold Scores?

    ERIC Educational Resources Information Center

    Maruyama, Geoffrey

    2012-01-01

    This article examines the logic underlying different models for assessing the college readiness of high school students. It focuses on benchmark scores that purportedly identify students who are college ready and presents the challenges of using threshold scores from a single assessment instrument to represent readiness. As well as providing…

  2. Correcting for Test Score Measurement Error in ANCOVA Models for Estimating Treatment Effects

    ERIC Educational Resources Information Center

    Lockwood, J. R.; McCaffrey, Daniel F.

    2014-01-01

    A common strategy for estimating treatment effects in observational studies using individual student-level data is analysis of covariance (ANCOVA) or hierarchical variants of it, in which outcomes (often standardized test scores) are regressed on pretreatment test scores, other student characteristics, and treatment group indicators. Measurement…

  3. STABILITY OF ACADEMIC APTITUDE AND READING TEST SCORES OF MOBILE AND NON-MOBILE DISADVANTAGED CHILDREN.

    ERIC Educational Resources Information Center

    JUSTMAN, JOSEPH

    CHANGES IN ACADEMIC APTITUDE AND ACHIEVEMENT TEST SCORES OF PUPILS ATTENDING PUBLIC SCHOOLS IN DISADVANTAGED AREAS IN NEW YORK CITY WERE INVESTIGATED. AN ATTEMPT WAS MADE TO DETERMINE WHETHER VARYING DEGREES OF MOBILITY WERE ASSOCIATED WITH VARIATION IN CHANGES IN TEST SCORES. THE CUMULATIVE RECORD CARDS OF SIXTH-GRADE PUPILS WERE EXAMINED TO…

  4. The Implications of Family Size and Birth Order for Test Scores and Behavioral Development

    ERIC Educational Resources Information Center

    Silles, Mary A.

    2010-01-01

    This article, using longitudinal data from the National Child Development Study, presents new evidence on the effects of family size and birth order on test scores and behavioral development at age 7, 11 and 16. Sibling size is shown to have an adverse causal effect on test scores and behavioral development. For any given family size, first-borns…

  5. Sex Differences in Cognitive Abilities Test Scores: A UK National Picture

    ERIC Educational Resources Information Center

    Strand, Steve; Deary, Ian J.; Smith, Pauline

    2006-01-01

    Background and aims: There is uncertainty about the extent or even existence of sex differences in the mean and variability of reasoning test scores ( Jensen, 1998; Lynn, 1994, ; Mackintosh, 1996). This paper analyses the Cognitive Abilities Test (CAT) scores of a large and representative sample of UK pupils to determine the extent of any sex…

  6. Increasing Racial Isolation and Test Score Gaps in Mathematics: A 30-Year Perspective

    ERIC Educational Resources Information Center

    Berends, Mark; Penaloza, Roberto V.

    2010-01-01

    Background/Context: Although there has been progress in closing the test score gaps among student groups over past decades, that progress has stalled. Many researchers have speculated why the test score gaps closed between the early 1970s and the early 1990s, but only a few have been able to empirically study how changes in school factors and…

  7. Beyond Correlations: Usefulness of High School GPA and Test Scores in Making College Admissions Decisions

    ERIC Educational Resources Information Center

    Sawyer, Richard

    2013-01-01

    Correlational evidence suggests that high school GPA is better than admission test scores in predicting first-year college GPA, although test scores have incremental predictive validity. The usefulness of a selection variable in making admission decisions depends in part on its predictive validity, but also on institutions' selectivity and…

  8. Comparing the Effects of Elementary Music and Visual Arts Lessons on Standardized Mathematics Test Scores

    ERIC Educational Resources Information Center

    King, Molly Elizabeth

    2016-01-01

    The purpose of this quantitative, causal-comparative study was to compare the effect elementary music and visual arts lessons had on third through sixth grade standardized mathematics test scores. Inferential statistics were used to compare the differences between test scores of students who took in-school, elementary, music instruction during the…

  9. Using Raters from India to Score a Large-Scale Speaking Test

    ERIC Educational Resources Information Center

    Xi, Xiaoming; Mollaun, Pam

    2011-01-01

    We investigated the scoring of the Speaking section of the Test of English as a Foreign Language[TM] Internet-based (TOEFL iBT[R]) test by speakers of English and one or more Indian languages. We explored the extent to which raters from India, after being trained and certified, were able to score the TOEFL examinees with mixed first languages…

  10. Kindergarten Black-White Test Score Gaps: Replicating and Updating Previous Findings with New National Data

    ERIC Educational Resources Information Center

    Quinn, David

    2014-01-01

    A substantial body of evidence has shown large academic test score gaps between black and white students in early childhood. These gaps remain, and probably grow, as students progress through school. Many researchers have sought to explain these persistent test score gaps, and particularly, to understand the role of students' socio-economic status…

  11. Scoring Yes-No Vocabulary Tests: Reaction Time vs. Nonword Approaches

    ERIC Educational Resources Information Center

    Pellicer-Sanchez, Ana; Schmitt, Norbert

    2012-01-01

    Despite a number of research studies investigating the Yes-No vocabulary test format, one main question remains unanswered: What is the best scoring procedure to adjust for testee overestimation of vocabulary knowledge? Different scoring methodologies have been proposed based on the inclusion and selection of nonwords in the test. However, there…

  12. Increasing Racial Isolation and Test Score Gaps in Mathematics: A 30-Year Perspective

    ERIC Educational Resources Information Center

    Berends, Mark; Penaloza, Roberto V.

    2010-01-01

    Background/Context: Although there has been progress in closing the test score gaps among student groups over past decades, that progress has stalled. Many researchers have speculated why the test score gaps closed between the early 1970s and the early 1990s, but only a few have been able to empirically study how changes in school factors and…

  13. Peer Effects and the Indigenous/Non-Indigenous Early Test-Score Gap in Peru

    ERIC Educational Resources Information Center

    Sakellariou, Chris

    2008-01-01

    This paper assesses the magnitude of the non-indigenous/indigenous test-score gap for third-year and fourth-year primary school pupils in Peru, in relation to the main family, school and peer inputs contributing to the test-score gap using the estimation method of feasible generalized least squares. The article then decomposes the gap into its…

  14. The Dynamics of the Evolution of the Black-White Test Score Gap

    ERIC Educational Resources Information Center

    Sohn, Kitae

    2012-01-01

    We apply a quantile version of the Oaxaca-Blinder decomposition to estimate the counterfactual distribution of the test scores of Black students. In the Early Childhood Longitudinal Study, Kindergarten Class of 1998-1999 (ECLS-K), we find that the gap initially appears only at the top of the distribution of test scores. As children age, however,…

  15. Linking Scores From Tests of Similar Content Given in Different Languages: An Illustration Involving Methodological Alternatives

    ERIC Educational Resources Information Center

    Cascallar, Alicia S.; Dorans, Neil J.

    2005-01-01

    This study compares two methods commonly used (concordance and prediction) to establish linkages between scores from tests of similar content given in different languages. Score linkages between the Verbal and Math sections of the SAT I and the corresponding sections of the Spanish-language admissions test, the Prueba de Aptitud Academica (PAA),…

  16. Linking Scores from Tests of Similar Content Given in Different Languages: An Illustration Involving Methodological Alternatives

    ERIC Educational Resources Information Center

    Cascallar, Alicia S.; Dorans, Neil J.

    2005-01-01

    This study compares two methods commonly used (concordance and prediction) to establish linkages between scores from tests of similar content given in different languages. Score linkages between the Verbal and Math sections of the SAT I and the corresponding sections of the Spanish-language admissions test, the Prueba de Aptitud Academica (PAA),…

  17. Interpreting Standardized Test Scores: Strategies for Data-Driven Instructional Decision Making

    ERIC Educational Resources Information Center

    Mertler, Craig A.

    2007-01-01

    This book is designed to help K-12 teachers and administrators understand the nature of standardized tests and, in particular, the scores that result from them. This useful manual helps teachers develop the skills necessary to incorporate these test scores into various types of instructional decision making--a process known as "data-driven…

  18. Comparing the Effects of Elementary Music and Visual Arts Lessons on Standardized Mathematics Test Scores

    ERIC Educational Resources Information Center

    King, Molly Elizabeth

    2016-01-01

    The purpose of this quantitative, causal-comparative study was to compare the effect elementary music and visual arts lessons had on third through sixth grade standardized mathematics test scores. Inferential statistics were used to compare the differences between test scores of students who took in-school, elementary, music instruction during the…

  19. Comparison of Two Scoring Systems for the Modified Version of the Bender-Gestalt Test.

    ERIC Educational Resources Information Center

    Schachter, Steven; And Others

    1991-01-01

    Examined relative utility of two scoring systems for Modified Version of Bender-Gestalt Test in predicting performance on Developmental Test of Visual-Motor Integration. Findings from 53 kindergarten and 47 first grade students indicated that Qualitative Scoring System was significantly better predictor of visual-motor integration skills than…

  20. Beyond Correlations: Usefulness of High School GPA and Test Scores in Making College Admissions Decisions

    ERIC Educational Resources Information Center

    Sawyer, Richard

    2013-01-01

    Correlational evidence suggests that high school GPA is better than admission test scores in predicting first-year college GPA, although test scores have incremental predictive validity. The usefulness of a selection variable in making admission decisions depends in part on its predictive validity, but also on institutions' selectivity and…

  1. The Effects of Developmental Placement and Early Retention on Children's Later Scores on Standardized Tests.

    ERIC Educational Resources Information Center

    May, Deborah C.; Welch, Edward L.

    1984-01-01

    Examined the relationship between early school retention as a result of preschool and kindergarten developmental testing and children's later academic achievement (N=223). Results showed children who scored as immature on the Gesell Screening Test and who were retained a year had the lowest scores on all measures. (JAC)

  2. The Dynamics of the Evolution of the Black-White Test Score Gap

    ERIC Educational Resources Information Center

    Sohn, Kitae

    2012-01-01

    We apply a quantile version of the Oaxaca-Blinder decomposition to estimate the counterfactual distribution of the test scores of Black students. In the Early Childhood Longitudinal Study, Kindergarten Class of 1998-1999 (ECLS-K), we find that the gap initially appears only at the top of the distribution of test scores. As children age, however,…

  3. Peer Effects and the Indigenous/Non-Indigenous Early Test-Score Gap in Peru

    ERIC Educational Resources Information Center

    Sakellariou, Chris

    2008-01-01

    This paper assesses the magnitude of the non-indigenous/indigenous test-score gap for third-year and fourth-year primary school pupils in Peru, in relation to the main family, school and peer inputs contributing to the test-score gap using the estimation method of feasible generalized least squares. The article then decomposes the gap into its…

  4. Scoring Yes-No Vocabulary Tests: Reaction Time vs. Nonword Approaches

    ERIC Educational Resources Information Center

    Pellicer-Sanchez, Ana; Schmitt, Norbert

    2012-01-01

    Despite a number of research studies investigating the Yes-No vocabulary test format, one main question remains unanswered: What is the best scoring procedure to adjust for testee overestimation of vocabulary knowledge? Different scoring methodologies have been proposed based on the inclusion and selection of nonwords in the test. However, there…

  5. Comparison of Two Scoring Systems for the Modified Version of the Bender-Gestalt Test.

    ERIC Educational Resources Information Center

    Schachter, Steven; And Others

    1991-01-01

    Examined relative utility of two scoring systems for Modified Version of Bender-Gestalt Test in predicting performance on Developmental Test of Visual-Motor Integration. Findings from 53 kindergarten and 47 first grade students indicated that Qualitative Scoring System was significantly better predictor of visual-motor integration skills than…

  6. Effects of Scoring by Section and Independent Scorers' Patterns on Scorer Reliability in Biology Essay Tests

    ERIC Educational Resources Information Center

    Ebuoh, Casmir N.; Ezeudu, S. A.

    2015-01-01

    The study investigated the effects of scoring by section, use of independent scorers and conventional patterns on scorer reliability in Biology essay tests. It was revealed from literature review that conventional pattern of scoring all items at a time in essay tests had been criticized for not being reliable. The study was true experimental study…

  7. Using Raters from India to Score a Large-Scale Speaking Test

    ERIC Educational Resources Information Center

    Xi, Xiaoming; Mollaun, Pam

    2011-01-01

    We investigated the scoring of the Speaking section of the Test of English as a Foreign Language[TM] Internet-based (TOEFL iBT[R]) test by speakers of English and one or more Indian languages. We explored the extent to which raters from India, after being trained and certified, were able to score the TOEFL examinees with mixed first languages…

  8. Life Stress and Reading Comprehension Test Scores in the Middle School Student.

    ERIC Educational Resources Information Center

    Jones, Maryann Clementi

    A study determined the relationship between life stress and reading comprehension test scores on the IOWA Tests of Basic Skills. Subjects, 41 middle-school students attending Lincoln School in Garwood, New Jersey, were surveyed as to the amount of life stress prevalent in their lives. In addition, the Iowa scores for reading comprehension were…

  9. The Implications of Family Size and Birth Order for Test Scores and Behavioral Development

    ERIC Educational Resources Information Center

    Silles, Mary A.

    2010-01-01

    This article, using longitudinal data from the National Child Development Study, presents new evidence on the effects of family size and birth order on test scores and behavioral development at age 7, 11 and 16. Sibling size is shown to have an adverse causal effect on test scores and behavioral development. For any given family size, first-borns…

  10. Individual Part Score Profiles of Children with Intellectual Disability: A Descriptive Analysis across Three Intelligence Tests

    ERIC Educational Resources Information Center

    Bergeron, Renee; Floyd, Randy G.

    2013-01-01

    This study examined the group- and individual-level part score profiles of children with intellectual disability (ID) who participated in clinical validity studies supporting three individually administered intelligence tests. Across tests, children with ID produced group-level profiles comprising mean part scores that fell in the Low to Very Low…

  11. An Item Analysis and Validity Investigation of Bender Visual Motor Gestalt Test Score Items

    ERIC Educational Resources Information Center

    Lambert, Nadine M.

    1971-01-01

    This investigation attempted to demonstrate the utility of standard item analysis procedures for selecting the most reliable and valid items for scoring Bender Visual Motor Gestalt Test test records. (Author)

  12. An Item Analysis and Validity Investigation of Bender Visual Motor Gestalt Test Score Items

    ERIC Educational Resources Information Center

    Lambert, Nadine M.

    1971-01-01

    This investigation attempted to demonstrate the utility of standard item analysis procedures for selecting the most reliable and valid items for scoring Bender Visual Motor Gestalt Test test records. (Author)

  13. The non-credible score of the Rey Auditory Verbal Learning Test: is it better at predicting non-credible neuropsychological test performance than the RAVLT recognition score?

    PubMed

    Whitney, Kriscinda A; Davis, Jeremy J

    2015-03-01

    The ability of both the non-credible score of the Rey Auditory Verbal Learning Test (RAVLT NC) and the recognition score of the RAVLT (RAVLT Recog) to predict credible versus non-credible neuropsychological test performance was examined. Credible versus non-credible group membership was determined according to diagnostic criteria with consideration of performance on two stand-alone performance validity tests. Findings from this retrospective data analysis of outpatients seen for neuropsychological testing within a Veterans Affairs Medical Center (N = 175) showed that RAVLT Recog demonstrated better classification accuracy than RAVLT NC in predicting credible versus non-credible neuropsychological test performance. Specifically, an RAVLT Recog cutoff of ≤9 resulted in reasonable sensitivity (48%) and acceptable specificity (91%) in predicting non-credible neuropsychological test performance. Implications for clinical practice are discussed. Note: The views contained here within are those of the authors and not representative of the institutions with which they are associated.

  14. Motor impairment influences Farnsworth-Munsell 100 Hue test error scores in Parkinson's disease patients.

    PubMed

    Müller, Thomas; Meisel, Margareta; Russ, Herrmann; Przuntek, Horst

    2003-09-15

    Farnsworth-Munsell 100 Hue test (FMT) error scores and peg insertion abilities significantly differ between Parkinson's disease (PD) patients and controls. Both tasks ask for performance of voluntary movements. The objective of this study was to demonstrate a relation between FMT error scores and peg insertion outcomes. We successively performed both tasks in 28 previously untreated PD patients. The FMT error score was significantly (p=0.016) lower in patients with better peg insertion outcome. A significant (Spearman R=0.47, p=0.012) correlation between peg insertion results and the FMT error scores appeared. Motor impairment influences FMT error scores in PD patients.

  15. Does IQ = IQ? Comparability of Intelligence Test Scores in Typically Developing Children.

    PubMed

    Hagmann-von Arx, Priska; Lemola, Sakari; Grob, Alexander

    2016-08-05

    Numerous intelligence tests are available to psychological diagnosticians to assess children's intelligence, but whether they yield comparable test results has been little studied. We examined test scores of 206 typically developing children aged 6 to 11 years on five German intelligence tests (Reynolds Intellectual Assessment Scales; Snijders Oomen Nonverbal Intelligence Test; Intelligence and Development Scales; Wechsler Intelligence Scale for Children, 4th edition; Culture Fair Intelligence Test Scale 2), which were individually administered. On a sample level, the test scores showed strong correlation and little or no mean difference. These results indicate that the tests measure a similar underlying construct, which is interpreted as general intelligence. On an individual level, however, test scores significantly differed across tests for 12% to 38% of the children. Differences did not depend on which test was used but rather on unexplained error. Implications for the application of intelligence assessment in psychological practice are discussed. © The Author(s) 2016.

  16. The Relationship between Scores on the Graduate Management Admission Test and the Test of English as a Foreign Language.

    ERIC Educational Resources Information Center

    Powers, Donald E.

    In addition to scores on the Graduate Management Admission Test (GMAT), which are required of applicants to a substantial number of graduate management schools, foreign candidates may also be required to submit scores on the Test of English as a Foreign Language (TOEFL) as an indication of English language proficiency. The present study provides…

  17. A weighted generalized score statistic for comparison of predictive values of diagnostic tests

    PubMed Central

    Kosinski, Andrzej S.

    2013-01-01

    Positive and negative predictive values are important measures of a medical diagnostic test performance. We consider testing equality of two positive or two negative predictive values within a paired design in which all patients receive two diagnostic tests. The existing statistical tests for testing equality of predictive values are either Wald tests based on the multinomial distribution or the empirical Wald and generalized score tests within the generalized estimating equations (GEE) framework. As presented in the literature, these test statistics have considerably complex formulas without clear intuitive insight. We propose their re-formulations which are mathematically equivalent but algebraically simple and intuitive. As is clearly seen with a new re-formulation we present, the generalized score statistic does not always reduce to the commonly used score statistic in the independent samples case. To alleviate this, we introduce a weighted generalized score (WGS) test statistic which incorporates empirical covariance matrix with newly proposed weights. This statistic is simple to compute, it always reduces to the score statistic in the independent samples situation, and it preserves type I error better than the other statistics as demonstrated by simulations. Thus, we believe the proposed WGS statistic is the preferred statistic for testing equality of two predictive values and for corresponding sample size computations. The new formulas of the Wald statistics may be useful for easy computation of confidence intervals for difference of predictive values. The introduced concepts have potential to lead to development of the weighted generalized score test statistic in a general GEE setting. PMID:22912343

  18. Characteristics of patients with increasing COPD assessment test scores within 3 years.

    PubMed

    Irie, Hidehiro; Chubachi, Shotaro; Sato, Minako; Tsutsumi, Akihiro; Nakachi, Ichiro; Miyao, Naoki; Nishio, Kazumi; Nakamura, Hidetoshi; Asano, Koichiro; Betsuyaku, Tomoko

    2017-10-01

    The chronic obstructive pulmonary disease (COPD) Assessment Test (CAT) is a subjective measure of quality of life. The aim of this study was to examine the characteristics of COPD patients with increasing CAT scores within 3 years. Keio University and its affiliate hospitals conducted an observational COPD cohort study over 3 years. St. George's Respiratory Questionnaire (SGRQ) and CAT were completed at baseline and annually thereafter. Patients who had at least 3 CAT scores were included (n = 315). The ΔCAT score/year and ΔSGRQ score/year were calculated by the slope between each of the measures and the date of measurement. The median ΔCAT score/year was 0.4, and ΔCAT score/year was significantly correlated with ΔSGRQ total score/year. Using an annual cut-off CAT score of +2 points, patients who deteriorated (n = 79) were older, had lower %FEV1, and more severe emphysema on computed tomography scan at baseline than patients who did not deteriorate. The baseline value was not a determinant of subsequent changes in the CAT score. Longitudinal changes in the CAT score were positively correlated with those in the SGRQ score. Old age and severe COPD, not the CAT score at one time point, predicted worsening quality of life. Copyright © 2017 Elsevier Ltd. All rights reserved.

  19. Effects of Targeted Test Preparation on Scores of Two Tests of Oral English as a Second Language

    ERIC Educational Resources Information Center

    Farnsworth, Tim

    2013-01-01

    This study investigated the effect of targeted test preparation, or coaching, on oral English as a second language test scores. The tests in question were the Basic English Skills Test Plus (BEST Plus), a scripted oral interview published by the Center for Applied Linguistics, and the Versant English Test (VET), a computer-administered and…

  20. Effects of Targeted Test Preparation on Scores of Two Tests of Oral English as a Second Language

    ERIC Educational Resources Information Center

    Farnsworth, Tim

    2013-01-01

    This study investigated the effect of targeted test preparation, or coaching, on oral English as a second language test scores. The tests in question were the Basic English Skills Test Plus (BEST Plus), a scripted oral interview published by the Center for Applied Linguistics, and the Versant English Test (VET), a computer-administered and…

  1. Estimating Conditional Distributions of Scores on an Alternate Form of a Test. Research Report. ETS RR-15-18

    ERIC Educational Resources Information Center

    Livingston, Samuel A.; Chen, Haiwen H.

    2015-01-01

    Quantitative information about test score reliability can be presented in terms of the distribution of equated scores on an alternate form of the test for test takers with a given score on the form taken. In this paper, we describe a procedure for estimating that distribution, for any specified score on the test form taken, by estimating the joint…

  2. Correlation Between Students' Dental Admission Test Scores and Performance on a Dental School's Competency Exam.

    PubMed

    Carroll, Alexander M; Schuster, Gregory M

    2015-11-01

    The aim of this study was to investigate whether there was a statistically significant positive correlation between dental students' Dental Admission Test (DAT) scores, particularly on the Perceptual Ability Test (PAT), and their performance on a dental school's competency exam. Scores from the written and clinical competency exam administered in the fall quarter of the fourth year of the curriculum at Midwestern University College of Dental Medicine-Arizona were compared to DAT scores of all 216 members of the graduating classes of 2012 and 2013. It was hypothesized that students who performed highly on one or more sections of the DAT would perform highly on the competency exam. Backward stepwise regression analyses were used to analyze the data. The results showed that the PAT scores were most strongly correlated with the competency exam scores and were a positive predictor for all three clinical sections of the exam (operative dentistry, periodontics, and endodontics). Positive predictors for the written portion of the exam were total DAT score for patient assessment and treatment planning and the DAT reading comprehension score for prosthodontics; there were no predictors for periodontics. The total variance explained by the results ranged from 4% to 15%. While statistically significant relationships were found between the students' PAT scores and clinical performance, DAT scores explained relatively little variance in the competency exam scores. According to these findings, neither the PAT nor any of the DAT components contributed to predicting these students' clinical performance.

  3. 76 FR 16350 - Medical Devices; Ovarian Adnexal Mass Assessment Score Test System; Labeling; Black Box Restrictions

    Federal Register 2010, 2011, 2012, 2013, 2014

    2011-03-23

    ...-alone diagnostic test, or as a test to determine whether or not to proceed with surgery. Elsewhere in... assessment score test system is intended for use in those patients for whom surgery is planned, and should not be used to decide whether or not a patient should receive surgery. The test is used in...

  4. Spanish children's scores on Children's Embedded Figures and Rod and Frame Tests.

    PubMed

    Amador-Campos, J A; Kirchner-Nebot, T

    1993-06-01

    The Children's Embedded Figures Test and the Rod and Frame Test were administered to 179 boys and 110 girls of an average age of 9.03 years to measure field dependence-independence. No significant gender-related differences were found on either test. Scores on these tests were moderately and significantly correlated.

  5. Estimating Achievement Gaps from Test Scores Reported in Ordinal "Proficiency" Categories

    ERIC Educational Resources Information Center

    Ho, Andrew D.; Reardon, Sean F.

    2012-01-01

    Test scores are commonly reported in a small number of ordered categories. Examples of such reporting include state accountability testing, Advanced Placement tests, and English proficiency tests. This article introduces and evaluates methods for estimating achievement gaps on a familiar standard-deviation-unit metric using data from these ordered…

  6. Estimating Achievement Gaps from Test Scores Reported in Ordinal "Proficiency" Categories

    ERIC Educational Resources Information Center

    Ho, Andrew D.; Reardon, Sean F.

    2012-01-01

    Test scores are commonly reported in a small number of ordered categories. Examples of such reporting include state accountability testing, Advanced Placement tests, and English proficiency tests. This paper introduces and evaluates methods for estimating achievement gaps on a familiar standard-deviation-unit metric using data from these ordered…

  7. Estimating Achievement Gaps from Test Scores Reported in Ordinal "Proficiency" Categories

    ERIC Educational Resources Information Center

    Ho, Andrew D.; Reardon, Sean F.

    2012-01-01

    Test scores are commonly reported in a small number of ordered categories. Examples of such reporting include state accountability testing, Advanced Placement tests, and English proficiency tests. This article introduces and evaluates methods for estimating achievement gaps on a familiar standard-deviation-unit metric using data from these ordered…

  8. Loanwords and Vocabulary Size Test Scores: A Case of Different Estimates for Different L1 Learners

    ERIC Educational Resources Information Center

    Laufer, Batia; McLean, Stuart

    2016-01-01

    The article investigated how the inclusion of loanwords in vocabulary size tests affected the test scores of two L1 groups of EFL learners: Hebrew and Japanese. New BNC- and COCA-based vocabulary size tests were constructed in three modalities: word form recall, word form recognition, and word meaning recall. Depending on the test modality, the…

  9. Loanwords and Vocabulary Size Test Scores: A Case of Different Estimates for Different L1 Learners

    ERIC Educational Resources Information Center

    Laufer, Batia; McLean, Stuart

    2016-01-01

    The article investigated how the inclusion of loanwords in vocabulary size tests affected the test scores of two L1 groups of EFL learners: Hebrew and Japanese. New BNC- and COCA-based vocabulary size tests were constructed in three modalities: word form recall, word form recognition, and word meaning recall. Depending on the test modality, the…

  10. Scoring Divergent Thinking Tests by Computer With a Semantics-Based Algorithm.

    PubMed

    Beketayev, Kenes; Runco, Mark A

    2016-05-01

    Divergent thinking (DT) tests are useful for the assessment of creative potentials. This article reports the semantics-based algorithmic (SBA) method for assessing DT. This algorithm is fully automated: Examinees receive DT questions on a computer or mobile device and their ideas are immediately compared with norms and semantic networks. This investigation compared the scores generated by the SBA method with the traditional methods of scoring DT (i.e., fluency, originality, and flexibility). Data were collected from 250 examinees using the "Many Uses Test" of DT. The most important finding involved the flexibility scores from both scoring methods. This was critical because semantic networks are based on conceptual structures, and thus a high SBA score should be highly correlated with the traditional flexibility score from DT tests. Results confirmed this correlation (r = .74). This supports the use of algorithmic scoring of DT. The nearly-immediate computation time required by SBA method may make it the method of choice, especially when it comes to moderate- and large-scale DT assessment investigations. Correlations between SBA scores and GPA were insignificant, providing evidence of the discriminant and construct validity of SBA scores. Limitations of the present study and directions for future research are offered.

  11. Computerized scoring and graphing of the Farnsworth-Munsell 100-hue color vision test.

    PubMed

    Lugo, M; Tiedeman, J S

    1986-04-15

    The Farnsworth-Munsell 100-hue test is a sensitive and accurate test of color discrimination. A major disadvantage of the test is the laborious and time-consuming calculation needed to score the results and plot them on a chart for interpretation. We present a computer program, written in Microsoft's BASIC language, that performs the calculation and reports both the individual color cap error scores (from which the graph is plotted) and the total error score. If used with an IBM personal computer (or compatible) capable of graphics, the program plots a graph in a modified polar coordinate format that can be printed on a dot-matrix printer.

  12. Test Score Stability and Construct Validity of the Adult Manifest Anxiety Scale-College Version Scores among College Students: A Brief Report

    ERIC Educational Resources Information Center

    Lowe, Patricia A.; Papanastasiou, Elena C.; DeRuyck, Kimberly A.; Reynolds, Cecil R.

    2005-01-01

    In this study, the authors investigated the temporal stability and construct validity of the Adult Manifest Anxiety Scale-College Version (AMAS-C; C. R. Reynolds, B. O. Richmond, & P. A. Lowe, 2003b) scores. Results indicated that the AMAS-C scores had adequate to excellent test score stability, and evidence supported the construct validity of the…

  13. The Effect of Mobility on Texas Assessment of Knowledge and Skills Test Scores

    ERIC Educational Resources Information Center

    Alvarez, Ray

    2006-01-01

    This research studies the effects of mobility on the high-stakes test scores of a Title I South Central Texas school district. The study involved 10, 5th-grade elementary feeder school populations graduating to the 6th grade in 3 middle schools. The researcher compared the 1st administration scores of the Texas Assessment of Knowledge and Skills…

  14. Assessing the Relationship among Defining Issues Test Scores and Crystallised and Fluid Intellectual Indices

    ERIC Educational Resources Information Center

    Derryberry, W. Pitt; Jones, Kristy L.; Grieve, Frederick G.; Barger, Brian

    2007-01-01

    Differing findings exist on how Defining Issues Test (DIT) scores relate to intelligence. Further study is needed in order to address aspects of intellect not previously considered and to address how these relationships rival studies that have compared indices of intellect with constructs similar to DIT scores. In the present study, a sample of…

  15. Optimal Scoring Methods of Hand-Strength Tests in Patients with Stroke

    ERIC Educational Resources Information Center

    Huang, Sheau-Ling; Hsieh, Ching-Lin; Lin, Jau-Hong; Chen, Hui-Mei

    2011-01-01

    The purpose of this study was to determine the optimal scoring methods for measuring strength of the more-affected hand in patients with stroke by examining the effect of reducing measurement errors. Three hand-strength tests of grip, palmar pinch, and lateral pinch were administered at two sessions in 56 patients with stroke. Five scoring methods…

  16. TOEFL iBT Speaking Test Scores as Indicators of Oral Communicative Language Proficiency

    ERIC Educational Resources Information Center

    Bridgeman, Brent; Powers, Donald; Stone, Elizabeth; Mollaun, Pamela

    2012-01-01

    Scores assigned by trained raters and by an automated scoring system (SpeechRater[TM]) on the speaking section of the TOEFL iBT[TM] were validated against a communicative competence criterion. Specifically, a sample of 555 undergraduate students listened to speech samples from 184 examinees who took the Test of English as a Foreign Language…

  17. From #2 Pencils to the World Wide Web: A History of Test Scoring

    ERIC Educational Resources Information Center

    Zytowski, Donald G.

    2008-01-01

    The present highly developed status of psychological and educational testing in the United States is in part the result of many efforts over the past 100 years to develop economical and reliable methods of scoring. The present article traces a number of methods, ranging from hand scoring to present-day computer applications, stimulated by the need…

  18. Comparison of the Koppitz and Watkins Scoring Systems for the Bender Gestalt Test.

    ERIC Educational Resources Information Center

    Johnston, Cris W.; Lanak, Brenda

    1985-01-01

    The Bender Gestalt Test was administered to 25 children (7-10 years old) referred for neuropsychological assessment and scored using the Koppitz system and the Watkins system. Although the scores obtained using the two different sets of criteria were highly correlated, the Watkins rules produced generally better performance. (Author/CL)

  19. Language Variation and Score Variation in the Testing of English Language Learners, Native Spanish Speakers

    ERIC Educational Resources Information Center

    Solano-Flores, Guillermo; Li, Min

    2009-01-01

    We investigated language variation and score variation in the testing of English language learners, native Spanish speakers. We gave students the same set of National Assessment of Educational Progress mathematics items in both their first language and their second language. We examined the amount of score variation due to the main and interaction…

  20. Relationship of Sentence Skills Test Scores and Final Course Grades in Marketing 100.

    ERIC Educational Resources Information Center

    Ryan, Nancy

    1996-01-01

    Describes a study examining the relationship between scores on a Sentence Skills component of an English placement test and final course grades in a community college marketing course. Finds a significant positive correlation between scores and final grades, but one not strong enough to be used for predictive purposes. (13 citations) (BCY)

  1. The Impact of the 2004 Hurricanes on Florida Comprehensive Assessment Test Scores: Implications for School Counselors

    ERIC Educational Resources Information Center

    Baggerly, Jennifer; Ferretti, Larissa K.

    2008-01-01

    What is the impact of natural disasters on students' statewide assessment scores? To answer this question, Florida Comprehensive Assessment Test (FCAT) scores of 55,881 students in grades 4 through 10 were analyzed to determine if there were significant decreases after the 2004 hurricanes. Results reveal that there was statistical but no practical…

  2. Using Test Scores from Students with Disabilities in Teacher Effectiveness Indicators

    ERIC Educational Resources Information Center

    Buzick, Heather M.; Jones, Nathan D.

    2015-01-01

    The increased emphasis on using student growth measures in teacher evaluation has raised questions about how to treat test scores from students with disabilities. This study explores the consequences of three common approaches for treating scores from students with disabilities in statistical approaches to estimating teacher effectiveness: (1)…

  3. The Impact of the 2004 Hurricanes on Florida Comprehensive Assessment Test Scores: Implications for School Counselors

    ERIC Educational Resources Information Center

    Baggerly, Jennifer; Ferretti, Larissa K.

    2008-01-01

    What is the impact of natural disasters on students' statewide assessment scores? To answer this question, Florida Comprehensive Assessment Test (FCAT) scores of 55,881 students in grades 4 through 10 were analyzed to determine if there were significant decreases after the 2004 hurricanes. Results reveal that there was statistical but no practical…

  4. Comparison of the Koppitz and Watkins Scoring Systems for the Bender Gestalt Test.

    ERIC Educational Resources Information Center

    Johnston, Cris W.; Lanak, Brenda

    1985-01-01

    The Bender Gestalt Test was administered to 25 children (7-10 years old) referred for neuropsychological assessment and scored using the Koppitz system and the Watkins system. Although the scores obtained using the two different sets of criteria were highly correlated, the Watkins rules produced generally better performance. (Author/CL)

  5. Optimal Scoring Methods of Hand-Strength Tests in Patients with Stroke

    ERIC Educational Resources Information Center

    Huang, Sheau-Ling; Hsieh, Ching-Lin; Lin, Jau-Hong; Chen, Hui-Mei

    2011-01-01

    The purpose of this study was to determine the optimal scoring methods for measuring strength of the more-affected hand in patients with stroke by examining the effect of reducing measurement errors. Three hand-strength tests of grip, palmar pinch, and lateral pinch were administered at two sessions in 56 patients with stroke. Five scoring methods…

  6. Psychometric Properties of Raw and Scale Scores on Mixed-Format Tests

    ERIC Educational Resources Information Center

    Kolen, Michael J.; Lee, Won-Chan

    2011-01-01

    This paper illustrates that the psychometric properties of scores and scales that are used with mixed-format educational tests can impact the use and interpretation of the scores that are reported to examinees. Psychometric properties that include reliability and conditional standard errors of measurement are considered in this paper. The focus is…

  7. The Score Comparability of Computerized and Paper-and-Pencil Formats for K-3 Reading Tests

    ERIC Educational Resources Information Center

    Pomplun, Mark; Custer, Michael

    2005-01-01

    This study investigated the equivalence of scores from computerized and paper-and-pencil formats of a series of K-3 reading screening tests. Concerns about score equivalence on the computerized formats were warranted because of the use of reading passages, computer unfamiliarity of primary school students, and teacher versus computer…

  8. TOEFL iBT Speaking Test Scores as Indicators of Oral Communicative Language Proficiency

    ERIC Educational Resources Information Center

    Bridgeman, Brent; Powers, Donald; Stone, Elizabeth; Mollaun, Pamela

    2012-01-01

    Scores assigned by trained raters and by an automated scoring system (SpeechRater[TM]) on the speaking section of the TOEFL iBT[TM] were validated against a communicative competence criterion. Specifically, a sample of 555 undergraduate students listened to speech samples from 184 examinees who took the Test of English as a Foreign Language…

  9. Psychometric Properties of Raw and Scale Scores on Mixed-Format Tests

    ERIC Educational Resources Information Center

    Kolen, Michael J.; Lee, Won-Chan

    2011-01-01

    This paper illustrates that the psychometric properties of scores and scales that are used with mixed-format educational tests can impact the use and interpretation of the scores that are reported to examinees. Psychometric properties that include reliability and conditional standard errors of measurement are considered in this paper. The focus is…

  10. Using Scholastic Aptitude Test Scores as Indicators of State Educational Performance.

    ERIC Educational Resources Information Center

    Dynarski, Mark; Gleason, Philip

    1993-01-01

    The Scholastic Aptitude Test (SAT) is often used to measure educational performance at national, state, and local levels. Because participation rates differ considerably, such comparisons are invalid. This article proposes a regression model framework for adjusting SAT scores. Results are validated by comparing adjusted SAT scores with state…

  11. A comparison of likelihood ratio tests and Rao's score test for three separable covariance matrix structures.

    PubMed

    Filipiak, Katarzyna; Klein, Daniel; Roy, Anuradha

    2017-01-01

    The problem of testing the separability of a covariance matrix against an unstructured variance-covariance matrix is studied in the context of multivariate repeated measures data using Rao's score test (RST). The RST statistic is developed with the first component of the separable structure as a first-order autoregressive (AR(1)) correlation matrix or an unstructured (UN) covariance matrix under the assumption of multivariate normality. It is shown that the distribution of the RST statistic under the null hypothesis of any separability does not depend on the true values of the mean or the unstructured components of the separable structure. A significant advantage of the RST is that it can be performed for small samples, even smaller than the dimension of the data, where the likelihood ratio test (LRT) cannot be used, and it outperforms the standard LRT in a number of contexts. Monte Carlo simulations are then used to study the comparative behavior of the null distribution of the RST statistic, as well as that of the LRT statistic, in terms of sample size considerations, and for the estimation of the empirical percentiles. Our findings are compared with existing results where the first component of the separable structure is a compound symmetry (CS) correlation matrix. It is also shown by simulations that the empirical null distribution of the RST statistic converges faster than the empirical null distribution of the LRT statistic to the limiting χ(2) distribution. The tests are implemented on a real dataset from medical studies. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  12. Estimating the Relationship between Use of Test-Preparation Methods and Scores on the Graduate Management Admission Test.

    ERIC Educational Resources Information Center

    Leary, Linda F.; Wightman, Lawrence E.

    This study sought to examine the relationship between five methods of test preparation and test performance as measured by Graduate Management Admission Test (GMAT) Verbal, Quantitative and Total scores. Data on method of test preparation were obtained through voluntary examinee response to five questions which appeared on the answer sheets. One…

  13. Estimating the Relationship between Use of Test-Preparation Methods and Scores on the Graduate Management Admission Test.

    ERIC Educational Resources Information Center

    Leary, Linda F.; Wightman, Lawrence E.

    This study sought to examine the relationship between five methods of test preparation and test performance as measured by Graduate Management Admission Test (GMAT) Verbal, Quantitative and Total scores. Data on method of test preparation were obtained through voluntary examinee response to five questions which appeared on the answer sheets. One…

  14. Rocking at 81 and Rolling at 34: ROC Cut-Off Scores for the Negative Acts Questionnaire–Revised in Serbia

    PubMed Central

    Petrović, Ivana B.; Vukelić, Milica; Čizmić, Svetlana

    2017-01-01

    Researchers are still searching for the ways to identify different categories of employees according to their exposure to negative acts and psychological experience of workplace bullying. We followed Notelaers and Einarsen’s application of the ROC analysis to determine the NAQ-R cut-off scores applying a “lower” and “higher” threshold. The main goal of this research was to develop and test different gold standards of personal and organizational relevance in determining the NAQ-R cut-off scores in a specific cultural and economic context of Serbia. Apart from combining self-labeling as a victim with self-perceived health, the objectives were to test the gold standards developed as a combination of self-labeling with life satisfaction, self-labeling with intention to leave and a complex gold standard based on self-labeling, self-perceived health, life satisfaction and intention to leave taken together. The ROC analysis on Serbian workforce data supports applying of different gold standards. For identifying employees in a preliminary stage of bullying, the most applicable was the gold standard based on self-labeling and intention to leave (score 34 and higher). The most accurate identification of victims could be based on the most complex gold standard (score 81 and higher). This research encourages further investigation of gold standards in different cultures. PMID:28119652

  15. Relationships between spatial activities and scores on the mental rotation test as a function of sex.

    PubMed

    Ginn, Sheryl R; Pickens, Stefanie J

    2005-06-01

    Previous results suggested that female college students' scores on the Mental Rotations Test might be related to their prior experience with spatial tasks. For example, women who played video games scored better on the test than their non-game-playing peers, whereas playing video games was not related to men's scores. The present study examined whether participation in different types of spatial activities would be related to women's performance on the Mental Rotations Test. 31 men and 59 women enrolled at a small, private church-affiliated university and majoring in art or music as well as students who participated in intercollegiate athletics completed the Mental Rotations Test. Women's scores on the Mental Rotations Test benefitted from experience with spatial activities; the more types of experience the women had, the better their scores. Thus women who were athletes, musicians, or artists scored better than those women who had no experience with these activities. The opposite results were found for the men. Efforts are currently underway to assess how length of experience and which types of experience are related to scores.

  16. Scoring Divergent Thinking Tests by Computer With a Semantics-Based Algorithm

    PubMed Central

    Beketayev, Kenes; Runco, Mark A.

    2016-01-01

    Divergent thinking (DT) tests are useful for the assessment of creative potentials. This article reports the semantics-based algorithmic (SBA) method for assessing DT. This algorithm is fully automated: Examinees receive DT questions on a computer or mobile device and their ideas are immediately compared with norms and semantic networks. This investigation compared the scores generated by the SBA method with the traditional methods of scoring DT (i.e., fluency, originality, and flexibility). Data were collected from 250 examinees using the “Many Uses Test” of DT. The most important finding involved the flexibility scores from both scoring methods. This was critical because semantic networks are based on conceptual structures, and thus a high SBA score should be highly correlated with the traditional flexibility score from DT tests. Results confirmed this correlation (r = .74). This supports the use of algorithmic scoring of DT. The nearly-immediate computation time required by SBA method may make it the method of choice, especially when it comes to moderate- and large-scale DT assessment investigations. Correlations between SBA scores and GPA were insignificant, providing evidence of the discriminant and construct validity of SBA scores. Limitations of the present study and directions for future research are offered. PMID:27298632

  17. A Maturing Global Testing Regime Meets the World Economy: Test Scores and Economic Growth, 1960-2012

    ERIC Educational Resources Information Center

    Kamens, David H.

    2015-01-01

    This article considers the growth of the international testing regime. It discusses sources of growth and empirically examines two related sets of issues: (1) the stability of countries' achievement scores, and (2) the influence of those national scores on subsequent economic development over different time lags. The article suggests that…

  18. A Maturing Global Testing Regime Meets the World Economy: Test Scores and Economic Growth, 1960-2012

    ERIC Educational Resources Information Center

    Kamens, David H.

    2015-01-01

    This article considers the growth of the international testing regime. It discusses sources of growth and empirically examines two related sets of issues: (1) the stability of countries' achievement scores, and (2) the influence of those national scores on subsequent economic development over different time lags. The article suggests that…

  19. Psychometric Evaluation of the Lower Extremity Computerized Adaptive Test, the Modified Harris Hip Score, and the Hip Outcome Score

    PubMed Central

    Hung, Man; Hon, Shirley D.; Cheng, Christine; Franklin, Jeremy D.; Aoki, Stephen K.; Anderson, Mike B.; Kapron, Ashley L.; Peters, Christopher L.; Pelt, Christopher E.

    2014-01-01

    Background: The applicability and validity of many patient-reported outcome measures in the high-functioning population are not well understood. Purpose: To compare the psychometric properties of the modified Harris Hip Score (mHHS), the Hip Outcome Score activities of daily living subscale (HOS-ADL) and sports (HOS-sports), and the Lower Extremity Computerized Adaptive Test (LE CAT). The hypotheses was that all instruments would perform well but that the LE CAT would show superiority psychometrically because a combination of CAT and a large item bank allows for a high degree of measurement precision. Study Design: Cohort study (diagnosis); Level of evidence, 2. Methods: Data were collected from 472 advanced-age, active participants from the Huntsman World Senior Games in 2012. Validity evidences were examined through item fit, dimensionality, monotonicity, local independence, differential item functioning, person raw score to measure correlation, and instrument coverage (ie, ceiling and floor effects), and reliability evidences were examined through Cronbach alpha and person separation index. Results: All instruments demonstrated good item fit, unidimensionality, monotonicity, local independence, and person raw score to measure correlations. The HOS-ADL had high ceiling effects of 36.02%, and the mHHS had ceiling effects of 27.54%. The LE CAT had ceiling effects of 8.47%, and the HOS-sports had no ceiling effects. None of the instruments had any floor effects. The mHHS had a very low Cronbach alpha of 0.41 and an extremely low person separation index of 0.08. Reliabilities for the LE CAT were excellent and for the HOS-ADL and HOS-sports were good. Conclusion: The LE CAT showed better psychometric properties overall than the HOS-ADL, HOS-sports, and mHHS for the senior population. The mHHS demonstrated pronounced ceiling effects and poor reliabilities that should be of concern. The high ceiling effects for the HOS-ADL were also of concern. The LE CAT was superior

  20. Effect of Two Types of Footwear on Physical Fitness Test Scores.

    DTIC Science & Technology

    A total of 53 U.S. Marines took the Physical Fitness Test (less push-ups and sit-ups) in both Marine Corps issue boots and tennis shoes. Scores were significantly better when tennis shoes were worn. (Author)

  1. Relationship of the Nelson Denny Reading Test to scores on the National Physical Therapy Licensure Examination.

    PubMed

    Aldridge, Roy Lee; Keith, Becky; Sloas, Stacy; Mott-Murphree, Ashley

    2010-01-01

    This study examined the relationship between scaled scores of reading comprehension as measured by the Nelson Denny Reading Test and scaled scores on the National Physical Therapy Licensure Examination (NPTE). Sixty-seven graduate students from Arkansas State University participated in this study. The Nelson Denny Reading Test was given at the initiation of the physical therapy program, and the NPTE was taken after successful completion of the graduate program in physical therapy. A correlation analysis examined the scaled scores of these graduates from the Reading Test and the NPTE. Analysis revealed that a moderate positive relationship and statistically significant difference existed between scaled scores on the Reading Test and the NPTE for those graduates who passed the NPTE their first time.

  2. Accuracy of Test-Score-Difference Decisions and the Jagged-ProfileEffect

    ERIC Educational Resources Information Center

    Clawar, Harry J.; Hopkins, Thomas F.

    1975-01-01

    The present paper emphasizes the importance of making interpretations regarding differences among an individual's scores on a test battery based on this same concern for the appropriateness of the reference group. (Author)

  3. Accuracy of Test-Score-Difference Decisions and the Jagged-ProfileEffect

    ERIC Educational Resources Information Center

    Clawar, Harry J.; Hopkins, Thomas F.

    1975-01-01

    The present paper emphasizes the importance of making interpretations regarding differences among an individual's scores on a test battery based on this same concern for the appropriateness of the reference group. (Author)

  4. NBME Obstetrics and Gynecology clerkship final examination scores: predictive value of standardized tests and demographic factors.

    PubMed

    Ogunyemi, Dotun; De Taylor-Harris, Shawn

    2004-12-01

    To assess the predictive value of standardized tests and demographic factors on performance on the National Board of Medical Examiners (NBME) Obstetrics and Gynecology (OBGYN) examination. The study included 171 students who rotated through obstetrics and gynecology from 1992 to 2001. Correlations between NBME OBGYN scores and United States Medical Licensing Examination (USMLE) step 1, MCAT and GPA scores, and temporal and demographic factors were analyzed. The mean Medical College Admission Test (MCAT), NBME and USMLE step 1 scores were 24.03, 67.47 and 194.53, respectively. Significant correlations with NMBE OBGYN were USMLE step 1 scores (r = .517, p < 0.001), MCAT scores (r = .481, p < 0.001), faculty evaluation grade (r=.223, p <0.01), OBGYN residency interest (r= .179, p < 0.05) and female gender (r=.157, p< 0.05) with a significant negative correlation with clerkship duration (r = -.208, p < 0. 01). On logistic regression analysis, USMLE, MCAT, academic year and faculty evaluation grade were independently predictive of NBME OBGYN scores. All preclinical standardized tests showed positive correlation with progressive academic years and decreasing student age. and decreasing student age. USMLE step 1, MCAT scores and faculty evaluation grades were predictive of NMBE OBGYN scores. They may be helpful in assessing third-year medical students in need of special supervision and assistance.

  5. Correcting for prematurity affects developmental test scores in infants born late and moderately preterm.

    PubMed

    Parekh, Shalin A; Boyle, Elaine M; Guy, Alexa; Blaggan, Samarita; Manktelow, Bradley N; Wolke, Dieter; Johnson, Samantha

    2016-03-01

    Corrected age is typically applied when assessing the development of children born <32 weeks of gestation. There is no consensus as to whether corrected age should be applied when assessing children born late/moderately preterm (LMPT; 32-36 weeks of gestation). This study explored the impact of corrected age on developmental test scores in infants born LMPT. 221 LMPT infants were assessed at two years corrected age using the Bayley-III cognitive and language scales, from which cognitive and language composite scores were derived (Normative Mean 100; SD 15). Assessments were then re-scored using chronological age. Bayley-III composite scores <80 were used to define developmental delay. Paired sample t-tests were used to assess the difference in mean test scores derived using corrected versus chronological age, and McNemar's tests to assess the difference in the proportion of infants with developmental delay using corrected versus chronological age. Mean corrected age scores were significantly higher than chronological age scores (cognitive: 2.1 points; 95% CI 1.6, 2.5; language 2.5; 95% CI 2.1, 2.8). Overall, significantly more LMPT infants were classified with developmental delay when chronological (18.3%) versus corrected (15.0%) age was used (p=0.016). Correcting for prematurity results in significantly higher developmental test scores and a significantly lower prevalence of developmental delay in LMPT infants and may affect eligibility for intervention services. Researchers and clinicians should be aware that the use of corrected age may impact on developmental test scores at both an individual and population level among infants born LMPT. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.

  6. Validity of Alternative Cut-Off Scores for the Back-Saver Sit and Reach Test

    ERIC Educational Resources Information Center

    Looney, Marilyn A.; Gilbert, Jennie

    2012-01-01

    The purpose of the study was to determine if currently used FITNESSGRAM[R] cut-off scores for the Back Saver Sit and Reach Test had the best criterion-referenced validity evidence for 6-12 year old children. Secondary analyses of an existing data set focused on the passive straight leg raise and Back Saver Sit and Reach Test flexibility scores of…

  7. Trends in Black-White Test-Score Differentials. Discussion Paper.

    ERIC Educational Resources Information Center

    Hauser, Robert M.; Huang, Min-Hsiung

    Until the 1970s, there were few signs of change in the historic difference of one standard deviation between average ability or achievement test scores of black and white students. From 1970 to the mid-to-late 1980s, there was a substantial convergence of the average achievement test scores of blacks and whites. From the mid-to-late 1980s to 1992,…

  8. A Quantitative Analysis of the Effect of Resistance Training on Strength Test Score Variability

    DTIC Science & Technology

    2009-10-02

    Bemben, D. A., Loftiss, D. D., & Knehans, A. W. (2001). Creatine supplementation during resistance training in college football athletes. Med Sci...Naval Health Research Center A Quantitative Analysis of the Effect of Resistance Training on Strength Test Score Variability Ross R. Vickers, Jr...the Effect of Resistance Training on Strength Test Score Variability Ross R. Vickers Jr. Amanda C. Barnard Linda K. Hervig Naval Health

  9. Evidence-based decision about test scoring rules in clinical anatomy multiple-choice examinations.

    PubMed

    Severo, Milton; Gaio, A Rita; Povo, Ana; Silva-Pereira, Fernanda; Ferreira, Maria Amélia

    2015-01-01

    In theory the formula scoring methods increase the reliability of multiple-choice tests in comparison with number-right scoring. This study aimed to evaluate the impact of the formula scoring method in clinical anatomy multiple-choice examinations, and to compare it with that from the number-right scoring method, hoping to achieve an evidence-based decision about test scoring rules. Two hundred and ninety-eight students completed an examination in clinical anatomy which included 40 multiple-choice questions with five response options each. Among these, 245 (82.2%) examinees were assessed according to the number-right scoring method (group A) while 53 (17.8%) were assessed according to the formula scoring method (group B). The prevalence of passing was significantly higher in group A than in group B, after correction of the pass and fail cutoffs for guessing (84.9% vs. 62.3%, P = 0.005), keeping a similar reliability in both groups (0.7 vs. 0.8, P = 0.094). Pearson Correlation coefficients between practical and theoretical examination scores were 0.66 [95%CI = (0.58-0.73)] and 0.72 [95%CI = (0.56-0.83)] for groups A and B, respectively. Based solely on the reliability and validity assessments, the test-maker could therefore use either scoring rules; however, if the test-maker also takes into account the students' ability to deduce answers with partial knowledge, then the number-right score rule is most appropriate. © 2014 American Association of Anatomists.

  10. Reliability of the Koppitz scoring system for the Bender Gestalt Test.

    PubMed

    Hustak, T L; Dinning, W D; Andert, J N

    1976-04-01

    This study investigated the test-retest reliability of the Koppitz scoring system with Bender Gestalt protocols of adult retardates. Results of a sample of 74 adult retardates yielded a correlation of .80 over an interval of 8 to 146 months. A directional measure of change between error scores on the first and second administrations was not significant, which suggests that the test-retest reliability coefficient is an accurate estimate of the Koppitz scoring system for adult retardates. Scorer reliability for three independent scorers ranged from .92 to .95, which suggests comparability to other investigations with different populations.

  11. The effect of the use of Christian-published science textbooks on the ACT Science Reasoning subtest scores of midwest Christian high schools

    NASA Astrophysics Data System (ADS)

    Guthrie, Janice Marie

    Statistics indicate that students in conservative Christian schools earn higher standardized test scores in mathematics, reading, science, and writing compared to their public school counterparts, while many are criticized for using curricular materials deemed inferior in quality and for employing uncertified and ill-trained teachers. This study investigates the effectiveness of Christian-published science textbooks in preparing students for college-level science courses as measured by the Science Reasoning subtest of the ACT college entrance examination. A questionnaire was sent to conservative Christian high schools in five Midwest states which are affiliated with the American Association of Christian Schools, Association of Christian Schools International, and Oral Roberts University Educational Fellowship. Information gathered on the schools, teachers, facilities, and ACT Science Reasoning scores and the resulting descriptive statistics provide a sketch of the typical Midwest Christian high school. Hypothesis testing resulted in acceptance of the Null Hypothesis: There is no difference between the mean ACT Science Reasoning scores of Midwest Christian high schools using Christian-published science textbooks as compared to those using secular-published science textbooks. Multiple regression analysis on the two publishers represented by the sample statistics, A Beka and Bob Jones University Press, showed no significant difference in the effectiveness of one publisher over the other. Analysis of the one open-ended question asking why each school chooses to use the type of publisher it does led to the issues of worldview instruction and academic rigor as the deciding factors for the selection of one type of publisher over the other.

  12. On Reporting IRT Ability Scores When the Test Is Not Unidimensional.

    ERIC Educational Resources Information Center

    Dirir, Mohamed A.; Sinclair, Norma

    The purpose of this study was to examine the effect of test dimensionality on the stability of examinee ability estimates and item response theory (IRT) based score reports. A simulation procedure based on W. F. Stout's Essential Unidimensionality was used to generate test data with one dominant trait for the whole test and three minor traits…

  13. Commentary: Student Cognition, the Situated Learning Context, and Test Score Interpretation

    ERIC Educational Resources Information Center

    La Marca, Paul M.

    2006-01-01

    Although it is assumed that student cognition contributes to student performance on achievement tests, it may be that current testing models lack the degree of specification necessary to warrant such inferences. With test score interpretations as the referent, the authors in this special issue address the role of student cognition in learning and…

  14. Self Adapted Testing as Formative Assessment: Effects of Feedback and Scoring on Engagement and Performance

    ERIC Educational Resources Information Center

    Arieli-Attali, Meirav

    2016-01-01

    This dissertation investigated the feasibility of self-adapted testing (SAT) as a formative assessment tool with the focus on learning. Under two different orientation goals--to excel on a test (performance goal) or to learn from the test (learning goal)--I examined the effect of different scoring rules provided as interactive feedback, on test…

  15. Self Adapted Testing as Formative Assessment: Effects of Feedback and Scoring on Engagement and Performance

    ERIC Educational Resources Information Center

    Arieli-Attali, Meirav

    2016-01-01

    This dissertation investigated the feasibility of self-adapted testing (SAT) as a formative assessment tool with the focus on learning. Under two different orientation goals--to excel on a test (performance goal) or to learn from the test (learning goal)--I examined the effect of different scoring rules provided as interactive feedback, on test…

  16. The 5-Step Way to Raise Test Scores: Using the Data to Drive Your Instruction

    ERIC Educational Resources Information Center

    East, Pam C.

    2005-01-01

    Many teachers look at standardized tests as something to be dreaded. This author and teacher looks at standardized-test scores and sees a tool to bring students learning to new heights. This is a way for teachers to target instruction exactly where it's needed. A way to get students looking forward to end-of-the-year tests (really!) as a way to…

  17. Interpretation and Utilization of Scores on the Air Force Officer Qualifying Test.

    ERIC Educational Resources Information Center

    Miller, Robert E.

    The report summarizes a large body of data relevant to the proper interpretation and use of aptitude scores on the Air Force Officer Qualifying Test (AFOQT). Included are descriptions of the AFOQT testing program and the test itself. Technical data include an extensive sampling of validation studies covering predictors of success in pilot…

  18. Methods for Evaluating the Validity of Test Scores for English Language Learners

    ERIC Educational Resources Information Center

    Sireci, Stephen G.; Han, Kyung T.; Wells, Craig S.

    2008-01-01

    In the United States, when English language learners (ELLs) are tested, they are usually tested in English and their limited English proficiency is a potential cause of construct-irrelevant variance. When such irrelevancies affect test scores, inaccurate interpretations of ELLs' knowledge, skills, and abilities may occur. In this article, we…

  19. The 5-Step Way to Raise Test Scores: Using the Data to Drive Your Instruction

    ERIC Educational Resources Information Center

    East, Pam C.

    2005-01-01

    Many teachers look at standardized tests as something to be dreaded. This author and teacher looks at standardized-test scores and sees a tool to bring students learning to new heights. This is a way for teachers to target instruction exactly where it's needed. A way to get students looking forward to end-of-the-year tests (really!) as a way to…

  20. A Guide for Setting the Cut-Scores to Minimize Weighted Classification Errors in Test Batteries

    ERIC Educational Resources Information Center

    Grabovsky, Irina; Wainer, Howard

    2017-01-01

    In this article, we extend the methodology of the Cut-Score Operating Function that we introduced previously and apply it to a testing scenario with multiple independent components and different testing policies. We derive analytically the overall classification error rate for a test battery under the policy when several retakes are allowed for…

  1. The Estimation of True Scores for Tests Not Taken: A Simulation Study.

    ERIC Educational Resources Information Center

    Bekhuis, Tanja C. H. M.

    An Educational Testing Service (ETS) procedure was evaluated, which is based on item response theory and estimates true scores on tests not taken. The reading, vocabulary, and mathematics tests of high school seniors from the National Longitudinal Study (NLS) of 1972 and the High School and Beyond (HSB) seniors of 1980 and 1982 were found to share…

  2. The Relationship between Career Maturity Test Scores and Appropriateness of Career Choices: A Replication.

    ERIC Educational Resources Information Center

    Westbrook, Bert W.; And Others

    1990-01-01

    Attempted to replicate study determining relationship between appropriateness of career choices and career maturity test scores in rural ninth grade students (N=112) using Goal Selection scale of Career Maturity Inventory Competence Test and American College Testing Program Career Planning Program. Found two career maturity measures correlated…

  3. Commentary: Student Cognition, the Situated Learning Context, and Test Score Interpretation

    ERIC Educational Resources Information Center

    La Marca, Paul M.

    2006-01-01

    Although it is assumed that student cognition contributes to student performance on achievement tests, it may be that current testing models lack the degree of specification necessary to warrant such inferences. With test score interpretations as the referent, the authors in this special issue address the role of student cognition in learning and…

  4. Methods for Evaluating the Validity of Test Scores for English Language Learners

    ERIC Educational Resources Information Center

    Sireci, Stephen G.; Han, Kyung T.; Wells, Craig S.

    2008-01-01

    In the United States, when English language learners (ELLs) are tested, they are usually tested in English and their limited English proficiency is a potential cause of construct-irrelevant variance. When such irrelevancies affect test scores, inaccurate interpretations of ELLs' knowledge, skills, and abilities may occur. In this article, we…

  5. A Study of the Relationship between Scores and Time on Tests.

    ERIC Educational Resources Information Center

    Kennedy, Rob

    The purpose of this study was to investigate the relationship between the scores students earned on multiple choice tests and the number of minutes students required to complete the tests. The 5 tests were made up of 20 randomly drawn questions from a large pool of questions about research methods. Students were allowed an unlimited amount of time…

  6. Effects of Grade Level and Subject on Student Test Score Predictions.

    ERIC Educational Resources Information Center

    Barnett, Jerrold E.; Hixon, Jon E.

    1997-01-01

    Interviews with elementary students before and after tests in three subjects investigated how grade level and subject affected students' ability to predict test scores. Results found a significant grade-subject area interaction for predictions prior to testing. Posttest predictions differed only slightly from pretest. Prediction accuracy was…

  7. Preliminary test of effects of cognitive ability, experience, and teaching methods on Verbal Analogy Test scores.

    PubMed

    Rosenberg, D; Willson-Quayle, A; Pasnak, R

    2000-06-01

    The methods from which one can choose when preparing for the GRE Verbal Analogies include books, software, audiotapes, and formal classroom instruction. What teaching method will work best for a given individual? To begin the search for an answer, Gray's test of reasoning ability was given to 28 undergraduates who also answered a questionnaire detailing their experience with analogies. They were randomly assigned to teaching conditions ranging from self-directed workbook study to intensive interactive assistance. No teaching method was superior overall, but interactions showed that (1) students who scored worst on the pretest improved the most, (2) those higher in cognitive functioning and experience performed better after intensive interactive assistance, and (3) those lower in both cognitive functioning and experience did significantly better with self-paced workbooks. This preliminary work suggests that it may be profitable to assess the prior experience and reasoning of potential students and adopt the methods for teaching formal operational thought found empirically to be most suitable.

  8. An Analysis of Cross Racial Identity Scale Scores Using Classical Test Theory and Rasch Item Response Models

    ERIC Educational Resources Information Center

    Sussman, Joshua; Beaujean, A. Alexander; Worrell, Frank C.; Watson, Stevie

    2013-01-01

    Item response models (IRMs) were used to analyze Cross Racial Identity Scale (CRIS) scores. Rasch analysis scores were compared with classical test theory (CTT) scores. The partial credit model demonstrated a high goodness of fit and correlations between Rasch and CTT scores ranged from 0.91 to 0.99. CRIS scores are supported by both methods.…

  9. An Analysis of Cross Racial Identity Scale Scores Using Classical Test Theory and Rasch Item Response Models

    ERIC Educational Resources Information Center

    Sussman, Joshua; Beaujean, A. Alexander; Worrell, Frank C.; Watson, Stevie

    2013-01-01

    Item response models (IRMs) were used to analyze Cross Racial Identity Scale (CRIS) scores. Rasch analysis scores were compared with classical test theory (CTT) scores. The partial credit model demonstrated a high goodness of fit and correlations between Rasch and CTT scores ranged from 0.91 to 0.99. CRIS scores are supported by both methods.…

  10. Predicting Long-Term College Success through Degree Completion Using ACT[R] Composite Score, ACT Benchmarks, and High School Grade Point Average. ACT Research Report Series, 2012 (5)

    ERIC Educational Resources Information Center

    Radunzel, Justine; Noble, Julie

    2012-01-01

    This study compared the effectiveness of ACT[R] Composite score and high school grade point average (HSGPA) for predicting long-term college success. Outcomes included annual progress towards a degree (based on cumulative credit-bearing hours earned), degree completion, and cumulative grade point average (GPA) at 150% of normal time to degree…

  11. A noninvasive test of sphincter of Oddi dysfunction in postcholecystectomy patients: the scintigraphic score.

    PubMed

    Sostre, S; Kalloo, A N; Spiegler, E J; Camargo, E E; Wagner, H N

    1992-06-01

    The ideal noninvasive test of sphincter of Oddi dysfunction (SOD) does not exist and the diagnosis of patients with postcholecystectomy pain often relies on invasive procedures. In this paper we describe a scintigraphic test for SOD: the scintigraphic score. This score combines quantitative and visual criteria for interpretation of hepatobiliary scans. Twenty-six consecutive postcholecystectomy patients underwent hepatobiliary imaging, ERCP, and sphincter manometry. Twelve patients had SOD and 14 had normal sphincters determined by clinical findings, ERCP, and manometric studies. All patients with normal sphincter had scores of 0-4, while patients with SOD had values of 5-12 for a perfect sensitivity and specificity of 100%. Hepatobiliary scans scored in this fashion may become the noninvasive test of choice to screen postcholecystectomy patients with suspected SOD.

  12. Generalization of the Lord-Wingersky Algorithm to Computing the Distribution of Summed Test Scores Based on Real-Number Item Scores

    ERIC Educational Resources Information Center

    Kim, Seonghoon

    2013-01-01

    With known item response theory (IRT) item parameters, Lord and Wingersky provided a recursive algorithm for computing the conditional frequency distribution of number-correct test scores, given proficiency. This article presents a generalized algorithm for computing the conditional distribution of summed test scores involving real-number item…

  13. Determining When Single Scoring for Constructed-Response Items Is as Effective as Double Scoring in Mixed-Format Licensure Tests

    ERIC Educational Resources Information Center

    Kim, Sooyeon; Moses, Tim

    2013-01-01

    The major purpose of this study is to assess the conditions under which single scoring for constructed-response (CR) items is as effective as double scoring in the licensure testing context. We used both empirical datasets of five mixed-format licensure tests collected in actual operational settings and simulated datasets that allowed for the…

  14. Test Score Stability and the Relationship of Adult Manifest Anxiety Scale-College Version Scores to External Variables among Graduate Students

    ERIC Educational Resources Information Center

    Lowe, Patricia A.; Peyton, Vicki; Reynolds, Cecil R.

    2007-01-01

    A sample of 79 individuals participated in the present study to evaluate the test score stability (8-week test-retest interval) and construct validity of the scores of the Adult Manifest Anxiety Scale-College Version, a new measure used to assess anxiety in college students, for application to graduate-level students. Results of the study…

  15. Determining When Single Scoring for Constructed-Response Items Is as Effective as Double Scoring in Mixed-Format Licensure Tests

    ERIC Educational Resources Information Center

    Kim, Sooyeon; Moses, Tim

    2013-01-01

    The major purpose of this study is to assess the conditions under which single scoring for constructed-response (CR) items is as effective as double scoring in the licensure testing context. We used both empirical datasets of five mixed-format licensure tests collected in actual operational settings and simulated datasets that allowed for the…

  16. Generalization of the Lord-Wingersky Algorithm to Computing the Distribution of Summed Test Scores Based on Real-Number Item Scores

    ERIC Educational Resources Information Center

    Kim, Seonghoon

    2013-01-01

    With known item response theory (IRT) item parameters, Lord and Wingersky provided a recursive algorithm for computing the conditional frequency distribution of number-correct test scores, given proficiency. This article presents a generalized algorithm for computing the conditional distribution of summed test scores involving real-number item…

  17. Improvement in Intelligence Test Scores from 6 to 10 years in Children of Teenage Mothers

    PubMed Central

    Cornelius, Marie D.; Goldschmidt, Lidush; De Genna, Natacha M.; Richardson, Gale A.; Leech, Sharon L.; Day, Richard

    2010-01-01

    Objective This study investigates change in IQ scores among 290 children born to teenage mothers and identifies social, economic, and environmental variables that may be associated with change in intelligence test performance. Methods The children of 290 teenage mothers (72% African American and 28% European American) were assessed with the Stanford-Binet Intelligence Scale-4th Edition (SBIS) at ages 6 and 10. Results The mean composite score at age 6 was 84.8 and was 91.2 at age 10, an improvement of 6.4 points. Significant cross-sectional predictors at both ages 6 and 10 of higher SBIS scores were maternal cognitive ability, school grade, Caucasian ethnicity, and caregiver education. Having more children in the household significantly predicted lower SBIS scores at age 6. Higher satisfaction with maternal social support predicted higher SBIS scores at age 10. Change in IQ scores was not related to maternal socioeconomic status, social support, home environment, ethnicity, or family interactions. Custodial stability was associated with an improvement in IQ scores, while increase in caregiver depression was related to decline in IQ scores. Conclusions Our findings suggest that improvement in IQ scores of offspring of teenage mothers may be related to stability of maternal custody. More research is needed to determine the impact of the maturation of adolescent mothers' parenting and the role of early education on improvement in cognitive abilities. PMID:20495472

  18. Does breastfeeding contribute to the racial gap in reading and math test scores?

    PubMed

    Peters, Kristen E; Huang, Jin; Vaughn, Michael G; Witko, Christopher

    2013-10-01

    The aim of this study was to examine the impact of divergent breastfeeding practices between Caucasian and African American mothers on the lingering achievement test gap between Caucasian and African American children. The Child Development Supplement of the Panel Study of Income Dynamics, beginning in 1997, followed a cohort of 3563 children aged 0-12 years. Reading and math test scores from 2002 for 1928 children were linked with breastfeeding history. Regression analysis was used to examine associations between ever having been breastfed and duration of breastfeeding and test scores, controlling for characteristics of child, mother, and household. African American students scored significantly lower than Caucasian children by 10.6 and 10.9 points on reading and math tests, respectively. After accounting for the impact of having been breastfed during infancy, the racial test gap decreased by 17% for reading scores and 9% for math scores. Study findings indicate that breastfeeding explains 17% and 9% of the observed gaps in reading and math scores, respectively, between African Americans and Caucasians, an effect larger than most recent educational policy interventions. Renewed efforts around policies and clinical practices that promote and remove barriers for African American mothers to breastfeed should be implemented. Copyright © 2013 Elsevier Inc. All rights reserved.

  19. How Does Emergency Department Crowding Affect Medical Student Test Scores and Clerkship Evaluations?

    PubMed

    Wei, Grant; Arya, Rajiv; Ritz, Z Trevor; He, Albert S; Ohman-Strickland, Pamela A; McCoy, Jonathan V

    2015-11-01

    The effect of emergency department (ED) crowding has been recognized as a concern for more than 20 years; its effect on productivity, medical errors, and patient satisfaction has been studied extensively. Little research has reviewed the effect of ED crowding on medical education. Prior studies that have considered this effect have shown no correlation between ED crowding and resident perception of quality of medical education. To determine whether ED crowding, as measured by the National ED Overcrowding Scale (NEDOCS) score, has a quantifiable effect on medical student objective and subjective experiences during emergency medicine (EM) clerkship rotations. We collected end-of-rotation examinations and medical student evaluations for 21 EM rotation blocks between July 2010 and May 2012, with a total of 211 students. NEDOCS scores were calculated for each corresponding period. Weighted regression analyses examined the correlation between components of the medical student evaluation, student test scores, and the NEDOCS score for each period. When all 21 rotations are included in the analysis, NEDOCS scores showed a negative correlation with medical student tests scores (regression coefficient= -0.16, p=0.04) and three elements of the rotation evaluation (attending teaching, communication, and systems-based practice; p<0.05). We excluded an outlying NEDOCS score from the analysis and obtained similar results. When the data were controlled for effect of month of the year, only student test score remained significantly correlated with NEDOCS score (p=0.011). No part of the medical student rotation evaluation attained significant correlation with the NEDOCS score (p≥0.34 in all cases). ED overcrowding does demonstrate a small but negative association with medical student performance on end-of-rotation examinations. Additional studies are recommended to further evaluate this effect.

  20. Intermediate acting non-depolarizing neuromuscular blocking agents and risk of postoperative respiratory complications: prospective propensity score matched cohort study

    PubMed Central

    Grosse-Sundrup, Martina; Henneman, Justin P; Sandberg, Warren S; Bateman, Brian T; Uribe, Jose Villa; Nguyen, Nicole Thuy; Ehrenfeld, Jesse M; Martinez, Elizabeth A; Kurth, Tobias

    2012-01-01

    Objective To determine whether use of intermediate acting neuromuscular blocking agents during general anesthesia increases the incidence of postoperative respiratory complications. Design Prospective, propensity score matched cohort study. Setting General teaching hospital in Boston, Massachusetts, United States, 2006-10. Participants 18 579 surgical patients who received intermediate acting neuromuscular blocking agents during surgery were matched by propensity score to 18 579 reference patients who did not receive such agents. Main outcome measures The main outcome measures were oxygen desaturation after extubation (hemoglobin oxygen saturation <90% with a decrease in oxygen saturation after extubation of >3%) and reintubations requiring unplanned admission to an intensive care unit within seven days of surgery. We also evaluated effects on these outcome variables of qualitative monitoring of neuromuscular transmission (train-of-four ratio) and reversal of neuromuscular blockade with neostigmine to prevent residual postoperative neuromuscular blockade. Results The use of intermediate acting neuromuscular blocking agents was associated with an increased risk of postoperative desaturation less than 90% after extubation (odds ratio 1.36, 95% confidence interval 1.23 to 1.51) and reintubation requiring unplanned admission to an intensive care unit (1.40, 1.09 to 1.80). Qualitative monitoring of neuromuscular transmission did not decrease this risk and neostigmine reversal increased the risk of postoperative desaturation to values less than 90% (1.32, 1.20 to 1.46) and reintubation (1.76, 1.38 to 2.26). Conclusion The use of intermediate acting neuromuscular blocking agents during anesthesia was associated with an increased risk of clinically meaningful respiratory complications. Our data suggest that the strategies used in our trial to prevent residual postoperative neuromuscular blockade should be revisited. PMID:23077290

  1. Validation of the Brazilian version of the childhood asthma control test (c-ACT).

    PubMed

    Oliveira, Suelen G; Sarria, Edgar E; Roncada, Cristian; Stein, Renato T; Pitrez, Paulo M; Mattiello, Rita

    2016-04-01

    Children's perception of their symptoms has proved reliable and relevant to disease management and should be considered when assessing their asthma control. The aim of the study is to validate the Brazilian Portuguese version of the Childhood Asthma Control Test (c-ACT) in children aged 4-11 years. This is a cross-sectional study in children diagnosed with asthma undergoing treatment in a pediatric pulmonology outpatient clinic in Porto Alegre, Brazil. The translation and linguistic adaptation of the instrument were performed in accordance with international recommendations for questionnaire validation. A total of 105 participants were included, aged 4-11 years. all correlations between the total score and items on the questionnaire were significant and obtained values of r ≥ 0.3, and c-ACT means showed statistically significant differences between the GINA categories (P < 0.01). The controlled asthma group showed significantly higher c-ACT scores than those of uncontrolled asthma group (controlled 22.0 ± 2.9 vs. uncontrolled 16.3 ± 5.3 P < 0.01); and partially controlled asthma group showed significantly higher c-ACT scores than those of uncontrolled asthma group (partially controlled 20.0 ± 4.0 vs. uncontrolled 16.3 ± 5.3 P = 0.03). Correlations between the c-ACT total score and spirometry and nitric oxide were poor (r = 0.020; P = 0.866 and r = 0.035; P = 0.753, respectively). Reliability: the α-C coefficient for the c-ACT total score was 0.677 (95%CI 0.573-0763). Sensitivity to change had an effect size of 0.8 and an intraclass correlation coefficient of 0.598. No floor or ceiling effects were observed. The Brazilian version of the Childhood Asthma Control Test proved to be valid and reliable in children aged 4-11 years. © 2015 Wiley Periodicals, Inc.

  2. The reciprocal internal/external frame of reference model using grades and test scores.

    PubMed

    Möller, Jens; Zimmermann, Friederike; Köller, Olaf

    2014-12-01

    The reciprocal I/E model (RI/EM) combines the internal/external frame of reference model (I/EM) with the reciprocal effects model (REM). The RI/EM extends the I/EM longitudinally and the REM across domains. The model predicts that, within domains, mathematics and verbal achievement (VACH) and academic self-concept have positive effects on subsequent mathematics and VACH and academic self-concept within domains but have negative effects across domains. The main purpose is to validate the RI/EM and extend it using objective achievement indicators and grades. Two waves of data collection from grade 5 to grade 9 with N = 1,045 secondary school students were used. Test scores, grades, and self-concept data were obtained. The main analyses were conducted using hierarchical linear modelling. The positive longitudinal effects of grades and test scores on subsequent grades, test scores and academic self-concept within domains and the negative effects of grades and test scores on subsequent academic self-concept across domains supported the RI/EM. The effects of academic self-concept on subsequent grades and test scores across domains were near zero when prior achievement indicators were controlled for. Overall, the results using school grades as achievement measures were replicated using standardized achievement test scores. The results serve to highlight the importance of the combination of common theories, which are mostly investigated individually, to enhance our understanding of the complexity of within- and across-domain relations between academic self-concepts and achievement using grades as well as test scores. © 2014 The British Psychological Society.

  3. Neighborhood Social Context and Individual Polycyclic Aromatic Hydrocarbon Exposures Associated with Child Cognitive Test Scores

    PubMed Central

    Eldred-Skemp, Nicolia; Quinn, James W.; Chang, Hsin-wen; Rauh, Virginia A.; Rundle, Andrew; Orjuela, Manuela A.; Perera, Frederica P.

    2013-01-01

    Childhood cognitive and test-taking abilities have long-term implications for educational achievement and health, and may be influenced by household environmental exposures and neighborhood contexts. This study evaluates whether age 5 scores on the Wechsler Preschool and Primary Scale of Intelligence-Revised (WPPSI-R, administered in English) are associated with polycyclic aromatic hydrocarbon (PAH) exposure and neighborhood context variables including poverty, low educational attainment, low English language proficiency, and inadequate plumbing. The Columbia Center for Children’s Environmental Health enrolled African-American and Dominican-American New York City women during pregnancy, and conducted follow-up for subsequent childhood health outcomes including cognitive test scores. Individual outcomes were linked to data characterizing 1-km network buffers around prenatal addresses, home observations, interviews, and prenatal PAH exposure data from personal air monitors. Prenatal PAH exposure above the median predicted 3.5 point lower total WPPSI-R scores and 3.9 point lower verbal scores; the association was similar in magnitude across models with adjustments for neighborhood characteristics. Neighborhood-level low English proficiency was independently associated with 2.3 point lower mean total WPPSI-R score, 1.2 point lower verbal score, and 2.7 point lower performance score per standard deviation. Low neighborhood-level educational attainment was also associated with 2.0 point lower performance scores. In models examining effect modification, neighborhood associations were similar or diminished among the high PAH exposure group, as compared with the low PAH exposure group. Early life exposure to personal PAH exposure or selected neighborhood-level social contexts may predict lower cognitive test scores. However, these results may reflect limited geographic exposure variation and limited generalizability. PMID:24994947

  4. Preservice Elementary and Secondary Science Methods Teachers: Comparison of Formal Reasoning, ACT Science, Process Skill, and Physical Science Misconceptions Scores.

    ERIC Educational Resources Information Center

    Bitner, Betty L.

    The purpose of this causal-comparative study was to compare reasoning level, American College Test (ACT) science, process skills, and physical science misconceptions of preservice elementary and secondary science teachers and to investigate gender differences. The stratified randomly drawn sample (n=68) consisted of preservice elementary and…

  5. 76 FR 38169 - Toxic Substances Control Act Chemical Testing; Receipt of Test Data

    Federal Register 2010, 2011, 2012, 2013, 2014

    2011-06-29

    ...This notice announces EPA's receipt of test data on five chemicals listed in the Toxic Substances Control Act (TSCA) section 4 test rule titled ``In Vitro Dermal Absorption Rate Testing of Certain Chemicals of Interest to the Occupational Safety and Health Administration,'' amended by the final rule titled ``Revocation of the TSCA Section 4 Testing Requirements for Certain Chemical Substances.''

  6. Construction of an Exome-Wide Risk Score for Schizophrenia Based on a Weighted Burden Test.

    PubMed

    Curtis, David

    2017-09-11

    Polygenic risk scores obtained as a weighted sum of associated variants can be used to explore association in additional data sets and to assign risk scores to individuals. The methods used to derive polygenic risk scores from common SNPs are not suitable for variants detected in whole exome sequencing studies. Rare variants, which may have major effects, are seen too infrequently to judge whether they are associated and may not be shared between training and test subjects. A method is proposed whereby variants are weighted according to their frequency, their annotations and the genes they affect. A weighted sum across all variants provides an individual risk score. Scores constructed in this way are used in a weighted burden test and are shown to be significantly different between schizophrenia cases and controls using a five-way cross-validation procedure. This approach represents a first attempt to summarise exome sequence variation into a summary risk score, which could be combined with risk scores from common variants and from environmental factors. It is hoped that the method could be developed further. © 2017 John Wiley & Sons Ltd/University College London.

  7. The Effects of Testing Circumstance and Education Level on MMPI-2 Correction Scale Scores

    DTIC Science & Technology

    2010-02-01

    Report The Effects of Testing Circumstance and Education Level on MMPI -2 Correction Scale Scores DOT/FAA/AM-10/3 Office of Aerospace Medicine Washington...A-1 1 The effecTs of TesTing circumsTance and educaTion LeveL on mmPi -2 correcTion scaLe scores...could compromise aviation safety. Therefore, tbe FAA has selected tbe MMPI -2 as the new psychological screening test . b. The Vice Presidents for

  8. Examining alternative scoring rubrics on a statewide test: The impact of different scoring methods on science and social studies performance assessments

    NASA Astrophysics Data System (ADS)

    Creighton, Susan Dabney

    There is no consensus regarding the most reliable and valid scoring methods for the assessment of higher order thinking skills. Most of the research on alternative formats has focused on the scoring of writing ability. This study examined the value of different types of performance assessment scoring guides on state mandated science and social studies tests. A proportional stratified sample of raters were randomly assigned to one of four scoring groups: checklist, analytic rubric, holistic rubric, and generic rubrics. A fifth method, the weighted analytic rubric, was included by applying an algorithmic formula to the scores assigned by raters using the analytic rubric. A comparison of the mean scores for the five scoring groups suggests that there may be a difference in the way raters applied the rubric for each group. Although the literature suggests that it is possible to achieve high levels of inter-rater reliability, across forms of scoring, phi coefficients of moderate strength were obtained for three of the four constructed-response items. Results for each scoring group were compared indicating that item complexity may impact the level of inter-rate, reliability and the selection of the most reliable rubric for each discipline. Analytic rubrics appear to achieve more reliable results with less complex items. A multitrait-multimethod approach was utilized to investigate the external validity of the social studies and science tasks. As expected, there tended to be a stronger association between the PACT science constructed-response scores with scores based on science multiple-choice scores than between the science constructed-response scores and the writing ability subtest scores. A similar pattern was seen with social studies items. These results provide some evidence for the validity of the performance assessments. A post study survey completed by raters provided qualitative information regarding their thought processes and their primary focus during the

  9. An electrophysiological correlate of Eating Attitudes Test scores in female college students.

    PubMed

    Wilson, J F; Mercer, J C

    1990-11-01

    Eating Attitudes Test (EAT) scores of forty female college students were compared to their electrodermal activity (EDA) responses when offered a plate of chocolate chip cookies. A significant positive correlation was detected between the EAT scores and the skin conductivity measures associated with the presentation of food. Women with the highest EAT scores also exhibited the greatest sympathetic nervous system responses to a plate of cookies. This finding supports the conclusion that the EAT is capable of identifying individuals who are preoccupied with food or anxious about eating.

  10. A new assessment of the normal ranges of the Farnsworth-Munsell 100-hue test scores.

    PubMed

    Verriest, G; Van Laethem, J; Uvijls, A

    1982-05-01

    We gave the Farnsworth-Munsell 100-hue color vision test to 232 normal subjects between 10 and 80 years of age. One half the subjects underwent binocular testing followed by monocular testing. In the other half monocular testing preceded binocular testing. Performance was better with both eyes than with either eye alone. The worst performance occurred on monocular tests in subjects without previous experience with the task (that is, those for whom this was the first test). The well-known age trend was apparent (children and elderly have the worst color vision). New data are provided for judging the point at which the total error score may be considered pathologic.

  11. Piloting a Polychotomous Partial-Credit Scoring Procedure in a Multiple-Choice Test

    ERIC Educational Resources Information Center

    Tsopanoglou, Antonios; Ypsilandis, George S.; Mouti, Anna

    2014-01-01

    Multiple-choice (MC) tests are frequently used to measure language competence because they are quick, economical and straightforward to score. While degrees of correctness have been investigated for partially correct responses in combined-response MC tests, degrees of incorrectness in distractors and the role they play in determining the…

  12. Investigation and Treatment of Missing Item Scores in Test and Questionnaire Data

    ERIC Educational Resources Information Center

    Sijtsma, Klaas; van der Ark, L. Andries

    2003-01-01

    This article first discusses a statistical test for investigating whether or not the pattern of missing scores in a respondent-by-item data matrix is random. Since this is an asymptotic test, we investigate whether it is useful in small but realistic sample sizes. Then, we discuss two known simple imputation methods, person mean (PM) and two-way…

  13. Comparison of Standardized Test Scores from Traditional Classrooms and Those Using Problem-Based Learning

    ERIC Educational Resources Information Center

    Needham, Martha Elaine

    2010-01-01

    This research compares differences between standardized test scores in problem-based learning (PBL) classrooms and a traditional classroom for 6th grade students using a mixed-method, quasi-experimental and qualitative design. The research shows that problem-based learning is as effective as traditional teaching methods on standardized tests. The…

  14. Stochastic Processes as True-Score Models for Highly Speeded Mental Tests.

    ERIC Educational Resources Information Center

    Moore, William E.

    The previous theoretical development of the Poisson process as a strong model for the true-score theory of mental tests is discussed, and additional theoretical properties of the model from the standpoint of individual examinees are developed. The paper introduces the Erlang process as a family of test theory models and shows in the context of…

  15. Zertifikat Deutsch als Fremdsprache and the Oral Proficiency Interview: A Comparison of Test Scores and Examinations.

    ERIC Educational Resources Information Center

    Lalande, John F.; Schweckendiek, Jurgen

    1986-01-01

    Investigates what correlations might exist between an individual's score on the Zertifikat Deutsch als Fremdsprache and on the Oral Proficiency Interview. The tests themselves are briefly described. Results indicate that the two tests appear to correlate well in their evaluation of speaking skills. (SED)

  16. Beating the Odds: A Low Equalized Assessed Valuation Elementary School with High Standardized Test Scores

    ERIC Educational Resources Information Center

    Levin, Brian

    2011-01-01

    This mixed methods study examines what makes Bluffview Elementary School a success as measured by the ISAT, the mandated state test of Illinois. Despite national reports of achievement gaps and low test scores, Bluffview Elementary has shown sustained success in educating children. This paper reviews how Bluffview Elementary students are achieving…

  17. Comparison of Standardized Test Scores from Traditional Classrooms and Those Using Problem-Based Learning

    ERIC Educational Resources Information Center

    Needham, Martha Elaine

    2010-01-01

    This research compares differences between standardized test scores in problem-based learning (PBL) classrooms and a traditional classroom for 6th grade students using a mixed-method, quasi-experimental and qualitative design. The research shows that problem-based learning is as effective as traditional teaching methods on standardized tests. The…

  18. An Investigation of the Effectiveness of Vocabulary Learning Strategies on Iranian EFL Learners' Vocabulary Test Score

    ERIC Educational Resources Information Center

    Rahimy, Ramin; Shams, Kiana

    2012-01-01

    This study aims to investigate the effectiveness of vocabulary learning strategies on Iranian EFL learners' vocabulary test score. To achieve this aim, fifty Intermediate level students from Kish English Institute were randomly selected from among fifteen classes after administering the Oxford Placement Test (OPT). Then, an intermediate level…

  19. The Relationship of Motivational Values of Math and Reading Teachers to Student Test Score Gains

    ERIC Educational Resources Information Center

    Loewen, David Allen

    2013-01-01

    This exploratory correlational study seeks to answer the question of whether a relationship exists between student average test score gains on state exams and teachers' rating of values on the Schwartz Values Survey. Eighty-seven randomly selected Kansas teachers of math and/or reading, grades four through eight, participated. Student test score…

  20. Bi-Factor MIRT Observed-Score Equating for Mixed-Format Tests

    ERIC Educational Resources Information Center

    Lee, Guemin; Lee, Won-Chan

    2016-01-01

    The main purposes of this study were to develop bi-factor multidimensional item response theory (BF-MIRT) observed-score equating procedures for mixed-format tests and to investigate relative appropriateness of the proposed procedures. Using data from a large-scale testing program, three types of pseudo data sets were formulated: matched samples,…

  1. Demographically Adjusted Groups for Equating Test Scores. Research Report. ETS RR-14-30

    ERIC Educational Resources Information Center

    Livingston, Samuel A.

    2014-01-01

    In this study, I investigated 2 procedures intended to create test-taker groups of equal ability by poststratifying on a composite variable created from demographic information. In one procedure, the stratifying variable was the composite variable that best predicted the test score. In the other procedure, the stratifying variable was the…

  2. The Disaggregation of Value-Added Test Scores to Assess Learning Outcomes in Economics Courses

    ERIC Educational Resources Information Center

    Walstad, William B.; Wagner, Jamie

    2016-01-01

    This study disaggregates posttest, pretest, and value-added or difference scores in economics into four types of economic learning: positive, retained, negative, and zero. The types are derived from patterns of student responses to individual items on a multiple-choice test. The micro and macro data from the "Test of Understanding in College…

  3. Two for One: Using QAR to Increase Reading Comprehension and Improve Test Scores

    ERIC Educational Resources Information Center

    Green, Susan

    2016-01-01

    This teaching tip describes an intervention used in a third-grade classroom implemented to help students pass an end-of-grade reading comprehension test. Low scores on a practice end-of-grade comprehension test prompted a re-examination of classroom reading instruction and a plan for intervention. This teaching tip describes the phases implemented…

  4. Two for One: Using QAR to Increase Reading Comprehension and Improve Test Scores

    ERIC Educational Resources Information Center

    Green, Susan

    2016-01-01

    This teaching tip describes an intervention used in a third-grade classroom implemented to help students pass an end-of-grade reading comprehension test. Low scores on a practice end-of-grade comprehension test prompted a re-examination of classroom reading instruction and a plan for intervention. This teaching tip describes the phases implemented…

  5. Investigating Score Dependability in English/Chinese Interpreter Certification Performance Testing: A Generalizability Theory Approach

    ERIC Educational Resources Information Center

    Han, Chao

    2016-01-01

    As a property of test scores, reliability/dependability constitutes an important psychometric consideration, and it underpins the validity of measurement results. A review of interpreter certification performance tests (ICPTs) reveals that (a) although reliability/dependability checking has been recognized as an important concern, its theoretical…

  6. The Disaggregation of Value-Added Test Scores to Assess Learning Outcomes in Economics Courses

    ERIC Educational Resources Information Center

    Walstad, William B.; Wagner, Jamie

    2016-01-01

    This study disaggregates posttest, pretest, and value-added or difference scores in economics into four types of economic learning: positive, retained, negative, and zero. The types are derived from patterns of student responses to individual items on a multiple-choice test. The micro and macro data from the "Test of Understanding in College…

  7. CSCOPE's Effect on Texas' State Mandated Standardized Test Scores in Mathematics

    ERIC Educational Resources Information Center

    Merritt, Brent Ross

    2011-01-01

    The purpose of the study was to examine standardized test scores of school districts in the state of Texas that have implemented CSCOPE, a popular curriculum management system, in an effort to determine what effect, if any, its implementation has had. The standardized test used in the state of the Texas is titled the Texas Assessment of Knowledge…

  8. Zertifikat Deutsch als Fremdsprache and the Oral Proficiency Interview: A Comparison of Test Scores and Examinations.

    ERIC Educational Resources Information Center

    Lalande, John F.; Schweckendiek, Jurgen

    1986-01-01

    Investigates what correlations might exist between an individual's score on the Zertifikat Deutsch als Fremdsprache and on the Oral Proficiency Interview. The tests themselves are briefly described. Results indicate that the two tests appear to correlate well in their evaluation of speaking skills. (SED)

  9. Investigating Score Dependability in English/Chinese Interpreter Certification Performance Testing: A Generalizability Theory Approach

    ERIC Educational Resources Information Center

    Han, Chao

    2016-01-01

    As a property of test scores, reliability/dependability constitutes an important psychometric consideration, and it underpins the validity of measurement results. A review of interpreter certification performance tests (ICPTs) reveals that (a) although reliability/dependability checking has been recognized as an important concern, its theoretical…

  10. Score Reliability of a Test Composed of Passage-Based Testlets: A Generalizability Theory Perspective.

    ERIC Educational Resources Information Center

    Lee, Yong-Won

    The purpose of this study was to investigate the impact of local item dependence (LID) in passage-based testlets on the test score reliability of an English as a Foreign Language (EFL) reading comprehension test from the perspective of generalizability (G) theory. Definitions and causes of LID in passage-based testlets are reviewed within the…

  11. Pragmatism or Gaming the System? One School District's Solution to Low Test Scores

    ERIC Educational Resources Information Center

    McKenzie, Kathryn Bell

    2009-01-01

    In this era of accountability and high stakes testing, district and school administrators are vigilant in their attention to student test scores and the ramifications these have for district and school performance labels. In other words, no school or district wants to be labeled "low performing." This case, based on a real situation, demonstrates…

  12. A Defensible Model for Determining a Minimal Cut-Off Score for Criterion Referenced Tests.

    ERIC Educational Resources Information Center

    Bernknopf, Stan; And Others

    The effectiveness of a model for determining a minimal cut-off score for criterion-referenced tests was examined. The model, based upon techniques presented originally by Nedelsky and by Angoff, was first used in conjunction with a multiple choice test developed for use in certifying school counselors in Georgia. A "knowledge estimation panel" was…

  13. Score Reliability of a Test Composed of Passage-Based Testlets: A Generalizability Theory Perspective.

    ERIC Educational Resources Information Center

    Lee, Yong-Won

    The purpose of this study was to investigate the impact of local item dependence (LID) in passage-based testlets on the test score reliability of an English as a Foreign Language (EFL) reading comprehension test from the perspective of generalizability (G) theory. Definitions and causes of LID in passage-based testlets are reviewed within the…

  14. A weighted generalized score statistic for comparison of predictive values of diagnostic tests.

    PubMed

    Kosinski, Andrzej S

    2013-03-15

    Positive and negative predictive values are important measures of a medical diagnostic test performance. We consider testing equality of two positive or two negative predictive values within a paired design in which all patients receive two diagnostic tests. The existing statistical tests for testing equality of predictive values are either Wald tests based on the multinomial distribution or the empirical Wald and generalized score tests within the generalized estimating equations (GEE) framework. As presented in the literature, these test statistics have considerably complex formulas without clear intuitive insight. We propose their re-formulations that are mathematically equivalent but algebraically simple and intuitive. As is clearly seen with a new re-formulation we presented, the generalized score statistic does not always reduce to the commonly used score statistic in the independent samples case. To alleviate this, we introduce a weighted generalized score (WGS) test statistic that incorporates empirical covariance matrix with newly proposed weights. This statistic is simple to compute, always reduces to the score statistic in the independent samples situation, and preserves type I error better than the other statistics as demonstrated by simulations. Thus, we believe that the proposed WGS statistic is the preferred statistic for testing equality of two predictive values and for corresponding sample size computations. The new formulas of the Wald statistics may be useful for easy computation of confidence intervals for difference of predictive values. The introduced concepts have potential to lead to development of the WGS test statistic in a general GEE setting. Copyright © 2012 John Wiley & Sons, Ltd.

  15. Correlation of Simulation Examination to Written Test Scores for Advanced Cardiac Life Support Testing: Prospective Cohort Study

    PubMed Central

    Strom, Suzanne L.; Anderson, Craig L.; Yang, Luanna; Canales, Cecilia; Amin, Alpesh; Lotfipour, Shahram; McCoy, C. Eric; Langdorf, Mark I.

    2015-01-01

    Introduction Traditional Advanced Cardiac Life Support (ACLS) courses are evaluated using written multiple-choice tests. High-fidelity simulation is a widely used adjunct to didactic content, and has been used in many specialties as a training resource as well as an evaluative tool. There are no data to our knowledge that compare simulation examination scores with written test scores for ACLS courses. Objective To compare and correlate a novel high-fidelity simulation-based evaluation with traditional written testing for senior medical students in an ACLS course. Methods We performed a prospective cohort study to determine the correlation between simulation-based evaluation and traditional written testing in a medical school simulation center. Students were tested on a standard acute coronary syndrome/ventricular fibrillation cardiac arrest scenario. Our primary outcome measure was correlation of exam results for 19 volunteer fourth-year medical students after a 32-hour ACLS-based Resuscitation Boot Camp course. Our secondary outcome was comparison of simulation-based vs. written outcome scores. Results The composite average score on the written evaluation was substantially higher (93.6%) than the simulation performance score (81.3%, absolute difference 12.3%, 95% CI [10.6–14.0%], p<0.00005). We found a statistically significant moderate correlation between simulation scenario test performance and traditional written testing (Pearson r=0.48, p=0.04), validating the new evaluation method. Conclusion Simulation-based ACLS evaluation methods correlate with traditional written testing and demonstrate resuscitation knowledge and skills. Simulation may be a more discriminating and challenging testing method, as students scored higher on written evaluation methods compared to simulation. PMID:26594288

  16. Correlation of Simulation Examination to Written Test Scores for Advanced Cardiac Life Support Testing: Prospective Cohort Study.

    PubMed

    Strom, Suzanne L; Anderson, Craig L; Yang, Luanna; Canales, Cecilia; Amin, Alpesh; Lotfipour, Shahram; McCoy, C Eric; Osborn, Megan Boysen; Langdorf, Mark I

    2015-11-01

    Traditional Advanced Cardiac Life Support (ACLS) courses are evaluated using written multiple-choice tests. High-fidelity simulation is a widely used adjunct to didactic content, and has been used in many specialties as a training resource as well as an evaluative tool. There are no data to our knowledge that compare simulation examination scores with written test scores for ACLS courses. To compare and correlate a novel high-fidelity simulation-based evaluation with traditional written testing for senior medical students in an ACLS course. We performed a prospective cohort study to determine the correlation between simulation-based evaluation and traditional written testing in a medical school simulation center. Students were tested on a standard acute coronary syndrome/ventricular fibrillation cardiac arrest scenario. Our primary outcome measure was correlation of exam results for 19 volunteer fourth-year medical students after a 32-hour ACLS-based Resuscitation Boot Camp course. Our secondary outcome was comparison of simulation-based vs. written outcome scores. The composite average score on the written evaluation was substantially higher (93.6%) than the simulation performance score (81.3%, absolute difference 12.3%, 95% CI [10.6-14.0%], p<0.00005). We found a statistically significant moderate correlation between simulation scenario test performance and traditional written testing (Pearson r=0.48, p=0.04), validating the new evaluation method. Simulation-based ACLS evaluation methods correlate with traditional written testing and demonstrate resuscitation knowledge and skills. Simulation may be a more discriminating and challenging testing method, as students scored higher on written evaluation methods compared to simulation.

  17. Clinical experience of scoring criteria for Familial Hypercholesterolaemia (FH) genetic testing in Wales.

    PubMed

    Haralambos, K; Whatley, S D; Edwards, R; Gingell, R; Townsend, D; Ashfield-Watt, P; Lansberg, P; Datta, D B N; McDowell, I F W

    2015-05-01

    Familial Hypercholesterolaemia (FH) is caused by mutations in genes of the Low Density Lipoprotein (LDL) receptor pathway. A definitive diagnosis of FH can be made by the demonstration of a pathogenic mutation. The Wales FH service has developed scoring criteria to guide selection of patients for DNA testing, for those referred to clinics with hypercholesterolaemia. The criteria are based on a modification of the Dutch Lipid Clinic scoring criteria and utilise a combination of lipid values, physical signs, personal and family history of premature cardiovascular disease. They are intended to provide clinical guidance and enable resources to be targeted in a cost effective manner. 623 patients who presented to lipid clinics across Wales had DNA testing following application of these criteria. The proportion of patients with a pathogenic mutation ranged from 4% in those scoring 5 or less up to 85% in those scoring 15 or more. LDL-cholesterol was the strongest discriminatory factor. Scores gained from physical signs, family history, coronary heart disease, and triglycerides also showed a gradient in mutation pick-up rate according to the score. These criteria provide a useful tool to guide selection of patients for DNA testing when applied by health professionals who have clinical experience of FH. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.

  18. Estimating the percentage of the population with abnormally low scores (or abnormally large score differences) on standardized neuropsychological test batteries: a generic method with applications.

    PubMed

    Crawford, John R; Garthwaite, Paul H; Gault, Catherine B

    2007-07-01

    Information on the rarity or abnormality of an individual's test scores (or test score differences) is fundamental in interpreting the results of a neuropsychological assessment. If a standardized battery of tests is administered, the question arises as to what percentage of the healthy population would be expected to exhibit one or more abnormally low test scores (and, in general, j or more abnormally low scores). Similar issues arise when the concern is with the number of abnormal pairwise differences between an individual's scores on the battery, or when an individual's scores on each component of the battery are compared with the individual's mean score. A generic Monte Carlo simulation method for tackling such problems is described (it requires only that the matrix of correlations between tests be available) and is contrasted with the use of binomial probabilities. The method is then applied to Index scores for the Wechsler Adult Intelligence Scale--Third Edition (WAIS-III; D. Wechsler, 1997) and Wechsler Intelligence Scale for Children--Fourth Edition (WISC-IV; D. Wechsler, 2003). Three computer programs that implement the methods are made available.

  19. A Score Based on Screening Tests to Differentiate Mild Cognitive Impairment from Subjective Memory Complaints

    PubMed Central

    de Gobbi Porto, Fábio Henrique; Spíndola, Lívia; de Oliveira, Maira Okada; Figuerêdo do Vale, Patrícia Helena; Orsini, Marco; Nitrini, Ricardo; Dozzi Brucki, Sonia Maria

    2013-01-01

    It is not easy to differentiate patients with mild cognitive impairment (MCI) from subjective memory complainers (SMC). Assessments with screening cognitive tools are essential, particularly in primary care where most patients are seen. The objective of this study was to evaluate the diagnostic accuracy of screening cognitive tests and to propose a score derived from screening tests. Elderly subjects with memory complaints were evaluated using the Mini Mental State Examination (MMSE) and the Brief Cognitive Battery (BCB). We added two delayed recalls in the MMSE (a delayed recall and a late-delayed recall, LDR), and also a phonemic fluency test of letter P fluency (LPF). A score was created based on these tests. The diagnoses were made on the basis of clinical consensus and neuropsychological testing. Receiver operating characteristic curve analyses were used to determine area under the curve (AUC), the sensitivity and specificity for each test separately and for the final proposed score. MMSE, LDR, LPF and delayed recall of BCB scores reach statistically significant differences between groups (P=0.000, 0.03, 0.001 and 0.01, respectively). Sensitivity, specificity and AUC were MMSE: 64%, 79% and 0.75 (cut off <29); LDR: 56%, 62% and 0.62 (cut off <3); LPF: 71%, 71% and 0.71 (cut off <14); delayed recall of BCB: 56%, 82% and 0.68 (cut off <9). The proposed score reached a sensitivity of 88% and 76% and specificity of 62% and 75% for cut off over 1 and over 2, respectively. AUC were 0.81. In conclusion, a score created from screening tests is capable of discriminating MCI from SMC with moderate to good accurancy. PMID:24147213

  20. High-Stakes Testing and Student Achievement: Problems for the No Child Left Behind Act. Executive Summary

    ERIC Educational Resources Information Center

    Nichols, Sharon L.; Glass, Gene V.; Berliner, David C.

    2005-01-01

    Under the federal No Child Left Behind Act of 2001 (NCLB), standardized test scores are the indicator used to hold schools and school districts accountable for student achievement. Each state is responsible for constructing an accountability system, attaching consequences--or stakes--for student performance. The theory of action implied by this…

  1. Do Standardized Tests Penalize Deep-Thinking, Creative, or Conscientious Students?: Some Personality Correlates of Graduate Record Examinations Test Scores

    ERIC Educational Resources Information Center

    Powers, Donald E.; Kaufman, James C.

    2004-01-01

    The objective of the study reported here was to explore the relationship of Graduate Record Examinations (GRE) General Test scores to selected personality traits--conscientiousness, rationality, ingenuity, quickness, creativity, and depth. A sample of 342 GRE test takers completed short personality inventory scales for each trait. Analyses…

  2. The Effects of Group Members' Personalities on a Test Taker's L2 Group Oral Discussion Test Scores

    ERIC Educational Resources Information Center

    Ockey, Gary J.

    2009-01-01

    The second language group oral is a test of second language speaking proficiency, in which a group of three or more English language learners discuss an assigned topic without interaction with interlocutors. Concerns expressed about the extent to which test takers' personal characteristics affect the scores of others in the group have limited its…

  3. Improvement in national test reading scores at Key Stage 1; grade inflation or better achievement?

    PubMed

    Meadows, Sara; Herrick, David; Feiler, Anthony

    2007-02-01

    The aim of the UK National Literacy Strategy is to raise standards in literacy. Strong evidence for its success has, however, been lacking: most of the available data comes from performance on tests administered in schools or from Office for Standards in Education reports and is vulnerable to suggestions of bias. An opportunistic analysis of data from a population cohort study extending over three school years compares school-based scores at school entry and at age 7-8 with independently administered scores on similar tests. The results show a small but statistically significant rise between 1998 and 1999 and between 1998 and 2000 in scores on both Key Stage 1 Reading Standard Assessment Tasks taken in schools and the reading component of the WORD test taken independently. This is clear evidence for a real rise in reading attainment over this period, which may be attributable to the children's experience of the National Literacy Strategy.

  4. Predictive effects of teachers and schools on test scores, college attendance, and earnings

    PubMed Central

    Chamberlain, Gary E.

    2013-01-01

    I studied predictive effects of teachers and schools on test scores in fourth through eighth grade and outcomes later in life such as college attendance and earnings. For example, predict the fraction of a classroom attending college at age 20 given the test score for a different classroom in the same school with the same teacher and given the test score for a classroom in the same school with a different teacher. I would like to have predictive effects that condition on averages over many classrooms, with and without the same teacher. I set up a factor model that, under certain assumptions, makes this feasible. Administrative school district data in combination with tax data were used to calculate estimates and do inference. PMID:24101492

  5. Structured didactic teaching sessions improve medical student neurology clerkship test scores: a pilot study.

    PubMed

    Menkes, Daniel L; Reed, Mary

    2008-01-01

    To determine the effectiveness of didactic case-based instruction methodology to improve medical student comprehension of common neurological illnesses and neurological emergencies. Neurology department, academic university. 415 third and fourth year medical students performing a required four week neurology clerkship. Raw test scores on a 1 hour, 50-item clinical vignette based examination and open-ended questions in a post-clerkship feedback session. There was a statistically significant improvement in overall test scores (p<0.001). Didactic teaching sessions have a significant positive impact on neurology student clerkship test score performance and perception of their educational experience. Confirmation of these results across multiple specialties in a multi-center trial is warranted.

  6. Impact of a standardized test package on exit examination scores and NCLEX-RN outcomes.

    PubMed

    Homard, Catherine M

    2013-03-01

    The purpose of this ex post facto correlational study was to compare exit examination scores and NCLEX-RN(®) pass rates of baccalaureate nursing students who differed in level of participation in a standardized test package. Three cohort groups emerged as a standardized test package was introduced: (a) students who did not participate in a standardized test package; (b) students with two semesters of a standardized test package; and (c) students with four semesters of a standardized test package. Benner's novice-to-expert theory framed the study in the belief that students best acquire knowledge and skills through practice and reflection. Students participating in four semesters of a standardized test package demonstrated higher exit examination scores and NCLEX-RN pass rates compared with students who did not participate in this package. This study's results could inform nurse educators about strategies to facilitate nursing student success on exit examinations and the NCLEX-RN. Copyright 2013, SLACK Incorporated.

  7. Do We Really Become Smarter When Our Fluid-Intelligence Test Scores Improve?

    PubMed

    Hayes, Taylor R; Petrov, Alexander A; Sederberg, Per B

    2015-01-01

    Recent reports of training-induced gains on fluid intelligence tests have fueled an explosion of interest in cognitive training-now a billion-dollar industry. The interpretation of these results is questionable because score gains can be dominated by factors that play marginal roles in the scores themselves, and because intelligence gain is not the only possible explanation for the observed control-adjusted far transfer across tasks. Here we present novel evidence that the test score gains used to measure the efficacy of cognitive training may reflect strategy refinement instead of intelligence gains. A novel scanpath analysis of eye movement data from 35 participants solving Raven's Advanced Progressive Matrices on two separate sessions indicated that one-third of the variance of score gains could be attributed to test-taking strategy alone, as revealed by characteristic changes in eye-fixation patterns. When the strategic contaminant was partialled out, the residual score gains were no longer significant. These results are compatible with established theories of skill acquisition suggesting that procedural knowledge tacitly acquired during training can later be utilized at posttest. Our novel method and result both underline a reason to be wary of purported intelligence gains, but also provide a way forward for testing for them in the future.

  8. Do We Really Become Smarter When Our Fluid-Intelligence Test Scores Improve?

    PubMed Central

    Hayes, Taylor R.; Petrov, Alexander A.; Sederberg, Per B.

    2014-01-01

    Recent reports of training-induced gains on fluid intelligence tests have fueled an explosion of interest in cognitive training—now a billion-dollar industry. The interpretation of these results is questionable because score gains can be dominated by factors that play marginal roles in the scores themselves, and because intelligence gain is not the only possible explanation for the observed control-adjusted far transfer across tasks. Here we present novel evidence that the test score gains used to measure the efficacy of cognitive training may reflect strategy refinement instead of intelligence gains. A novel scanpath analysis of eye movement data from 35 participants solving Raven’s Advanced Progressive Matrices on two separate sessions indicated that one-third of the variance of score gains could be attributed to test-taking strategy alone, as revealed by characteristic changes in eye-fixation patterns. When the strategic contaminant was partialled out, the residual score gains were no longer significant. These results are compatible with established theories of skill acquisition suggesting that procedural knowledge tacitly acquired during training can later be utilized at posttest. Our novel method and result both underline a reason to be wary of purported intelligence gains, but also provide a way forward for testing for them in the future. PMID:25395695

  9. Online pre-race education improves test scores for volunteers at a marathon.

    PubMed

    Maxwell, Shane; Renier, Colleen; Sikka, Robby; Widstrom, Luke; Paulson, William; Christensen, Trent; Olson, David; Nelson, Benjamin

    2017-09-01

    This study examined whether an online course would lead to increased knowledge about the medical issues volunteers encounter during a marathon. Health care professionals who volunteered to provide medical coverage for an annual marathon were eligible for the study. Demographic information about medical volunteers including profession, specialty, education level and number of marathons they had volunteered for was collected. A 15-question test about the most commonly encountered medical issues was created by the authors and administered before and after the volunteers took the online educational course and compared to a pilot study the previous year. Seventy-four subjects completed the pre-test. Those who participated in the pilot study last year (N = 15) had pre-test scores that were an average of 2.4 points higher than those who did not (mean ranks: pilot study = 51.6 vs. non-pilot = 33.9, p = 0.004). Of the 74 subjects who completed the pre-test, 54 also completed the post-test. The overall post-pre mean score difference was 3.8 ± 2.7 (t = 10.5 df = 53 p < 0.001). While subjects with all levels of volunteer experience demonstrated improvement, only change among first time marathon volunteers was significantly different from the others. Subjects reporting all degree/certification levels demonstrated improvement, but no difference in improvement was found between degree/certification levels. In this follow-up to the previous year's pilot study, online education demonstrated a long-term (one-year) increase in test scores. Testing also continued to show short-term improvement in post-course test scores, compared to pre-course test scores. In general, marathon medical volunteers who had no volunteer experience demonstrated greater improvement than those who had prior volunteer experience.

  10. Opportunity to learn: Investigating possible predictors for pre-course Test Of Astronomy STandards TOAST scores

    NASA Astrophysics Data System (ADS)

    Berryhill, Katie J.

    As astronomy education researchers become more interested in experimentally testing innovative teaching strategies to enhance learning in introductory astronomy survey courses ("ASTRO 101"), scholars are placing increased attention toward better understanding factors impacting student gain scores on the widely used Test Of Astronomy STandards (TOAST). Usually used in a pre-test and post-test research design, one might naturally assume that the pre-course differences observed between high- and low-scoring college students might be due in large part to their pre-existing motivation, interest, experience in science, and attitudes about astronomy. To explore this notion, 11 non-science majoring undergraduates taking ASTRO 101 at west coast community colleges were interviewed in the first few weeks of the course to better understand students' pre-existing affect toward learning astronomy with an eye toward predicting student success. In answering this question, we hope to contribute to our understanding of the incoming knowledge of students taking undergraduate introductory astronomy classes, but also gain insight into how faculty can best meet those students' needs and assist them in achieving success. Perhaps surprisingly, there was only weak correlation between students' motivation toward learning astronomy and their pre-test scores. Instead, the most fruitful predictor of TOAST pre-test scores was the quantity of pre-existing, informal, self-directed astronomy learning experiences.

  11. Individual differences in left parietal white matter predict math scores on the Preliminary Scholastic Aptitude Test.

    PubMed

    Matejko, Anna A; Price, Gavin R; Mazzocco, Michèle M M; Ansari, Daniel

    2013-02-01

    Mathematical skills are of critical importance, both academically and in everyday life. Neuroimaging research has primarily focused on the relationship between mathematical skills and functional brain activity. Comparatively few studies have examined which white matter regions support mathematical abilities. The current study uses diffusion tensor imaging (DTI) to test whether individual differences in white matter predict performance on the math subtest of the Preliminary Scholastic Aptitude Test (PSAT). Grades 10 and 11 PSAT scores were obtained from 30 young adults (ages 17-18) with wide-ranging math achievement levels. Tract based spatial statistics was used to examine the correlation between PSAT math scores, fractional anisotropy (FA), radial diffusivity (RD) and axial diffusivity (AD). FA in left parietal white matter was positively correlated with math PSAT scores (specifically in the left superior longitudinal fasciculus, left superior corona radiata, and left corticospinal tract) after controlling for chronological age and same grade PSAT critical reading scores. Furthermore, RD, but not AD, was correlated with PSAT math scores in these white matter microstructures. The negative correlation with RD further suggests that participants with higher PSAT math scores have greater white matter integrity in this region. Individual differences in FA and RD may reflect variability in experience dependent plasticity over the course of learning and development. These results are the first to demonstrate that individual differences in white matter are associated with mathematical abilities on a nationally administered scholastic aptitude measure.

  12. Does weight affect children’s test scores and teacher assessments differently?✩

    PubMed Central

    Zavodny, Madeline

    2013-01-01

    The prevalence of childhood overweight and obesity increased dramatically in the United States during the past three decades. This increase has adverse public health implications, but its implication for children’s academic outcomes is less clear. This paper uses data from five waves of the Early Childhood Longitudinal Study-Kindergarten to examine how children’s weight is related to their scores on standardized tests and to their teachers’ assessments of their academic ability. The results indicate that children’s weight is more negatively related to teacher assessments of their academic performance than to test scores. PMID:24014932

  13. Prognostic value of Pheochromocytoma of the Adrenal Gland Scaled Score (Pass score) tests to separate benign from malignant neoplasms.

    PubMed

    Mlika, Mona; Kourda, Nadia; Zorgati, Mohamed Majdi; Bahri, Sonia; Ben Ammar, Slim; Zermani, Rachida

    2013-03-01

    Differentiating malignant from benign pheochromocytoma has been challenging when based on histologic features. This is due to the definition of malignant pheochromocytoma which are defined by the presence of metastases. A PASS score was developed and according to many authors, a PASS score> =4 identified potentially malignant tumors. To assess the prognostic value of PASS score in differentiating benign pheochromocytomas from malignant ones. The records of 11 patients with tumors diagnosed as "pheochromocytoma" were identified from 1970 to 2010 in the files of the pathology, intern medicine and biochemistry departments of the Charles Nicolle hospital and Pasteur Institute. Receiver operating characteristics (ROC) curve analysis was performed to evaluate the diagnostic performance of PASS. The logistic model was developed using the 11 predictive variables. Its performance was evaluated by calculating the area under the ROC curve and comparing it with that of the PASS. In benign tumors, The PASS score was <4 in 3 cases and >=4 in 6 cases. In malignant tumors, the PASS score was >=4 in both cases. According to the ROC curve analysis, a PASS equal or superior to 4 identifies malignant pheochromocytoma with a sensitivity of 50% and a specificity of 45%. I think that PASS score, despite its low sensitivity, may help to reserve the more aggressive treatment and narrow follow up for potentially malignant tumors. Widespread of this called score with complete clinical data will help to validate these findings and to add other prognostic factors of value that could be a part of this scaled score such as immunohistochemical findings.

  14. The Relationship between Scholastic Aptitude Test Scores and the Academic Success of Industrial Arts/Technology Education Majors.

    ERIC Educational Resources Information Center

    Wescott, Jack W.

    1989-01-01

    A study of 107 graduates of industrial arts teacher education programs found statistically significant relationships between Scholastic Aptitude Test (SAT) scores and grade point average (GPA); between SAT scores and major GPA; especially between combined math and verbal scores and major GPA. It was concluded that SAT scores are important factors…

  15. Use of max and min scores for trend tests for association when the genetic model is unknown.

    PubMed

    Zheng, Gang

    2003-08-30

    In case-control studies, the Cochran-Armitage (CA) trend test is powerful for detection of an association between a risk allele and a marker. To apply this test, a score should be assigned to the genotypes based on the genetic model. When the underlying genetic model is unknown, the trend test statistic is a function of the score. In this paper, simple procedures are given to obtain two scores (max and min), which respectively maximize and minimize the CA trend test statistics for genetic associations. These two scores can be used to examine the effect of the choice of scores on the test of no association. When the CA trend test statistic with the max (or min) score is less (or greater) than a prespecified value, the conclusion is clear: we will accept (or reject) the null hypothesis of no association for any scores used. When this value is less than the CA trend test statistic with the max score but greater than the one with the min score, the decision of whether or not to reject the null hypothesis depends on the choice of scores. In this situation, the CA trend test with a prespecified score cannot be used without careful scientific justification of the choice of scores. The use of max and min scoring schemes is applied to a real data set.

  16. [Validation of a Spanish version of the Childhood Asthma Control Test (Sc-ACT) for use in Spain].

    PubMed

    Pérez-Yarza, E G; Castro-Rodriguez, J A; Villa Asensi, J R; Garde Garde, J; Hidalgo Bermejo, F J

    2015-08-01

    The Childhood Asthma Control Test (c-ACT) is a validated tool for determining pediatric asthma control. However, it is not validated in the Spanish language in Spain. We evaluated the psychometric properties of the Spanish version of the Childhood Asthma Control Test (Sc-ACT) for assessing asthma control in children ages 4 to11. This national, multicentre, prospective study was conducted in Spain with asthmatic children and their caregivers. Patients were assessed at 3 visits (Baseline, 2 Weeks, and 4 Months). Clinical variables included: symptoms, exacerbations, FEV1, asthma classification, PAQLQ and PACQLQ questionnaire scores, and asthma control as perceived by physicians, patients and caregivers. The Sc-ACT feasibility, validity, reliability, and sensitivity to change were assessed. A total of 394 children were included; mean (SD) time to complete the Sc-ACT was 5.3 (4.4) minutes. Sc-ACT score was correlated with asthma control as perceived by physician (-0.52), patient (-0.53), and caregiver (-0.51) and with the PAQLQ (0.56) and PACQLQ (0.55) scores. Sc-ACT was found to be significantly related to intensity and frequency of asthma symptoms. Cronbach alpha coefficient α was 0.81 and intraclass correlation coefficient was ≥0.85 for all of the items. The global effect size of Sc-ACT was 0.55. The cutoff point scores of 21 or higher indicated a good asthma control and their MCID was 4 points. The Spanish version of the c-ACT was found to be a reliable and valid questionnaire for evaluating asthma control in Spanish-speaking children ages 4 to 11 in Spain. Copyright © 2014 Asociación Española de Pediatría. Published by Elsevier España, S.L.U. All rights reserved.

  17. Score test for familial aggregation in probands studies: application to Alzheimer's disease.

    PubMed

    Commenges, D; Jacqmin, H; Letenneur, L; Van Duijn, C M

    1995-06-01

    When studying familial aggregation of a disease, the following two-stage design is often used: first select index subjects (cases and controls); then record data on their relatives. The likelihood corresponding to this design is derived and a score test of homogeneity is proposed for testing the hypothesis of no-aggregation. This test takes into account the selection procedure and allows adjustment to be made for explanatory variables. It appears as the sum of three terms: a pure test of homogeneity, a test of comparison of observed minus expected cases in the two groups, and a term which adjusts for the possible unequal probabilities of disease of the index subjects. Asymptotic efficiency and a simulation study show that the proposed test is superior to either the pure homogeneity test or tests based on the comparison of numbers of affected in the two groups. The test statistic, which has an asymptotically standard normal distribution, is applied to a study of familial aggregation of early-onset Alzheimer's disease for which a highly significant value (9.46) is obtained: this is the highest value among the three tests compared, in agreement with the simulation study. A logistic normal model is fitted to the data, taking account of the selection procedure: it allows to estimate the regression parameters and the variance of the random effect; the likelihood ratio test for familial aggregation seems less powerful than the score test.

  18. Skewness and transformations of Farnsworth-Munsell 100-hue test scores.

    PubMed

    Dain, S J

    1998-11-01

    In the past, suggested transformations of Farnsworth-Munsell 100-Hue Test (FM 100-Hue) test scores distributions have been limited to a square root transformation. In this study, the choice of transformations of total error scores (TES) are considered by identifying a possible source of skewness. Several distributions of FM100-Hue Test TES were assessed for skewness (third moment). The error score (ES) distributions for the 85 individual caps in each of the populations were also analysed for skewness (Figs. 3 and 4). There is no single transformation which will normalise all TES distributions. The single cap ES distributions with low mean ES (such as those achieved normals and, for some regions of the test, by anomalous trichromats and dichromats) are symmetrical because most subjects can organise the cap perfectly (and could do even better given smaller colour differences). The distributions of ESs where the mean ES is in the moderate range (such as those achieved by diabetics) are skewed because some ESs at the lower end of the range represent performance which could also be better than the test allows. ES distributions with a high mean (such as random distributions and some regions of the test by congenital dichromats) are symmetrical being unaffected by the limitations of the test. TES distributions of diabetics are asymmetrical and comprise skewed cap ES distributions. A suggestion for a transformation is made.

  19. Utility of a scoring balloon for a severely calcified lesion: bench test and finite element analysis.

    PubMed

    Kawase, Yoshiaki; Saito, Naritatsu; Watanabe, Shin; Bao, Bingyuan; Yamamoto, Erika; Watanabe, Hiroki; Higami, Hirooki; Matsuo, Hitoshi; Ueno, Katsumi; Kimura, Takeshi

    2014-04-01

    We aimed to investigate the effectiveness of a scoring balloon catheter in expanding a circumferentially calcified lesion compared to a conventional balloon catheter using an in vitro experiment setting and elucidate the underlying mechanisms of this ability using a finite element analysis. True efficacy of the scoring device and the underlying mechanisms for heavily calcified coronary lesions are unclear. We employed a Scoreflex scoring balloon catheter (OrbusNeich, Hong Kong, China). The ability of Scoreflex to dilate a calcified lesion was compared with a conventional balloon catheter using 3 different sized calcium tubes. The thickness of the calcium tubes were 2.0, 2.25, and 2.5 mm. The primary endpoints were the successful induction of cracks in the calcium tubes and the inflation pressures required for inducing cracks. The inflation pressure required for cracking the calcium tubes were consistently lower with Scoreflex (p < 0.05, Student t test). The finite element analysis revealed that the first principal stress applied to the calcified plaque was higher by at least threefold when applying the balloon catheter with scoring elements. A scoring balloon catheter can expand a calcified lesion with lower pressure than that of a conventional balloon. The finite element analysis revealed that the concentration of the stress observed in the outside of the calcified plaque just opposite to the scoring element is the underlying mechanism of the increased ability of Scoreflex to dilate the calcified lesion.

  20. Verbal fluency tests: Developing a new model of administration and scoring for Spanish language.

    PubMed

    Olabarrieta-Landa, Laiene; Torre, Esther Landa; López-Mugartza, Juan Carlos; Bialystok, Ellen; Arango-Lasprilla, Juan Carlos

    2017-09-16

    To conduct a literature review of the administration and scoring criteria used in normative studies of verbal fluency tests (VFT), and to propose a new model for the administration and scoring of phonological, semantic, and action VFT for use in Spanish-speakers. A literature search was performed using four databases Dialnet, ProQuest (PsycINFO, PsycArticles), Science Direct (Elsevier), and PubMed and 47 articles met the following criteria: 1) articles which contained normative data of phonological, semantic, or action VFT, 2) published between 2000 and 2015, 3) published in English or Spanish, 4) used healthy population. Of 2087 citations retrieved, 47 eligible studies were reviewed. The majority of the studies have been conducted in the USA, and with English and Spanish speakers. Only 12 studies provided the instruction, and 23 clearly describe the scoring guidelines. Moreover, among the studies that provided these information important discrepancies were found. Therefore, a new administration and scoring guidelines are presented, which may resolve this problem and be utilized in Spanish speaking countries. This review showed that still there is no consensus regarding the administration and scoring of VFT. A new method of administration and scoring is presented that can be use with Spanish-speakers.

  1. Supplemental Educational Services and Student Test Score Gains: Evidence from a Large, Urban School District

    ERIC Educational Resources Information Center

    Springer, Matthew G.; Pepper, Matthew J.; Ghosh-Dastidar, Bonnie

    2014-01-01

    This study examines the effect of supplemental education services (SES) on student test score gains and whether particular subgroups of students benefit more from NCLB tutoring services. Our sample includes information on students enrolled in third through eighth grades nested in 121 elementary and middle schools over a five-year period comprising…

  2. Raise Test Scores without Selling Your Soul: An Interview with Scott Mandel

    ERIC Educational Resources Information Center

    Curriculum Review, 2006

    2006-01-01

    With his 10th book, Improving Test Scores: A Practical Approach for Teachers and Administrators, Scott Mandel outlines steps educators can take to boost achievement on standardized exams while maintaining the integrity of their day-to-day teaching. Mandel, who holds a Ph.D. in curriculum and instruction from USC, teaches history and English at…

  3. Intelligence Test Scores and Birth Order among Young Norwegian Men (Conscripts) Analyzed within and between Families

    ERIC Educational Resources Information Center

    Bjerkedal, Tor; Kristensen, Petter; Skjeret, Geir A.; Brevik, John I.

    2007-01-01

    The present paper reports the results of a within and between family analysis of the relation between birth order and intelligence. The material comprises more than a quarter of a million test scores for intellectual performance of Norwegian male conscripts recorded during 1984-2004. Conscripts, mostly 18-19 years of age, were born to women for…

  4. An Investigation of Procedures for Computerized Adaptive Testing Using Partial Credit Scoring.

    ERIC Educational Resources Information Center

    Koch, William R.; Dodd, Barbara G.

    1989-01-01

    Various aspects of the computerized adaptive testing (CAT) procedure for partial credit scoring were manipulated, focusing on the effects of the manipulations on operational characteristics of the CAT. The effects of item-pool size, item-pool information, and stepsizes used along the trait continuum were assessed. (TJH)

  5. Defending the Quality of Links between Scores from Different Tests and Exams

    ERIC Educational Resources Information Center

    Cresswell, Mike

    2010-01-01

    Paul Newton (2010), with his characteristic concern about theory, has set out two different ways of thinking about the basis upon which equivalences of one sort or another are established between test score scales. His reason for doing this is a desire to establish "the defensibility of linkages lower on the continuum than concordance."…

  6. Detecting Dissimulation in Personality Test Scores: A Comparison between Person-Fit Indices and Detection Scales.

    ERIC Educational Resources Information Center

    Ferrando, Pere J.; Chico, Eliseo

    2001-01-01

    Examined whether a procedure based on item response theory (IRT) for assessing the scalability of response patterns could detect deliberate dissimulation (faking good) on scores from three tests of the Eysenck Personality Questionnaire Revised. Results for 489 and 140 undergraduates show that IRT measures were not powerful enough to detect…

  7. Changes in Student Populations and Average Test Scores of Dutch Primary Schools

    ERIC Educational Resources Information Center

    Luyten, Hans; de Wolf, Inge

    2011-01-01

    This article focuses on the relation between student population characteristics and average test scores per school in the final grade of primary education from a dynamic perspective. Aggregated data of over 5,000 Dutch primary schools covering a 6-year period were used to study the relation between changes in school populations and shifts in mean…

  8. Using College Admission Test Scores to Clarify High School Placement. Leading Indicator Spotlight

    ERIC Educational Resources Information Center

    Flug, Susanna

    2010-01-01

    In "Beyond Test Scores: Leading Indicators for Education," Foley and colleagues (2008) define leading indicators as those that "provide early signals of progress toward academic achievement" (p. 1) and stress that educators "need leading indicators to help them see the direction their efforts are going in and to take…

  9. Integrating GIS in the Middle School Curriculum: Impacts on Diverse Students' Standardized Test Scores

    ERIC Educational Resources Information Center

    Goldstein, Donna; Alibrandi, Marsha

    2013-01-01

    This case study conducted with 1,425 middle school students in Palm Beach County, Florida, included a treatment group receiving GIS instruction (256) and a control group without GIS instruction (1,169). Quantitative analyses on standardized test scores indicated that inclusion of GIS in middle school curriculum had a significant effect on student…

  10. Improving African American Youth's Standardized Test Scores: A Program of Study in Science, Mathematics, and Technology.

    ERIC Educational Resources Information Center

    Repa, J. Theodore; Miller, LaMar P.

    More than half the minority group students who completed the 1988/89 Science and Technology Entry Program (STEP) at Manhattanville College (New York) scored in the 90-95 percentile on tests of critical thinking, suggesting ability to do well in mathematics, science, and other studies. STEP comprises the following goals: (1) enable students to…

  11. Florida Defeats the Skeptics: Test Scores Show Genuine Progress in the Sunshine State

    ERIC Educational Resources Information Center

    Winters, Marcus

    2012-01-01

    Among the 50 states, Florida's gains on the National Assessment of Educational Progress (NAEP) between 1992 and 2011 ranked second only to Maryland's. Florida's progress has been particularly impressive in the early grades. In 1998, Florida scored about one grade level below the national average on the 4th-grade NAEP reading test, but it was…

  12. End of Course Grades and Standardized Test Scores: Are Grades Predictive of Student Achievement?

    ERIC Educational Resources Information Center

    Ricketts, Christine R.

    2010-01-01

    This study examined the extent to which end-of-course grades are predictive of Virginia Standards of Learning test scores in nine high school content areas. It also analyzed the impact of the variables school cluster attended, gender, ethnicity, disability status, Limited English Proficiency status, and socioeconomic status on the relationship…

  13. Validating Score Interpretations and Uses: Messick Lecture, Language Testing Research Colloquium, Cambridge, April 2010

    ERIC Educational Resources Information Center

    Kane, Michael

    2012-01-01

    The argument-based approach to validation involves two steps; specification of the proposed interpretations and uses of the test scores as an interpretive argument, and the evaluation of the plausibility of the proposed interpretive argument. More ambitious interpretations and uses tend to involve an extended network of inferences and assumptions…

  14. The Effect of Stakes on Accountability Test Scores and Pass Rates

    ERIC Educational Resources Information Center

    Steedle, Jeffrey T.; Grochowalski, Joseph

    2017-01-01

    Students may not fully demonstrate their knowledge and skills on accountability tests if there are no stakes attached to individual performance. In that case, assessment results may not accurately reflect student achievement, so the validity of score interpretations and uses suffers. For this study, matched samples of students taking state…

  15. Florida Defeats the Skeptics: Test Scores Show Genuine Progress in the Sunshine State

    ERIC Educational Resources Information Center

    Winters, Marcus

    2012-01-01

    Among the 50 states, Florida's gains on the National Assessment of Educational Progress (NAEP) between 1992 and 2011 ranked second only to Maryland's. Florida's progress has been particularly impressive in the early grades. In 1998, Florida scored about one grade level below the national average on the 4th-grade NAEP reading test, but it was…

  16. Comparing State and District Test Results to National Norms: Interpretations of Scoring "Above the National Average."

    ERIC Educational Resources Information Center

    Linn, Robert L.; And Others

    Norm-referenced test results reported by states and school districts and factors related to those scores were studied through mail and telephone surveys of 35 states and a nationally representative sample of 153 school districts to determine the degree to which "above average" results were being reported. Part of the stimulus for this…

  17. Permanent Income and the Black-White Test Score Gap. NBER Working Paper No. 17610

    ERIC Educational Resources Information Center

    Rothstein, Jesse; Wozny, Nathan

    2011-01-01

    Analysts often examine the black-white test score gap conditional on family income. Typically only a current income measure is available. We argue that the gap conditional on permanent income is of greater interest, and we describe a method for identifying this gap using an auxiliary data set to estimate the relationship between current and…

  18. Identifying Local Dependence with a Score Test Statistic Based on the Bifactor Logistic Model

    ERIC Educational Resources Information Center

    Liu, Yang; Thissen, David

    2012-01-01

    Local dependence (LD) refers to the violation of the local independence assumption of most item response models. Statistics that indicate LD between a pair of items on a test or questionnaire that is being fitted with an item response model can play a useful diagnostic role in applications of item response theory. In this article, a new score test…

  19. The Effect of Maturation and Educational Experience on Air Force Officer Qualifying Test Scores.

    ERIC Educational Resources Information Center

    Gregg, George

    Maturation and education improve Air Force Officer Qualifying Test (AFOQT) scores. Since the AFOQT is given at different educational levels for the several commissioning programs, differences, largely spurious, exist between the programs. To assess differences produced by maturation and education, the AFOQT was given to 415 Air Force Reserve…

  20. The Adaptation of Naval Enlistees Scoring in Mental Group 4 on the Armed Forces Qualification Test.

    ERIC Educational Resources Information Center

    Plag, John A.; And Others

    This report presents findings from a study evaluating differences in the adaptation of "average" and mentally marginal sailors during four years of military service. Sailors with Armed Forces Qualification Test (AFQT) scores of 50 are significantly superior to Category 4 enlistees on military performance measures which stress cognitive…