Science.gov

Sample records for act test scores

  1. The Relationship of Scores on Elizur's Hostility System on the Rorschach to the Acting-Out Score on the Hand Test.

    ERIC Educational Resources Information Center

    Martin, John D.; And Others

    1978-01-01

    The relationship between Elizur's Hostility Scoring on the Rorschach Test and the Acting-Out Score on the Hand Test was examined. Correlations between the two measures (using several scoring procedures) ranged from .40 to .64. (JKS)

  2. The Valid Use of NAEP Achievement Level Scores to Confirm State Test Results in the No Child Left Behind Act

    ERIC Educational Resources Information Center

    Stoneberg, Bert D.

    2007-01-01

    The No Child Left Behind Act sanctions the use of NAEP scores to confirm state testing results. The U.S. Department of Education, as test developer, is responsible to set forth how NAEP scores are to be interpreted and used. Thus far, the Department has not published a clear set of guidelines for using NAEP achievement level scores to conduct a…

  3. Estimating Reading Skill from ACT Assessment Scores.

    ERIC Educational Resources Information Center

    Noble, Julie

    1986-01-01

    A study correlating Nelson-Denny Reading Test scores with American College Testing Program Assessments (ACT) indicates that reading skill can be predicted accurately from the ACT social studies reading and English usage subtests. (MSE)

  4. The validity of ACT-PEP test scores for predicting academic performance of registered nurses in BSN programs.

    PubMed

    Yang, J C; Noble, J

    1990-01-01

    This study investigated the validity of three American College Testing-Proficiency Examination Program (ACT-PEP) tests (Maternal and Child Nursing, Psychiatric/Mental Health Nursing, Adult Nursing) for predicting the academic performance of registered nurses (RNs) enrolled in bachelor's degree BSN programs nationwide. This study also examined RN students' performance on the ACT-PEP tests by their demographic characteristics: student's age, sex, race, student status (full- or part-time), and employment status (full- or part-time). The total sample for the three tests comprised 2,600 students from eight institutions nationwide. The median correlation coefficients between the three ACT-PEP tests and the semester grade point averages ranged from .36 to .56. Median correlation coefficients increased over time, supporting the stability of ACT-PEP test scores for predicting academic performance over time. The relative importance of selected independent variables for predicting academic performance was also examined; the most important variable for predicting academic performance was typically the ACT-PEP test score. Across the institutions, student demographic characteristics did not contribute significantly to explaining academic performance, over and above ACT-PEP scores. PMID:2254527

  5. Predicting Grades in Specific College Freshman Courses from ACT Test Scores and Self-Reported High School Grades. November 1987.

    ERIC Educational Resources Information Center

    Noble, Julie P.; Sawyer, Richard

    The validity of American College Testing Program (ACT) test scores and self-reported high school grades for predicting grades in specific college freshman courses was studied. Specific course grades are typically used to place students in remedial, standard, or advanced classes. These placement decisions, in turn, have immediate implications for…

  6. The Effects of the Tennessee Diploma Project English/Language Arts Standards on the ACT Reading Test Scores of High School Seniors

    ERIC Educational Resources Information Center

    Medling, Michelle

    2012-01-01

    This research examined the effects of the new English/Language Arts standards, brought about by the Tennessee Diploma Project, on the ACT Reading test scores of high school seniors. The effects on student achievement, perceptions of secondary and postsecondary educators, as well as the effects on the test scores of at-risk students were measured.…

  7. ACT Test

    MedlinePlus

    ... this page helpful? Also known as: ACT; Activated Coagulation Time Formal name: Activated Clotting Time Related tests: ... in the blood called platelets and proteins called coagulation factors are activated in a sequence of steps ...

  8. Improving Test Scores. Research Brief

    ERIC Educational Resources Information Center

    Walker, Karen

    2003-01-01

    What strategies can improve test scores? According to research done by Amrein and Berliner, who studied 18 states with high stakes testing, their conclusion was that students did not necessarily score higher and often remained at the same level prior to the introduction of the high stakes testing. In other research done by Carnoy and Loeb, their…

  9. Development of Predicted Paths for ACT Aspire Score Reports. WP-2014-06

    ERIC Educational Resources Information Center

    Allen, Jeff

    2014-01-01

    This report focuses on the initial development of the predicted score paths for ACT Aspire reporting. The paths provide predicted score ranges for the next two years--as well as predicted ACT score ranges for tests administered at grades 9 and 10. Longitudinal ACT Aspire data for students tested in spring 2013 and spring 2014, as well as…

  10. Estimating Reading Skill from ACT Assessment Scores. Research Report No. 88.

    ERIC Educational Resources Information Center

    Noble, Julie

    A study examined whether ACT subtest scores can be used to predict reading skills, as measured by the Nelson-Denny Reading Test, with a degree of accuracy that would support their use as a screening device for college placement. ACT test scores of 2,431 students were used to predict Form C Nelson-Denny raw scores. ACT test scores from 3,016…

  11. More than Just Test Scores

    ERIC Educational Resources Information Center

    Levin, Henry M.

    2012-01-01

    Around the world we hear considerable talk about creating world-class schools. Usually the term refers to schools whose students get very high scores on the international comparisons of student achievement such as PISA or TIMSS. The practice of restricting the meaning of exemplary schools to the narrow criterion of achievement scores is usually…

  12. Smoothing Methods for Estimating Test Score Distributions.

    ERIC Educational Resources Information Center

    Kolen, Michael J.

    1991-01-01

    Estimation/smoothing methods that are flexible enough to fit a wide variety of test score distributions are reviewed: kernel method, strong true-score model-based method, and method that uses polynomial log-linear models. Applications of these methods include describing/comparing test score distributions, estimating norms, and estimating…

  13. Using Test Score Data to Focus Instruction

    ERIC Educational Resources Information Center

    Trimble, Susan; Gay, Anne; Matthews, Jan

    2005-01-01

    Advances in technology available to access test data coupled with the challenges of No Child Left Behind (NCLB) are pushing schools to grapple with the complexities of test score data. With the current frenzy to raise test scores, there is little attention being paid to teacher development in learning to use data to improve learning. For the past…

  14. Using Educational Test Scores To Evaluate Children.

    ERIC Educational Resources Information Center

    Pandiani, John A.; Simon, Monica M.; Banks, Steven M.

    This paper reports on an ongoing effort of the Vermont Mental Health Performance Indicator Project (PIP) to examine the relevance and utility of standardized test scores for evaluating community mental health programs. This analysis is of test scores from Vermont's first four years of statewide testing. The study is examining anonymous…

  15. Interpreting Standardized Test Scores: Some Fine Points.

    ERIC Educational Resources Information Center

    Hunter, William J.

    1980-01-01

    An essential function of the school guidance worker is the translation of test results into plain language and/or concrete recommendations. To do so requires a thorough understanding of the various test scores publishers provide. (Author)

  16. Paying for Better Test Scores

    ERIC Educational Resources Information Center

    Eisenkopf, Gerald

    2011-01-01

    The paper investigates if the provision of financial incentives has an impact on the performance of students in educational tests. The analysis is based on data from an experiment with high school students who answered multiple-choice items from the Third International Mathematics and Science Study (TIMSS). As in TIMSS, the setup did not…

  17. Deriving Enrollment Management Scores from ACT Data. IR Applications, Volume 16, November 15, 2008

    ERIC Educational Resources Information Center

    Saupe, Joe L.; Curs, Bradley R.

    2008-01-01

    This study is an investigation of the derivation of scores that predict whether or not prospective first-time freshmen will apply or will enroll and whether or not first-time freshman enrollees will graduate using data from the ACT (American College Testing) assessment. Using a regression methodology, four basic scores are derived to be…

  18. The Uses and Misuses of Test Scores: Technical Assistance Perspective.

    ERIC Educational Resources Information Center

    Echternacht, Gary

    The uses and misuses of standardized test results used for program evaluation as seen by a staff member of an Elementary Secondary Education Act (ESEA) Title I Technical Assistance Center are described. In ESEA Title I, test scores are used to select students for the program. Although federal requirements do not require using standardized test…

  19. Equating Scores from Adaptive to Linear Tests

    ERIC Educational Resources Information Center

    van der Linden, Wim J.

    2006-01-01

    Two local methods for observed-score equating are applied to the problem of equating an adaptive test to a linear test. In an empirical study, the methods were evaluated against a method based on the test characteristic function (TCF) of the linear test and traditional equipercentile equating applied to the ability estimates on the adaptive test…

  20. Do Examinees Understand Score Reports for Alternate Methods of Scoring Computer Based Tests?

    ERIC Educational Resources Information Center

    Whittaker, Tiffany A.; Williams, Natasha J.; Dodd, Barbara G.

    2011-01-01

    This study assessed the interpretability of scaled scores based on either number correct (NC) scoring for a paper-and-pencil test or one of two methods of scoring computer-based tests: an item pattern (IP) scoring method and a method based on equated NC scoring. The equated NC scoring method for computer-based tests was proposed as an alternative…

  1. Concordance between ACT and ITED Scores from Different Populations

    ERIC Educational Resources Information Center

    Yin, Ping; Brennan, Robert L.; Kolen, Michael J.

    2004-01-01

    The use of educational tests in practical situations often requires that tests that are built to different content and/or statistical specifications be statistically linked. When equating methods are applied to tests that differ in content, difficulty, and/or reliability, the resulting scores cannot be used interchangeably. This study examined…

  2. What We Lose in Winning the Test Score Race

    ERIC Educational Resources Information Center

    Jorgenson, Olaf

    2012-01-01

    To achieve perpetually better test results each year as mandated by the No Child Left Behind Act (NCLB), teachers in successful schools such as Leroy Anderson Elementary in San Jose, California, will "try anything" to raise scores, as the school's principal stated in an interview with "The San Jose Mercury News." In schools across California for…

  3. Teacher Greetings Increase College Students' Test Scores

    ERIC Educational Resources Information Center

    Weinstein, Lawrence; Laverghetta, Antonio; Alexander, Ralph; Stewart, Megan

    2009-01-01

    The current study is an extension of a previous investigation dealing with teacher greetings to students. The present investigation used teacher greetings with college students and academic performance (test scores). We report data using university students and in-class test performance. Students in introductory psychology who received teachers'…

  4. What Do Test Score Really Mean? A Latent Class Analysis of Danish Test Score Performance

    ERIC Educational Resources Information Center

    McIntosh, James; Munk, Martin D.

    2014-01-01

    Latent class Poisson count models are used to analyse a sample of Danish test score results from a cohort of individuals born in 1954-1955, tested in 1968, and followed until 2011. The procedure takes account of unobservable effects as well as excessive zeros in the data. We show that the test scores measure manifest or measured ability as it has…

  5. Using ACT Assessment Scores to Set Benchmarks for College Readiness. ACT Research Report Series, 2005-3

    ERIC Educational Resources Information Center

    Allen, Jeff; Sconing, Jim

    2005-01-01

    In this report, we establish benchmarks of readiness for four common first-year college courses: English Composition, College Algebra, Social Science, and Biology. Using grade data from a large sample of colleges, we modeled the probability of success in these courses as a function of ACT test scores. Success was defined as a course grade of B or…

  6. Teacher Use of Achievement Test Score Data

    ERIC Educational Resources Information Center

    Miller, Steven C.

    2012-01-01

    The Wyoming Department of Education (WDE) has invested time and money developing standardized achievement test score reports designed to give teachers data about each of their students' levels of mastery of particular concepts in order to differentiate their instruction. The purpose of this study was to determine the extent to which…

  7. Misidentifying Factors Underlying Singapore's High Test Scores

    ERIC Educational Resources Information Center

    Usiskin, Zalman

    2012-01-01

    Singapore students have scored exceedingly well on international tests in mathematics. In response, there has been a desire in the United States--both at the policy level and at the school level--to emulate Singapore. Because what can be identified most easily about Singapore's school mathematics can be gleaned from curriculum documents from the…

  8. Test Scores, Creativity, and Global Competitiveness.

    ERIC Educational Resources Information Center

    Bracey, Gerald W.

    2002-01-01

    Examines correlation between national test scores in mathematics from the Third International Mathematics and Science Study (TIMSS) and the Current Competitiveness Index (CCI). Finds, for example, that while the United States ranks 29th in TIMSS mathematics, it ranks second in competitiveness on the CCI. Korea ranks 3rd in mathematics, but 27th in…

  9. Critical Thinking: More than Test Scores

    ERIC Educational Resources Information Center

    Smith, Vernon G.; Szymanski, Antonia

    2013-01-01

    This article is for practicing or aspiring school administrators. The demand for excellence in public education has lead to an emphasis on standardized test scores. This article explores the development of a professional enhancement program designed to prepare teachers to teach higher order thinking skills. Higher order thinking is the primary…

  10. The Black-White Test Score Gap.

    ERIC Educational Resources Information Center

    Jencks, Christopher, Ed.; Phillips, Meredith, Ed.

    The 15 chapters of this book address issues related to the continuing test score gap between black and white students. The editors argue against traditional explanations which emphasize differences in economic resources and demographic factors, and they urge that more emphasis be put on psychological and cultural factors. The book suggests studies…

  11. Leveraging Gender Differences to Boost Test Scores

    ERIC Educational Resources Information Center

    Costello, Bill

    2008-01-01

    According to the 2004 National Assessment of Educational Progress, males who have made it through 12 years of school have significantly poorer reading skills than their female peers. In every age group, boys have been scoring lower than girls annually for more than three decades on U.S. Department of Education reading tests. The longer boys are in…

  12. The Influence of Foreign Language Learning during Early Childhood on Standardized Test Scores

    ERIC Educational Resources Information Center

    Shaw, Tommetta

    2010-01-01

    Increasing standardized test scores in reading and math is of high importance to the California Department of Education to meet requirements mandated by the No Child Left Behind (NCLB) act of 2001. More research is needed to understand the best ways to improve tests scores to meet concerns of the NCLB act. The purpose of the study was to evaluate…

  13. A Study of the Predictability of Praxis I Examination Scores from ACT Scores and Teacher Education Program Prerequisite Courses

    ERIC Educational Resources Information Center

    Henderson, Allen R.

    2013-01-01

    This study investigated the relationship between student enrollment in certain college courses and Praxis I scores. Specifically, the study examined the predictive nature of the relationships between students' grades in college algebra, their freshman English course of choice, their ACT scores, and their Praxis I scores. The subjects…

  14. ITC Guidelines on Quality Control in Scoring, Test Analysis, and Reporting of Test Scores

    ERIC Educational Resources Information Center

    Allalouf, Avi

    2014-01-01

    The Quality Control (QC) Guidelines are intended to increase the efficiency, precision, and accuracy of the scoring, analysis, and reporting process of testing. The QC Guidelines focus on large-scale testing operations where multiple forms of tests are created for use on set dates. However, they may also be used for a wide variety of other testing…

  15. Use of Standardized Test Scores to Predict Success in a Computer Applications Course

    ERIC Educational Resources Information Center

    Harris, Robert V.; King, Stephanie B.

    2016-01-01

    The purpose of this study was to see if a relationship existed between American College Testing (ACT) scores (i.e., English, reading, mathematics, science reasoning, and composite) and student success in a computer applications course at a Mississippi community college. The study showed that while the ACT scores were excellent predictors of…

  16. On Interpreting Test Scores as Social Indicators: Statistical Considerations.

    ERIC Educational Resources Information Center

    Spencer, Bruce D.

    1983-01-01

    Because test scores are ordinal not cordinal attributes, the average test score often is a misleading way to summarize the scores of a group of individuals. Similarly, correlation coefficients may be misleading summary measures of association between test scores. Proper, readily interpretable, summary statistics are developed from a theory of…

  17. Validating the Interpretations and Uses of Test Scores

    ERIC Educational Resources Information Center

    Kane, Michael T.

    2013-01-01

    To validate an interpretation or use of test scores is to evaluate the plausibility of the claims based on the scores. An argument-based approach to validation suggests that the claims based on the test scores be outlined as an argument that specifies the inferences and supporting assumptions needed to get from test responses to score-based…

  18. ACT Test Preparation Course and Its Impact on Students' College- and Career-Readiness

    ERIC Educational Resources Information Center

    Parrott, Timothy Nolan

    2012-01-01

    This study examined the effectiveness of an ACT intervention course developed for high school juniors at Anderson County High School during the 2011-2012 school year. This study compared the ACT composite test scores of the treatment group to the ACT composite test scores of the control group by using their PLAN scores as a baseline, to determine…

  19. From High School to the Future: ACT Preparation--Too Much, Too Late. Why ACT Scores Are Low in Chicago and What It Means for Schools

    ERIC Educational Resources Information Center

    Allensworth, Elaine; Correa, Macarena; Ponisciak, Steve

    2008-01-01

    The majority of Chicago Public Schools (CPS) students are not attaining the ACT scores they are aiming for, which they need to qualify for scholarships and college acceptance. This report looks at the reasons behind students' low performance and what matters for doing well on this test. CPS students are highly motivated to do well on the ACT, and…

  20. Estimating Total-Test Scores from Partial Scores in a Matrix Sampling Design.

    ERIC Educational Resources Information Center

    Sachar, Jane; Suppes, Patrick

    1980-01-01

    The present study compared six methods, two of which utilize the content structure of items, to estimate total-test scores using 450 students and 60 items of the 110-item Stanford Mental Arithmetic Test. Three methods yielded fairly good estimates of the total-test score. (Author/RL)

  1. Testing Intelligently Includes Double-Checking Wechsler IQ Scores

    ERIC Educational Resources Information Center

    Kuentzel, Jeffrey G.; Hetterscheidt, Lesley A.; Barnett, Douglas

    2011-01-01

    The rigors of standardized testing make for numerous opportunities for examiner error, including simple computational mistakes in scoring. Although experts recommend that test scoring be double-checked, the extent to which independent double-checking would reduce scoring errors is not known. A double-checking procedure was established at a…

  2. See It, Be It, Write It: Using Performing Arts to Improve Writing Skills and Test Scores

    ERIC Educational Resources Information Center

    Blecher-Sass, Hope Sara; Moffitt, Maryellen

    2010-01-01

    Improve students' writing skills and boost their assessment scores while adding arts education, creativity, and fun to your writing curriculum. With this vibrant resource, improving writing skills goes hand-in-hand with improving test scores. Students learn how to use acting and visualization as prewriting activities to help them connect writing…

  3. Does Test Preparation Work? Implications for Score Validity

    ERIC Educational Resources Information Center

    Xie, Qin

    2013-01-01

    This article reports an empirical study that examined the pattern of test preparation for College English Test Band 4 (CET4) and the differential effects of test preparation practices on its scores, thereby drawing implications for CET4 score validity. Data collection involved 1,003 test takers of CET4. A pretest was administered at the beginning…

  4. A Review of Scoring Algorithms for Ability and Aptitude Tests.

    ERIC Educational Resources Information Center

    Chevalier, Shirley A.

    In conventional practice, most educators and educational researchers score cognitive tests using a dichotomous right-wrong scoring system. Although simple and straightforward, this method does not take into consideration other factors, such as partial knowledge or guessing tendencies and abilities. This paper discusses alternative scoring models:…

  5. Reliability of Total Test Scores When Considered as Ordinal Measurements

    ERIC Educational Resources Information Center

    Biswas, Ajoy Kumar

    2006-01-01

    This article studies the ordinal reliability of (total) test scores. This study is based on a classical-type linear model of observed score (X), true score (T), and random error (E). Based on the idea of Kendall's tau-a coefficient, a measure of ordinal reliability for small-examinee populations is developed. This measure is extended to large…

  6. The Generalizability of Motivation Filtering in Improving Test Score Validity

    ERIC Educational Resources Information Center

    Wise, Vicki L.; Wise, Steven L.; Bhola, Dennison S.

    2006-01-01

    Accountability for educational quality is a priority at all levels of education. Low-stakes testing is one way to measure the quality of education that students receive and make inferences about what students know and can do. Aggregate test scores from low-stakes testing programs are suspect, however, to the degree that these scores are influenced…

  7. Estimating Total-test Scores from Partial Scores in a Matrix Sampling Design.

    ERIC Educational Resources Information Center

    Sachar, Jane; Suppes, Patrick

    It is sometimes desirable to obtain an estimated total-test score for an individual who was administered only a subset of the items in a total test. The present study compared six methods, two of which utilize the content structure of items, to estimate total-test scores using 450 students in grades 3-5 and 60 items of the ll0-item Stanford Mental…

  8. The Dental Hygiene Aptitude Tests and the American College Testing Program Tests as Predictors of Scores on the National Board Dental Hygiene Examination.

    ERIC Educational Resources Information Center

    Longenbecker, Sueann; Wood, Peter H.

    1984-01-01

    Scores from the National Board Dental Hygiene Examination (NBDHE) served as the criterion variable in a comparison of the predictive validity of the Dental Hygiene Aptitude Tests (DHAT) and the ACT Assessment tests. The DHAT-Science and Verbal tests combined to produce the highest multiple correlation with NBDHE scores. (Author/DWH)

  9. School accountability and the black-white test score gap.

    PubMed

    Gaddis, S Michael; Lauen, Douglas Lee

    2014-03-01

    Since at least the 1960s, researchers have closely examined the respective roles of families, neighborhoods, and schools in producing the black-white achievement gap. Although many researchers minimize the ability of schools to eliminate achievement gaps, the No Child Left Behind Act (NCLB) increased pressure on schools to do so by 2014. In this study, we examine the effects of NCLB's subgroup-specific accountability pressure on changes in black-white math and reading test score gaps using a school-level panel dataset on all North Carolina public elementary and middle schools between 2001 and 2009. Using difference-in-difference models with school fixed effects, we find that accountability pressure reduces black-white achievement gaps by raising mean black achievement without harming mean white achievement. We find no differential effects of accountability pressure based on the racial composition of schools, but schools with more affluent populations are the most successful at reducing the black-white math achievement gap. Thus, our findings suggest that school-based interventions have the potential to close test score gaps, but differences in school composition and resources play a significant role in the ability of schools to reduce racial inequality. PMID:24468431

  10. State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. Washington

    ERIC Educational Resources Information Center

    Center on Education Policy, 2010

    2010-01-01

    This paper profiles Washington's test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the basic level on NAEP (National Assessment of Educational Progress) decreased in grade 4 reading. In grade 4 math, the percentage scoring proficient on the state test decreased…

  11. Mental Test Performance as a Function of Various Scoring Cutoffs

    ERIC Educational Resources Information Center

    Quereshi, M. Y.; Veeser, William R.

    1970-01-01

    Investigates the influence of various scoring cutoffs on mental test performance as measured by the Michell General Ability Test (MGAT) and develops a rationale for selecting the optimum cutoff based on raw scores, internal consistency, stability, parallel-form reliability and concurrent validity estimates. (MB)

  12. Motivational Effects on Test Scores of Elementary Students.

    ERIC Educational Resources Information Center

    Brown, Steven M.; Walberg, Herbert J.

    1993-01-01

    To examine the effect of motivational manipulated conditions on students' mathematics scores, elementary students received either ordinary standardized test instructions or special instructions (do as well as possible for themselves, parents, and teachers). Those given special instructions scored significantly higher in the test, implying that…

  13. Math/FCS Class Boosts Test Scores

    ERIC Educational Resources Information Center

    Sanden, Jan

    2004-01-01

    Integrating mathematics with family and consumer sciences (FCS) has enabled youth to pass the Minnesota 8th Grade Math Basic Skills test. The test focuses on the eight content areas: (1) problem solving with whole numbers and fractions; (2) problem solving with percentage/ratio; (3) number sense; (4) estimation; 5) measurement; (6) tables and…

  14. Interpreting Nelson-Denny Reading Test Scores.

    ERIC Educational Resources Information Center

    Blai, Boris, Jr.

    Reading test results and their interpretation are stressed because of their importance in student achievement. The Nelson-Denny Reading Test used at Harcum Junior College is a useful measuring instrument for predicting academic achievement, screening students, and diagnosing reading and learning problems. General hints for interpretation of the…

  15. Talent Search Qualifying: Comparisons between Talent Search Students Qualifying via Scores on Standardized Tests and via Parent Nomination

    ERIC Educational Resources Information Center

    Lee, Seon-Young; Olszewski-Kubilius, Paula

    2006-01-01

    This study examined differences between students who qualified for talent search testing via scores on standardized tests and via parent nomination in their performances on the SAT or ACT and some demographic characteristics. Overall, the standardized testing group earned higher scores on the off-level tests than the parent nominated group. Asian…

  16. Fuzzy Math: A Meditation on Test Scoring

    ERIC Educational Resources Information Center

    Jacks, Meredith

    2011-01-01

    As a public school English teacher, the author observes standardized testing season each year with a sort of grim fascination. "So this is it," she thinks as she paces around her silent classroom, peering over kids' shoulders at articles about parasailing. Line graphs tracking the rainfall in Tulsa. Parts of speech. Functions of "x." "These are…

  17. Accountability Is More than a Test Score

    ERIC Educational Resources Information Center

    Turnipseed, Stephan; Darling-Hammond, Linda

    2015-01-01

    The number one quality business leaders look for in employees is creativity and yet the U.S. education system undermines the development of the higher-order skills that promote creativity by its dogged focus on multiple-choice tests. Stephan Turnipseed and Linda DarlingHammond discuss the kind of rich accountability system that will help students…

  18. Bewitched, Bothered, and Bewildering: The Use and Misuse of State SAT and ACT Scores.

    ERIC Educational Resources Information Center

    Powell, Brian; Steelman, Lala Carr

    1996-01-01

    Updates an earlier study by reanalyzing interstate variations in Scholastic Aptitude Test and American College Test scores. Reaffirms the conclusion that state rankings based on such scores change dramatically when adjusted for number of test takers or class rank of test-taking population. Finds that public expenditures are positively related to…

  19. ACT Scores as Predictors of Success: A Correlation Study between ACT Scores and the Lindenwood University Business Capstone Course

    ERIC Educational Resources Information Center

    Lapinska-Schneider, Karolina

    2010-01-01

    Business schools across the nation, to meet the accreditation standards of the Association of Collegiate Business Schools and Programs (ACBSP), must incorporate results of certain outcomes assessments for evaluation of programs efficacy and continuous improvement purposes. The Educational Testing Service Major Field Test in Business Administration…

  20. Missing the Mark: What Test Scores Really Tell Us

    ERIC Educational Resources Information Center

    Tanner, John R.

    2011-01-01

    State test scores administered for accountability purposes are regularly used to adjust instruction in nuanced ways. This is no accident--No Child Left Behind demanded that students' scores be returned quickly to teachers in order that this might be the case, and the idea of data-driven decision making continues as one way the promise of education…

  1. Mathematical SAT Test Scores and College Chemistry Grades.

    ERIC Educational Resources Information Center

    Spencer, Harry E.

    1996-01-01

    Explores the comparative performance of various segments of the student sample in general chemistry courses relative to their scores on the mathematical SAT test. Results indicate that mathematical skill measured by the SAT scores is an important factor in determining grades while factors that are not important in determining grades are gender…

  2. A Study of Methods for Estimating Distributions of Test Scores.

    ERIC Educational Resources Information Center

    Cope, Ronald T.; Kolen, Michael J.

    This study compared five density estimation techniques applied to samples from a population of 272,244 examinees' ACT English Usage and Mathematics Usage raw scores. Unsmoothed frequencies, kernel method, negative hypergeometric, four-parameter beta compound binomial, and Cureton-Tukey methods were applied to 500 replications of random samples of…

  3. Test Scores and What They Mean. Sixth Edition.

    ERIC Educational Resources Information Center

    Lyman, Howard B.

    The first edition of this book was written to give information about testing to people whose work gave them access to test results, but whose training included little or nothing about the use and interpretation of tests. Later editions have been intended for a broader audience as the need for understanding what test scores really mean has…

  4. A Comparison of an Introductory Course to SAT/ACT Scores in Predicting Student Performance

    ERIC Educational Resources Information Center

    Marsh, Crystale M.; Vandehey, Michael A.; Diekhoff, George M.

    2008-01-01

    We assessed students in General Psychology classes and examined their SAT/ACT scores, GPAs, and attempted and earned hours. Exams in General Psychology were superior to the SAT/ACT in predicting GPA, supporting the use of an introductory course as a "gateway" for identifying at-risk students and engaging them in academic services. (Contains 3…

  5. Prediction of true test scores from observed item scores and ancillary data.

    PubMed

    Haberman, Shelby J; Yao, Lili; Sinharay, Sandip

    2015-05-01

    In many educational tests which involve constructed responses, a traditional test score is obtained by adding together item scores obtained through holistic scoring by trained human raters. For example, this practice was used until 2008 in the case of GRE(®) General Analytical Writing and until 2009 in the case of TOEFL(®) iBT Writing. With use of natural language processing, it is possible to obtain additional information concerning item responses from computer programs such as e-rater(®). In addition, available information relevant to examinee performance may include scores on related tests. We suggest application of standard results from classical test theory to the available data to obtain best linear predictors of true traditional test scores. In performing such analysis, we require estimation of variances and covariances of measurement errors, a task which can be quite difficult in the case of tests with limited numbers of items and with multiple measurements per item. As a consequence, a new estimation method is suggested based on samples of examinees who have taken an assessment more than once. Such samples are typically not random samples of the general population of examinees, so that we apply statistical adjustment methods to obtain the needed estimated variances and covariances of measurement errors. To examine practical implications of the suggested methods of analysis, applications are made to GRE General Analytical Writing and TOEFL iBT Writing. Results obtained indicate that substantial improvements are possible both in terms of reliability of scoring and in terms of assessment reliability. PMID:25773314

  6. Relationship of Scores and Times of Test Administration via Computer.

    ERIC Educational Resources Information Center

    Gorney, Barbara; Maury, Marcia

    The purpose of this study was to determine if a relationship exists between scores and the times that medical students choose to take a computer-administered test. The results indicate that students who choose to take a test later within a given time period tend to perform less well than students who take the test earlier. Although the magnitude…

  7. Test Takers and the Validity of Score Interpretations

    ERIC Educational Resources Information Center

    Kopriva, Rebecca J.; Thurlow, Martha L.; Perie, Marianne; Lazarus, Sheryl S.; Clark, Amy

    2016-01-01

    This article argues that test takers are as integral to determining validity of test scores as defining target content and conditioning inferences on test use. A principled sustained attention to how students interact with assessment opportunities is essential, as is a principled sustained evaluation of evidence confirming the validity or calling…

  8. Interpreting Test Scores: More Complicated than You Think

    ERIC Educational Resources Information Center

    Tully, Susannah

    2008-01-01

    As more colleges move to "test optional" admissions policies, the debate over the utility and interpretation of standardized-test scores continues. In this article, the author interviews Daniel Koretz, a professor of education at Harvard University and author of "Measuring Up: What Educational Testing Really Tells Us". Koretz shares his thoughts…

  9. Observed-Score Equating as a Test Assembly Problem.

    ERIC Educational Resources Information Center

    van der Linden, Wim J.; Luecht, Richard M.

    1998-01-01

    Derives a set of linear conditions of item-response functions that guarantees identical observed-score distributions on two test forms. The conditions can be added as constraints to a linear programming model for test assembly. An example illustrates the use of the model for an item pool from the Law School Admissions Test (LSAT). (SLD)

  10. Which Advanced Mathematics Courses Influence ACT Score? A State Level Analysis of the Iowa Class of 2012

    ERIC Educational Resources Information Center

    Grinstead, Mary L.

    2013-01-01

    This study explores the relationship between specific advanced mathematics courses and college readiness (as determined by ACT score). The ACT organization has found a consistent relationship between taking a minimum core number of mathematics courses and higher ACT scores (mathematics and composite) (ACT, Inc., 2012c). However, the extent to…

  11. State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. Maryland

    ERIC Educational Resources Information Center

    Center on Education Policy, 2010

    2010-01-01

    This paper profiles Maryland's test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the basic level on NAEP (National Assessment of Educational Progress) increased at grades 4 and 8 in both reading and math. Average annual gains were larger on the state test than…

  12. State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. Pennsylvania

    ERIC Educational Resources Information Center

    Center on Education Policy, 2010

    2010-01-01

    This paper profiles Pennsylvania's test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the basic level on NAEP (National Assessment of Educational Progress) increased in grade 8 reading and math. Average annual gains were larger on the state test than on NAEP in…

  13. Methods for Improving Test Scores: The Good, the Bad, and the Ugly

    ERIC Educational Resources Information Center

    Wright, Robert J.

    2009-01-01

    The No Child Left Behind Act (NCLB 2001) has the faculties of every public and charter school scrambling to drive test scores of seven identified groups of children (African-American children, Anglo-White children, children with disabilities, Hispanic children, children of poverty, children with English language limitations, and Native-American…

  14. "No Child" Effect on English-Learners Mulled: Teachers Welcome Attention, Fault Focus on Test Scores

    ERIC Educational Resources Information Center

    Zehr, Mary Ann

    2006-01-01

    Educators who specialize in teaching English-language learners agree that the 4-year-old No Child Left Behind Act has brought unprecedented attention to those students by requiring schools to isolate test-score data for them. They disagree, though, on whether changes in instruction spurred by the law have been positive or negative overall. Such…

  15. The Consequences of Ignorance Can Be More Serious than Low Test Scores.

    ERIC Educational Resources Information Center

    Erb, Tom

    2002-01-01

    Asserts that concern for academic success across the curriculum should extend beyond raising test scores. Contends that in the wake of the terrorist acts of 9/11/01, issues of public security and civil liberties present an opportunity for teachers to teach the Constitution and its amendments in an effort to fight citizen ignorance. (SD)

  16. High Test Scores: The Wrong Road to National Economic Success

    ERIC Educational Resources Information Center

    Baker, Keith

    2011-01-01

    A widely held view is that good schools are essential to a nation's international economic success and that high test scores on international tests of academic skills and knowledge indicate how good a nation's schools are. The widespread belief that good schools are an important contributor to a nation's economic success in the world is supported…

  17. Effort Analysis: Individual Score Validation of Achievement Test Data

    ERIC Educational Resources Information Center

    Wise, Steven L.

    2015-01-01

    Whenever the purpose of measurement is to inform an inference about a student's achievement level, it is important that we be able to trust that the student's test score accurately reflects what that student knows and can do. Such trust requires the assumption that a student's test event is not unduly influenced by construct-irrelevant factors…

  18. Motivating High School Students to Score Proficient on State Tests

    ERIC Educational Resources Information Center

    Brown, Sarah Lee

    2015-01-01

    The researcher interviewed two groups of eleventh grade students, in a rural Appalachian setting, who tended to score low on the state mandated high stakes/low stakes test to discover their efforts on the test, specifically in reading, and to obtain their opinions concerning the effects of a specific incentive or consequence. Before the eleventh…

  19. A prognostic scoring system for arm exercise stress testing

    PubMed Central

    Xie, Yan; Xian, Hong; Chandiramani, Pooja; Bainter, Emily; Wan, Leping; Martin, Wade H

    2016-01-01

    Objective Arm exercise stress testing may be an equivalent or better predictor of mortality outcome than pharmacological stress imaging for the ≥50% for patients unable to perform leg exercise. Thus, our objective was to develop an arm exercise ECG stress test scoring system, analogous to the Duke Treadmill Score, for predicting outcome in these individuals. Methods In this retrospective observational cohort study, arm exercise ECG stress tests were performed in 443 consecutive veterans aged 64.1 (11.1) years. (mean (SD)) between 1997 and 2002. From multivariate Cox models, arm exercise scores were developed for prediction of 5-year and 12-year all-cause and cardiovascular mortality and 5-year cardiovascular mortality or myocardial infarction (MI). Results Arm exercise capacity in resting metabolic equivalents (METs), 1 min heart rate recovery (HRR) and ST segment depression ≥1 mm were the stress test variables independently associated with all-cause and cardiovascular mortality by step-wise Cox analysis (all p<0.01). A score based on the relation HRR (bpm)+7.3×METs−10.5×ST depression (0=no; 1=yes) prognosticated 5-year cardiovascular mortality with a C-statistic of 0.81 before and 0.88 after adjustment for significant demographic and clinical covariates. Arm exercise scores for the other outcome end points yielded C-statistic values of 0.77–0.79 before and 0.82–0.86 after adjustment for significant covariates versus 0.64–0.72 for best fit pharmacological myocardial perfusion imaging models in a cohort of 1730 veterans who were evaluated over the same time period. Conclusions Arm exercise scores, analogous to the Duke Treadmill Score, have good power for prediction of mortality or MI in patients who cannot perform leg exercise. PMID:26835142

  20. Does weight affect children's test scores and teacher assessments differently?

    PubMed

    Zavodny, Madeline

    2013-06-01

    The prevalence of childhood overweight and obesity increased dramatically in the United States during the past three decades. This increase has adverse public health implications, but its implication for children's academic outcomes is less clear. This paper uses data from five waves of the Early Childhood Longitudinal Study-Kindergarten to examine how children's weight is related to their scores on standardized tests and to their teachers' assessments of their academic ability. The results indicate that children's weight is more negatively related to teacher assessments of their academic performance than to test scores. PMID:24014932

  1. State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. Massachusetts

    ERIC Educational Resources Information Center

    Center on Education Policy, 2010

    2010-01-01

    This paper profiles Massachusetts' test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the basic level on NAEP (National Assessment of Educational Progress) increased in grade 4 reading and math and grade 8 math. Average annual gains were larger on the state test…

  2. State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. Texas

    ERIC Educational Resources Information Center

    Center on Education Policy, 2010

    2010-01-01

    This paper profiles Texas' test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the basic level on NAEP (National Assessment of Educational Progress) increased in reading at grades 4 and 8 and in math at grade 8. In grade 4 math, however, the percentage scoring…

  3. America's Mediocre Test Scores: Education Crisis or Poverty Crisis?

    ERIC Educational Resources Information Center

    Petrilli, Michael J.; Wright, Brandon L.

    2016-01-01

    At a time when the national conversation is focused on lagging upward mobility, it is no surprise that many educators point to poverty as the explanation for mediocre test scores among U.S. students compared to those of students in other countries. If American teachers in struggling U.S. schools taught in Finland, says Finnish educator Pasi…

  4. A Bad Idea: National Standards Based on Test Scores

    ERIC Educational Resources Information Center

    Baker, Keith

    2010-01-01

    The justification for national standards is that test scores predict a nation's future economic success. There is no evidence that supports this assumption. There is evidence that it is wrong. For more than half a century, reformers have been trying to fix our schools with little success. The obvious conclusion is that something that can't be…

  5. What Do Test Scores in Texas Tell Us?

    ERIC Educational Resources Information Center

    Klein, Stephen P.; Hamilton, Laura S.; McCaffrey, Daniel F.; Stecher, Brian M.

    2000-01-01

    Compared results on the Texas Assessment of Academic Skills to Texas (TAAS) score changes on the National Assessment of Educational Progress (NAEP). Texas fourth graders did improve significantly more on the NAEP mathematics test than their counterparts nationally, but this gain was smaller than their TAAS gains, and a similar gain was not seen…

  6. Between-District Test Score Variation, 2009-2012

    ERIC Educational Resources Information Center

    Fahle, Erin; Reardon, Sean

    2016-01-01

    Describing the variation in test scores between and within school districts is critical for: (1) for policy-related and descriptive work that investigates the sorting of students among districts and the differential effectiveness of those districts; and (2) for methodological work planning future experiments or interventions. Intraclass…

  7. Student Laptop Use and Scores on Standardized Tests

    ERIC Educational Resources Information Center

    Kposowa, Augustine J.; Valdez, Amanda D.

    2013-01-01

    Objectives: The primary objective of the study was to investigate the relationship between ubiquitous laptop use and academic achievement. It was hypothesized that students with ubiquitous laptops would score on average higher on standardized tests than those without such computers. Methods: Data were obtained from two sources. First, demographic…

  8. The Correlational Relationship between Homeschooling Demographics and High Test Scores.

    ERIC Educational Resources Information Center

    Burns, Johnna

    Homeschooling, one of the fastest growing educational alternatives, is enjoying increasing respect from educators and parents alike. This is partly because homeschooling children score as well and often better on standardized tests than their publicly schooled counterparts. However, the vast majority of homeschooled students come from the…

  9. Using Student Test Scores to Measure Principal Performance

    ERIC Educational Resources Information Center

    Grissom, Jason A.; Kalogrides, Demetra; Loeb, Susanna

    2015-01-01

    Expansion of the use of student test score data to measure teacher performance has fueled recent policy interest in using those data to measure the effects of school administrators as well. However, little research has considered the capacity of student performance data to uncover principal effects. Filling this gap, this article identifies…

  10. Experiential Awareness of the Effects of Test Score Reports.

    ERIC Educational Resources Information Center

    Bender, Robert C.

    Because most counselors have experienced a significant amount of success, they often have difficulty understanding the impact of test scores on persons who do not perform well. Counselor educators must develop experiential awareness in an area normally outside the realm of their students. To provide such an experience, 25 counselor trainees took…

  11. Benefits of Coaching on Test Scores Seen as Negligible.

    ERIC Educational Resources Information Center

    Report on Education Research, 1983

    1983-01-01

    THE FOLLOWING IS THE FULL TEXT OF THIS DOCUMENT: A new study by a pair of Harvard University researchers discounts earlier findings that coaching can substantially improve student performance on the Scholastic Aptitude Test (SAT). "There is simply insufficient evidence that large score increases are a result of a coaching program," write Rebecca…

  12. Commentary on "Validating the Interpretations and Uses of Test Scores"

    ERIC Educational Resources Information Center

    Brennan, Robert L.

    2013-01-01

    Kane's paper "Validating the Interpretations and Uses of Test Scores" is the most complete and clearest discussion yet available of the argument-based approach to validation. At its most basic level, validation as formulated by Kane is fundamentally a simply-stated two-step enterprise: (1) specify the claims inherent in a particular interpretation…

  13. A Latent Class Approach to Estimating Test-Score Reliability

    ERIC Educational Resources Information Center

    van der Ark, L. Andries; van der Palm, Daniel W.; Sijtsma, Klaas

    2011-01-01

    This study presents a general framework for single-administration reliability methods, such as Cronbach's alpha, Guttman's lambda-2, and method MS. This general framework was used to derive a new approach to estimating test-score reliability by means of the unrestricted latent class model. This new approach is the latent class reliability…

  14. Univariate and Bivariate Loglinear Models for Discrete Test Score Distributions.

    ERIC Educational Resources Information Center

    Holland, Paul W.; Thayer, Dorothy T.

    2000-01-01

    Applied the theory of exponential families of distributions to the problem of fitting the univariate histograms and discrete bivariate frequency distributions that often arise in the analysis of test scores. Considers efficient computation of the maximum likelihood estimates of the parameters using Newton's Method and computationally efficient…

  15. School Choice in Suburbia: Test Scores, Race, and Housing Markets

    ERIC Educational Resources Information Center

    Dougherty, Jack; Harelson, Jeffrey; Maloney, Laura; Murphy, Drew; Smith, Russell; Snow, Michael; Zannoni, Diane

    2009-01-01

    Home buyers exercise school choice when shopping for a private residence due to its location in a public school district or attendance area. In this quantitative study of one Connecticut suburban district, we measure the effect of elementary school test scores and racial composition on home buyers' willingness to purchase single-family homes over…

  16. Source Country Differences in Test Score Gaps: Evidence from Denmark

    ERIC Educational Resources Information Center

    Rangvid, Beatrice Schindler

    2010-01-01

    We combine data from three studies for Denmark in the PISA 2000 framework to investigate differences in the native-immigrant test score gap by country of origin. In addition to the controls available from PISA data sources, we use student-level data on home background and individual migration histories linked from administrative registers. We find…

  17. The Validity of Reading Comprehension Test Scores: Evidence of Generalizability across Different Test Administration Conditions.

    ERIC Educational Resources Information Center

    Bishop, N. Scott

    This study examined the effects of different test administration conditions on reading comprehension test scores. Evidence of performance differences across district testing conditions might imply that the meanings and interpretations associated with the corresponding test scores have limited generalizability (i.e., knowing how well one reads…

  18. Flow and diffusion of high-stakes test scores

    PubMed Central

    Marder, M.; Bansal, D.

    2009-01-01

    We apply visualization and modeling methods for convective and diffusive flows to public school mathematics test scores from Texas. We obtain plots that show the most likely future and past scores of students, the effects of random processes such as guessing, and the rate at which students appear in and disappear from schools. We show that student outcomes depend strongly upon economic class, and identify the grade levels where flows of different groups diverge most strongly. Changing the effectiveness of instruction in one grade naturally leads to strongly nonlinear effects on student outcomes in subsequent grades. PMID:19805049

  19. Simplifying multivariate survival analysis using global score test methodology

    NASA Astrophysics Data System (ADS)

    Zain, Zakiyah; Aziz, Nazrina; Ahmad, Yuhaniz

    2015-12-01

    In clinical trials, the main purpose is often to compare efficacy between experimental and control treatments. Treatment comparisons often involve multiple endpoints, and this situation further complicates the analysis of survival data. In the case of tumor patients, endpoints concerning survival times include: times from tumor removal until the first, the second and the third tumor recurrences, and time to death. For each patient, these endpoints are correlated, and the estimation of the correlation between two score statistics is fundamental in derivation of overall treatment advantage. In this paper, the bivariate survival analysis method using the global score test methodology is extended to multivariate setting.

  20. Do Teacher Unions Hinder Educational Performance? Lessons Learned from State SAT and ACT Scores.

    ERIC Educational Resources Information Center

    Steelman, Lala Carr; Powell, Brian; Carini, Robert M.

    2000-01-01

    Comparison of standardized test scores and degree of teacher unionization in states found a statistically significant and positive relationship between the presence of teacher unions and stronger state performance on tests. Taking into account the percentage of students taking the tests, states with greater percentages of teachers in unions…

  1. Correlation of the Scores on Barron's Ego Strength Scale with the Scores on the Bender-Gestalt Test.

    ERIC Educational Resources Information Center

    Martin, John D.; And Others

    1979-01-01

    The degree of relationship between scores on the Barron Ego Strength Scale and the scores on the Bender-Gestalt Test was investigated on a sample of college students. Correlations were moderate to low. Racial differences were observed on the Bender-Gestalt Test. (Author/JKS)

  2. TOXIC SUBSTANCES CONTROL ACT TEST SUBMISSIONS (TSCATS)

    EPA Science Inventory

    Toxic Substances Control Act Test Submissions (TSCATS) is an online index to unpublished, nonconfidential studies covering chemical testing results and adverse effects of chemicals on health and ecological systems. The studies are submitted by U.S. industry to EPA under several s...

  3. Improving ACT and SAT Scores: Making Progress, Facing Challenges. A Focus Report in the "Challenge to Lead" Series

    ERIC Educational Resources Information Center

    Jackson, Jenny; Lord, Joan

    2007-01-01

    The Southern Regional Education Board's (SREB) "Challenge to Lead" Goals for Education call for more students to take college admission tests, to score higher on those tests and for SREB states to reduce gaps in test scores among all groups of students. For states, achieving higher statewide scores is one sign that more students can go on to…

  4. Digit symbol substitution test score and hyperhomocysteinemia in older adults.

    PubMed

    Hsu, Wen-Chuin; Chu, Yi-Chuan; Fung, Hon-Chung; Wai, Yau-Yau; Wang, Jiun-Jie; Lee, Jiann-Der; Chen, Yi-Chun

    2016-08-01

    Mounting evidence shows that hyperhomocysteinemia is a risk factor for cognitive decline. This study enrolled subjects with normal serum levels of B12 and folate and performed thorough neuropsychological assessments to illuminate the independent role of homocysteine on cognitive functions.Participants between ages 50 and 85 were enrolled with Modified Hachinski ischemic score of <4, adequate visual and auditory acuity to allow neuropsychological testing, and good general health. Subjects with cognitive impairment resulting from secondary causes were excluded. Each of the participants completed evaluations of general intellectual function, including the Mini-Mental State Examination, Cognitive Abilities Screening Instrument, Clinical Dementia Rating, and a battery of neuropsychological assessments.This study enrolled 225 subjects (90 subjects younger than 65 years and 135 subjects aged 65 years or older). The sex proportion was similar between the 2 age groups. Years of education were significantly fewer in the elderly (7.49 ± 5.40 years) than in the young (9.76 ± 4.39 years, P = 0.001). There was no significant difference in body mass index or levels of vitamin B12 and folate between the 2 age groups. Homocysteine levels were significantly higher in the elderly group compared to the younger group (10.8 ± 2.7 vs. 9.5 ± 2.5 μmol/L, respectively, P = 0.0006). After adjusting for age, sex, and education, only the Digit Symbol Substitution (DSS) score was significantly lower in subjects with hyperhomocysteinemia (homocysteine >12 μmol/L) than those with homocysteine ≤12 μmol/L in the elderly group (DSS score: 7.1 ± 2.7 and 9.0 ± 3.0, respectively, beta = -1.6, 95% confidence interval [CI] = -2.8∼-0.5, P = 0.001) and borderline significance was noted in the combined age group (beta = -1.1, 95% CI = -2.1∼-0.1, P = 0.04). We did not find an association between hyperhomocysteinemia and other

  5. Digit symbol substitution test score and hyperhomocysteinemia in older adults

    PubMed Central

    Hsu, Wen-Chuin; Chu, Yi-Chuan; Fung, Hon-Chung; Wai, Yau-Yau; Wang, Jiun-Jie; Lee, Jiann-Der; Chen, Yi-Chun

    2016-01-01

    Abstract Mounting evidence shows that hyperhomocysteinemia is a risk factor for cognitive decline. This study enrolled subjects with normal serum levels of B12 and folate and performed thorough neuropsychological assessments to illuminate the independent role of homocysteine on cognitive functions. Participants between ages 50 and 85 were enrolled with Modified Hachinski ischemic score of <4, adequate visual and auditory acuity to allow neuropsychological testing, and good general health. Subjects with cognitive impairment resulting from secondary causes were excluded. Each of the participants completed evaluations of general intellectual function, including the Mini-Mental State Examination, Cognitive Abilities Screening Instrument, Clinical Dementia Rating, and a battery of neuropsychological assessments. This study enrolled 225 subjects (90 subjects younger than 65 years and 135 subjects aged 65 years or older). The sex proportion was similar between the 2 age groups. Years of education were significantly fewer in the elderly (7.49 ± 5.40 years) than in the young (9.76 ± 4.39 years, P = 0.001). There was no significant difference in body mass index or levels of vitamin B12 and folate between the 2 age groups. Homocysteine levels were significantly higher in the elderly group compared to the younger group (10.8 ± 2.7 vs. 9.5 ± 2.5 μmol/L, respectively, P = 0.0006). After adjusting for age, sex, and education, only the Digit Symbol Substitution (DSS) score was significantly lower in subjects with hyperhomocysteinemia (homocysteine >12 μmol/L) than those with homocysteine ≤12 μmol/L in the elderly group (DSS score: 7.1 ± 2.7 and 9.0 ± 3.0, respectively, beta = −1.6, 95% confidence interval [CI] = −2.8∼−0.5, P = 0.001) and borderline significance was noted in the combined age group (beta = −1.1, 95% CI = −2.1∼−0.1, P = 0.04). We did not find an association between

  6. An Investigation into the Relationships Between Cloze Test Scores and Informal Reading Inventory Scores of Fifth Grade Pupils.

    ERIC Educational Resources Information Center

    Walter, Richard Barry

    This study investigated the relationship between instructional level scores as determined by a cloze test and instructional level scores as determined by an informal reading inventory (IRI). Fifty male and 50 female subjects were randomly selected from the total fifth grade population of five schools chosen from a total of 22 midwestern elementary…

  7. Relationship of Reading Fluency Assessment Data with State Accountability Test Scores: A Longitudinal Comparison of Grade Levels

    ERIC Educational Resources Information Center

    Silberglitt, Benjamin; Burns, Matthew K.; Madyun, Na'im H.; Lail, Kathryn E.

    2006-01-01

    The No Child Left Behind Act of 2001 mandated statewide accountability testing and focused the accountability conversation on reading. Therefore, the current study examined the relationship between curriculum-based measurement for reading (R-CBM) and state accountability test scores, potential grade differences in relationship magnitude, and…

  8. Recent Trends in Mean Scores and Characteristics of Test-Takers on "Praxis II" Licensure Tests

    ERIC Educational Resources Information Center

    Gitomer, Drew H.; Qi, Yi

    2010-01-01

    This study concerns the "highly qualified teacher" provisions of the "Elementary and Secondary Education Act" ("ESEA," 2002), as reauthorized, and other policies at the federal, state and local levels, which have aimed to elevate the content knowledge of teachers. This examination of "Praxis II" score trends was not meant to serve as an evaluation…

  9. Reporting Diagnostic Scores in Educational Testing: Temptations, Pitfalls, and Some Solutions

    ERIC Educational Resources Information Center

    Sinharay, Sandip; Puhan, Gautam; Haberman, Shelby J.

    2010-01-01

    Diagnostic scores are of increasing interest in educational testing due to their potential remedial and instructional benefit. Naturally, the number of educational tests that report diagnostic scores is on the rise, as are the number of research publications on such scores. This article provides a critical evaluation of diagnostic score reporting…

  10. Teacher Education Students: A Look at Basic Skills Admission Tests and National Teacher Examination Scores.

    ERIC Educational Resources Information Center

    Clawson, Kenneth

    This study examined the relationship between teacher education students' scores on basic skills admission tests and graduating seniors' scores on the National Teacher Examinations (NTE) at Eastern Kentucky University. The 1981-82 basic skills test scores for 262 teacher education students were compared with their NTE scores taken in 1984-85 during…

  11. Money Improves Test Scores--Even State-Level SATs.

    ERIC Educational Resources Information Center

    Bracey, Gerald W.

    1996-01-01

    Three former secretaries of education--William Bennett, Lauro Cavazos, and Terrel Bell--have touted state-level SAT scores as proof that educational financing does not matter. Recently, Brian Powell and Lala Carr Steelman adjusted scores for participation rate and detected a very strong relationship between expenditures and SAT scores. Bigger…

  12. The Graduate Management Admission Test: Technical Report on Test Development and Score Interpretation for GMAT Users.

    ERIC Educational Resources Information Center

    Schrader, William B.

    This report provides information on test development, test administration, and score interpretation for the Graduate Management Admission Test (GMAT). The GMAT, first administered in 1954, provides objective measures of an applicant's abilities for use in admissions decisions by graduate management schools. It is currently composed of five…

  13. Variability of Test Scores and the Split-Half Reliability Coefficient

    ERIC Educational Resources Information Center

    Zimmerman, Donald W.

    1970-01-01

    Results of this study indicate that the correlation between half-test scores over repeated splits, over persons, and over repeated testings resulting in different sets of observed scores, is given by Kuder-Richardson Formula 21. (RF)

  14. A Comparison of Three Methods for Computing Scale Score Conditional Standard Errors of Measurement. ACT Research Report Series, 2013 (7)

    ERIC Educational Resources Information Center

    Woodruff, David; Traynor, Anne; Cui, Zhongmin; Fang, Yu

    2013-01-01

    Professional standards for educational testing recommend that both the overall standard error of measurement and the conditional standard error of measurement (CSEM) be computed on the score scale used to report scores to examinees. Several methods have been developed to compute scale score CSEMs. This paper compares three methods, based on…

  15. State Test Score Trends through 2007-08, Part 3: Are Achievement Gaps Closing and Is Achievement Rising for All?

    ERIC Educational Resources Information Center

    Chudowsky, Naomi; Chudowsky, Victor; Kober, Nancy

    2009-01-01

    A main goal of the No Child Left Behind Act (NCLB) is to close gaps in test scores between different groups of students while raising achievement for all groups. Of particular concern are the persistent achievement gaps between African American, Latino, and Native American students and their white and Asian counterparts, and between students from…

  16. Developing Test Score Reports that Work: The Process and Best Practices for Effective Communication

    ERIC Educational Resources Information Center

    Zenisky, April L.; Hambleton, Ronald K.

    2012-01-01

    Test scores matter these days. Test-takers want to understand how they performed, and test score reports, particularly those for individual examinees, are the vehicles by which most people get the bulk of this information. Historically, score reports have not always met the examinees' information or usability needs, but this is clearly changing…

  17. The Effect of Schooling and Ability on Achievement Test Scores. NBER Working Paper Series.

    ERIC Educational Resources Information Center

    Hansen, Karsten; Heckman, James J.; Mullen, Kathleen J.

    This study developed two methods for estimating the effect of schooling on achievement test scores that control for the endogeneity of schooling by postulating that both schooling and test scores are generated by a common unobserved latent ability. The methods were applied to data on schooling and test scores. Estimates from the two methods are in…

  18. The Comparison of Accuracy Scores on the Paper and Pencil Testing vs. Computer-Based Testing

    ERIC Educational Resources Information Center

    Retnawati, Heri

    2015-01-01

    This study aimed to compare the accuracy of the test scores as results of Test of English Proficiency (TOEP) based on paper and pencil test (PPT) versus computer-based test (CBT). Using the participants' responses to the PPT documented from 2008-2010 and data of CBT TOEP documented in 2013-2014 on the sets of 1A, 2A, and 3A for the Listening and…

  19. The Relationship between Deductive Reasoning Ability, Test Anxiety, and Standardized Test Scores in a Latino Sample

    ERIC Educational Resources Information Center

    Rich, John D., Jr.; Fullard, William; Overton, Willis

    2011-01-01

    One Hundred and Twelve Latino students from Philadelphia participated in this study, which examined the development of deductive reasoning across adolescence, and the relation of reasoning to test anxiety and standardized test scores. As predicted, 11th and ninth graders demonstrated significantly more advanced reasoning than seventh graders.…

  20. Test Scores Count! A Handbook for Teaching Test-Taking Skills.

    ERIC Educational Resources Information Center

    Koenigs, Sharon

    This handbook provides guidelines for teaching test-taking skills to students of all grade levels to help the students raise their standardized test scores. Topics covered include: understanding instructions and following directions, efficient use of time, intelligent guessing, and application of special strategies for multiple-choice and…

  1. The Formalization of Fairness: Issues in Testing for Measurement Invariance Using Subtest Scores

    ERIC Educational Resources Information Center

    Molenaar, Dylan; Borsboom, Denny

    2013-01-01

    Measurement invariance is an important prerequisite for the adequate comparison of group differences in test scores. In psychology, measurement invariance is typically investigated by means of linear factor analyses of subtest scores. These subtest scores typically result from summing the item scores. In this paper, we discuss 4 possible problems…

  2. Evidence-Based Decision about Test Scoring Rules in Clinical Anatomy Multiple-Choice Examinations

    ERIC Educational Resources Information Center

    Severo, Milton; Gaio, A. Rita; Povo, Ana; Silva-Pereira, Fernanda; Ferreira, Maria Amélia

    2015-01-01

    In theory the formula scoring methods increase the reliability of multiple-choice tests in comparison with number-right scoring. This study aimed to evaluate the impact of the formula scoring method in clinical anatomy multiple-choice examinations, and to compare it with that from the number-right scoring method, hoping to achieve an…

  3. A Diet Score Assessing Norwegian Adolescents' Adherence to Dietary Recommendations-Development and Test-Retest Reproducibility of the Score.

    PubMed

    Handeland, Katina; Kjellevold, Marian; Wik Markhus, Maria; Eide Graff, Ingvild; Frøyland, Livar; Lie, Øyvind; Skotheim, Siv; Stormark, Kjell Morten; Dahl, Lisbeth; Øyen, Jannike

    2016-01-01

    Assessment of adolescents' dietary habits is challenging. Reliable instruments to monitor dietary trends are required to promote healthier behaviours in this group. The purpose of this cross-sectional study was to assess adolescents' adherence to Norwegian dietary recommendations with a diet score and to report results from, and test-retest reliability of, the score. The diet score involved seven food groups and one physical activity indicator, and was applied to answers from a semi-quantitative food frequency questionnaire (FFQ) administered twice. Reproducibility of the score was assessed with Cohen's Kappa (κ statistics) at an interval of three months. The setting was eight lower-secondary schools in Hordaland County, Norway, and subjects were adolescents (n = 472) aged 14-15 years and their caregivers. Results showed that the proportion of adolescents consistently classified by the diet score was 87.6% (κ = 0.465). For food groups, proportions ranged from 74.0% to 91.6% (κ = 0.249 to κ = 0.573). Less than 40% of the participants were found to adhere to recommendations for frequencies of eating fruits, vegetables, added sugar, and fish. Highest compliance to recommendations was seen for choosing water as beverage and limit the intake of red meat. The score was associated with parental socioeconomic status. The diet score was found to be reproducible at an acceptable level. Health promoting work targeting adolescents should emphasize to increase the intake of recommended foods to approach nutritional guidelines. PMID:27483312

  4. A Diet Score Assessing Norwegian Adolescents’ Adherence to Dietary Recommendations—Development and Test-Retest Reproducibility of the Score

    PubMed Central

    Handeland, Katina; Kjellevold, Marian; Wik Markhus, Maria; Eide Graff, Ingvild; Frøyland, Livar; Lie, Øyvind; Skotheim, Siv; Stormark, Kjell Morten; Dahl, Lisbeth; Øyen, Jannike

    2016-01-01

    Assessment of adolescents’ dietary habits is challenging. Reliable instruments to monitor dietary trends are required to promote healthier behaviours in this group. The purpose of this cross-sectional study was to assess adolescents’ adherence to Norwegian dietary recommendations with a diet score and to report results from, and test-retest reliability of, the score. The diet score involved seven food groups and one physical activity indicator, and was applied to answers from a semi-quantitative food frequency questionnaire (FFQ) administered twice. Reproducibility of the score was assessed with Cohen’s Kappa (κ statistics) at an interval of three months. The setting was eight lower-secondary schools in Hordaland County, Norway, and subjects were adolescents (n = 472) aged 14–15 years and their caregivers. Results showed that the proportion of adolescents consistently classified by the diet score was 87.6% (κ = 0.465). For food groups, proportions ranged from 74.0% to 91.6% (κ = 0.249 to κ = 0.573). Less than 40% of the participants were found to adhere to recommendations for frequencies of eating fruits, vegetables, added sugar, and fish. Highest compliance to recommendations was seen for choosing water as beverage and limit the intake of red meat. The score was associated with parental socioeconomic status. The diet score was found to be reproducible at an acceptable level. Health promoting work targeting adolescents should emphasize to increase the intake of recommended foods to approach nutritional guidelines. PMID:27483312

  5. Test/score/report: Simulation techniques for automating the test process

    NASA Technical Reports Server (NTRS)

    Hageman, Barbara H.; Sigman, Clayton B.; Koslosky, John T.

    1994-01-01

    A Test/Score/Report capability is currently being developed for the Transportable Payload Operations Control Center (TPOCC) Advanced Spacecraft Simulator (TASS) system which will automate testing of the Goddard Space Flight Center (GSFC) Payload Operations Control Center (POCC) and Mission Operations Center (MOC) software in three areas: telemetry decommutation, spacecraft command processing, and spacecraft memory load and dump processing. Automated computer control of the acceptance test process is one of the primary goals of a test team. With the proper simulation tools and user interface, the task of acceptance testing, regression testing, and repeatability of specific test procedures of a ground data system can be a simpler task. Ideally, the goal for complete automation would be to plug the operational deliverable into the simulator, press the start button, execute the test procedure, accumulate and analyze the data, score the results, and report the results to the test team along with a go/no recommendation to the test team. In practice, this may not be possible because of inadequate test tools, pressures of schedules, limited resources, etc. Most tests are accomplished using a certain degree of automation and test procedures that are labor intensive. This paper discusses some simulation techniques that can improve the automation of the test process. The TASS system tests the POCC/MOC software and provides a score based on the test results. The TASS system displays statistics on the success of the POCC/MOC system processing in each of the three areas as well as event messages pertaining to the Test/Score/Report processing. The TASS system also provides formatted reports documenting each step performed during the tests and the results of each step. A prototype of the Test/Score/Report capability is available and currently being used to test some POCC/MOC software deliveries. When this capability is fully operational it should greatly reduce the time necessary

  6. Situational Effects May Account for Gain Scores in Cognitive Ability Testing: A Longitudinal SEM Approach

    ERIC Educational Resources Information Center

    Matton, Nadine; Vautier, Stephane; Raufaste, Eric

    2009-01-01

    Mean gain scores for cognitive ability tests between two sessions in a selection setting are now a robust finding, yet not fully understood. Many authors do not attribute such gain scores to an increase in the target abilities. Our approach consists of testing a longitudinal SEM model suitable to this view. We propose to model the scores' changes…

  7. Relationships between Gender and Alberta Achievement Test Scores during a Four-Year Period

    ERIC Educational Resources Information Center

    Pope, Gregory A.; Wentzel, Carolyn; Braden, Brigitta; Anderson, Jordan

    2006-01-01

    The purpose of this study was to investigate statistical relationships between gender and Alberta Achievement Testing Program scores. Achievement test scores from grades 3, 6, and 9 in all subject areas were investigated during a four-year period. Results showed statistically significant positive correlations between gender and scores in most…

  8. The Persisting Racial Scoring Gap on Graduate and Professional School Admission Tests.

    ERIC Educational Resources Information Center

    Journal of Blacks in Higher Education, 2003

    2003-01-01

    Discusses the racial scoring gap on tests for admission to medical, business, law, and other graduate programs, noting that in the highest-scoring brackets on the Medical College Admission Test (MCAT), the racial gap is even larger. Whites are five times, twelve times, and seven times more likely, respectively, to score higher on the MCAT, Law…

  9. Construct Validity and Test Re-Test Reliability of the Forgotten Joint Score.

    PubMed

    Thompson, Simon M; Salmon, Lucy J; Webb, Justin M; Pinczewski, Leo A; Roe, Justin P

    2015-11-01

    Consecutive patients undergoing knee arthroplasty completed questionnaires: FJS, Knee Injury and Osteoarthritis Outcome Score (KOOS) and WOMAC Score (mean 39 months after surgery), and were mailed a repeat questionnaire after 4 to 6 weeks. The test-retest reliability was almost perfect for the FJS (ICC = 0.97), and the FJS subdomains (ICC > 0.8). Convergent construct validity of the FJS was correlated with the KOOS Subscores of Quality of Life (0.63, P = 0.001), Symptom (0.33, P = 0.001), Pain (0.68, P = 0.001) and ADL (0.66, P = 0.001) and the Total WOMAC (0.70, P = 0.001). The FJS demonstrates high test-retest reliability and construct validity compared to the Normalised WOMAC and KOOS Subscales. The FJS does not demonstrate the ceiling effect of the WOMAC or KOOS pain scores so may have greater discriminatory ability following TKR. PMID:26027525

  10. Evaluation of ACT/SAT scores as predictors of student success in general chemistry

    NASA Astrophysics Data System (ADS)

    Scott, Kerri Dalita

    Prediction of student success in a single course such as general chemistry has traditionally been the purview of the department which owns the course such as the department of chemistry. In the analysis of data such as ACT or SAT scores, chemists have traditionally used the statistic techniques common to their research pursuits. For many chemists, this is typically been ordinary linear regression where letter grades that students receive are treated as the dependent variable while factors of interest were explored as potential predictive variables. Evaluation of the underlying mathematical basis of linear regression reveals that its use in success predication violates one of more of the assumptions that must be made about the type and quality of data. The most commonly violated assumption is homoscedacity (or constant variance of the errors). Additionally, the ultimate goal of success prediction is to determine if a student will either pass or fail a course based on the predictor variable---that is---only 2 possible outcomes. With the increasing availability of personal computing, other means of data analysis are now commonly available via statistical software packages such as SPSS. One alternative technique that continues to grow in the interest of chemists' evaluation of student success is logistic regression. Logistic regression is a general linearized model in which the natural logarithm of the odds is linearly related to the predictor variable. This study investigates the use of the ACT and SAT to predict student success in general chemistry at The University of Mississippi, CHEM 105, over a four year period. Additionally this study seeks to determine a typical student profile in order to determine if the widely accepted notions of the typical CHEM 105 student are reasonably accurate.

  11. Mathematical SAT Test Scores and College Chemistry Grades

    NASA Astrophysics Data System (ADS)

    Spencer, Harry E.

    1996-12-01

    The relationships between mathematical SAT scores (SAT-M) and grades earned by students in eight consecutive years of first- and second-semester general chemistry courses at Oberlin College are reported. The academic years surveyed are 1987-1988 through 1994-1995. SAT-M scores are grouped within seven ranges from 450 and less to 710-800. Within any range of scores, students in both courses earned a wide variety of grades, but those within the higher ranges tended to earn higher grades and fewer failures relative to students in the lower ranges. For all students within each range of SAT-M scores, the fraction earning each grade are calculated. These fractions along with the numbers of students and their SAT-M scores in a subset are used to calculate grades expected for that subset. In the first-semester course, the expected and actual grades for subsets of males, females, first-year students, non-first-year students, Asians, Blacks, and Latinos are not significantly different. Those who eventually majored in chemistry or biochemistry attained grades very significantly higher than expected. Most students tended to achieve grades in the second-semester course that were similar to those earned in the first-semester course.

  12. How Parents Can Help Kids Improve Test Scores: Taking the Stakes out of Literacy Testing

    ERIC Educational Resources Information Center

    Schneider, Steven

    2006-01-01

    In order to meet the goals of No Child Left Behind, standardized testing is preeminent as the sole indicator determining whether states all across America demonstrate adequate yearly progress regarding the improvement of student achievement in literacy education. This book will help teachers and parents raise children's scores on standardized…

  13. End-box scoring artefact evaluation of the Farnsworth-Munsell 100-Hue colour vision test.

    PubMed

    Viliūnas, V; Lukauskiene, R; Svegzda, A; Zukauskas, A

    2006-11-01

    The scoring artefact in the Farnsworth-Munsell 100-Hue test, arising from the grouping of the caps into four boxes, was investigated. The traditional method of scoring performed with the numbers of the anchor caps disregarded and the alternative scoring performed with the numbers of the anchor caps employed, were compared. For the traditional method of scoring, we revealed an increase of the error score of the outside (end-box) caps when the total error score was above 240. On the contrary for scoring performed with the numbers of the anchor caps employed, the difference between the error score of the outside caps and the average error per cap is not significant. To mitigate the end-box artefact and to improve the reliability of the Farnsworth-Munsell 100-Hue test, corrections to the traditional method of scoring are proposed. PMID:17040422

  14. The Role of Test Scores in Explaining Race and Gender Differences in Wages

    ERIC Educational Resources Information Center

    Blackburn, McKinley L.

    2004-01-01

    Previous research has suggested that skills reflected in test-score performance on tests such as the Armed Forces Qualification Test (AFQT) can account for some of the racial differences in average wages. I use a more complete set of test scores available with the National Longitudinal Survey of Youth 1979 Cohort to reconsider this evidence, and…

  15. 21 CFR 866.6050 - Ovarian adnexal mass assessment score test system.

    Code of Federal Regulations, 2011 CFR

    2011-04-01

    ... 21 Food and Drugs 8 2011-04-01 2011-04-01 false Ovarian adnexal mass assessment score test system... immunological Test Systems § 866.6050 Ovarian adnexal mass assessment score test system. (a) Identification. An ovarian/adnexal mass assessment test system is a device that measures one or more proteins in serum...

  16. 21 CFR 866.6050 - Ovarian adnexal mass assessment score test system.

    Code of Federal Regulations, 2012 CFR

    2012-04-01

    ... 21 Food and Drugs 8 2012-04-01 2012-04-01 false Ovarian adnexal mass assessment score test system... immunological Test Systems § 866.6050 Ovarian adnexal mass assessment score test system. (a) Identification. An ovarian/adnexal mass assessment test system is a device that measures one or more proteins in serum...

  17. 21 CFR 866.6050 - Ovarian adnexal mass assessment score test system.

    Code of Federal Regulations, 2013 CFR

    2013-04-01

    ... 21 Food and Drugs 8 2013-04-01 2013-04-01 false Ovarian adnexal mass assessment score test system... immunological Test Systems § 866.6050 Ovarian adnexal mass assessment score test system. (a) Identification. An ovarian/adnexal mass assessment test system is a device that measures one or more proteins in serum...

  18. 21 CFR 866.6050 - Ovarian adnexal mass assessment score test system.

    Code of Federal Regulations, 2014 CFR

    2014-04-01

    ... 21 Food and Drugs 8 2014-04-01 2014-04-01 false Ovarian adnexal mass assessment score test system... immunological Test Systems § 866.6050 Ovarian adnexal mass assessment score test system. (a) Identification. An ovarian/adnexal mass assessment test system is a device that measures one or more proteins in serum...

  19. Principles and Practices of Test Score Equating. Research Report. ETS RR-10-29

    ERIC Educational Resources Information Center

    Dorans, Neil J.; Moses, Tim P.; Eignor, Daniel R.

    2010-01-01

    Score equating is essential for any testing program that continually produces new editions of a test and for which the expectation is that scores from these editions have the same meaning over time. Particularly in testing programs that help make high-stakes decisions, it is extremely important that test equating be done carefully and accurately.…

  20. Adaptive Testing with Equated Number-Correct Scoring. Research Report 99-02.

    ERIC Educational Resources Information Center

    van der Linden, Wim J.

    A constrained computerized adaptive testing (CAT) algorithm is presented that automatically equates the number-correct scores on adaptive tests. The algorithm can be used to equate number-correct scores across different administrations of the same adaptive test as well as to an external reference test. The constraints are derived from a set of…

  1. Transiency, Test Scores, and the Public: One School District's Story

    ERIC Educational Resources Information Center

    Sanderson, D.R.

    2004-01-01

    Research documents that transient students who change schools frequently oftensuffer from low academic achievement. This article investigates standardized group measures by disentangling elementary achievement scores. Located in a highly transient area outside of Philadelphia, Pennsylvania, Main Street School had their fifth grade Pennsylvania…

  2. Why African American College Students Miss the Perfect Test Score

    ERIC Educational Resources Information Center

    Gentry, Ruben; Stokes, Dorothy

    2016-01-01

    Many African Americans were imbued with the cliché that they must work twice as hard as others to be a success in life. Entering college, students with this belief put extensive effort into earning top grades to ensure quality preparation for their chosen career; yet, some fail to earn top scores. Why? This is the million dollar question, but the…

  3. Interpreting Changes over Time in High School Average ACT® College Readiness Assessment Composite Scores and ACT College Readiness Benchmark Attainment Rates. ACT Research Report Series, 2013 (9)

    ERIC Educational Resources Information Center

    Sawyer, Richard

    2013-01-01

    Education officials and journalists frequently track changes over time in the average ACT® College Readiness Assessment Composite scores and ACT College Readiness Benchmark attainment rates of individual high schools. Using standard statistical methods, I examined how often changes in these statistics are unambiguously positive or negative, rather…

  4. Determining the Relationship of Nursing Test Scores and Test Anxiety Levels before and after a Test-Taking Strategy Seminar.

    ERIC Educational Resources Information Center

    Carraway, Cassandra T.

    A study was conducted to determine whether participation in a test-taking strategy seminar significantly decreased test anxiety in first-year nursing students. The study also sought to compare nursing test scores of first-year nursing students who participated in the seminar with those who did not. The sample consisted of 30 first-year nursing…

  5. Neuropsychological test scores, academic performance, and developmental disorders in Spanish-speaking children.

    PubMed

    Rosselli, M; Ardila, A; Bateman, J R; Guzmán, M

    2001-01-01

    Limited information is currently available about performance of Spanish-speaking children on different neuropsychological tests. This study was designed to (a) analyze the effects of age and sex on different neuropsychological test scores of a randomly selected sample of Spanish-speaking children, (b) analyze the value of neuropsychological test scores for predicting school performance, and (c) describe the neuropsychological profile of Spanish-speaking children with learning disabilities (LD). Two hundred ninety (141 boys, 149 girls) 6- to 11-year-old children were selected from a school in Bogotá, Colombia. Three age groups were distinguished: 6- to 7-, 8- to 9-, and 10- to 11-year-olds. Performance was measured utilizing the following neuropsychological tests: Seashore Rhythm Test, Finger Tapping Test (FTT), Grooved Pegboard Test, Children's Category Test (CCT), California Verbal Learning Test-Children's Version (CVLT-C), Benton Visual Retention Test (BVRT), and Bateria Woodcock Psicoeducativa en Español (Woodcock, 1982). Normative scores were calculated. Age effect was significant for most of the test scores. A significant sex effect was observed for 3 test scores. Intercorrelations were performed between neuropsychological test scores and academic areas (science, mathematics, Spanish, social studies, and music). In a post hoc analysis, children presenting very low scores on the reading, writing, and arithmetic achievement scales of the Woodcock battery were identified in the sample, and their neuropsychological test scores were compared with a matched normal group. Finally, a comparison was made between Colombian and American norms. PMID:11827093

  6. A Seven-Year Follow-Up of Intelligence Test Scores of Foster Grandparents

    ERIC Educational Resources Information Center

    Troll, Lillian E.; And Others

    1976-01-01

    After seven years, a group (N=32) of originally nonemployed poverty-level older people (over 60) now employed as foster grandparents were retested with the WAIS. Three subtest scores showed stability and Digit Span showed a statistically significant drop. Neither age nor initial level of health or WAIS scores was related to test-score changes over…

  7. A New Method for Administering and Scoring Multiple-Choice Tests: Theoretical Considerations and Empirical Results.

    ERIC Educational Resources Information Center

    Cross, Lawrence H.; And Others

    A new scoring procedure for multiple choice tests attempts to assess partial knowledge and to restrict guessing. It is a variant of Coombs' elimination scoring method, adapted for use with the carbon-shield answer sheets commonly used with answer-until-correct scoring. Examinees are directed to erase the carbon shields of choices they are certain…

  8. Can Machine Scoring Deal with Broad and Open Writing Tests as Well as Human Readers?

    ERIC Educational Resources Information Center

    McCurry, Doug

    2010-01-01

    This article considers the claim that machine scoring of writing test responses agrees with human readers as much as humans agree with other humans. These claims about the reliability of machine scoring of writing are usually based on specific and constrained writing tasks, and there is reason for asking whether machine scoring of writing requires…

  9. D.C. Student Test Scores Show Uneven Progress. Data Snapshot

    ERIC Educational Resources Information Center

    DuPre, Mary

    2011-01-01

    Over the past five years, both DC Public Schools (DCPS) and public charter schools (PCS) have seen significant growth in secondary reading and math scores on the state test known as the District of Columbia Comprehensive Assessment System (DC CAS). However, scores have not improved as much at the elementary level. Reading and math scores for DCPS…

  10. Comparing Graphical and Verbal Representations of Measurement Error in Test Score Reports

    ERIC Educational Resources Information Center

    Zwick, Rebecca; Zapata-Rivera, Diego; Hegarty, Mary

    2014-01-01

    Research has shown that many educators do not understand the terminology or displays used in test score reports and that measurement error is a particularly challenging concept. We investigated graphical and verbal methods of representing measurement error associated with individual student scores. We created four alternative score reports, each…

  11. TCAP Scores and per Pupil Expenditures: Statewide Changes before and after Tennessee's First to the Top Act

    ERIC Educational Resources Information Center

    Cantrell, Martha Ely

    2013-01-01

    The purpose of this study was to investigate the relationships between the changes in Tennessee Comprehensive Assessment Program (TCAP) scores and the changes in Per Pupil Expenditures (PPE) after the enactment of "First to the Top Act of 2010" and the receipt of $501,000,000 in federal Race to the Top (RTTT) grant monies. Half of that…

  12. A Preliminary Attempt at Teaching Abstract Mathematics to Freshmen with an A.C.T. Score of Less Than 15.

    ERIC Educational Resources Information Center

    Schremmer, A. G.

    This experiment attempted to teach abstract mathematics fo college freshmen with A.C.T. scores less than 15 in a three semester terminal course sequence. The course content included a formal mathematical language, set theory, Boolean Algebra, relations and functions, operations, cardinals and ordinals, the rational numbers, and college algebra.…

  13. Effects of a Coaching Class on the ACT Scores of Students at a Large Midwest High School

    ERIC Educational Resources Information Center

    Moss, Gary L.; Chippendale, Ene Kaja; Mershon, Clark W.; Carney, Trisha

    2012-01-01

    The role of standardized tests in general and college admission tests in particular is a hot topic for educators, administrators and others evaluating the outcomes of educational institutions. The importance of the test scores for both students and institutions is acknowledged, and the test preparation industry has spawned into a multi-billion…

  14. The value of Bayes' theorem for interpreting abnormal test scores in cognitively healthy and clinical samples.

    PubMed

    Gavett, Brandon E

    2015-03-01

    The base rates of abnormal test scores in cognitively normal samples have been a focus of recent research. The goal of the current study is to illustrate how Bayes' theorem uses these base rates--along with the same base rates in cognitively impaired samples and prevalence rates of cognitive impairment--to yield probability values that are more useful for making judgments about the absence or presence of cognitive impairment. Correlation matrices, means, and standard deviations were obtained from the Wechsler Memory Scale--4th Edition (WMS-IV) Technical and Interpretive Manual and used in Monte Carlo simulations to estimate the base rates of abnormal test scores in the standardization and special groups (mixed clinical) samples. Bayes' theorem was applied to these estimates to identify probabilities of normal cognition based on the number of abnormal test scores observed. Abnormal scores were common in the standardization sample (65.4% scoring below a scaled score of 7 on at least one subtest) and more common in the mixed clinical sample (85.6% scoring below a scaled score of 7 on at least one subtest). Probabilities varied according to the number of abnormal test scores, base rates of normal cognition, and cutoff scores. The results suggest that interpretation of base rates obtained from cognitively healthy samples must also account for data from cognitively impaired samples. Bayes' theorem can help neuropsychologists answer questions about the probability that an individual examinee is cognitively healthy based on the number of abnormal test scores observed. PMID:25784058

  15. Note on the Scoring of Foreign Language Speaking and Writing Fluency Tests.

    ERIC Educational Resources Information Center

    Carroll, John B.

    The problem of determining relative weights for quantity and quality in scoring foreign language speaking and writing fluency tests is studied. French speaking and writing fluency tests were administered to students of French in several schools in England. Data from these tests was analyzed to support the suggestion that scoring formulas should…

  16. Noncognitive Skills and the Gender Disparities in Test Scores and Teacher Assessments: Evidence from Primary School

    ERIC Educational Resources Information Center

    Cornwell, Christopher; Mustard, David B.; Van Parys, Jessica

    2013-01-01

    Using data from the 1998-99 ECLS-K cohort, we show that the grades awarded by teachers are not aligned with test scores. Girls in every racial category outperform boys on reading tests, while boys score at least as well on math and science tests as girls. However, boys in all racial categories across all subject areas are not represented in…

  17. Discrepancies between Score Trends from NAEP and State Tests: A Scale-Invariant Perspective

    ERIC Educational Resources Information Center

    Ho, Andrew D.

    2007-01-01

    State test score trends are widely interpreted as indicators of educational improvement. To validate these interpretations, state test score trends are often compared to trends on other tests such as the National Assessment of Educational Progress (NAEP). These comparisons raise serious technical and substantive concerns. Technically, the most…

  18. Relationship of Achievement Test Scores and State Board Performance in a Diploma Nursing Program.

    ERIC Educational Resources Information Center

    Washburn, Gail

    The relationship between the National League for Nursing (NLN) achievement test scores and performance on the State Board Test Pool Examination (SBTPE) was studied with 166 graduates of a diploma degree school of nursing between 1976 and 1978. It was found that NLN achievement test scores had a highly significant correlation with SBTPE results.…

  19. Score Reporting in Teacher Certification Testing: A Review, Design, and Interview/Focus Group Study

    ERIC Educational Resources Information Center

    Klesch, Heather S.

    2010-01-01

    The reporting of scores on educational tests is at times misunderstood, misinterpreted, and potentially confusing to examinees and other stakeholders who may need to interpret test scores. In reporting test results to examinees, there is a need for clarity in the message communicated. As pressure rises for students to demonstrate performance at a…

  20. School Inputs, Household Substitution, and Test Scores. NBER Working Paper No. 16830

    ERIC Educational Resources Information Center

    Das, Jishnu; Dercon, Stefan; Habyarimana, James; Krishnan, Pramila; Muralidharan, Karthik; Sundararaman, Venkatesh

    2011-01-01

    Empirical studies of the relationship between school inputs and test scores typically do not account for the fact that households will respond to changes in school inputs. We present a dynamic household optimization model relating test scores to school and household inputs, and test its predictions in two very different low-income country…

  1. Delaware Student Testing Program: A Score Results Guide for Parents.

    ERIC Educational Resources Information Center

    Delaware State Dept. of Education, Dover. Assessment and Accountability Branch.

    This guide is intended to help parents understand the Delaware Student Testing Program (DSTP) and the reports it generates. The DSTP tests are administered to provide an accurate measure of how well students are doing relative to Delaware's rigorous content standards. DSTP tests are administered in reading, writing, mathematics, science, and…

  2. Effects of Test Media on Different EFL Test-Takers in Writing Scores and in the Cognitive Writing Process

    ERIC Educational Resources Information Center

    Zou, Xiao-Ling; Chen, Yan-Min

    2016-01-01

    The effects of computer and paper test media on EFL test-takers with different computer familiarity in writing scores and in the cognitive writing process have been comprehensively explored from the learners' aspect as well as on the basis of related theories and practice. The results indicate significant differences in test scores among the…

  3. Graduate Students' Administration and Scoring Errors on the Woodcock-Johnson III Tests of Cognitive Abilities

    ERIC Educational Resources Information Center

    Ramos, Erica; Alfonso, Vincent C.; Schermerhorn, Susan M.

    2009-01-01

    The interpretation of cognitive test scores often leads to decisions concerning the diagnosis, educational placement, and types of interventions used for children. Therefore, it is important that practitioners administer and score cognitive tests without error. This study assesses the frequency and types of examiner errors that occur during the…

  4. TEST OF ADULT COLLEGE APTITUDE (TACA). MANUAL FOR ADMINISTRATION, SCORING AND INTERPRETATION.

    ERIC Educational Resources Information Center

    DUBOIS, PHILIP H.; WIENTGE, KING M.

    THIS PRELIMINARY MANUAL OUTLINES CONTENT, ADMINISTRATIVE AND SCORING PROCEDURES, ANTECEDENT RESEARCH, AND AVAILABLE NORM DATA FOR THE TEST OF ADULT COLLEGE APTITUDE (TACA). THE TACA, A COMBINED TEST AND ANSWER SHEET ADAPTED FOR VISUAL SCORING BY AN OPTICAL SCANNER, CONSISTS OF 22 ITEMS ON BIOGRAPHICAL DATA (AGE, SEX, OCCUPATION, FAMILY AND MARITAL…

  5. Using Raters from India to Score a Large-Scale Speaking Test

    ERIC Educational Resources Information Center

    Xi, Xiaoming; Mollaun, Pam

    2011-01-01

    We investigated the scoring of the Speaking section of the Test of English as a Foreign Language[TM] Internet-based (TOEFL iBT[R]) test by speakers of English and one or more Indian languages. We explored the extent to which raters from India, after being trained and certified, were able to score the TOEFL examinees with mixed first languages…

  6. Beyond Correlations: Usefulness of High School GPA and Test Scores in Making College Admissions Decisions

    ERIC Educational Resources Information Center

    Sawyer, Richard

    2013-01-01

    Correlational evidence suggests that high school GPA is better than admission test scores in predicting first-year college GPA, although test scores have incremental predictive validity. The usefulness of a selection variable in making admission decisions depends in part on its predictive validity, but also on institutions' selectivity and…

  7. Sex Differences in Cognitive Abilities Test Scores: A UK National Picture

    ERIC Educational Resources Information Center

    Strand, Steve; Deary, Ian J.; Smith, Pauline

    2006-01-01

    Background and aims: There is uncertainty about the extent or even existence of sex differences in the mean and variability of reasoning test scores ( Jensen, 1998; Lynn, 1994, ; Mackintosh, 1996). This paper analyses the Cognitive Abilities Test (CAT) scores of a large and representative sample of UK pupils to determine the extent of any sex…

  8. AP Trends: Tests Soar, Scores Slip--Gaps between Groups Spur Equity Concerns

    ERIC Educational Resources Information Center

    Cech, Scott J.

    2008-01-01

    More students are taking Advanced Placement tests, but the proportion of tests receiving what is deemed a passing score has dipped, and the mean score is down for the fourth year in a row. Data released here this week by the New York City-based nonprofit organization that owns the AP brand shows that a greater-than-ever proportion of students…

  9. Scoring Yes-No Vocabulary Tests: Reaction Time vs. Nonword Approaches

    ERIC Educational Resources Information Center

    Pellicer-Sanchez, Ana; Schmitt, Norbert

    2012-01-01

    Despite a number of research studies investigating the Yes-No vocabulary test format, one main question remains unanswered: What is the best scoring procedure to adjust for testee overestimation of vocabulary knowledge? Different scoring methodologies have been proposed based on the inclusion and selection of nonwords in the test. However, there…

  10. The Effect of Misinformation, Partial Information, and Guessing on Expected Multiple-Choice Test Item Scores.

    ERIC Educational Resources Information Center

    Frary, Robert B.

    1980-01-01

    Six scoring methods for assigning weights to right or wrong responses according to various instructions given to test takers are analyzed with respect to expected change scores and the effect of various levels of information and misinformation. Three of the methods provide feedback to the test taker. (Author/CTM)

  11. Alignment, High Stakes, and the Inflation of Test Scores. CSE Report 655

    ERIC Educational Resources Information Center

    Koretz, Daniel

    2005-01-01

    There are many reasons to align tests with curricular standards, but this alignment is not sufficient to protect against score inflation. This report explains the relationship between alignment and score inflation by clarifying what is meant by inappropriate test preparation. It provides a concrete, hypothetical example that illustrates a process…

  12. Effects of Scoring by Section and Independent Scorers' Patterns on Scorer Reliability in Biology Essay Tests

    ERIC Educational Resources Information Center

    Ebuoh, Casmir N.; Ezeudu, S. A.

    2015-01-01

    The study investigated the effects of scoring by section, use of independent scorers and conventional patterns on scorer reliability in Biology essay tests. It was revealed from literature review that conventional pattern of scoring all items at a time in essay tests had been criticized for not being reliable. The study was true experimental study…

  13. Effects of Homework Completion on Test Scores in Introductory Spanish Courses.

    ERIC Educational Resources Information Center

    Brender, John R.

    This research investigated the effects of homework completion on test scores for 401 undergraduate students, 94 percent African American, at an urban university in 2 levels of introductory Spanish, all with the same instructor. Five to six teacher-generated exams were administered during the course; the lowest test score for each student was…

  14. Identification of Statistically Significant Differences between Standard Scores on the Woodcock Reading Mastery Tests.

    ERIC Educational Resources Information Center

    Simpson, Robert G.

    1981-01-01

    Occasionally, differences in test scores seem to indicate that a student performs much better in one reading area than in another when, in reality, the differences may not be statistically significant. The author presents a table in which statistically significant differences between Woodcock test standard scores are identified. (Author)

  15. Linking Scores From Tests of Similar Content Given in Different Languages: An Illustration Involving Methodological Alternatives

    ERIC Educational Resources Information Center

    Cascallar, Alicia S.; Dorans, Neil J.

    2005-01-01

    This study compares two methods commonly used (concordance and prediction) to establish linkages between scores from tests of similar content given in different languages. Score linkages between the Verbal and Math sections of the SAT I and the corresponding sections of the Spanish-language admissions test, the Prueba de Aptitud Academica (PAA),…

  16. Linking Scores from Tests of Similar Content Given in Different Languages: An Illustration Involving Methodological Alternatives

    ERIC Educational Resources Information Center

    Cascallar, Alicia S.; Dorans, Neil J.

    2005-01-01

    This study compares two methods commonly used (concordance and prediction) to establish linkages between scores from tests of similar content given in different languages. Score linkages between the Verbal and Math sections of the SAT I and the corresponding sections of the Spanish-language admissions test, the Prueba de Aptitud Academica (PAA),…

  17. The Implications of Family Size and Birth Order for Test Scores and Behavioral Development

    ERIC Educational Resources Information Center

    Silles, Mary A.

    2010-01-01

    This article, using longitudinal data from the National Child Development Study, presents new evidence on the effects of family size and birth order on test scores and behavioral development at age 7, 11 and 16. Sibling size is shown to have an adverse causal effect on test scores and behavioral development. For any given family size, first-borns…

  18. An Investigation of Methods for Improving Estimation of Test Score Distributions.

    ERIC Educational Resources Information Center

    Hanson, Bradley A.

    Three methods of estimating test score distributions that may improve on using the observed frequencies (OBFs) as estimates of a population test score distribution are considered: the kernel method (KM); the polynomial method (PM); and the four-parameter beta binomial method (FPBBM). The assumption each method makes about the smoothness of the…

  19. 76 FR 16350 - Medical Devices; Ovarian Adnexal Mass Assessment Score Test System; Labeling; Black Box Restrictions

    Federal Register 2010, 2011, 2012, 2013, 2014

    2011-03-23

    ... HUMAN SERVICES Food and Drug Administration 21 CFR Part 866 Medical Devices; Ovarian Adnexal Mass... regulation classifying ovarian adnexal mass assessment score test systems to restrict these devices so that a... mass assessment score test system into class II (special controls). DATES: Submit either electronic...

  20. Difference in Standard Scores of Adults on the Peabody Picture Vocabulary Test (Revised and Third Edition)

    ERIC Educational Resources Information Center

    Pankratz, Mary; Morrison, Andrea; Plante, Elena

    2004-01-01

    Differences in the standard scores for the Peabody Picture Vocabulary Test-Revised (PPVT-R; L. M. Dunn & L. M. Dunn, 1981) and the PPVT-Third Edition (PPVT-III; Dunn & Dunn, 1997b) are known to exist for children, with typically higher scores occurring on the PPVT-III. However, these tests are administered into adulthood as well, and score…

  1. STABILITY OF ACADEMIC APTITUDE AND READING TEST SCORES OF MOBILE AND NON-MOBILE DISADVANTAGED CHILDREN.

    ERIC Educational Resources Information Center

    JUSTMAN, JOSEPH

    CHANGES IN ACADEMIC APTITUDE AND ACHIEVEMENT TEST SCORES OF PUPILS ATTENDING PUBLIC SCHOOLS IN DISADVANTAGED AREAS IN NEW YORK CITY WERE INVESTIGATED. AN ATTEMPT WAS MADE TO DETERMINE WHETHER VARYING DEGREES OF MOBILITY WERE ASSOCIATED WITH VARIATION IN CHANGES IN TEST SCORES. THE CUMULATIVE RECORD CARDS OF SIXTH-GRADE PUPILS WERE EXAMINED TO…

  2. Correcting for Test Score Measurement Error in ANCOVA Models for Estimating Treatment Effects

    ERIC Educational Resources Information Center

    Lockwood, J. R.; McCaffrey, Daniel F.

    2014-01-01

    A common strategy for estimating treatment effects in observational studies using individual student-level data is analysis of covariance (ANCOVA) or hierarchical variants of it, in which outcomes (often standardized test scores) are regressed on pretreatment test scores, other student characteristics, and treatment group indicators. Measurement…

  3. The Influence of an NCLB Accountability Plan on the Distribution of Student Test Score Gains

    ERIC Educational Resources Information Center

    Springer, Matthew G.

    2008-01-01

    Previous research on the effect of accountability programs on the distribution of student test score gains is decidedly mixed. This study examines the issue by estimating an educational production function in which test score gains are a function of the incentives schools have to focus instruction on below-proficient students. NCLB's threat of…

  4. Peer Effects and the Indigenous/Non-Indigenous Early Test-Score Gap in Peru

    ERIC Educational Resources Information Center

    Sakellariou, Chris

    2008-01-01

    This paper assesses the magnitude of the non-indigenous/indigenous test-score gap for third-year and fourth-year primary school pupils in Peru, in relation to the main family, school and peer inputs contributing to the test-score gap using the estimation method of feasible generalized least squares. The article then decomposes the gap into its…

  5. Kindergarten Black-White Test Score Gaps: Replicating and Updating Previous Findings with New National Data

    ERIC Educational Resources Information Center

    Quinn, David

    2014-01-01

    A substantial body of evidence has shown large academic test score gaps between black and white students in early childhood. These gaps remain, and probably grow, as students progress through school. Many researchers have sought to explain these persistent test score gaps, and particularly, to understand the role of students' socio-economic status…

  6. The Dynamics of the Evolution of the Black-White Test Score Gap

    ERIC Educational Resources Information Center

    Sohn, Kitae

    2012-01-01

    We apply a quantile version of the Oaxaca-Blinder decomposition to estimate the counterfactual distribution of the test scores of Black students. In the Early Childhood Longitudinal Study, Kindergarten Class of 1998-1999 (ECLS-K), we find that the gap initially appears only at the top of the distribution of test scores. As children age, however,…

  7. Are Mathematics and Science Test Scores Good Indicators of Labor-Force Quality?

    ERIC Educational Resources Information Center

    Chen, Shiu-Sheng; Luoh, Ming-Ching

    2010-01-01

    Using data from the Programme for International Student Assessment (PISA) and the Trends in International Mathematics and Science Study (TIMSS), we investigate the link between test scores (mathematics and science) and cross-country income differences. We would like to know whether test scores are good indicators of labor-force quality. The…

  8. How Changes in Families and Schools Are Related to Trends in Black-White Test Scores

    ERIC Educational Resources Information Center

    Berends, Mark; Lucas, Samuel R.; Penaloza, Roberto V.

    2008-01-01

    Through several decades of research, a great deal has been written about trends in black-white test scores and the factors that may explain the gaps in different subject areas. Only a few studies have examined the changing relationships between gaps in students' test scores and family and school measures in nationally representative data over…

  9. Increasing Racial Isolation and Test Score Gaps in Mathematics: A 30-Year Perspective

    ERIC Educational Resources Information Center

    Berends, Mark; Penaloza, Roberto V.

    2010-01-01

    Background/Context: Although there has been progress in closing the test score gaps among student groups over past decades, that progress has stalled. Many researchers have speculated why the test score gaps closed between the early 1970s and the early 1990s, but only a few have been able to empirically study how changes in school factors and…

  10. Freudenfreude and Schadenfreude Test (FAST) scores of depressed and non-depressed undergraduates.

    PubMed

    Chambliss, Catherine; Cattai, Ashley; Benton, Peter; Elghawy, Ahmed; Fan, Madde; Thompson, Kayleigh; Scavicchio, Daniel; Tanenbaum, Joshua

    2012-08-01

    The Freudenfreude and Schadenfreude Test (FAST) had moderate test-retest reliability in an undergraduate sample. Freudenfreude scores were lower and Schadenfreude scores were higher among mildly depressed than nondepressed students. Distinctive reactions to personal success and failure were associated with depression. Responses to others' success and failure may also be related to depression. PMID:23045853

  11. The Scoring of Matching Questions Tests: A Closer Look

    ERIC Educational Resources Information Center

    Jancarík, Antonín; Kostelecká, Yvona

    2015-01-01

    Electronic testing has become a regular part of online courses. Most learning management systems offer a wide range of tools that can be used in electronic tests. With respect to time demands, the most efficient tools are those that allow automatic assessment. The presented paper focuses on one of these tools: matching questions in which one…

  12. High-Stakes Testing and Student Achievement: Problems for the No Child Left Behind Act

    ERIC Educational Resources Information Center

    Nichols, Sharon L.; Glass, Gene V.; Berliner, David C.

    2005-01-01

    Under the federal No Child Left Behind Act of 2001 (NCLB), standardized test scores are the indicator used to hold schools and school districts accountable for student achievement. Each state is responsible for constructing an accountability system, attaching consequences--or stakes--for student performance. The theory of action implied by this…

  13. Validity Evidence for ACT Compass® Placement Tests. ACT Research Report Series 2014 (2)

    ERIC Educational Resources Information Center

    Westrick, Paul A.; Allen, Jeff

    2014-01-01

    We examined the validity of using Compass® test scores and high school grade point average (GPA) for placing students in first-year college courses and for identifying students at risk of not succeeding. Consistent with other research, the combination of high school GPA and Compass scores performed better than either measure used alone. Results…

  14. Maintaining Equivalent Cut Scores for Small Sample Test Forms

    ERIC Educational Resources Information Center

    Dwyer, Andrew C.

    2016-01-01

    This study examines the effectiveness of three approaches for maintaining equivalent performance standards across test forms with small samples: (1) common-item equating, (2) resetting the standard, and (3) rescaling the standard. Rescaling the standard (i.e., applying common-item equating methodology to standard setting ratings to account for…

  15. What Do Test Scores in Texas Tell Us? Issue Paper.

    ERIC Educational Resources Information Center

    Klein, Stephen P.; Hamilton, Laura S.; McCaffrey, Daniel F.; Stecher, Brian M.

    Texas students have made extraordinarily large gains on statewide achievement tests, the Texas Assessment of Academic Skills (TAAS), gains so dramatic that they have been dubbed the "Texas miracle." There is general agreement that these gains are attributable to the high stakes accountability system in Texas, but there is some question about what…

  16. Test Scores and the Rural School and Community Trust.

    ERIC Educational Resources Information Center

    Rural School and Community Trust, Washington, DC.

    A number of studies suggests that the small size of many rural schools gives their students, especially the poorest, a leg up on academic achievement. This notion is supported by the standardized test results presented in this report, from a sample of the primarily small schools participating in the Rural School and Community Trust, a national…

  17. Allometric Scaling of Wingate Anaerobic Power Test Scores in Women

    ERIC Educational Resources Information Center

    Hetzler, Ronald K.; Stickley, Christopher D.; Kimura, Iris F.

    2011-01-01

    In this study, we developed allometric exponents for scaling Wingate anaerobic test (WAnT) power data that are reflective in controlling for body mass (BM) and lean body mass (LBM) and established a normative WAnT data set for college-age women. One hundred women completed a standard WAnT. Allometric exponents and percentile ranks for peak (PP)…

  18. Personnel Test Battery and Scoring Procedures. Memorandum No. L.S. 15.

    ERIC Educational Resources Information Center

    Berson, Barry L.

    The purpose of this memo is to present tests that comprise the test battery used to select Navy personnel to train marine mammals, and to describe the scoring procedures of the tests. The test battery consists of: Biosystems General Information Test (BGIT), Personnel History Questionnaire (PHQ), Gordon Personal Inventory, Gordon Personal Profile,…

  19. Correlation Between Students' Dental Admission Test Scores and Performance on a Dental School's Competency Exam.

    PubMed

    Carroll, Alexander M; Schuster, Gregory M

    2015-11-01

    The aim of this study was to investigate whether there was a statistically significant positive correlation between dental students' Dental Admission Test (DAT) scores, particularly on the Perceptual Ability Test (PAT), and their performance on a dental school's competency exam. Scores from the written and clinical competency exam administered in the fall quarter of the fourth year of the curriculum at Midwestern University College of Dental Medicine-Arizona were compared to DAT scores of all 216 members of the graduating classes of 2012 and 2013. It was hypothesized that students who performed highly on one or more sections of the DAT would perform highly on the competency exam. Backward stepwise regression analyses were used to analyze the data. The results showed that the PAT scores were most strongly correlated with the competency exam scores and were a positive predictor for all three clinical sections of the exam (operative dentistry, periodontics, and endodontics). Positive predictors for the written portion of the exam were total DAT score for patient assessment and treatment planning and the DAT reading comprehension score for prosthodontics; there were no predictors for periodontics. The total variance explained by the results ranged from 4% to 15%. While statistically significant relationships were found between the students' PAT scores and clinical performance, DAT scores explained relatively little variance in the competency exam scores. According to these findings, neither the PAT nor any of the DAT components contributed to predicting these students' clinical performance. PMID:26522638

  20. Scoring Divergent Thinking Tests by Computer With a Semantics-Based Algorithm.

    PubMed

    Beketayev, Kenes; Runco, Mark A

    2016-05-01

    Divergent thinking (DT) tests are useful for the assessment of creative potentials. This article reports the semantics-based algorithmic (SBA) method for assessing DT. This algorithm is fully automated: Examinees receive DT questions on a computer or mobile device and their ideas are immediately compared with norms and semantic networks. This investigation compared the scores generated by the SBA method with the traditional methods of scoring DT (i.e., fluency, originality, and flexibility). Data were collected from 250 examinees using the "Many Uses Test" of DT. The most important finding involved the flexibility scores from both scoring methods. This was critical because semantic networks are based on conceptual structures, and thus a high SBA score should be highly correlated with the traditional flexibility score from DT tests. Results confirmed this correlation (r = .74). This supports the use of algorithmic scoring of DT. The nearly-immediate computation time required by SBA method may make it the method of choice, especially when it comes to moderate- and large-scale DT assessment investigations. Correlations between SBA scores and GPA were insignificant, providing evidence of the discriminant and construct validity of SBA scores. Limitations of the present study and directions for future research are offered. PMID:27298632

  1. Can Percentiles Replace Raw Scores in the Statistical Analysis of Test Data?

    ERIC Educational Resources Information Center

    Zimmerman, Donald W.; Zumbo, Bruno D.

    2005-01-01

    Educational and psychological testing textbooks typically warn of the inappropriateness of performing arithmetic operations and statistical analysis on percentiles instead of raw scores. This seems inconsistent with the well-established finding that transforming scores to ranks and using nonparametric methods often improves the validity and power…

  2. Score Generalizability of Academic Writing Tasks: Does One Test Method Fit It All?

    ERIC Educational Resources Information Center

    Gebril, Atta

    2009-01-01

    Generalizability of writing scores has always been a longstanding concern in L2 writing assessment. A number of studies have been conducted to investigate this topic during the last two decades. However, with the introduction of new test methods, such as reading-to-write tasks, generalizability studies need to focus on the score accuracy of…

  3. TOEFL iBT Speaking Test Scores as Indicators of Oral Communicative Language Proficiency

    ERIC Educational Resources Information Center

    Bridgeman, Brent; Powers, Donald; Stone, Elizabeth; Mollaun, Pamela

    2012-01-01

    Scores assigned by trained raters and by an automated scoring system (SpeechRater[TM]) on the speaking section of the TOEFL iBT[TM] were validated against a communicative competence criterion. Specifically, a sample of 555 undergraduate students listened to speech samples from 184 examinees who took the Test of English as a Foreign Language…

  4. From #2 Pencils to the World Wide Web: A History of Test Scoring

    ERIC Educational Resources Information Center

    Zytowski, Donald G.

    2008-01-01

    The present highly developed status of psychological and educational testing in the United States is in part the result of many efforts over the past 100 years to develop economical and reliable methods of scoring. The present article traces a number of methods, ranging from hand scoring to present-day computer applications, stimulated by the need…

  5. Psychometric Properties of Raw and Scale Scores on Mixed-Format Tests

    ERIC Educational Resources Information Center

    Kolen, Michael J.; Lee, Won-Chan

    2011-01-01

    This paper illustrates that the psychometric properties of scores and scales that are used with mixed-format educational tests can impact the use and interpretation of the scores that are reported to examinees. Psychometric properties that include reliability and conditional standard errors of measurement are considered in this paper. The focus is…

  6. Language Variation and Score Variation in the Testing of English Language Learners, Native Spanish Speakers

    ERIC Educational Resources Information Center

    Solano-Flores, Guillermo; Li, Min

    2009-01-01

    We investigated language variation and score variation in the testing of English language learners, native Spanish speakers. We gave students the same set of National Assessment of Educational Progress mathematics items in both their first language and their second language. We examined the amount of score variation due to the main and interaction…

  7. Multinomial and Compound Multinomial Error Models for Tests with Complex Item Scoring

    ERIC Educational Resources Information Center

    Lee, Won-Chan

    2007-01-01

    This article introduces a multinomial error model, which models an examinee's test scores obtained over repeated measurements of an assessment that consists of polytomously scored items. A compound multinomial error model is also introduced for situations in which items are stratified according to content categories and/or prespecified numbers of…

  8. Assessing the Relationship among Defining Issues Test Scores and Crystallised and Fluid Intellectual Indices

    ERIC Educational Resources Information Center

    Derryberry, W. Pitt; Jones, Kristy L.; Grieve, Frederick G.; Barger, Brian

    2007-01-01

    Differing findings exist on how Defining Issues Test (DIT) scores relate to intelligence. Further study is needed in order to address aspects of intellect not previously considered and to address how these relationships rival studies that have compared indices of intellect with constructs similar to DIT scores. In the present study, a sample of…

  9. Optimal Scoring Methods of Hand-Strength Tests in Patients with Stroke

    ERIC Educational Resources Information Center

    Huang, Sheau-Ling; Hsieh, Ching-Lin; Lin, Jau-Hong; Chen, Hui-Mei

    2011-01-01

    The purpose of this study was to determine the optimal scoring methods for measuring strength of the more-affected hand in patients with stroke by examining the effect of reducing measurement errors. Three hand-strength tests of grip, palmar pinch, and lateral pinch were administered at two sessions in 56 patients with stroke. Five scoring methods…

  10. The Effects of Diverse Test Score Distribution Characteristics on the Estimation of the Rasch Measurement Model.

    ERIC Educational Resources Information Center

    Cypress, Beulah K.

    The potential of the Rasch model to develop scores, on a ratio scale, suitable for interindividual comparisons, from intact groups with disparate distribution characteristics was investigated. The specific problems studied were: (1) the effects of skewed test score distributions on the ability parameter of the Rasch measurement model; (2) the…

  11. The Impact of the 2004 Hurricanes on Florida Comprehensive Assessment Test Scores: Implications for School Counselors

    ERIC Educational Resources Information Center

    Baggerly, Jennifer; Ferretti, Larissa K.

    2008-01-01

    What is the impact of natural disasters on students' statewide assessment scores? To answer this question, Florida Comprehensive Assessment Test (FCAT) scores of 55,881 students in grades 4 through 10 were analyzed to determine if there were significant decreases after the 2004 hurricanes. Results reveal that there was statistical but no practical…

  12. Using Test Scores from Students with Disabilities in Teacher Effectiveness Indicators

    ERIC Educational Resources Information Center

    Buzick, Heather M.; Jones, Nathan D.

    2015-01-01

    The increased emphasis on using student growth measures in teacher evaluation has raised questions about how to treat test scores from students with disabilities. This study explores the consequences of three common approaches for treating scores from students with disabilities in statistical approaches to estimating teacher effectiveness: (1)…

  13. An Analysis of Cross Racial Identity Scale Scores Using Classical Test Theory and Rasch Item Response Models

    ERIC Educational Resources Information Center

    Sussman, Joshua; Beaujean, A. Alexander; Worrell, Frank C.; Watson, Stevie

    2013-01-01

    Item response models (IRMs) were used to analyze Cross Racial Identity Scale (CRIS) scores. Rasch analysis scores were compared with classical test theory (CTT) scores. The partial credit model demonstrated a high goodness of fit and correlations between Rasch and CTT scores ranged from 0.91 to 0.99. CRIS scores are supported by both methods.…

  14. Scoring Divergent Thinking Tests by Computer With a Semantics-Based Algorithm

    PubMed Central

    Beketayev, Kenes; Runco, Mark A.

    2016-01-01

    Divergent thinking (DT) tests are useful for the assessment of creative potentials. This article reports the semantics-based algorithmic (SBA) method for assessing DT. This algorithm is fully automated: Examinees receive DT questions on a computer or mobile device and their ideas are immediately compared with norms and semantic networks. This investigation compared the scores generated by the SBA method with the traditional methods of scoring DT (i.e., fluency, originality, and flexibility). Data were collected from 250 examinees using the “Many Uses Test” of DT. The most important finding involved the flexibility scores from both scoring methods. This was critical because semantic networks are based on conceptual structures, and thus a high SBA score should be highly correlated with the traditional flexibility score from DT tests. Results confirmed this correlation (r = .74). This supports the use of algorithmic scoring of DT. The nearly-immediate computation time required by SBA method may make it the method of choice, especially when it comes to moderate- and large-scale DT assessment investigations. Correlations between SBA scores and GPA were insignificant, providing evidence of the discriminant and construct validity of SBA scores. Limitations of the present study and directions for future research are offered. PMID:27298632

  15. Estimating Achievement Gaps from Test Scores Reported in Ordinal "Proficiency" Categories

    ERIC Educational Resources Information Center

    Ho, Andrew D.; Reardon, Sean F.

    2012-01-01

    Test scores are commonly reported in a small number of ordered categories. Examples of such reporting include state accountability testing, Advanced Placement tests, and English proficiency tests. This article introduces and evaluates methods for estimating achievement gaps on a familiar standard-deviation-unit metric using data from these ordered…

  16. Using Microcomputer Software to Score Placement Tests--An Example from the University of California, Irvine.

    ERIC Educational Resources Information Center

    Shoemaker, Judith S.; And Others

    This article describes the placement testing program, the use of microcomputer software for scoring and analyzing test results, and the integration of the computerized test results into a comprehensive microcomputer-based student information system at the University of California, Irvine (UCI). UCI offers placement tests in five academic fields:…

  17. Effects of Targeted Test Preparation on Scores of Two Tests of Oral English as a Second Language

    ERIC Educational Resources Information Center

    Farnsworth, Tim

    2013-01-01

    This study investigated the effect of targeted test preparation, or coaching, on oral English as a second language test scores. The tests in question were the Basic English Skills Test Plus (BEST Plus), a scripted oral interview published by the Center for Applied Linguistics, and the Versant English Test (VET), a computer-administered and…

  18. Psychometric Evaluation of the Lower Extremity Computerized Adaptive Test, the Modified Harris Hip Score, and the Hip Outcome Score

    PubMed Central

    Hung, Man; Hon, Shirley D.; Cheng, Christine; Franklin, Jeremy D.; Aoki, Stephen K.; Anderson, Mike B.; Kapron, Ashley L.; Peters, Christopher L.; Pelt, Christopher E.

    2014-01-01

    Background: The applicability and validity of many patient-reported outcome measures in the high-functioning population are not well understood. Purpose: To compare the psychometric properties of the modified Harris Hip Score (mHHS), the Hip Outcome Score activities of daily living subscale (HOS-ADL) and sports (HOS-sports), and the Lower Extremity Computerized Adaptive Test (LE CAT). The hypotheses was that all instruments would perform well but that the LE CAT would show superiority psychometrically because a combination of CAT and a large item bank allows for a high degree of measurement precision. Study Design: Cohort study (diagnosis); Level of evidence, 2. Methods: Data were collected from 472 advanced-age, active participants from the Huntsman World Senior Games in 2012. Validity evidences were examined through item fit, dimensionality, monotonicity, local independence, differential item functioning, person raw score to measure correlation, and instrument coverage (ie, ceiling and floor effects), and reliability evidences were examined through Cronbach alpha and person separation index. Results: All instruments demonstrated good item fit, unidimensionality, monotonicity, local independence, and person raw score to measure correlations. The HOS-ADL had high ceiling effects of 36.02%, and the mHHS had ceiling effects of 27.54%. The LE CAT had ceiling effects of 8.47%, and the HOS-sports had no ceiling effects. None of the instruments had any floor effects. The mHHS had a very low Cronbach alpha of 0.41 and an extremely low person separation index of 0.08. Reliabilities for the LE CAT were excellent and for the HOS-ADL and HOS-sports were good. Conclusion: The LE CAT showed better psychometric properties overall than the HOS-ADL, HOS-sports, and mHHS for the senior population. The mHHS demonstrated pronounced ceiling effects and poor reliabilities that should be of concern. The high ceiling effects for the HOS-ADL were also of concern. The LE CAT was superior

  19. Determining When Single Scoring for Constructed-Response Items Is as Effective as Double Scoring in Mixed-Format Licensure Tests

    ERIC Educational Resources Information Center

    Kim, Sooyeon; Moses, Tim

    2013-01-01

    The major purpose of this study is to assess the conditions under which single scoring for constructed-response (CR) items is as effective as double scoring in the licensure testing context. We used both empirical datasets of five mixed-format licensure tests collected in actual operational settings and simulated datasets that allowed for the…

  20. Generalization of the Lord-Wingersky Algorithm to Computing the Distribution of Summed Test Scores Based on Real-Number Item Scores

    ERIC Educational Resources Information Center

    Kim, Seonghoon

    2013-01-01

    With known item response theory (IRT) item parameters, Lord and Wingersky provided a recursive algorithm for computing the conditional frequency distribution of number-correct test scores, given proficiency. This article presents a generalized algorithm for computing the conditional distribution of summed test scores involving real-number item…

  1. Validity of Alternative Cut-Off Scores for the Back-Saver Sit and Reach Test

    ERIC Educational Resources Information Center

    Looney, Marilyn A.; Gilbert, Jennie

    2012-01-01

    The purpose of the study was to determine if currently used FITNESSGRAM[R] cut-off scores for the Back Saver Sit and Reach Test had the best criterion-referenced validity evidence for 6-12 year old children. Secondary analyses of an existing data set focused on the passive straight leg raise and Back Saver Sit and Reach Test flexibility scores of…

  2. Coefficient α as a Measure of Test Score Reliability: Review of 3 Popular Misconceptions.

    PubMed

    Morera, Osvaldo F; Stokes, Sonya M

    2016-03-01

    We discuss 3 popular misconceptions about Cronbach α or coefficient α, traditionally used in public health and the behavioral sciences as an index of test score reliability. We also review several other indices of test score reliability. We encourage researchers to thoughtfully consider the nature of their data and the options when choosing an index of reliability, and to clearly communicate this choice and its implications to their audiences. PMID:26885962

  3. Explaining the black-white gap in cognitive test scores: Toward a theory of adverse impact.

    PubMed

    Cottrell, Jonathan M; Newman, Daniel A; Roisman, Glenn I

    2015-11-01

    In understanding the causes of adverse impact, a key parameter is the Black-White difference in cognitive test scores. To advance theory on why Black-White cognitive ability/knowledge test score gaps exist, and on how these gaps develop over time, the current article proposes an inductive explanatory model derived from past empirical findings. According to this theoretical model, Black-White group mean differences in cognitive test scores arise from the following racially disparate conditions: family income, maternal education, maternal verbal ability/knowledge, learning materials in the home, parenting factors (maternal sensitivity, maternal warmth and acceptance, and safe physical environment), child birth order, and child birth weight. Results from a 5-wave longitudinal growth model estimated on children in the NICHD Study of Early Child Care and Youth Development from ages 4 through 15 years show significant Black-White cognitive test score gaps throughout early development that did not grow significantly over time (i.e., significant intercept differences, but not slope differences). Importantly, the racially disparate conditions listed above can account for the relation between race and cognitive test scores. We propose a parsimonious 3-Step Model that explains how cognitive test score gaps arise, in which race relates to maternal disadvantage, which in turn relates to parenting factors, which in turn relate to cognitive test scores. This model and results offer to fill a need for theory on the etiology of the Black-White ethnic group gap in cognitive test scores, and attempt to address a missing link in the theory of adverse impact. PMID:25867168

  4. A Maturing Global Testing Regime Meets the World Economy: Test Scores and Economic Growth, 1960-2012

    ERIC Educational Resources Information Center

    Kamens, David H.

    2015-01-01

    This article considers the growth of the international testing regime. It discusses sources of growth and empirically examines two related sets of issues: (1) the stability of countries' achievement scores, and (2) the influence of those national scores on subsequent economic development over different time lags. The article suggests that…

  5. The effect of the use of Christian-published science textbooks on the ACT Science Reasoning subtest scores of midwest Christian high schools

    NASA Astrophysics Data System (ADS)

    Guthrie, Janice Marie

    Statistics indicate that students in conservative Christian schools earn higher standardized test scores in mathematics, reading, science, and writing compared to their public school counterparts, while many are criticized for using curricular materials deemed inferior in quality and for employing uncertified and ill-trained teachers. This study investigates the effectiveness of Christian-published science textbooks in preparing students for college-level science courses as measured by the Science Reasoning subtest of the ACT college entrance examination. A questionnaire was sent to conservative Christian high schools in five Midwest states which are affiliated with the American Association of Christian Schools, Association of Christian Schools International, and Oral Roberts University Educational Fellowship. Information gathered on the schools, teachers, facilities, and ACT Science Reasoning scores and the resulting descriptive statistics provide a sketch of the typical Midwest Christian high school. Hypothesis testing resulted in acceptance of the Null Hypothesis: There is no difference between the mean ACT Science Reasoning scores of Midwest Christian high schools using Christian-published science textbooks as compared to those using secular-published science textbooks. Multiple regression analysis on the two publishers represented by the sample statistics, A Beka and Bob Jones University Press, showed no significant difference in the effectiveness of one publisher over the other. Analysis of the one open-ended question asking why each school chooses to use the type of publisher it does led to the issues of worldview instruction and academic rigor as the deciding factors for the selection of one type of publisher over the other.

  6. Improvement in Intelligence Test Scores from 6 to 10 years in Children of Teenage Mothers

    PubMed Central

    Cornelius, Marie D.; Goldschmidt, Lidush; De Genna, Natacha M.; Richardson, Gale A.; Leech, Sharon L.; Day, Richard

    2010-01-01

    Objective This study investigates change in IQ scores among 290 children born to teenage mothers and identifies social, economic, and environmental variables that may be associated with change in intelligence test performance. Methods The children of 290 teenage mothers (72% African American and 28% European American) were assessed with the Stanford-Binet Intelligence Scale-4th Edition (SBIS) at ages 6 and 10. Results The mean composite score at age 6 was 84.8 and was 91.2 at age 10, an improvement of 6.4 points. Significant cross-sectional predictors at both ages 6 and 10 of higher SBIS scores were maternal cognitive ability, school grade, Caucasian ethnicity, and caregiver education. Having more children in the household significantly predicted lower SBIS scores at age 6. Higher satisfaction with maternal social support predicted higher SBIS scores at age 10. Change in IQ scores was not related to maternal socioeconomic status, social support, home environment, ethnicity, or family interactions. Custodial stability was associated with an improvement in IQ scores, while increase in caregiver depression was related to decline in IQ scores. Conclusions Our findings suggest that improvement in IQ scores of offspring of teenage mothers may be related to stability of maternal custody. More research is needed to determine the impact of the maturation of adolescent mothers' parenting and the role of early education on improvement in cognitive abilities. PMID:20495472

  7. Examining alternative scoring rubrics on a statewide test: The impact of different scoring methods on science and social studies performance assessments

    NASA Astrophysics Data System (ADS)

    Creighton, Susan Dabney

    There is no consensus regarding the most reliable and valid scoring methods for the assessment of higher order thinking skills. Most of the research on alternative formats has focused on the scoring of writing ability. This study examined the value of different types of performance assessment scoring guides on state mandated science and social studies tests. A proportional stratified sample of raters were randomly assigned to one of four scoring groups: checklist, analytic rubric, holistic rubric, and generic rubrics. A fifth method, the weighted analytic rubric, was included by applying an algorithmic formula to the scores assigned by raters using the analytic rubric. A comparison of the mean scores for the five scoring groups suggests that there may be a difference in the way raters applied the rubric for each group. Although the literature suggests that it is possible to achieve high levels of inter-rater reliability, across forms of scoring, phi coefficients of moderate strength were obtained for three of the four constructed-response items. Results for each scoring group were compared indicating that item complexity may impact the level of inter-rate, reliability and the selection of the most reliable rubric for each discipline. Analytic rubrics appear to achieve more reliable results with less complex items. A multitrait-multimethod approach was utilized to investigate the external validity of the social studies and science tasks. As expected, there tended to be a stronger association between the PACT science constructed-response scores with scores based on science multiple-choice scores than between the science constructed-response scores and the writing ability subtest scores. A similar pattern was seen with social studies items. These results provide some evidence for the validity of the performance assessments. A post study survey completed by raters provided qualitative information regarding their thought processes and their primary focus during the

  8. How Does Emergency Department Crowding Affect Medical Student Test Scores and Clerkship Evaluations?

    PubMed Central

    Wei, Grant; Arya, Rajiv; Ritz, Z. Trevor; He, Albert S.; Ohman-Strickland, Pamela A.; McCoy, Jonathan V.

    2015-01-01

    Introduction The effect of emergency department (ED) crowding has been recognized as a concern for more than 20 years; its effect on productivity, medical errors, and patient satisfaction has been studied extensively. Little research has reviewed the effect of ED crowding on medical education. Prior studies that have considered this effect have shown no correlation between ED crowding and resident perception of quality of medical education. Objective To determine whether ED crowding, as measured by the National ED Overcrowding Scale (NEDOCS) score, has a quantifiable effect on medical student objective and subjective experiences during emergency medicine (EM) clerkship rotations. Methods We collected end-of-rotation examinations and medical student evaluations for 21 EM rotation blocks between July 2010 and May 2012, with a total of 211 students. NEDOCS scores were calculated for each corresponding period. Weighted regression analyses examined the correlation between components of the medical student evaluation, student test scores, and the NEDOCS score for each period. Results When all 21 rotations are included in the analysis, NEDOCS scores showed a negative correlation with medical student tests scores (regression coefficient= −0.16, p=0.04) and three elements of the rotation evaluation (attending teaching, communication, and systems-based practice; p<0.05). We excluded an outlying NEDOCS score from the analysis and obtained similar results. When the data were controlled for effect of month of the year, only student test score remained significantly correlated with NEDOCS score (p=0.011). No part of the medical student rotation evaluation attained significant correlation with the NEDOCS score (p≥0.34 in all cases). Conclusion ED overcrowding does demonstrate a small but negative association with medical student performance on end-of-rotation examinations. Additional studies are recommended to further evaluate this effect. PMID:26594289

  9. Instructions for additional qualitative scoring of the initial-letter Word-association Test.

    PubMed

    Zivković, M

    1994-04-01

    An additional scoring method is based on grouping test-words according to whether the same sign is given by subjects to the test-words. In this way five test-word categories are formed, Eros (test-words with double plus signs), demi-Eros (single plus sign), demi-Thanatos (single minus), Thanatos (double minus), and Deviant (+/- and theta signs). The next step in scoring is to count the number of test-words in a given scoring category whose meanings do not conform. The greater the discrepancy between the test-word category and its meaning, the less well adapted is the subject. Several illustrative protocols are discussed. PMID:8022674

  10. Methods for Evaluating the Validity of Test Scores for English Language Learners

    ERIC Educational Resources Information Center

    Sireci, Stephen G.; Han, Kyung T.; Wells, Craig S.

    2008-01-01

    In the United States, when English language learners (ELLs) are tested, they are usually tested in English and their limited English proficiency is a potential cause of construct-irrelevant variance. When such irrelevancies affect test scores, inaccurate interpretations of ELLs' knowledge, skills, and abilities may occur. In this article, we…

  11. The 5-Step Way to Raise Test Scores: Using the Data to Drive Your Instruction

    ERIC Educational Resources Information Center

    East, Pam C.

    2005-01-01

    Many teachers look at standardized tests as something to be dreaded. This author and teacher looks at standardized-test scores and sees a tool to bring students learning to new heights. This is a way for teachers to target instruction exactly where it's needed. A way to get students looking forward to end-of-the-year tests (really!) as a way to…

  12. Effects of Student Self-Corrective Measures on Learning and Standardized Test Scores

    ERIC Educational Resources Information Center

    Poplin, Beth D.

    2010-01-01

    This study examined whether students who graded and corrected their own test papers improved their learning and standardized test scores on the North Carolina end-of-course test in United States History. Four preexisting, intact classrooms of 11th grade United States History students in two different high schools formed the basis of this…

  13. Demands on Users for Interpretation of Achievement Test Scores: Implications for the Evaluation Profession

    ERIC Educational Resources Information Center

    Della-Piana, Gabriel Mario; Gardner, Michael

    2011-01-01

    Background: Professional standards for validity of achievement tests have long reflected a consensus that validity is the degree to which evidence and theory support interpretations of test scores entailed by the intended uses of tests. Yet there are convincing lines of evidence that the standards are not adequately followed in practice, that…

  14. Colleges and Universities That Do Not Require SAT® or ACT Scores. Research Notes. RN-18

    ERIC Educational Resources Information Center

    Milewski, Glenn B.; Camara, Wayne J.

    2002-01-01

    In recent years, conflicting reports have circulated about the number of colleges and universities that have made standardized tests optional, rather than required, for admission. The National Center for Fair and Open Testing, or FairTest, has widely publicized a list of 391 colleges and universities that, it claims, do not require admissions…

  15. Does breastfeeding contribute to the racial gap in reading and math test scores?

    PubMed Central

    Peters, Kristen E.; Huang, Jin; Vaughn, Michael G.; Witko, Christopher

    2013-01-01

    Purpose The aim of this study was to examine the impact of divergent breastfeeding practices between Caucasian and African American mothers on the lingering achievement test gap between Caucasian and African American children. Methods The Child Development Supplement of the Panel Study of Income Dynamics, beginning in 1997, followed a cohort of 3563 children aged 0–12 years. Reading and math test scores from 2002 for 1928 children were linked with breastfeeding history. Regression analysis was used to examine associations between ever having been breastfed and duration of breastfeeding and test scores, controlling for characteristics of child, mother, and household. Results African American students scored significantly lower than Caucasian children by 10.6 and 10.9 points on reading and math tests, respectively. After accounting for the impact of having been breastfed during infancy, the racial test gap decreased by 17% for reading scores and 9% for math scores. Conclusions Study findings indicate that breastfeeding explains 17% and 9% of the observed gaps in reading and math scores, respectively, between African Americans and Caucasians, an effect larger than most recent educational policy interventions. Renewed efforts around policies and clinical practices that promote and remove barriers for African American mothers to breastfeed should be implemented. PMID:23880156

  16. Predicting Long-Term College Success through Degree Completion Using ACT[R] Composite Score, ACT Benchmarks, and High School Grade Point Average. ACT Research Report Series, 2012 (5)

    ERIC Educational Resources Information Center

    Radunzel, Justine; Noble, Julie

    2012-01-01

    This study compared the effectiveness of ACT[R] Composite score and high school grade point average (HSGPA) for predicting long-term college success. Outcomes included annual progress towards a degree (based on cumulative credit-bearing hours earned), degree completion, and cumulative grade point average (GPA) at 150% of normal time to degree…

  17. A Cochran-Armitage-type and a score-free global test for multivariate ordinal data.

    PubMed

    Jelizarow, Monika; Mansmann, Ulrich; Goeman, Jelle J

    2016-07-20

    We propose a Cochran-Armitage-type and a score-free global test that can be used to assess the presence of an association between a set of ordinally scaled covariates and an outcome variable within the range of generalized linear models. Both tests are developed within the framework of the well-established 'global test' methodology and as such are feasible in high-dimensional data situations under any correlation and enable adjustment for covariates. The Cochran-Armitage-type test, for which an intimate connection with the traditional score-based Cochran-Armitage test is shown, rests upon explicit assumptions on the distances between the covariates' ordered categories. The score-free test, in contrast, parametrizes these distances and thus keeps them flexible, rendering it ideally suited for covariates measured on an ordinal scale. As confirmed by means of simulations, the Cochran-Armitage-type test focuses its power on set-outcome relationships where the distances between the covariates' categories are equal or close to those assumed, whereas the score-free test spreads its power over a wide range of possible set-outcome relationships, putting more emphasis on monotonic than on non-monotonic ones. Based on the tests' power properties, it is discussed when to favour one or the other, and the practical merits of both of them are illustrated by an application in the field of rehabilitation medicine. Our proposed tests are implemented in the R package globaltest. Copyright © 2016 John Wiley & Sons, Ltd. PMID:26924287

  18. Neighborhood Social Context and Individual Polycyclic Aromatic Hydrocarbon Exposures Associated with Child Cognitive Test Scores.

    PubMed

    Lovasi, Gina S; Eldred-Skemp, Nicolia; Quinn, James W; Chang, Hsin-Wen; Rauh, Virginia A; Rundle, Andrew; Orjuela, Manuela A; Perera, Frederica P

    2014-07-01

    Childhood cognitive and test-taking abilities have long-term implications for educational achievement and health, and may be influenced by household environmental exposures and neighborhood contexts. This study evaluates whether age 5 scores on the Wechsler Preschool and Primary Scale of Intelligence-Revised (WPPSI-R, administered in English) are associated with polycyclic aromatic hydrocarbon (PAH) exposure and neighborhood context variables including poverty, low educational attainment, low English language proficiency, and inadequate plumbing. The Columbia Center for Children's Environmental Health enrolled African-American and Dominican-American New York City women during pregnancy, and conducted follow-up for subsequent childhood health outcomes including cognitive test scores. Individual outcomes were linked to data characterizing 1-km network buffers around prenatal addresses, home observations, interviews, and prenatal PAH exposure data from personal air monitors. Prenatal PAH exposure above the median predicted 3.5 point lower total WPPSI-R scores and 3.9 point lower verbal scores; the association was similar in magnitude across models with adjustments for neighborhood characteristics. Neighborhood-level low English proficiency was independently associated with 2.3 point lower mean total WPPSI-R score, 1.2 point lower verbal score, and 2.7 point lower performance score per standard deviation. Low neighborhood-level educational attainment was also associated with 2.0 point lower performance scores. In models examining effect modification, neighborhood associations were similar or diminished among the high PAH exposure group, as compared with the low PAH exposure group. Early life exposure to personal PAH exposure or selected neighborhood-level social contexts may predict lower cognitive test scores. However, these results may reflect limited geographic exposure variation and limited generalizability. PMID:24994947

  19. Neighborhood Social Context and Individual Polycyclic Aromatic Hydrocarbon Exposures Associated with Child Cognitive Test Scores

    PubMed Central

    Eldred-Skemp, Nicolia; Quinn, James W.; Chang, Hsin-wen; Rauh, Virginia A.; Rundle, Andrew; Orjuela, Manuela A.; Perera, Frederica P.

    2013-01-01

    Childhood cognitive and test-taking abilities have long-term implications for educational achievement and health, and may be influenced by household environmental exposures and neighborhood contexts. This study evaluates whether age 5 scores on the Wechsler Preschool and Primary Scale of Intelligence-Revised (WPPSI-R, administered in English) are associated with polycyclic aromatic hydrocarbon (PAH) exposure and neighborhood context variables including poverty, low educational attainment, low English language proficiency, and inadequate plumbing. The Columbia Center for Children’s Environmental Health enrolled African-American and Dominican-American New York City women during pregnancy, and conducted follow-up for subsequent childhood health outcomes including cognitive test scores. Individual outcomes were linked to data characterizing 1-km network buffers around prenatal addresses, home observations, interviews, and prenatal PAH exposure data from personal air monitors. Prenatal PAH exposure above the median predicted 3.5 point lower total WPPSI-R scores and 3.9 point lower verbal scores; the association was similar in magnitude across models with adjustments for neighborhood characteristics. Neighborhood-level low English proficiency was independently associated with 2.3 point lower mean total WPPSI-R score, 1.2 point lower verbal score, and 2.7 point lower performance score per standard deviation. Low neighborhood-level educational attainment was also associated with 2.0 point lower performance scores. In models examining effect modification, neighborhood associations were similar or diminished among the high PAH exposure group, as compared with the low PAH exposure group. Early life exposure to personal PAH exposure or selected neighborhood-level social contexts may predict lower cognitive test scores. However, these results may reflect limited geographic exposure variation and limited generalizability. PMID:24994947

  20. Mantoux tests. Performing, interpreting, and acting upon them.

    PubMed Central

    Pennie, R. A.

    1995-01-01

    Interpreting and acting upon results of the tuberculin skin test (Mantoux test) can be confusing because the test is inherently ambiguous and published recommendations for investigation and prevention of tuberculosis are complex. This article presents a step-by-step guide to enable family physicians to make effective use of the Mantoux test. PMID:7780314

  1. An electrophysiological correlate of Eating Attitudes Test scores in female college students.

    PubMed

    Wilson, J F; Mercer, J C

    1990-11-01

    Eating Attitudes Test (EAT) scores of forty female college students were compared to their electrodermal activity (EDA) responses when offered a plate of chocolate chip cookies. A significant positive correlation was detected between the EAT scores and the skin conductivity measures associated with the presentation of food. Women with the highest EAT scores also exhibited the greatest sympathetic nervous system responses to a plate of cookies. This finding supports the conclusion that the EAT is capable of identifying individuals who are preoccupied with food or anxious about eating. PMID:2284404

  2. Use and interpretation of test scores from limited cognitive test batteries: how g + Gc can equal g.

    PubMed

    Annell, Stefan; Sjöberg, Anders; Sverke, Magnus

    2014-10-01

    Single scores from limited and unbalanced test batteries of cognitive ability can be ambiguous to interpret theoretically. In this study, a limited verbally and knowledge-loaded cognitive test battery, from applicants to the Swedish police academies (N = 1,344), was examined to provide foundations for the use and interpretation of test scores. Three measurement models were compared: one single factor model and two bifactor models, which decomposed the variance of the battery into orthogonal components. The models were evaluated by fit indices and omega coefficients, and then applied to the prediction of academic performance. The overall prediction of all models was similar, although specific abilities also were found to provide substantial predictive validity over and above general intelligence (g). The findings provide support for the use of single scores in applied settings (selection), but suggest that it may be more appropriate to interpret such scores as composites of substantive components, and not just as measures of g. PMID:25040205

  3. Students' Attitudes toward Institutional Accountability Testing in Higher Education: Implications for the Validity of Test Scores

    ERIC Educational Resources Information Center

    Zilberberg, Anna

    2013-01-01

    Recent calls for an increase in educational accountability in K-16 resulted in an uptick of low-stakes testing and, consequently, an increased need for ensuring that students' test scores are reliable and valid representations of their true ability. Focusing on accountability testing in higher education, the current program of research was…

  4. Stochastic Processes as True-Score Models for Highly Speeded Mental Tests.

    ERIC Educational Resources Information Center

    Moore, William E.

    The previous theoretical development of the Poisson process as a strong model for the true-score theory of mental tests is discussed, and additional theoretical properties of the model from the standpoint of individual examinees are developed. The paper introduces the Erlang process as a family of test theory models and shows in the context of…

  5. Investigation and Treatment of Missing Item Scores in Test and Questionnaire Data

    ERIC Educational Resources Information Center

    Sijtsma, Klaas; van der Ark, L. Andries

    2003-01-01

    This article first discusses a statistical test for investigating whether or not the pattern of missing scores in a respondent-by-item data matrix is random. Since this is an asymptotic test, we investigate whether it is useful in small but realistic sample sizes. Then, we discuss two known simple imputation methods, person mean (PM) and two-way…

  6. Effects of School Characteristics upon Achievement Test Scores in New York State.

    ERIC Educational Resources Information Center

    Fowler, William J., Jr.

    The effects of school characteristics upon achievement test scores in New York State were studied. Data, composed of the 1975-76 Consolidated Data Base and Finance Tapes for all 705 school districts in the state, were supplied by the New York State Department of Education. Among the 24 variables of interest were: state pupil evaluation tests of…

  7. Detection of Invalid Test Scores: The Usefulness of Simple Nonparametric Statistics

    ERIC Educational Resources Information Center

    Tendeiro, Jorge N.; Meijer, Rob R.

    2014-01-01

    In recent guidelines for fair educational testing it is advised to check the validity of individual test scores through the use of person-fit statistics. For practitioners it is unclear on the basis of the existing literature which statistic to use. An overview of relatively simple existing nonparametric approaches to identify atypical response…

  8. The Disaggregation of Value-Added Test Scores to Assess Learning Outcomes in Economics Courses

    ERIC Educational Resources Information Center

    Walstad, William B.; Wagner, Jamie

    2016-01-01

    This study disaggregates posttest, pretest, and value-added or difference scores in economics into four types of economic learning: positive, retained, negative, and zero. The types are derived from patterns of student responses to individual items on a multiple-choice test. The micro and macro data from the "Test of Understanding in College…

  9. Two for One: Using QAR to Increase Reading Comprehension and Improve Test Scores

    ERIC Educational Resources Information Center

    Green, Susan

    2016-01-01

    This teaching tip describes an intervention used in a third-grade classroom implemented to help students pass an end-of-grade reading comprehension test. Low scores on a practice end-of-grade comprehension test prompted a re-examination of classroom reading instruction and a plan for intervention. This teaching tip describes the phases implemented…

  10. Pragmatism or Gaming the System? One School District's Solution to Low Test Scores

    ERIC Educational Resources Information Center

    McKenzie, Kathryn Bell

    2009-01-01

    In this era of accountability and high stakes testing, district and school administrators are vigilant in their attention to student test scores and the ramifications these have for district and school performance labels. In other words, no school or district wants to be labeled "low performing." This case, based on a real situation, demonstrates…

  11. An Investigation of the Effectiveness of Vocabulary Learning Strategies on Iranian EFL Learners' Vocabulary Test Score

    ERIC Educational Resources Information Center

    Rahimy, Ramin; Shams, Kiana

    2012-01-01

    This study aims to investigate the effectiveness of vocabulary learning strategies on Iranian EFL learners' vocabulary test score. To achieve this aim, fifty Intermediate level students from Kish English Institute were randomly selected from among fifteen classes after administering the Oxford Placement Test (OPT). Then, an intermediate level…

  12. Beating the Odds: A Low Equalized Assessed Valuation Elementary School with High Standardized Test Scores

    ERIC Educational Resources Information Center

    Levin, Brian

    2011-01-01

    This mixed methods study examines what makes Bluffview Elementary School a success as measured by the ISAT, the mandated state test of Illinois. Despite national reports of achievement gaps and low test scores, Bluffview Elementary has shown sustained success in educating children. This paper reviews how Bluffview Elementary students are achieving…

  13. Zertifikat Deutsch als Fremdsprache and the Oral Proficiency Interview: A Comparison of Test Scores and Examinations.

    ERIC Educational Resources Information Center

    Lalande, John F.; Schweckendiek, Jurgen

    1986-01-01

    Investigates what correlations might exist between an individual's score on the Zertifikat Deutsch als Fremdsprache and on the Oral Proficiency Interview. The tests themselves are briefly described. Results indicate that the two tests appear to correlate well in their evaluation of speaking skills. (SED)

  14. Predicting Teacher Performance with Test Scores and Grade Point Average: A Meta-Analysis

    ERIC Educational Resources Information Center

    D'Agostino, Jerome V.; Powers, Sonya J.

    2009-01-01

    A meta-analysis was conducted to examine the degree to which teachers' test scores and their performance in preparation programs as measured by their collegiate grade point average (GPA) predicted their teaching competence. Results from 123 studies that yielded 715 effect sizes were analyzed, and the mediating effects of test and GPA type,…

  15. A Score Based on Screening Tests to Differentiate Mild Cognitive Impairment from Subjective Memory Complaints

    PubMed Central

    de Gobbi Porto, Fábio Henrique; Spíndola, Lívia; de Oliveira, Maira Okada; Figuerêdo do Vale, Patrícia Helena; Orsini, Marco; Nitrini, Ricardo; Dozzi Brucki, Sonia Maria

    2013-01-01

    It is not easy to differentiate patients with mild cognitive impairment (MCI) from subjective memory complainers (SMC). Assessments with screening cognitive tools are essential, particularly in primary care where most patients are seen. The objective of this study was to evaluate the diagnostic accuracy of screening cognitive tests and to propose a score derived from screening tests. Elderly subjects with memory complaints were evaluated using the Mini Mental State Examination (MMSE) and the Brief Cognitive Battery (BCB). We added two delayed recalls in the MMSE (a delayed recall and a late-delayed recall, LDR), and also a phonemic fluency test of letter P fluency (LPF). A score was created based on these tests. The diagnoses were made on the basis of clinical consensus and neuropsychological testing. Receiver operating characteristic curve analyses were used to determine area under the curve (AUC), the sensitivity and specificity for each test separately and for the final proposed score. MMSE, LDR, LPF and delayed recall of BCB scores reach statistically significant differences between groups (P=0.000, 0.03, 0.001 and 0.01, respectively). Sensitivity, specificity and AUC were MMSE: 64%, 79% and 0.75 (cut off <29); LDR: 56%, 62% and 0.62 (cut off <3); LPF: 71%, 71% and 0.71 (cut off <14); delayed recall of BCB: 56%, 82% and 0.68 (cut off <9). The proposed score reached a sensitivity of 88% and 76% and specificity of 62% and 75% for cut off over 1 and over 2, respectively. AUC were 0.81. In conclusion, a score created from screening tests is capable of discriminating MCI from SMC with moderate to good accurancy. PMID:24147213

  16. Do We Really Become Smarter When Our Fluid-Intelligence Test Scores Improve?

    PubMed Central

    Hayes, Taylor R.; Petrov, Alexander A.; Sederberg, Per B.

    2014-01-01

    Recent reports of training-induced gains on fluid intelligence tests have fueled an explosion of interest in cognitive training—now a billion-dollar industry. The interpretation of these results is questionable because score gains can be dominated by factors that play marginal roles in the scores themselves, and because intelligence gain is not the only possible explanation for the observed control-adjusted far transfer across tasks. Here we present novel evidence that the test score gains used to measure the efficacy of cognitive training may reflect strategy refinement instead of intelligence gains. A novel scanpath analysis of eye movement data from 35 participants solving Raven’s Advanced Progressive Matrices on two separate sessions indicated that one-third of the variance of score gains could be attributed to test-taking strategy alone, as revealed by characteristic changes in eye-fixation patterns. When the strategic contaminant was partialled out, the residual score gains were no longer significant. These results are compatible with established theories of skill acquisition suggesting that procedural knowledge tacitly acquired during training can later be utilized at posttest. Our novel method and result both underline a reason to be wary of purported intelligence gains, but also provide a way forward for testing for them in the future. PMID:25395695

  17. Concurrent Validity of New Subscale Scores for the Booklet Category Test.

    PubMed

    McNally, Shannon; Dsurney, John; McGovern, Jessica; DeFilippis, Nick; Chan, Leighton

    2016-06-01

    The Booklet Category Test (BCT) is a neuropsychological test of cognitive dysfunction that provides only one overall error score indicative of global impairment. It does not, however, delineate specific domains that might be impaired. The aim of this study is to concurrently validate 13 new BCT subscales using legacy instruments in patients with nonpenetrating traumatic brain injury (TBI). Eighty-nine patients with mild, moderate, and severe TBI completed a battery of neuropsychology tests. Partial correlations controlling for age were performed and there were significant correlations between the a priori selected scores from legacy measures of major cognitive domains and both BCT total errors and subscale scores. Additional analysis showed that several subscales were able to differentiate between performance levels on the legacy measures. Overall, our results showed that the subscales measured cognitive skills beyond global impairment, supporting the use of the BCT subscales in a population with TBI. PMID:26033113

  18. Individual differences in left parietal white matter predict math scores on the Preliminary Scholastic Aptitude Test.

    PubMed

    Matejko, Anna A; Price, Gavin R; Mazzocco, Michèle M M; Ansari, Daniel

    2013-02-01

    Mathematical skills are of critical importance, both academically and in everyday life. Neuroimaging research has primarily focused on the relationship between mathematical skills and functional brain activity. Comparatively few studies have examined which white matter regions support mathematical abilities. The current study uses diffusion tensor imaging (DTI) to test whether individual differences in white matter predict performance on the math subtest of the Preliminary Scholastic Aptitude Test (PSAT). Grades 10 and 11 PSAT scores were obtained from 30 young adults (ages 17-18) with wide-ranging math achievement levels. Tract based spatial statistics was used to examine the correlation between PSAT math scores, fractional anisotropy (FA), radial diffusivity (RD) and axial diffusivity (AD). FA in left parietal white matter was positively correlated with math PSAT scores (specifically in the left superior longitudinal fasciculus, left superior corona radiata, and left corticospinal tract) after controlling for chronological age and same grade PSAT critical reading scores. Furthermore, RD, but not AD, was correlated with PSAT math scores in these white matter microstructures. The negative correlation with RD further suggests that participants with higher PSAT math scores have greater white matter integrity in this region. Individual differences in FA and RD may reflect variability in experience dependent plasticity over the course of learning and development. These results are the first to demonstrate that individual differences in white matter are associated with mathematical abilities on a nationally administered scholastic aptitude measure. PMID:23108272

  19. MSFC/Ball Space-Act Test Results of SBMD

    NASA Technical Reports Server (NTRS)

    Hadaway, James; Brown, Bob; Eng, Ron; Stahl, Phil

    2002-01-01

    The results of two cryo tests of the SBMD that were funded by Ball Aerospace through a Space-Act Agreement with MSFC will be discussed. These tests followed the formal completion of the SBMD program. The PhaseCam interferometer, rather than the Wavescope Shack-Hartmann sensor, was used during these tests.

  20. Tests of executive functioning predict scores on the MacAndrew Alcoholism Scale.

    PubMed

    Deckel, A W

    1999-02-01

    1. Previous work reported that tests of executive functioning (EF) predict the risk of alcoholism in subject populations selected for a "high density" of a family history of alcoholism and/or the presence of sociopathic traits. The current experiment examined the ability of EF tests to predict the risk of alcoholism, as measured by the MacAndrew Alcoholism Scale (MAC), in outpatient subjects referred to a general neuropsychological testing service. 2. Sixty-eight male and female subjects referred for neuropsychological testing were assessed for their past drinking histories and administered the Wisconsin Card Sorting Test, the Wechsler Adult Intelligence Scale-Revised, the Trails (Part B) Test, and the MAC. Principal Components analysis (PCA) reduced the number of EF tests to two measures, including one that loaded on the WCST, and one that loaded on the Similarities, Picture Arrangement, and Trails tests. Multiple hierarchical regression first removed the variance from demographic variables, alcohol consumption, and verbal (i.e., Vocabulary) and non-verbal (i.e., Block Design) IQ, and then entered the executive functioning factors into the prediction of the MAC. 3. Seventy-six percent of the subjects were classified as either light, infrequent, or non-drinkers on the Quantity-Frequency-Variability scale. The factor derived from the WCST on PCA significantly added to the prediction of risk on the MAC (p = .0063), as did scores on Block Design (p = .033). Relatively more impaired scores on the WCST factor and Block Design were predictive of higher scores on the MAC. The other factors were not associated with MAC scores. 4. These results support the hypothesis that decrements in EF are associated with risk factors for alcoholism, even in populations where the density of alcoholic behaviors are not unusually high. When taken in conjunction with other findings, these results implicate EF test scores, and prefrontal brain functioning, in the neurobiology of the risk for

  1. Wisconsin Card Sorting Test scores and clinical and sociodemographic correlates in Schizophrenia: multiple logistic regression analysis

    PubMed Central

    Banno, Masahiro; Koide, Takayoshi; Aleksic, Branko; Okada, Takashi; Kikuchi, Tsutomu; Kohmura, Kunihiro; Adachi, Yasunori; Kawano, Naoko; Iidaka, Tetsuya; Ozaki, Norio

    2012-01-01

    Objectives This study investigated what clinical and sociodemographic factors affected Wisconsin Card Sorting Test (WCST) factor scores of patients with schizophrenia to evaluate parameters or items of the WCST. Design Cross-sectional study. Setting Patients with schizophrenia from three hospitals participated. Participants Participants were recruited from July 2009 to August 2011. 131 Japanese patients with schizophrenia (84 men and 47 women, 43.5±13.8 years (mean±SD)) entered and completed the study. Participants were recruited in the study if they (1) met DSM-IV criteria for schizophrenia; (2) were physically healthy and (3) had no mood disorders, substance abuse, neurodevelopmental disorders, epilepsy or mental retardation. We examined their basic clinical and sociodemographic factors (sex, age, education years, age of onset, duration of illness, chlorpromazine equivalent doses and the positive and negative syndrome scale (PANSS) scores). Primary and secondary outcome measures All patients carried out the WCST Keio version. Five indicators were calculated, including categories achieved (CA), perseverative errors in Milner (PEM) and Nelson (PEN), total errors (TE) and difficulties of maintaining set (DMS). From the principal component analysis, we identified two factors (1 and 2). We assessed the relationship between these factor scores and clinical and sociodemographic factors, using multiple logistic regression analysis. Results Factor 1 was mainly composed of CA, PEM, PEN and TE. Factor 2 was mainly composed of DMS. The factor 1 score was affected by age, education years and the PANSS negative scale score. The factor 2 score was affected by duration of illness. Conclusions Age, education years, PANSS negative scale score and duration of illness affected WCST factor scores in patients with schizophrenia. Using WCST factor scores may reduce the possibility of type I errors due to multiple comparisons. PMID:23135537

  2. Association between students' dental admission test scores and performance on comprehensive clinical exams.

    PubMed

    Allareddy, Veerasathpurush; Howell, T Howard; Karimbux, Nadeem Y

    2012-02-01

    This study examined the association between Dental Admission Test (DAT) scores and the comprehensive exams conducted at Harvard School of Dental Medicine. The authors hypothesized that students who scored high on the DAT would also perform well on the comprehensive examinations. Sixty-six students from the graduating classes of 2005 and 2006 were included. The outcome variable of interest was the final composite grade obtained by the students in the three comprehensive examinations. The main independent variable of interest was the individual component scores on the DAT. Multivariable logistic regression analysis using the maximum likelihood methods was used to examine the association between comprehensive exam grades and DAT scores. Effects of age, gender, and race/ethnicity were adjusted in the regression models. On the first comprehensive examination, seventeen students obtained an Honors grade, while thirteen did so on the second comprehensive examination and fifteen on the third comprehensive examination. None of the DAT component scores were significantly associated with Honors grades on the first comprehensive examination. On the second comprehensive examination, quantitative reasoning scores (OR=2.48, 95 percent CI=1.09-5.68, p=0.03) and total science scores (OR=14.17, 95 percent CI=1.89-106.80, p=0.01) were significantly associated with Honors grades. Reading comprehension score was associated with increased odds of obtaining Honors grade on the third comprehensive examination (OR=1.81, 95 percent CI=1.13-2.92, p=0.01). Students who scored well on the quantitative reasoning, total science, and reading comprehension sections of the DAT had higher odds of receiving an Honors grade on the second and third comprehensive examinations. These factors may be associated with the problem-solving/critical thinking components in the school's PBL curriculum. PMID:22319081

  3. Correlation of Simulation Examination to Written Test Scores for Advanced Cardiac Life Support Testing: Prospective Cohort Study

    PubMed Central

    Strom, Suzanne L.; Anderson, Craig L.; Yang, Luanna; Canales, Cecilia; Amin, Alpesh; Lotfipour, Shahram; McCoy, C. Eric; Langdorf, Mark I.

    2015-01-01

    Introduction Traditional Advanced Cardiac Life Support (ACLS) courses are evaluated using written multiple-choice tests. High-fidelity simulation is a widely used adjunct to didactic content, and has been used in many specialties as a training resource as well as an evaluative tool. There are no data to our knowledge that compare simulation examination scores with written test scores for ACLS courses. Objective To compare and correlate a novel high-fidelity simulation-based evaluation with traditional written testing for senior medical students in an ACLS course. Methods We performed a prospective cohort study to determine the correlation between simulation-based evaluation and traditional written testing in a medical school simulation center. Students were tested on a standard acute coronary syndrome/ventricular fibrillation cardiac arrest scenario. Our primary outcome measure was correlation of exam results for 19 volunteer fourth-year medical students after a 32-hour ACLS-based Resuscitation Boot Camp course. Our secondary outcome was comparison of simulation-based vs. written outcome scores. Results The composite average score on the written evaluation was substantially higher (93.6%) than the simulation performance score (81.3%, absolute difference 12.3%, 95% CI [10.6–14.0%], p<0.00005). We found a statistically significant moderate correlation between simulation scenario test performance and traditional written testing (Pearson r=0.48, p=0.04), validating the new evaluation method. Conclusion Simulation-based ACLS evaluation methods correlate with traditional written testing and demonstrate resuscitation knowledge and skills. Simulation may be a more discriminating and challenging testing method, as students scored higher on written evaluation methods compared to simulation. PMID:26594288

  4. Ability to learn inhaler technique in relation to cognitive scores and tests of praxis in old age

    PubMed Central

    Allen, S; Ragab, S

    2002-01-01

    Clinical observations have shown that some older patients are unable to learn to use a metered dose inhaler (MDI) despite having a normal abbreviated mental test (AMT) score, possibly because of dyspraxia or unrecognised cognitive impairment. Thirty inhaler-naive inpatients (age 76–94) with an AMT score of 8–10 (normal) were studied. Standard MDI training was given and the level of competence reached was scored (inhalation score). A separate observer performed the minimental test (MMT), Barthel index, geriatric depression score (GDS), ideational dyspraxia test (IDT), and ideomotor dyspraxia test (IMD). No correlative or threshold relationship was found between inhalation score and Barthel index, GDS, or IDT. However, a significant correlation was found between inhalation score and IMD (r = 0.45, p = 0.039) and MMT (r = 0.48, p = 0.032) and threshold effects emerged in that no subject with a MMT score of less than 23/30 had an inhalation score of 5/10 or more (adequate technique requires 6/10 or more), and all 17/18 with an inhalation score of 6/10 or more had an IMD of 14/20 or more. The three patients with a MMT >22 and inhalation score <6 had abnormal IMD scores. Inability to learn an adequate inhaler technique in subjects with a normal AMT score appears to be due to unrecognised cognitive impairment or dyspraxia. The MMT is probably a more useful screening test than the AMT score in this context. PMID:11796871

  5. The Effects of Group Members' Personalities on a Test Taker's L2 Group Oral Discussion Test Scores

    ERIC Educational Resources Information Center

    Ockey, Gary J.

    2009-01-01

    The second language group oral is a test of second language speaking proficiency, in which a group of three or more English language learners discuss an assigned topic without interaction with interlocutors. Concerns expressed about the extent to which test takers' personal characteristics affect the scores of others in the group have limited its…

  6. Do Standardized Tests Penalize Deep-Thinking, Creative, or Conscientious Students?: Some Personality Correlates of Graduate Record Examinations Test Scores

    ERIC Educational Resources Information Center

    Powers, Donald E.; Kaufman, James C.

    2004-01-01

    The objective of the study reported here was to explore the relationship of Graduate Record Examinations (GRE) General Test scores to selected personality traits--conscientiousness, rationality, ingenuity, quickness, creativity, and depth. A sample of 342 GRE test takers completed short personality inventory scales for each trait. Analyses…

  7. The Black-White Scoring Gap on SAT II Achievement Tests: Some of the News Is Cheering.

    ERIC Educational Resources Information Center

    Journal of Blacks in Higher Education, 2003

    2003-01-01

    Academically accomplished applicants to the nation's top colleges usually take SAT II Achievement Tests. While scoring gaps between college-bound Blacks and Whites on these tests tend to be smaller than gaps on the basic SAT, a racial scoring gap persists. However, black students appear to be making progress in closing the racial scoring gap on…

  8. Dependence of Gene-by-Environment Interactions (GxE) on Scaling: Comparing the Use of Sum Scores, Transformed Sum Scores and IRT Scores for the Phenotype in Tests of GxE.

    PubMed

    Murray, Aja Louise; Molenaar, Dylan; Johnson, Wendy; Krueger, Robert F

    2016-07-01

    Estimates of gene-environment interactions (GxE) in behavior genetic models depend on how a phenotype is scaled. Inappropriately scaled phenotypes result in biased estimates of GxE and can sometimes even suggest GxE in the direction opposite to its true direction. Previously proposed solutions are mathematically complex, computationally demanding and may prove impractical for the substantive researcher. We, therefore, evaluated two simple-to-use alternatives: (1) straightforward non-linear transformation of sum scores and (2) factor scores from an appropriate item response theory (IRT) model. Within Purcell's (2002) GxM framework, both alternatives provided less biased parameter estimates, and improved false and true positive rates than using a raw sum score. These approaches are, therefore, recommended over using raw sum scores in tests of GxE. Circumstances under which IRT factor scores versus transformed sum scores should be preferred are discussed. PMID:26833465

  9. Effects of Classroom Ventilation Rate and Temperature on Students’ Test Scores

    PubMed Central

    2015-01-01

    Using a multilevel approach, we estimated the effects of classroom ventilation rate and temperature on academic achievement. The analysis is based on measurement data from a 70 elementary school district (140 fifth grade classrooms) from Southwestern United States, and student level data (N = 3109) on socioeconomic variables and standardized test scores. There was a statistically significant association between ventilation rates and mathematics scores, and it was stronger when the six classrooms with high ventilation rates that were indicated as outliers were filtered (> 7.1 l/s per person). The association remained significant when prior year test scores were included in the model, resulting in less unexplained variability. Students’ mean mathematics scores (average 2286 points) were increased by up to eleven points (0.5%) per each liter per second per person increase in ventilation rate within the range of 0.9–7.1 l/s per person (estimated effect size 74 points). There was an additional increase of 12–13 points per each 1°C decrease in temperature within the observed range of 20–25°C (estimated effect size 67 points). Effects of similar magnitude but higher variability were observed for reading and science scores. In conclusion, maintaining adequate ventilation and thermal comfort in classrooms could significantly improve academic achievement of students. PMID:26317643

  10. California Standards Test Scores and Attendance Rates in an Afterschool Program

    ERIC Educational Resources Information Center

    Diamond, Sandra M.

    2013-01-01

    The Problem: The purpose of this study was to investigate whether or not there were any statistically significant differences in the Mathematics California Standard Test scores and attendance rates for African American and Latina high school girls who participated in an afterschool program. Method: A quasi-experimental design was conducted with…

  11. Relationship of Friends, Physical Education, and State Test Scores: Implications for School Counselors

    ERIC Educational Resources Information Center

    Hollingsworth, Mary Ann

    2010-01-01

    This study examined the relationship between dimensions of wellness and academic performance for 634 third through fifth grade students in Title One schools in rural Mississippi, using composites of the Five Factor Wellness Inventory for Elementary Children and Reading, Language, and Math Scores of the Mississippi Curriculum Test (a state level…

  12. A Generalizability Approach To Evaluating the Reliability of Testlet-Based Test Scores.

    ERIC Educational Resources Information Center

    Lee, Guemin; Frisbie, David A.

    Previous studies have indicated that the reliability of test scores composed of testlets might be overestimated by conventional item-based reliability estimation methods (R. Thorndike, 1953; A. Anastasi, 1988; S. Sireci, D. Thissen, and H. Wainer, 1991; H. Wainer and D. Thissen, 1996). This study used generalizability theory to investigate the…

  13. Text of Dr. Kamin's Presentation Denying That Proof Exists That IQ Test Scores Are Hereditary

    ERIC Educational Resources Information Center

    Kamin, Leon

    1973-01-01

    A presentation of findings of the author's extensive research into the original studies on which some American social scientists have based writings which at least question whether environment has any effect on IQ test scores--suggesting that heredity may be the determinant. (Author/JM)

  14. The Effect of Sex-Dependent Norms on Aggregated Reading and Mathematics Test Scores.

    ERIC Educational Resources Information Center

    Gramenz, Gary W.; And Others

    The study reported in this paper examined whether a disproportionate number of boys and girls in a given grade level at various schools might have an inequitable effect on the mean achievement test scores of those schools. The student population of the Palm Beach County Schools, Florida, reflecting a cross-section of socio-economic backgrounds,…

  15. Development and Validation of Scores from an Instrument Measuring Student Test-Taking Motivation

    ERIC Educational Resources Information Center

    Eklof, Hanna

    2006-01-01

    Using the expectancy-value model of achievement motivation as a basis, this study's purpose is to develop, apply, and validate scores from a self-report instrument measuring student test-taking motivation. Sampled evidence of construct validity for the present sample indicates that a number of the items in the instrument could be used as an…

  16. The Fight's Not Always Fixed: Using Literary Response to Transcend Standardized Test Scores

    ERIC Educational Resources Information Center

    Avila, JuliAnna

    2012-01-01

    In 2004, the National Endowment for the Arts (NEA) concluded that "literature reading is fading as a meaningful activity, especially among younger people." How can educators continue to teach students about the power of literary response when the priority is for them to achieve proficiency on standardized tests, whose scores can only be narrowly…

  17. The Relationship of Motivational Values of Math and Reading Teachers to Student Test Score Gains

    ERIC Educational Resources Information Center

    Loewen, David Allen

    2013-01-01

    This exploratory correlational study seeks to answer the question of whether a relationship exists between student average test score gains on state exams and teachers' rating of values on the Schwartz Values Survey. Eighty-seven randomly selected Kansas teachers of math and/or reading, grades four through eight, participated. Student test…

  18. Score Change When Retaking the SAT® I: Reasoning Test. Research Notes RN-05

    ERIC Educational Resources Information Center

    Nathan, Julie S.; Camara, Wayne J.

    1998-01-01

    Each year over one million high school students complete the SAT[superscript R] I. Information on repeat testing and the probability that a student's SAT I score will change may help students and educators make better-informed decisions about retesting. This information would also be useful for high school counselors who advise students regarding…

  19. Comprehensive School Reform and Standardized Test Scores in Illinois Elementary and Middle Schools

    ERIC Educational Resources Information Center

    McEnroe, James D.

    2010-01-01

    The study examined the effects of the federally funded Comprehensive School Reform (CSR) program on student performance on mandated standardized tests. The study focused on the mathematics and reading scores of Illinois public elementary and middle and junior high school students. The federal CSR program provided Illinois schools with an annual…

  20. Florida Defeats the Skeptics: Test Scores Show Genuine Progress in the Sunshine State

    ERIC Educational Resources Information Center

    Winters, Marcus

    2012-01-01

    Among the 50 states, Florida's gains on the National Assessment of Educational Progress (NAEP) between 1992 and 2011 ranked second only to Maryland's. Florida's progress has been particularly impressive in the early grades. In 1998, Florida scored about one grade level below the national average on the 4th-grade NAEP reading test, but it was…

  1. Assessing the Discriminating Power of Item and Test Scores in the Linear Factor-Analysis Model

    ERIC Educational Resources Information Center

    Ferrando, Pere J.

    2012-01-01

    Model-based attempts to rigorously study the broad and imprecise concept of "discriminating power" are scarce, and generally limited to nonlinear models for binary responses. This paper proposes a comprehensive framework for assessing the discriminating power of item and test scores which are analyzed or obtained using Spearman's factor-analytic…

  2. A Confirmatory Factor Analysis of Cross Racial Identity Scale (CRIS) Scores: Testing the Expanded Nigrescence Model

    ERIC Educational Resources Information Center

    Worrell, Frank C.; Watson, Stevie

    2008-01-01

    In this study, the authors tested the viability of the expanded nigrescence (NT-E) model as operationalized by Cross Racial Identity Scale (CRIS) scores using confirmatory factor analyses. Participants were 594 Black college students from the Southeastern United States. Results indicated a good fit for NT-E's proposed six-factor structure.…

  3. States Eyeing Expense of Hand-Scored Tests in Light of NCLB Rules

    ERIC Educational Resources Information Center

    Archer, Jeff

    2005-01-01

    When students put down their pencils at the end of Connecticut's testing each year, another intensive process begins. Hundreds of trained evaluators work day and night for about a month to score the written responses. Although expensive, the use of open-ended questions drives the kind of instruction that state leaders say they want in their…

  4. Factors affecting milk ELISA scores of cows tested for Johne’s disease

    Technology Transfer Automated Retrieval System (TEKTRAN)

    Infection with Mycobacterium avium subsp. paratuberculosis (Johne’s disease) has been estimated to cost dairy producers over $1.5 billion per year. The objective of this study was to examine the influence a number of environmental and genetic factors have on ELISA milk test scores for Johne’s diseas...

  5. Creating a System of Accountability: The Impact of Instructional Assessment on Elementary Children's Achievement Test Scores.

    ERIC Educational Resources Information Center

    Meisels, Samuel J.; Atkins-Burnett, Sally; Xue, Yange; Bickel, Donna DiPrima; Son, Seung-Hee; Nicholson, Julie

    2003-01-01

    Examined the trajectory of change in scores on the Iowa Tests of Basic Skills of low-income, urban third and fourth graders enrolled in classrooms in which the Work Sampling System (WSS) had been used at least 3 years. Results for 2,564 students show academic gains associated with the WSS. (SLD)

  6. Test Score Gaps between Private and Government Sector Students at School Entry Age in India

    ERIC Educational Resources Information Center

    Singh, Abhijeet

    2014-01-01

    Various studies have noted that students enrolled in private schools in India perform better on average than students in government schools. In this paper, I show that large gaps in the test scores of children in private and public sector education are evident even at the point of initial enrollment in formal schooling and are associated with…

  7. The Effect of Four Intervention Programs on Standardized Test Scores by Gender

    ERIC Educational Resources Information Center

    Cryder, Rebecca E.

    2012-01-01

    This quantitative correlational study involved the analysis, by gender, of the effect of four intervention programs at an Arizona middle school as seen on Arizona's Instrument to Measure Standards (AIMS) test scores. These four intervention programs included: Advancement Via Individual Determination (AVID), a planner stamping system, a World…

  8. Using Automated Essay Scores as an Anchor When Equating Constructed Response Writing Tests

    ERIC Educational Resources Information Center

    Almond, Russell G.

    2014-01-01

    Assessments consisting of only a few extended constructed response items (essays) are not typically equated using anchor test designs as there are typically too few essay prompts in each form to allow for meaningful equating. This article explores the idea that output from an automated scoring program designed to measure writing fluency (a common…

  9. Segregation and the Black-White Test Score Gap. NBER Working Paper No. 12988

    ERIC Educational Resources Information Center

    Vigdor, Jacob; Ludwig, Jens

    2007-01-01

    The mid-1980s witnessed breaks in two important trends related to race and schooling. School segregation, which had been declining, began a period of relative stasis. Black-white test score gaps, which had also been declining, also stagnated. The notion that these two phenomena may be related is also supported by basic cross-sectional evidence. We…

  10. Using College Admission Test Scores to Clarify High School Placement. Leading Indicator Spotlight

    ERIC Educational Resources Information Center

    Flug, Susanna

    2010-01-01

    In "Beyond Test Scores: Leading Indicators for Education," Foley and colleagues (2008) define leading indicators as those that "provide early signals of progress toward academic achievement" (p. 1) and stress that educators "need leading indicators to help them see the direction their efforts are going in and to take corrective action as soon as…

  11. Integrating GIS in the Middle School Curriculum: Impacts on Diverse Students' Standardized Test Scores

    ERIC Educational Resources Information Center

    Goldstein, Donna; Alibrandi, Marsha

    2013-01-01

    This case study conducted with 1,425 middle school students in Palm Beach County, Florida, included a treatment group receiving GIS instruction (256) and a control group without GIS instruction (1,169). Quantitative analyses on standardized test scores indicated that inclusion of GIS in middle school curriculum had a significant effect on student…

  12. Estimated Effect of the Teacher Advancement Program on Student Test Score Gains

    ERIC Educational Resources Information Center

    Springer, Matthew G.; Ballou, Dale; Peng, Art

    2014-01-01

    This article presents findings from the first independent, third-party appraisal of the impact of the Teacher Advancement Program (TAP) on student test score gains in mathematics. TAP is a comprehensive school reform model designed to attract highly effective teachers, improve instructional effectiveness, and elevate student achievement. We use a…

  13. The Predictive Validity of Scholastic Aptitude Test Scores For Minority College Students.

    ERIC Educational Resources Information Center

    Hand, Carol A.; Prather, James E.

    This paper investigated the predictive validity of the Scholastic Aptitude Test (SAT) for members of different gender and minority status groups. The following data were obtainted on 45,067 undergraduates enrolled in 31 different institutions in a state college system: SAT Verbal (SAT-V) and SAT Mathematics (SAT-M) scores; high school average…

  14. Establishing Cut Scores on the New Jersey College Basic Skills Placement Test.

    ERIC Educational Resources Information Center

    New Jersey State Dept. of Higher Education, Trenton. New Jersey Basic Skills Council.

    This paper describes the meeting of the Reading and Writing Advisory Committee of the New Jersey Basic Skills Council with selected high school English teachers and administrators. This panel established cut scores for statewide interpretation of the New Jersey College Basic Skills Placement Test (NJCBSPT). The NJCBSPT is administered to all…

  15. Impact of School Reform on Dropout Rates and Test Scores in an Urban High School

    ERIC Educational Resources Information Center

    Bartlett, Lydia; Kupczynski, Lori; Holland, Glenda

    2011-01-01

    The primary focus of the study was to explore the effect of School Within a School reform on two components in the Adequate Yearly Progress Report by which Texas schools are measured--dropout rates and scores on the Texas Assessment of Knowledge and Skills test. The demographics of the particular study sample included 11th grade students…

  16. Comparing State and District Test Results to National Norms: Interpretations of Scoring "Above the National Average."

    ERIC Educational Resources Information Center

    Linn, Robert L.; And Others

    Norm-referenced test results reported by states and school districts and factors related to those scores were studied through mail and telephone surveys of 35 states and a nationally representative sample of 153 school districts to determine the degree to which "above average" results were being reported. Part of the stimulus for this study came…

  17. Raise Test Scores without Selling Your Soul: An Interview with Scott Mandel

    ERIC Educational Resources Information Center

    Curriculum Review, 2006

    2006-01-01

    With his 10th book, Improving Test Scores: A Practical Approach for Teachers and Administrators, Scott Mandel outlines steps educators can take to boost achievement on standardized exams while maintaining the integrity of their day-to-day teaching. Mandel, who holds a Ph.D. in curriculum and instruction from USC, teaches history and English at…

  18. Detecting Dissimulation in Personality Test Scores: A Comparison between Person-Fit Indices and Detection Scales.

    ERIC Educational Resources Information Center

    Ferrando, Pere J.; Chico, Eliseo

    2001-01-01

    Examined whether a procedure based on item response theory (IRT) for assessing the scalability of response patterns could detect deliberate dissimulation (faking good) on scores from three tests of the Eysenck Personality Questionnaire Revised. Results for 489 and 140 undergraduates show that IRT measures were not powerful enough to detect…

  19. The Relationship between Computer Use and Standardized Test Scores: Does Gender Play a Role?

    ERIC Educational Resources Information Center

    Kay, Rachel E.

    2010-01-01

    Over the past few decades, and especially in the past ten years, computer use in schools has increased dramatically; however there has been little research examining the effects of technology use on student achievement, specifically defined by standardized test scores. There is also concern as to how technology use differs by gender and if that…

  20. End of Course Grades and Standardized Test Scores: Are Grades Predictive of Student Achievement?

    ERIC Educational Resources Information Center

    Ricketts, Christine R.

    2010-01-01

    This study examined the extent to which end-of-course grades are predictive of Virginia Standards of Learning test scores in nine high school content areas. It also analyzed the impact of the variables school cluster attended, gender, ethnicity, disability status, Limited English Proficiency status, and socioeconomic status on the relationship…

  1. Supplemental Educational Services and Student Test Score Gains: Evidence from a Large, Urban School District

    ERIC Educational Resources Information Center

    Springer, Matthew G.; Pepper, Matthew J.; Ghosh-Dastidar, Bonnie

    2014-01-01

    This study examines the effect of supplemental education services (SES) on student test score gains and whether particular subgroups of students benefit more from NCLB tutoring services. Our sample includes information on students enrolled in third through eighth grades nested in 121 elementary and middle schools over a five-year period comprising…

  2. Defending the Quality of Links between Scores from Different Tests and Exams

    ERIC Educational Resources Information Center

    Cresswell, Mike

    2010-01-01

    Paul Newton (2010), with his characteristic concern about theory, has set out two different ways of thinking about the basis upon which equivalences of one sort or another are established between test score scales. His reason for doing this is a desire to establish "the defensibility of linkages lower on the continuum than concordance." His…

  3. Fitting the Normal-Ogive Factor Analytic Model to Scores on Tests.

    ERIC Educational Resources Information Center

    Ferrando, Pere J.; Lorenzo-Seva, Urbano

    2001-01-01

    Describes how the nonlinear factor analytic approach of R. McDonald to the normal ogive curve can be used to factor analyze test scores. Discusses the conditions in which this model is more appropriate than the linear model and illustrates the applicability of both models using an empirical example based on data from 1,769 adolescents who took the…

  4. Permanent Income and the Black-White Test Score Gap. NBER Working Paper No. 17610

    ERIC Educational Resources Information Center

    Rothstein, Jesse; Wozny, Nathan

    2011-01-01

    Analysts often examine the black-white test score gap conditional on family income. Typically only a current income measure is available. We argue that the gap conditional on permanent income is of greater interest, and we describe a method for identifying this gap using an auxiliary data set to estimate the relationship between current and…

  5. Intelligence Test Scores and Birth Order among Young Norwegian Men (Conscripts) Analyzed within and between Families

    ERIC Educational Resources Information Center

    Bjerkedal, Tor; Kristensen, Petter; Skjeret, Geir A.; Brevik, John I.

    2007-01-01

    The present paper reports the results of a within and between family analysis of the relation between birth order and intelligence. The material comprises more than a quarter of a million test scores for intellectual performance of Norwegian male conscripts recorded during 1984-2004. Conscripts, mostly 18-19 years of age, were born to women for…

  6. Estimating Teacher Effectiveness from Two-Year Changes in Students' Test Scores

    ERIC Educational Resources Information Center

    Leigh, Andrew

    2010-01-01

    Using a dataset covering over 10,000 Australian school teachers and over 90,000 pupils, I estimate how effective teachers are in raising students' test scores. Since the exams are biennial, it is necessary to take account of the teacher's work in the intervening year. Even adjusting for measurement error, the teacher fixed effects are widely…

  7. A Model for Incorporating Response-Time Data in Scoring Achievement Tests. Research Report No. 3.

    ERIC Educational Resources Information Center

    Tatsuoka, Kikumi; Tatsuoka, Maurice

    The differences in types of information-processing skills developed by different instructional backgrounds affect, negatively or positively, the learning of further advanced instructional materials. If prior and subsequent instructional methods are different, a proactive inhibition effect produces low achievement scores on a post test. This poses…

  8. Relationships between Inquiry-Based Teaching and Physical Science Standardized Test Scores

    ERIC Educational Resources Information Center

    Tretter, Thomas R.; Jones, M. Gail

    2003-01-01

    This exploratory case study investigates relationships between use of an inquiry-based instructional style and student scores on standardized multiple-choice tests. The study takes the form of a case study of physical science classes taught by one of the authors over a span of four school years. The first 2 years were taught using traditional…

  9. A Comparative Analysis of Career Development Test Scores of Eighth Grade Boys and Girls.

    ERIC Educational Resources Information Center

    McTeer, J. Hugh; And Others

    Eighth grade students' knowledge of the principles involved in career choice was investigated to determine whether sex stereotyped attitudes about jobs were present. Scores from the career development section of the Georgia Criterion-Referenced Tests were analyzed for 734 (379 females; 355 males) eighth grade students. The career development…

  10. Changes in Student Populations and Average Test Scores of Dutch Primary Schools

    ERIC Educational Resources Information Center

    Luyten, Hans; de Wolf, Inge

    2011-01-01

    This article focuses on the relation between student population characteristics and average test scores per school in the final grade of primary education from a dynamic perspective. Aggregated data of over 5,000 Dutch primary schools covering a 6-year period were used to study the relation between changes in school populations and shifts in mean…

  11. Selected Demographic Variables, School Music Participation, and Achievement Test Scores of Urban Middle School Students

    ERIC Educational Resources Information Center

    Kinney, Daryl W.

    2008-01-01

    Nontransient 6th- and 8th-grade urban middle school students' achievement test scores were examined before (4th grade) and during (6th or 8th grade) enrollment in a performing ensemble. Ensemble participation (band, choir, none) and subject variables of socioeconomic status (SES) and home environment were considered. Fourth- and 6th-grade…

  12. An NCME Instructional Module on Quality Control Procedures in the Scoring, Equating, and Reporting of Test Scores

    ERIC Educational Resources Information Center

    Allalouf, Avi

    2007-01-01

    There is significant potential for error in long production processes that consist of sequential stages, each of which is heavily dependent on the previous stage, such as the SER (Scoring, Equating, and Reporting) process. Quality control procedures are required in order to monitor this process and to reduce the number of mistakes to a minimum. In…

  13. Survival analysis of cancer patients with multiple endpoints using global score test methodology

    NASA Astrophysics Data System (ADS)

    Zain, Zakiyah; Whitehead, John

    2014-06-01

    Progression-free survival (PFS), time-to-progression (TTP) and overall survival (OS) are examples of multiple endpoints commonly used in clinical trials of cancer patients. PFS is increasingly used as a primary endpoint in evaluation of patients with solid tumors, while multiple endpoints are often analysed independently. These endpoints are indeed correlated and it is desirable to evaluate effectiveness of treatments by means of a single parameter. In this paper, a single overall treatment effect is provided by combining the univariate score statistics for comparing treatments with respect to each survival endpoint. This global score test methodology was applied in analysis of 330 patients with an aggressive cancer, each with two endpoints recorded, T1 and T2, relating to disease progression and death respectively. The values of score statistics obtained from the proposed method matched closely those from the logrank test. Meanwhile, the correlations between the two score test statistics were found to be similar to those computed using the established Wei, Lin and Weissfeld method. Simulations further confirmed the consistent performance of this new method in analysis of bivariate survival data.

  14. The comparison question polygraph test: a contrast of methods and scoring.

    PubMed

    Honts, Charles R; Reavy, Racheal

    2015-05-01

    We conducted a mock crime experiment with 250 paid participants (126 females, Mdn age = 30 years) contrasting the validity of the probable-lie and the directed-lie variants of the comparison question test (CQT) for the detection of deception. Subjects were assigned at random to one of eight conditions in a Guilt (Guilty/Innocent) × Test Type (Probable-Lie/Directed-Lie) × Stimulation (Between Repetition Stimulation/No Stimulation) factorial design. The data were scored by an experienced polygraph examiner who was unaware of subject assignment to conditions and with a computer algorithm known as the Objective Scoring System Version 2 (OSS2). There were substantial main effects of guilt in both the OSS2 computer scores F(1, 241) = 143.82, p < .001, η(p)(2) = 0.371, and in the human scoring, F(1, 242) = 98.92, p<.001, η(p)(2) = .29. There were no differences between the test types in the number of spontaneous countermeasure attempts made against them. Although under the controlled conditions of an experiment the probable-lie and the directed-lie variants of the CQT produced equivocal results in terms of detection accuracy, the directed-lie variant has much to recommend it as it is inherently more standardized in its administration and construction. PMID:25703188

  15. Agreement in the Scoring of Respiratory Events Among International Sleep Centers for Home Sleep Testing

    PubMed Central

    Magalang, Ulysses J.; Arnardottir, Erna S.; Chen, Ning-Hung; Cistulli, Peter A.; Gíslason, Thorarinn; Lim, Diane; Penzel, Thomas; Schwab, Richard; Tufik, Sergio; Pack, Allan I.

    2016-01-01

    Study Objectives: Home sleep testing (HST) is used worldwide to confirm the presence of obstructive sleep apnea (OSA). We sought to determine the agreement of HST scoring among international sleep centers. Methods: Fifteen HSTs, previously recorded using a type 3 monitor, were deidentified and saved in European Data Format. The studies were scored by nine technologists from the sleep centers of the Sleep Apnea Global Interdisciplinary Consortium (SAGIC) using the locally available software. Each study was scored separately using one of three different airflow signals: nasal pressure (NP), transformed (square root) nasal pressure signal (transformed NP), and uncalibrated respiratory inductive plethysmography (RIP) flow. Only one of the three airflow signals was visible to the scorer at each scoring session. The scoring procedure was repeated to determine the intrarater reliability. Results: The intraclass correlation coefficients (ICCs) using the NP were: apnea-hypopnea index (AHI) = 0.96 (95% confidence interval [CI]: 0.93–0.99); apnea index = 0.91 (0.83–0.96); and hypopnea index = 0.75 (0.59–0.89). The ICCs using the transformed NP were: AHI = 0.98 (0.96–0.99); apnea index = 0.95 (0.90–0.98); and hypopnea index = 0.90 (0.82–0.96). The ICCs using the RIP flow were: AH I = 0.98 (0.96–0.99); apnea index = 0.66 (0.48–0.84); and hypopnea index = 0.78 (0.63–0.90). The mean difference of first and second scoring sessions of the same respiratory variables ranged from −1.02 to 0.75/h. Conclusion: There is a strong agreement in the scoring of the respiratory events for HST among international sleep centers. Our results suggest that centralized scoring of HSTs may not be necessary in future research collaboration among international sites. Commentary: A commentary on this article appears in this issue on page 7. Citation: Magalang UJ, Arnardottir ES, Chen NH, Cistulli PA, Gíslason T, Lim D, Penzel T, Schwab R, Tufik S, Pack AI, SAGIC Investigators

  16. Mental health matters in elementary school: first-grade screening predicts fourth grade achievement test scores.

    PubMed

    Guzman, Maria Paz; Jellinek, Michael; George, Myriam; Hartley, Marcela; Squicciarini, Ana Maria; Canenguez, Katia M; Kuhlthau, Karen A; Yucel, Recai; White, Gwyne W; Guzman, Javier; Murphy, J Michael

    2011-08-01

    The objective of the study was to evaluate whether mental health problems identified through screens administered in first grade are related to poorer academic achievement test scores in the fourth grade. The government of Chile uses brief teacher- and parent-completed measures [Teacher Observation of Classroom Adaptation-Revised (TOCA-RR) and Pediatric Symptom Checklist (PSC-Cl)] to screen for mental health problems in about one-fifth of the country's elementary schools. In fourth grade, students take the national achievement tests (SIMCE) of language, mathematics and science. This study examined whether mental health problems identified through either or both screens predicted achievement test scores after controlling for student and family risk factors. A total of 17,252 students had complete first grade teacher forms and these were matched with fourth grade SIMCE data for 11,185 students, 7,903 of whom also had complete parent form data from the first grade. Students at risk on either the TOCA-RR or the PSC-Cl or both performed significantly worse on all SIMCE subtests. Even after controlling for covariates and adjusting for missing data, students with mental health problems on one screen in first grade had fourth grade achievement scores that were 14-18 points (~1/3 SD) lower than students screened as not at risk. Students at risk on both screens had scores that were on average 33 points lower than students at risk on either screen. Mental health problems in first grade were one of the strongest predictors of lower achievement test scores 3 years later, supporting the premise that for children mental health matters in the real world. PMID:21647553

  17. Pose prediction and virtual screening performance of GOLD scoring functions in a standardized test.

    PubMed

    Liebeschuetz, John W; Cole, Jason C; Korb, Oliver

    2012-06-01

    The performance of all four GOLD scoring functions has been evaluated for pose prediction and virtual screening under the standardized conditions of the comparative docking and scoring experiment reported in this Edition. Excellent pose prediction and good virtual screening performance was demonstrated using unmodified protein models and default parameter settings. The best performing scoring function for both pose prediction and virtual screening was demonstrated to be the recently introduced scoring function ChemPLP. We conclude that existing docking programs already perform close to optimally in the cognate pose prediction experiments currently carried out and that more stringent pose prediction tests should be used in the future. These should employ cross-docking sets. Evaluation of virtual screening performance remains problematic and much remains to be done to improve the usefulness of publically available active and decoy sets for virtual screening. Finally we suggest that, for certain target/scoring function combinations, good enrichment may sometimes be a consequence of 2D property recognition rather than a modelling of the correct 3D interactions. PMID:22371207

  18. Correcting Two-Sample "z" and "t" Tests for Correlation: An Alternative to One-Sample Tests on Difference Scores

    ERIC Educational Resources Information Center

    Zimmerman, Donald W.

    2012-01-01

    In order to circumvent the influence of correlation in paired-samples and repeated measures experimental designs, researchers typically perform a one-sample Student "t" test on difference scores. That procedure entails some loss of power, because it employs N - 1 degrees of freedom instead of the 2N - 2 degrees of freedom of the…

  19. Changes in scores on the Mental Rotations Test during the menstrual cycle.

    PubMed

    Moody, M S

    1997-06-01

    The purpose of the study was to examine changes in performance on Vandenberg's Mental Rotations Test during the menstrual cycles of college women. Participants were 12 male and 34 female students recruited from undergraduate educational psychology and nursing classes at a large southeastern university. Each woman was tested once during the menstrual phase and once during the luteal phase of her menstrual cycle. Phases in which the testings occurred were counterbalanced. Men were also tested twice. For all participants, the two testing sessions were held exactly 14 days apart. Women who were contraceptive pill users did not perform significantly differently during either phase from women who were nonusers, and there was no interaction for pill use by phase. Therefore, users and nonusers were combined for a paired-sample t test which indicated that women scored significantly higher during the menstrual phase (Days 2-7) than during the luteal phase (Days 16-22 for 31 women and Days 24-26 for three women with longer cycles). The 12 men scored significantly higher than the 34 women during the initial testing; but not significantly higher than the 17 women who were in the menstrual phase during the first testing. Therefore, that the effect of the phase of menstrual cycle influences the sex difference in performance on the Mental Rotations Test was supported. PMID:9172209

  20. A Comparison of Three Scoring Methods for Tests with Selected-Response and Constructed-Response Items

    ERIC Educational Resources Information Center

    Schaeffer, Gary A.; Henderson-Montero, Diane; Julian, Marc; Bene, Nancy H.

    2002-01-01

    A number of methods for scoring tests with selected-response (SR) and constructed-response (CR) items are available. The selection of a method depends on the requirements of the program, the particular psychometric model and assumptions employed in the analysis of item and score data, and how scores are to be used. This article compares 3 methods:…

  1. The Effect of Luck and Misinformation on the Discrepancy between Multiple-Choice Test Scores and True Ability.

    ERIC Educational Resources Information Center

    Lowry, Stephen R.

    The effects of luck and misinformation on ability of multiple-choice test scores to estimate examinee ability were investigated. Two measures of examinee ability were defined. Misinformation was shown to have little effect on ability of raw scores and a substantial effect on ability of corrected-for-guessing scores to estimate examinee ability.…

  2. High-Stakes Testing and Student Achievement: Problems for the No Child Left Behind Act. Executive Summary

    ERIC Educational Resources Information Center

    Nichols, Sharon L.; Glass, Gene V.; Berliner, David C.

    2005-01-01

    Under the federal No Child Left Behind Act of 2001 (NCLB), standardized test scores are the indicator used to hold schools and school districts accountable for student achievement. Each state is responsible for constructing an accountability system, attaching consequences--or stakes--for student performance. The theory of action implied by this…

  3. The effect of instructional methodology on high school students natural sciences standardized tests scores

    NASA Astrophysics Data System (ADS)

    Powell, P. E.

    Educators have recently come to consider inquiry based instruction as a more effective method of instruction than didactic instruction. Experience based learning theory suggests that student performance is linked to teaching method. However, research is limited on inquiry teaching and its effectiveness on preparing students to perform well on standardized tests. The purpose of the study to investigate whether one of these two teaching methodologies was more effective in increasing student performance on standardized science tests. The quasi experimental quantitative study was comprised of two stages. Stage 1 used a survey to identify teaching methods of a convenience sample of 57 teacher participants and determined level of inquiry used in instruction to place participants into instructional groups (the independent variable). Stage 2 used analysis of covariance (ANCOVA) to compare posttest scores on a standardized exam by teaching method. Additional analyses were conducted to examine the differences in science achievement by ethnicity, gender, and socioeconomic status by teaching methodology. Results demonstrated a statistically significant gain in test scores when taught using inquiry based instruction. Subpopulation analyses indicated all groups showed improved mean standardized test scores except African American students. The findings benefit teachers and students by presenting data supporting a method of content delivery that increases teacher efficacy and produces students with a greater cognition of science content that meets the school's mission and goals.

  4. Effect of Mindfulness Meditation on Perceived Stress Scores and Autonomic Function Tests of Pregnant Indian Women

    PubMed Central

    Jain, Reena; Kohli, Sangeeta; Batra, Swaraj

    2016-01-01

    Introduction Various pregnancy complications like hypertension, preeclampsia have been strongly correlated with maternal stress. One of the connecting links between pregnancy complications and maternal stress is mind-body intervention which can be part of Complementary and Alternative Medicine (CAM). Biologic measures of stress during pregnancy may get reduced by such interventions. Aim To evaluate the effect of Mindfulness meditation on perceived stress scores and autonomic function tests of pregnant Indian women. Materials and Methods Pregnant Indian women of 12 weeks gestation were randomised to two treatment groups: Test group with Mindfulness meditation and control group with their usual obstetric care. The effect of Mindfulness meditation on perceived stress scores and cardiac sympathetic functions and parasympathetic functions (Heart rate variation with respiration, lying to standing ratio, standing to lying ratio and respiratory rate) were evaluated on pregnant Indian women. Results There was a significant decrease in perceived stress scores, a significant decrease of blood pressure response to cold pressor test and a significant increase in heart rate variability in the test group (p< 0.05, significant) which indicates that mindfulness meditation is a powerful modulator of the sympathetic nervous system and can thereby reduce the day-to-day perceived stress in pregnant women. Conclusion The results of this study suggest that mindfulness meditation improves parasympathetic functions in pregnant women and is a powerful modulator of the sympathetic nervous system during pregnancy. PMID:27190795

  5. ACT and College Success

    ERIC Educational Resources Information Center

    Bleyaert, Barbara

    2010-01-01

    What is the relationship between ACT scores and success in college? For decades, admissions policies in colleges and universities across the country have required applicants to submit scores from a college entrance exam, most typically the ACT (American College Testing) or SAT (Scholastic Aptitude Test). This requirement suggests that high school…

  6. The Relationship between Academic Averages of Primary School Science and Technology Class and Test Sub-Test Scores of Placement Test of Science

    ERIC Educational Resources Information Center

    Guzeller, Cem Oktay

    2012-01-01

    In this research, the relationship between written exam scores of science and technology class of 6th, 7th, and 8th grades, project, participation in class activities and performance work, year-end academic success point averages and sub-test raw scores of LDT science of 6th, 7th and 8th grades. Academic success point averages were used as…

  7. Gender-Related Differences in Academically Talented Students' Scores and Use of Time on Tests of Spatial Ability.

    ERIC Educational Resources Information Center

    Stumpf, Heinrich

    1998-01-01

    A study of 1,283 academically talented junior high students found that males had higher scores on three of the four subtests of the Spatial Test Battery of the Institute for the Academic Advancement of Youth. Females scored higher on the visual memory test and spent more time on the tests. (Author/CR)

  8. Examining the Validity of GED[R] Tests Scores with Scheduling and Setting Accommodations. GED Testing Service Research Studies, 2004-1

    ERIC Educational Resources Information Center

    George-Ezzelle, Carol E.; Skaggs, Gary

    2004-01-01

    Current testing standards call for test developers to provide evidence that testing procedures and test scores, and the inferences made based on the test scores, show evidence of validity and are comparable across subpopulations (American Educational Research Association [AERA], American Psychological Association [APA], & National Council on…

  9. Adults with poor reading skills: How lexical knowledge interacts with scores on standardized reading comprehension tests.

    PubMed

    McKoon, Gail; Ratcliff, Roger

    2016-01-01

    Millions of adults in the United States lack the necessary literacy skills for most living wage jobs. For students from adult learning classes, we used a lexical decision task to measure their knowledge of words and we used a decision-making model (Ratcliff's, 1978, diffusion model) to abstract the mechanisms underlying their performance from their RTs and accuracy. We also collected scores for each participant on standardized IQ tests and standardized reading tests used commonly in the education literature. We found significant correlations between the model's estimates of the strengths with which words are represented in memory and scores for some of the standardized tests but not others. The findings point to the feasibility and utility of combining a test of word knowledge, lexical decision, that is well-established in psycholinguistic research, a decision-making model that supplies information about underlying mechanisms, and standardized tests. The goal for future research is to use this combination of approaches to understand better how basic processes relate to standardized tests with the eventual aim of understanding what these tests are measuring and what the specific difficulties are for individual, low-literacy adults. PMID:26550803

  10. The effects of calculator-based laboratories on standardized test scores

    NASA Astrophysics Data System (ADS)

    Stevens, Charlotte Bethany Rains

    Nationwide, the goal of providing a productive science and math education to our youth in today's educational institutions is centering itself around the technology being utilized in these classrooms. In this age of digital technology, educational software and calculator-based laboratories (CBL) have become significant devices in the teaching of science and math for many states across the United States. Among the technology, the Texas Instruments graphing calculator and Vernier Labpro interface, are among some of the calculator-based laboratories becoming increasingly popular among middle and high school science and math teachers in many school districts across this country. In Tennessee, however, it is reported that this type of technology is not regularly utilized at the student level in most high school science classrooms, especially in the area of Physical Science (Vernier, 2006). This research explored the effect of calculator based laboratory instruction on standardized test scores. The purpose of this study was to determine the effect of traditional teaching methods versus graphing calculator teaching methods on the state mandated End-of-Course (EOC) Physical Science exam based on ability, gender, and ethnicity. The sample included 187 total tenth and eleventh grade physical science students, 101 of which belonged to a control group and 87 of which belonged to the experimental group. Physical Science End-of-Course scores obtained from the Tennessee Department of Education during the spring of 2005 and the spring of 2006 were used to examine the hypotheses. The findings of this research study suggested the type of teaching method, traditional or calculator based, did not have an effect on standardized test scores. However, the students' ability level, as demonstrated on the End-of-Course test, had a significant effect on End-of-Course test scores. This study focused on a limited population of high school physical science students in the middle Tennessee

  11. Are patient questionnaire scores as "scientific" as laboratory tests for rheumatology clinical care?

    PubMed

    Pincus, Theodore

    2010-01-01

    Modern medical care is based largely on laboratory advances, such as microbiological cultures giving rise to antibiotics and hemoglobin A1c leading to "tight control" of diabetes, among many others. Development of a "gold standard" laboratory test has appeared attractive for care of patients with rheumatoid arthritis (RA) since rheumatoid factor was identified in the 1940s. Indeed, rheumatoid factor, anti-cyclic citrullinated peptide (anti-CCP) antibodies, erythrocyte sedimentation rate (ESR), and C-reactive protein (CRP) are abnormal in most RA patients. However, each of these tests is normal in at least 30% of patients, and no laboratory test (or any other measure) can serve as a single "gold standard" measure for all individual RA patients. A new approach to quantitative assessment in rheumatic diseases involves patient self-report questionnaires as standardized, quantitative, cost-effective "scientific" data from a medical history, the primary source of RA management decisions. Patient questionnaires distinguish active from control treatments in RA clinical trials at levels similar to laboratory tests (or formal joint counts), and are far more significant in the prognosis of work disability, costs, and premature death than laboratory tests or radiographic scores. RAPID3 (routine assessment of patient index data) on an MDHAQ (multidimensional health assessment questionnaire) requires 5 seconds to score, compared to 114 seconds for a DAS28 (disease activity score). Patient questionnaires do not replace further medical history, physical examination, laboratory or other tests, and require physician interpretation for patient management, as do laboratory tests and all quantitative data. Advances in therapy require laboratory science, but patient questionnaires provide optimal "scientific" data for clinical care. PMID:20632989

  12. Physiologic Dysfunction Scores and Cognitive Function Test Performance in United States Adults

    PubMed Central

    Kobrosly, Roni W; Seplaki, Christopher L; Jones, Courtney M; van Wijngaarden, Edwin

    2013-01-01

    Objective To investigate the relationship between a measure of cumulative physiologic dysfunction and specific domains of cognitive function. Methods We examined a summary score measuring physiological dysfunction, a multisystem measure of the body’s ability to effectively adapt to physical and psychological demands, in relation to cognitive function deficits in a population of 4511 adults aged 20 to 59 who participated in the third National Health and Nutrition Examination Survey (1988–1994). Measures of cognitive function comprised three domains: working memory, visuomotor speed, and perceptual-motor speed. ‘Physiologic dysfunction’ scores summarizing measures of cardiovascular, immunologic, kidney, and liver function were explored. We used multiple linear regression models to estimate associations between cognitive function measures and physiological dysfunction scores, adjusting for socioeconomic factors, test conditions, and self-reported health factors. Results We noted a dose-response relationship between physiologic dysfunction and working memory (coefficient = 0.207, 95% CI = (0.066, 0.348), p < 0.0001) that persisted after adjustment for all covariates (p = 0.03). We did not observe any significant relationships between dysfunction scores and visuomotor (p = 0.37) or perceptual-motor ability (p = 0.33). Conclusions Our findings suggest that multisystem physiologic dysfunction is associated with working memory. Future longitudinal studies are needed to clarify the underlying mechanisms and explore the persistency of this association into later life. We suggest that such studies should incorporate physiologic data, neuroendocrine parameters, and a wide range of specific cognitive domains. PMID:22155941

  13. The FCE Speaking Test: Using rater Reports To Help Interpret Test Scores.

    ERIC Educational Resources Information Center

    Orr, Michael

    2002-01-01

    Reports research using oral examiner verbal reports that attempts to gain insights into the rating process in the Cambridge First Certificate in English Speaking Test. Raters' verbal reports of the decision-making process were analyzed and heeded aspects of the test performances were identified, with a view to better understanding how test scores…

  14. Scores on the Safe Functional Motion Test Are Associated with Prevalent Fractures and Fall History

    PubMed Central

    Recknor, Chris P.; Grant, Stephanie L.; Recknor, Julie C.

    2013-01-01

    ABSTRACT Purpose: The Safe Functional Motion test (SFM) was developed to measure observed body mechanics and functional motion associated with spine load, balance, strength, and flexibility during everyday tasks to profile modifiable risks for osteoporotic fracture. This cross-sectional study evaluated the associations between SFM score and history of vertebral compression fracture (VCF), hip fracture, and injurious falls, all established predictors of future risk. Method: An osteoporosis clinic database was queried for adults with an initial SFM score and corresponding data for prevalent VCF and/or hip fracture, femoral neck bone mineral density (fnBMD), and history of injurious fall (n=847). Multiple logistic regressions, adjusted for age, gender, and fnBMD (and injurious falls in the prevalent fracture analyses), were used to determine whether associations exist between SFM score and prevalent VCF, prevalent hip fracture, and history of injurious fall. Results: SFM score was associated with prevalent VCF (odds ratio [OR]=0.89; 95% CI, 0.79–0.99; p=0.036), prevalent hip fracture (OR=0.77; 95% CI, 0.65–0.92; p=0.004), and history of injurious fall (OR=0.80; 95% CI, 0.70–0.93; p=0.003) after adjusting for other important covariates. Conclusions: Adults with higher SFM scores (“safer motion” during performance of everyday tasks) were less likely to have a history of fracture or injurious fall. Further study is warranted to evaluate the predictive value of this tool. PMID:24381386

  15. Testing Students with Special Educational Needs in Large-Scale Assessments – Psychometric Properties of Test Scores and Associations with Test Taking Behavior

    PubMed Central

    Pohl, Steffi; Südkamp, Anna; Hardt, Katinka; Carstensen, Claus H.; Weinert, Sabine

    2016-01-01

    Assessing competencies of students with special educational needs in learning (SEN-L) poses a challenge for large-scale assessments (LSAs). For students with SEN-L, the available competence tests may fail to yield test scores of high psychometric quality, which are—at the same time—measurement invariant to test scores of general education students. We investigated whether we can identify a subgroup of students with SEN-L, for which measurement invariant competence measures of adequate psychometric quality may be obtained with tests available in LSAs. We furthermore investigated whether differences in test-taking behavior may explain dissatisfying psychometric properties and measurement non-invariance of test scores within LSAs. We relied on person fit indices and mixture distribution models to identify students with SEN-L for whom test scores with satisfactory psychometric properties and measurement invariance may be obtained. We also captured differences in test-taking behavior related to guessing and missing responses. As a result we identified a subgroup of students with SEN-L for whom competence scores of adequate psychometric quality that are measurement invariant to those of general education students were obtained. Concerning test taking behavior, there was a small number of students who unsystematically picked response options. Removing these students from the sample slightly improved item fit. Furthermore, two different patterns of missing responses were identified that explain to some extent problems in the assessments of students with SEN-L. PMID:26941665

  16. Testing Students with Special Educational Needs in Large-Scale Assessments - Psychometric Properties of Test Scores and Associations with Test Taking Behavior.

    PubMed

    Pohl, Steffi; Südkamp, Anna; Hardt, Katinka; Carstensen, Claus H; Weinert, Sabine

    2016-01-01

    Assessing competencies of students with special educational needs in learning (SEN-L) poses a challenge for large-scale assessments (LSAs). For students with SEN-L, the available competence tests may fail to yield test scores of high psychometric quality, which are-at the same time-measurement invariant to test scores of general education students. We investigated whether we can identify a subgroup of students with SEN-L, for which measurement invariant competence measures of adequate psychometric quality may be obtained with tests available in LSAs. We furthermore investigated whether differences in test-taking behavior may explain dissatisfying psychometric properties and measurement non-invariance of test scores within LSAs. We relied on person fit indices and mixture distribution models to identify students with SEN-L for whom test scores with satisfactory psychometric properties and measurement invariance may be obtained. We also captured differences in test-taking behavior related to guessing and missing responses. As a result we identified a subgroup of students with SEN-L for whom competence scores of adequate psychometric quality that are measurement invariant to those of general education students were obtained. Concerning test taking behavior, there was a small number of students who unsystematically picked response options. Removing these students from the sample slightly improved item fit. Furthermore, two different patterns of missing responses were identified that explain to some extent problems in the assessments of students with SEN-L. PMID:26941665

  17. A score-type test for heterogeneity in zero-inflated models in a stratified population.

    PubMed

    Cao, Guanqun; Hsu, Wei-Wen; Todem, David

    2014-05-30

    We propose a score-type statistic to evaluate heterogeneity in zero-inflated models for count data in a stratified population, where heterogeneity is defined as instances in which the zero counts are generated from two sources. Evaluating heterogeneity in this class of models has attracted considerable attention in the literature, but existing testing procedures have primarily relied on the constancy assumption under the alternative hypothesis. In this paper, we extend the literature by describing a score-type test to evaluate homogeneity against general alternatives that do not neglect the stratification information under the alternative hypothesis. The limiting null distribution of the proposed test statistic is a mixture of chi-squared distributions that can be well approximated by a simple parametric bootstrap procedure. Our numerical simulation studies show that the proposed test can greatly improve efficiency over tests of heterogeneity that ignore the stratification information. An empirical application to dental caries data in early childhood further shows the importance and practical utility of the methodology in using the stratification profile to detect heterogeneity in the population. PMID:24488881

  18. Comparison of the Bender Gestalt Test for Both Black and White Brain-Damaged Patients Using Two Scoring Systems

    ERIC Educational Resources Information Center

    Butler, Oliver T.; And Others

    1976-01-01

    This study tested for cultural bias in the Bender Visual Motor Gestalt Test. Subjects were 72 black and white patients diagnosed as either brain damaged or psychiatric. Bender protocols were scored by Pascal-Suttell and Hain systems. No race effect appeared except for the Pascal-Suttell system for which blacks scored significantly better. (Author)

  19. Relationships of Teacher-Assigned Grades in High School Chemistry to Taxonomy-Type Objective Test Scores.

    ERIC Educational Resources Information Center

    Even, Alexander

    Reported is a study designed (1) to investigate the relationship between teacher-assigned chemistry grades and the scores obtained on a multiple-choice chemistry test built on taxonomic principles, and (2) to compare the contributions of various predictor variables to the explainable variance of the grades and the total test scores. The sample…

  20. On Further Examination: Report of the Advisory Panel on the Scholastic Aptitude Test Score Decline and Appendices.

    ERIC Educational Resources Information Center

    College Entrance Examination Board, New York, NY.

    After two years of investigation, a panel appointed to advise the College Entrance Examination Board and Educational Testing Service on the recent change in Scholastic Aptitude Test (SAT) scores concluded that the 14-year score decline is a complex phenomenon, yielding neither simple explanations nor easy solutions. As a result of looking at the…

  1. The Relationship Between Nelson-Denny Test Scores and Academic Performance of Educational Opportunity Program Students. EAC Reports.

    ERIC Educational Resources Information Center

    Yamagishi, Midori; Gillmore, Gerald M.

    The relationship of Nelson-Denny Reading Test scores and an English course placement recommendation to academic success of Educational Opportunity Program students at the University of Washington was studied. The placement recommendation was based on a writing sample and test scores. The 207 freshmen students who entered in either 1976 or 1978…

  2. Interpreting the "g" Loadings of Intelligence Test Composite Scores in Light of Spearman's Law of Diminishing Returns

    ERIC Educational Resources Information Center

    Reynolds, Matthew R.

    2013-01-01

    The linear loadings of intelligence test composite scores on a general factor ("g") have been investigated recently in factor analytic studies. Spearman's law of diminishing returns (SLODR), however, implies that the "g" loadings of test scores likely decrease in magnitude as g increases, or they are nonlinear. The purpose of this study was to (a)…

  3. The Childhood Asperger Syndrome Test (CAST): Test-Retest Reliability in a High Scoring Sample

    ERIC Educational Resources Information Center

    Allison, Carrie; Williams, Jo; Scott, Fiona; Stott, Carol; Bolton, Patrick; Baron-Cohen, Simon; Brayne, Carol

    2007-01-01

    The Childhood Asperger Syndrome Test (CAST) is a 37-item parental self-completion questionnaire designed to screen for high-functioning autism spectrum conditions in epidemiological research. The CAST has previously demonstrated good accuracy for use as a screening test, with high sensitivity in studies with primary school aged children in…

  4. Increased correlation coefficient between the written test score and tutors’ performance test scores after training of tutors for assessment of medical students during problem-based learning course in Malaysia

    PubMed Central

    Jaiprakash, Heethal; Min, Aung Ko Ko; Ghosh, Sarmishtha

    2016-01-01

    This paper is aimed at finding if there was a change of correlation between the written test score and tutors’ performance test scores in the assessment of medical students during a problem-based learning (PBL) course in Malaysia. This is a cross-sectional observational study, conducted among 264 medical students in two groups from November 2010 to November 2012. The first group’s tutors did not receive tutor training; while the second group’s tutors were trained in the PBL process. Each group was divided into high, middle and low achievers based on their end-of-semester exam scores. PBL scores were taken which included written test scores and tutors’ performance test scores. Pearson correlation coefficient was calculated between the two kinds of scores in each group. The correlation coefficient between the written scores and tutors’ scores in group 1 was 0.099 (p<0.001) and for group 2 was 0.305 (p<0.001). The higher correlation coefficient in the group where tutors received the PBL training reinforces the importance of tutor training before their participation in the PBL course. PMID:26838577

  5. Increased correlation coefficient between the written test score and tutors' performance test scores after training of tutors for assessment of medical students during problem-based learning course in Malaysia.

    PubMed

    Jaiprakash, Heethal; Min, Aung Ko Ko; Ghosh, Sarmishtha

    2016-03-01

    This paper is aimed at finding if there was a change of correlation between the written test score and tutors' performance test scores in the assessment of medical students during a problem-based learning (PBL) course in Malaysia. This is a cross-sectional observational study, conducted among 264 medical students in two groups from November 2010 to November 2012. The first group's tutors did not receive tutor training; while the second group's tutors were trained in the PBL process. Each group was divided into high, middle and low achievers based on their end-of-semester exam scores. PBL scores were taken which included written test scores and tutors' performance test scores. Pearson correlation coefficient was calculated between the two kinds of scores in each group. The correlation coefficient between the written scores and tutors' scores in group 1 was 0.099 (p<0.001) and for group 2 was 0.305 (p<0.001). The higher correlation coefficient in the group where tutors received the PBL training reinforces the importance of tutor training before their participation in the PBL course. PMID:26838577

  6. An exposure-weighted score test for genetic associations integrating environmental risk factors.

    PubMed

    Han, Summer S; Rosenberg, Philip S; Ghosh, Arpita; Landi, Maria Teresa; Caporaso, Neil E; Chatterjee, Nilanjan

    2015-09-01

    Current methods for detecting genetic associations lack full consideration of the background effects of environmental exposures. Recently proposed methods to account for environmental exposures have focused on logistic regressions with gene-environment interactions. In this report, we developed a test for genetic association, encompassing a broad range of risk models, including linear, logistic and probit, for specifying joint effects of genetic and environmental exposures. We obtained the test statistics by maximizing over a class of score tests, each of which involves modified standard tests of genetic association through a weight function. This weight function reflects the potential heterogeneity of the genetic effects by levels of environmental exposures under a particular model. Simulation studies demonstrate the robust power of these methods for detecting genetic associations under a wide range of scenarios. Applications of these methods are further illustrated using data from genome-wide association studies of type 2 diabetes with body mass index and of lung cancer risk with smoking. PMID:26134142

  7. The Impact of Four-Year Participation in Music and/or Athletic Activities in South Dakota Public High Schools on GPA and ACT Scores

    ERIC Educational Resources Information Center

    Jones, Patrick S.

    2010-01-01

    Finding the key to success for high school students has long been the goal for both school personnel and parents. This study examined the impact of four-year participation in music and/or athletics activities in South Dakota public high schools on student GPA and ACT scores. The data in this study were collected from five Class A high schools…

  8. Effects of Knowledge of Cognitive-Moral Development and Request to Fake on Defining Issues Test P-Scores.

    ERIC Educational Resources Information Center

    Napier, John D.

    1979-01-01

    Support claims that the "Defining Issues Test" of cognitive-moral development cannot be faked higher. Finds that instruction about cognitive-moral development affected the scores of the teacher trainees who were tested. (RL)

  9. Test Scores, Class Rank and College Performance: Lessons for Broadening Access and Promoting Success

    PubMed Central

    Niu, Sunny X.; Tienda, Marta

    2012-01-01

    Using administrative data for five Texas universities that differ in selectivity, this study evaluates the relative influence of two key indicators for college success—high school class rank and standardized tests. Empirical results show that class rank is the superior predictor of college performance and that test score advantages do not insulate lower ranked students from academic underperformance. Using the UT-Austin campus as a test case, we conduct a simulation to evaluate the consequences of capping students admitted automatically using both achievement metrics. We find that using class rank to cap the number of students eligible for automatic admission would have roughly uniform impacts across high schools, but imposing a minimum test score threshold on all students would have highly unequal consequences by greatly reduce the admission eligibility of the highest performing students who attend poor high schools while not jeopardizing admissibility of students who attend affluent high schools. We discuss the implications of the Texas admissions experiment for higher education in Europe. PMID:23788828

  10. The relationship between selected standardized test scores and performance in advanced placement math and science exams: Analyzing the differential effectiveness of scores for course identification and placement

    NASA Astrophysics Data System (ADS)

    Urbina, Josue N.

    There is a national need to increase the STEM-related workforce. Among factors leading towards STEM careers include the number of advanced high school mathematics and science courses students complete. Florida's enrollment patterns in STEM-related Advanced Placement (AP) courses, however, reveal that only a small percentage of students enroll into these classes. Therefore, screening tools are needed to find more students for these courses, who are academically ready, yet have not been identified. The purpose of this study was to investigate the extent to which scores from a national standardized test, Preliminary Scholastic Assessment Test/ National Merit Qualifying Test (PSAT/NMSQT), in conjunction with and compared to a state-mandated standardized test, Florida Comprehensive Assessment Test (FCAT), are related to selected AP exam performance in Seminole County Public Schools. An ex post facto correlational study was conducted using 6,189 student records from the 2010 - 2012 academic years. Multiple regression analyses using simultaneous Full Model testing showed differential moderate to strong relationships between scores in eight of the nine AP courses (i.e., Biology, Environmental Science, Chemistry, Physics B, Physics C Electrical, Physics C Mechanical, Statistics, Calculus AB and BC) examined. For example, the significant unique contribution to overall variance in AP scores was a linear combination of PSAT Math (M), Critical Reading (CR) and FCAT Reading (R) for Biology and Environmental Science. Moderate relationships for Chemistry included a linear combination of PSAT M, W (Writing) and FCAT M; a combination of FCAT M and PSAT M was most significantly associated with Calculus AB performance. These findings have implications for both research and practice. FCAT scores, in conjunction with PSAT scores, can potentially be used for specific STEM-related AP courses, as part of a systematic approach towards AP course identification and placement. For courses with

  11. A Comparison of the Approaches of Generalizability Theory and Item Response Theory in Estimating the Reliability of Test Scores for Testlet-Composed Tests

    ERIC Educational Resources Information Center

    Lee, Guemin; Park, In-Yong

    2012-01-01

    Previous assessments of the reliability of test scores for testlet-composed tests have indicated that item-based estimation methods overestimate reliability. This study was designed to address issues related to the extent to which item-based estimation methods overestimate the reliability of test scores composed of testlets and to compare several…

  12. Predicting First-Quarter Test Scores from the New Medical College Admission Test.

    ERIC Educational Resources Information Center

    Cullen, Thomas J.; And Others

    1980-01-01

    The predictive validity of the new Medical College Admission Test as it relates to end-of-quarter examinations in anatomy, histology, physiology, biochemistry, and "ages of man" is presented. Results indicate that the Science Knowledge assessment areas of chemistry and physics and the Science Problems subtest were most useful in predicting student…

  13. Relationship of Students' Prior Knowledge and Order of Questions on Tests to Students' Test Scores.

    ERIC Educational Resources Information Center

    Papp, Klara K.; And Others

    1987-01-01

    A study examined whether students beginning a cell biology course with prior knowledge of its three areas (genetics, histology, and biochemistry) would retain that advantage throughout the course and whether achievement was influenced by the order of questions in a test. (MSE)

  14. CT densitovolumetry in children with obliterative bronchiolitis: correlation with clinical scores and pulmonary function test results*,**

    PubMed Central

    Mocelin, Helena; Bueno, Gilberto; Irion, Klaus; Marchiori, Edson; Sarria, Edgar; Watte, Guilherme; Hochhegger, Bruno

    2013-01-01

    OBJECTIVE: To determine whether air trapping (expressed as the percentage of air trapping relative to total lung volume [AT%]) correlates with clinical and functional parameters in children with obliterative bronchiolitis (OB). METHODS: CT scans of 19 children with OB were post-processed for AT% quantification with the use of a fixed threshold of −950 HU (AT%950) and of thresholds selected with the aid of density masks (AT%DM). Patients were divided into three groups by AT% severity. We examined AT% correlations with oxygen saturation (SO2) at rest, six-minute walk distance (6MWD), minimum SO2 during the six-minute walk test (6MWT_SO2), FVC, FEV1, FEV1/FVC, and clinical parameters. RESULTS: The 6MWD was longer in the patients with larger normal lung volumes (r = 0.53). We found that AT%950 showed significant correlations (before and after the exclusion of outliers, respectively) with the clinical score (r = 0.72; 0.80), FVC (r = 0.24; 0.59), FEV1 (r = −0.58; −0.67), and FEV1/FVC (r = −0.53; r = −0.62), as did AT%DM with the clinical score (r = 0.58; r = 0.63), SO2 at rest (r = −0.40; r = −0.61), 6MWT_SO2 (r = −0.24; r = −0.55), FVC (r = −0.44; r = −0.80), FEV1 (r = −0.65; r = −0.71), and FEV1/FVC (r = −0.41; r = −0.52). CONCLUSIONS: Our results show that AT% correlates significantly with clinical scores and pulmonary function test results in children with OB. PMID:24473764

  15. Test and Score Data Summary for TOEFL[R] Internet-Based and Paper-Based Tests. January 2008-December 2008 Test Data

    ERIC Educational Resources Information Center

    Educational Testing Service, 2008

    2008-01-01

    The Test of English as a Foreign Language[TM], better known as TOEFL[R], is designed to measure the English-language proficiency of people whose native language is not English. TOEFL scores are accepted by more than 6,000 colleges, universities, and licensing agencies in 130 countries. The test is also used by governments, and scholarship and…

  16. 78 FR 69414 - Toxic Substances Control Act Chemical Testing; Receipt of Test Data

    Federal Register 2010, 2011, 2012, 2013, 2014

    2013-11-19

    ... October 21, 2011 (76 FR 65385) (FRL-8885-5) (docket ID number EPA-HQ-OPPT-2009-0112). The table in this... AGENCY Toxic Substances Control Act Chemical Testing; Receipt of Test Data AGENCY: Environmental Protection Agency (EPA). ACTION: Notice. SUMMARY: This notice announces EPA's receipt of test data on...

  17. An Examination of the Relationship Between School Scores Derived From Commercial Achievement Tests and Those From Statewide Assessment.

    ERIC Educational Resources Information Center

    Blust, Ross S.; Kohr, Richard L.

    An apparent discrepancy between building level scores in basic skills produced by Pennsylvania's state assessment program (EQA) and building summary scores, generally a grade equivalent, provided by commercial standardized achievement tests is investigated. The impetus for the study came from occasional reports by school administrators that their…

  18. Validation of Automated Scores of TOEFL iBT Tasks against Non-Test Indicators of Writing Ability

    ERIC Educational Resources Information Center

    Weigle, Sara Cushing

    2010-01-01

    Automated scoring has the potential to dramatically reduce the time and costs associated with the assessment of complex skills such as writing, but its use must be validated against a variety of criteria for it to be accepted by test users and stakeholders. This study approaches validity by comparing human and automated scores on responses to…

  19. Science Teacher Efficacy and Outcome Expectancy as Predictors of Students' End-of-Instruction (EOI) Biology I Test Scores

    ERIC Educational Resources Information Center

    Angle, Julie; Moseley, Christine

    2009-01-01

    The purpose of this study was to compare teacher efficacy beliefs of secondary Biology I teachers whose students' mean scores on the statewide End-of-Instruction (EOI) Biology I test met or exceeded the state academic proficiency level (Proficient Group) to teacher efficacy beliefs of secondary Biology I teachers whose students' mean scores on the…

  20. Comparison between Dichotomous and Polytomous Scoring of Innovative Items in a Large-Scale Computerized Adaptive Test

    ERIC Educational Resources Information Center

    Jiao, Hong; Liu, Junhui; Haynie, Kathleen; Woo, Ada; Gorham, Jerry

    2012-01-01

    This study explored the impact of partial credit scoring of one type of innovative items (multiple-response items) in a computerized adaptive version of a large-scale licensure pretest and operational test settings. The impacts of partial credit scoring on the estimation of the ability parameters and classification decisions in operational test…

  1. AN INVESTIGATION OF NON-INDEPENDENCE OF COMPONENTS OF SCORES ON MULTIPLE-CHOICE TESTS. FINAL REPORT.

    ERIC Educational Resources Information Center

    ZIMMERMAN, DONALD W.; BURKHEIMER, GRAHAM J., JR.

    INVESTIGATION IS CONTINUED INTO VARIOUS EFFECTS OF NON-INDEPENDENT ERROR INTRODUCED INTO MULTIPLE-CHOICE TEST SCORES AS A RESULT OF CHANCE GUESSING SUCCESS. A MODEL IS DEVELOPED IN WHICH THE CONCEPT OF THEORETICAL COMPONENTS OF SCORES IS NOT INTRODUCED AND IN WHICH, THEREFORE, NO ASSUMPTIONS REGARDING ANY RELATIONSHIP BETWEEN SUCH COMPONENTS NEED…

  2. Sorting and Supporting: Why Double-Dose Algebra Led to Better Test Scores but More Course Failures

    ERIC Educational Resources Information Center

    Nomi, Takako; Allensworth, Elaine M.

    2013-01-01

    In 2003, Chicago schools required students entering ninth grade with below-average math scores to take two periods of algebra. This led to higher test scores for students with both above- and below-average skills, yet failure rates increased for above-average students. We examine the mechanisms behind these surprising results. Sorting by incoming…

  3. Effects of Instruction with Interactive Whiteboards versus Instruction without Interactive Whiteboards on End-of-Course Test Scores

    ERIC Educational Resources Information Center

    Evans, Kendra J.

    2010-01-01

    This study addressed the implementation of interactive whiteboards (IWBs) on academic achievement in a Georgia public high school. The researcher studied the effects of the implementation of IWBs on Georgia's end-of-course test (EOCT) scores and satisfaction of students with this method of instruction. The researcher compared EOCT scores of a…

  4. Improving Personality Facet Scores with Multidimensional Computer Adaptive Testing: An Illustration with the Neo Pi-R

    ERIC Educational Resources Information Center

    Makransky, Guido; Mortensen, Erik Lykke; Glas, Cees A. W.

    2013-01-01

    Narrowly defined personality facet scores are commonly reported and used for making decisions in clinical and organizational settings. Although these facets are typically related, scoring is usually carried out for a single facet at a time. This method can be ineffective and time consuming when personality tests contain many highly correlated…

  5. Assessing Growth in Young Children: A Comparison of Raw, Age-Equivalent, and Standard Scores Using the Peabody Picture Vocabulary Test

    ERIC Educational Resources Information Center

    Sullivan, Jeremy R.; Winter, Suzanne M.; Sass, Daniel A.; Svenkerud, Nicole

    2014-01-01

    Many tests provide users with several different types of scores to facilitate interpretation and description of students' performance. Common examples include raw scores, age- and grade-equivalent scores, and standard scores. However, when used within the context of assessing growth among young children, these scores should not be…

  6. A Brief Look at: Test Scores and the Standard Error of Measurement. E&R Report No. 10.13

    ERIC Educational Resources Information Center

    Holdzkom, David; Sumner, Brian; McMillen, Brad

    2010-01-01

    In the context of standardized testing, the standard error of measurement (SEM) is a measure of the factors other than the student's actual knowledge of the tested material that may affect the student's test score. Such factors may include distractions in the testing environment, fatigue, hunger, or even luck. This means that a student's observed…

  7. Testing for Accountability: A Balancing Act That Challenges Current Testing Practices and Theories

    ERIC Educational Resources Information Center

    Brennan, Robert L.

    2015-01-01

    Koretz, in his article published in this issue, provides compelling arguments that the high stakes currently associated with accountability testing lead to behavioral changes in students, teachers, and other stakeholders that often have negative consequences, such as inflated scores. Koretz goes on to argue that these negative consequences require…

  8. Performance on large-scale science tests: Item attributes that may impact achievement scores

    NASA Astrophysics Data System (ADS)

    Gordon, Janet Victoria

    Significant differences in achievement among ethnic groups persist on the eighth-grade science Washington Assessment of Student Learning (WASL). The WASL measures academic performance in science using both scenario and stand-alone question types. Previous research suggests that presenting target items connected to an authentic context, like scenario question types, can increase science achievement scores especially in underrepresented groups and thus help to close the achievement gap. The purpose of this study was to identify significant differences in performance between gender and ethnic subgroups by question type on the 2005 eighth-grade science WASL. MANOVA and ANOVA were used to examine relationships between gender and ethnic subgroups as independent variables with achievement scores on scenario and stand-alone question types as dependent variables. MANOVA revealed no significant effects for gender, suggesting that the 2005 eighth-grade science WASL was gender neutral. However, there were significant effects for ethnicity. ANOVA revealed significant effects for ethnicity and ethnicity by gender interaction in both question types. Effect sizes were negligible for the ethnicity by gender interaction. Large effect sizes between ethnicities on scenario question types became moderate to small effect sizes on stand-alone question types. This indicates the score advantage the higher performing subgroups had over the lower performing subgroups was not as large on stand-alone question types compared to scenario question types. A further comparison examined performance on multiple-choice items only within both question types. Similar achievement patterns between ethnicities emerged; however, achievement patterns between genders changed in boys' favor. Scenario question types appeared to register differences between ethnic groups to a greater degree than stand-alone question types. These differences may be attributable to individual differences in cognition

  9. Multiple tests for wind turbine fault detection and score fusion using two- level multidimensional scaling (MDS)

    NASA Astrophysics Data System (ADS)

    Ye, Xiang; Gao, Weihua; Yan, Yanjun; Osadciw, Lisa A.

    2010-04-01

    Wind is an important renewable energy source. The energy and economic return from building wind farms justify the expensive investments in doing so. However, without an effective monitoring system, underperforming or faulty turbines will cause a huge loss in revenue. Early detection of such failures help prevent these undesired working conditions. We develop three tests on power curve, rotor speed curve, pitch angle curve of individual turbine. In each test, multiple states are defined to distinguish different working conditions, including complete shut-downs, under-performing states, abnormally frequent default states, as well as normal working states. These three tests are combined to reach a final conclusion, which is more effective than any single test. Through extensive data mining of historical data and verification from farm operators, some state combinations are discovered to be strong indicators of spindle failures, lightning strikes, anemometer faults, etc, for fault detection. In each individual test, and in the score fusion of these tests, we apply multidimensional scaling (MDS) to reduce the high dimensional feature space into a 3-dimensional visualization, from which it is easier to discover turbine working information. This approach gains a qualitative understanding of turbine performance status to detect faults, and also provides explanations on what has happened for detailed diagnostics. The state-of-the-art SCADA (Supervisory Control And Data Acquisition) system in industry can only answer the question whether there are abnormal working states, and our evaluation of multiple states in multiple tests is also promising for diagnostics. In the future, these tests can be readily incorporated in a Bayesian network for intelligent analysis and decision support.

  10. Using Hierarchical Modeling to Examine Course Work and ACT Score Relationships across High Schools. ACT Research Report Series, 2007-2

    ERIC Educational Resources Information Center

    Noble, Julie P.; Schnelker, Diane

    2007-01-01

    This study used hierarchical linear regression to examine the effects of taking specific high school courses on students' ACT performance in English, Mathematics, and Science. In addition, hierarchical logistic regression was used to examine the benefits of taking specific courses on students' likelihood of meeting or exceeding the ACT College…

  11. Guided-Inquiry Lessons Raise Scores on the Sixth Grade Georgia Science Test

    NASA Astrophysics Data System (ADS)

    Page, Purlie M.

    At the local level, G Middle School has the highest district-wide percentage of 6th grade science students who are not meeting standards. It is imperative that G middle school take corrective action to reduce the number of students failing to meet state science standards. Dewey's theory of conceptual framework, which involves knowledge constructed on a person's personal experience and mind activity through active forms of learning, guided this study. The goal of the study was to determine whether inquiry-based science modules produce greater 6th grade science achievement, as measured by an equivalent instrument of the science section of the Georgia Criterion-Referenced Competency Test, when compared to traditional instruction among eastern Georgia 6th graders. The sample consisted of 230 students in the nonintervention group and 119 students in the intervention group. All students were from intact classes. At the end of the intervention, an independent t test was conducted to analyze the scores. According to the study t test, (t = 12.33, df = 304.56, p < 0.05), the difference between the means was statistically significant. This project's potential impact on social change includes increasing student motivation towards, comprehension of, and interest in science concepts. At the local level, these inquiry lessons can be shared with science teachers across grade levels and within the district to improve county-wide science scores. An increase in student interest and comprehension of science concepts could ultimately lead to the United States producing more students in the fields of science, technology, engineering, and mathematics (STEM) education.

  12. Survival analysis of colorectal cancer patients with tumor recurrence using global score test methodology

    SciTech Connect

    Zain, Zakiyah Ahmad, Yuhaniz; Azwan, Zairul E-mail: farhanaraduan@gmail.com Raduan, Farhana E-mail: farhanaraduan@gmail.com Sagap, Ismail E-mail: farhanaraduan@gmail.com; Aziz, Nazrina

    2014-12-04

    Colorectal cancer is the third and the second most common cancer worldwide in men and women respectively, and the second in Malaysia for both genders. Surgery, chemotherapy and radiotherapy are among the options available for treatment of patients with colorectal cancer. In clinical trials, the main purpose is often to compare efficacy between experimental and control treatments. Treatment comparisons often involve several responses or endpoints, and this situation complicates the analysis. In the case of colorectal cancer, sets of responses concerned with survival times include: times from tumor removal until the first, the second and the third tumor recurrences, and time to death. For a patient, the time to recurrence is correlated to the overall survival. In this study, global score test methodology is used in combining the univariate score statistics for comparing treatments with respect to each survival endpoint into a single statistic. The data of tumor recurrence and overall survival of colorectal cancer patients are taken from a Malaysian hospital. The results are found to be similar to those computed using the established Wei, Lin and Weissfeld method. Key factors such as ethnic, gender, age and stage at diagnose are also reported.

  13. Sensitivity of the Balance Error Scoring System and the Sensory Organization Test in the Combat Environment.

    PubMed

    Haran, F Jay; Slaboda, Jill C; King, Laurie A; Wright, W Geoff; Houlihan, Daniel; Norris, Jacob N

    2016-04-01

    This study evaluated the utility of the Balance Error Scoring System (BESS) and the Sensory Organization Test (SOT) as tools for the screening and monitoring of Service members (SMs) with mild traumatic brain injury (mTBI) in a deployed setting during the acute and subacute phases of recovery. Patient records (N = 699) were reviewed for a cohort of SMs who sustained a blast-related mTBI while deployed to Afghanistan and were treated at the Concussion Restoration Care Center (CRCC) at Camp Leatherneck. On initial intake into the CRCC, participants completed two assessments of postural control, the BESS, and SOT. SMs with mTBI performed significantly worse on the BESS and SOT when compared with comparative samples. When the SOT data were further examined using sensory ratios, the results indicated that postural instability was primarily a result of vestibular and visual integration dysfunction (r > 0.62). The main finding of this study was that the sensitivity of the SOT composite score (50-58%) during the acute phase was higher than previous sensitivities found in the sports medicine literature for impact-related trauma. PMID:26560740

  14. Survival analysis of colorectal cancer patients with tumor recurrence using global score test methodology

    NASA Astrophysics Data System (ADS)

    Zain, Zakiyah; Aziz, Nazrina; Ahmad, Yuhaniz; Azwan, Zairul; Raduan, Farhana; Sagap, Ismail

    2014-12-01

    Colorectal cancer is the third and the second most common cancer worldwide in men and women respectively, and the second in Malaysia for both genders. Surgery, chemotherapy and radiotherapy are among the options available for treatment of patients with colorectal cancer. In clinical trials, the main purpose is often to compare efficacy between experimental and control treatments. Treatment comparisons often involve several responses or endpoints, and this situation complicates the analysis. In the case of colorectal cancer, sets of responses concerned with survival times include: times from tumor removal until the first, the second and the third tumor recurrences, and time to death. For a patient, the time to recurrence is correlated to the overall survival. In this study, global score test methodology is used in combining the univariate score statistics for comparing treatments with respect to each survival endpoint into a single statistic. The data of tumor recurrence and overall survival of colorectal cancer patients are taken from a Malaysian hospital. The results are found to be similar to those computed using the established Wei, Lin and Weissfeld method. Key factors such as ethnic, gender, age and stage at diagnose are also reported.

  15. The advanced computational testing and simulation toolkit (ACTS)

    SciTech Connect

    Drummond, L.A.; Marques, O.

    2002-05-21

    During the past decades there has been a continuous growth in the number of physical and societal problems that have been successfully studied and solved by means of computational modeling and simulation. Distinctively, a number of these are important scientific problems ranging in scale from the atomic to the cosmic. For example, ionization is a phenomenon as ubiquitous in modern society as the glow of fluorescent lights and the etching on silicon computer chips; but it was not until 1999 that researchers finally achieved a complete numerical solution to the simplest example of ionization, the collision of a hydrogen atom with an electron. On the opposite scale, cosmologists have long wondered whether the expansion of the Universe, which began with the Big Bang, would ever reverse itself, ending the Universe in a Big Crunch. In 2000, analysis of new measurements of the cosmic microwave background radiation showed that the geometry of the Universe is flat, and thus the Universe will continue expanding forever. Both of these discoveries depended on high performance computer simulations that utilized computational tools included in the Advanced Computational Testing and Simulation (ACTS) Toolkit. The ACTS Toolkit is an umbrella project that brought together a number of general purpose computational tool development projects funded and supported by the U.S. Department of Energy (DOE). These tools, which have been developed independently, mainly at DOE laboratories, make it easier for scientific code developers to write high performance applications for parallel computers. They tackle a number of computational issues that are common to a large number of scientific applications, mainly implementation of numerical algorithms, and support for code development, execution and optimization. The ACTS Toolkit Project enables the use of these tools by a much wider community of computational scientists, and promotes code portability, reusability, reduction of duplicate efforts

  16. Single- versus Double-Scoring of Trend Responses in Trend Score Equating with Constructed-Response Tests. Research Report. ETS RR-10-12

    ERIC Educational Resources Information Center

    Tan, Xuan; Ricker, Kathryn L.; Puhan, Gautam

    2010-01-01

    This study examines the differences in equating outcomes between two trend score equating designs resulting from two different scoring strategies for trend scoring when operational constructed-response (CR) items are double-scored--the single group (SG) design, where each trend CR item is double-scored, and the nonequivalent groups with anchor…

  17. Should We Stop Looking for a Better Scoring Algorithm for Handling Implicit Association Test Data? Test of the Role of Errors, Extreme Latencies Treatment, Scoring Formula, and Practice Trials on Reliability and Validity

    PubMed Central

    Perugini, Marco; Schönbrodt, Felix

    2015-01-01

    Since the development of D scores for the Implicit Association Test, few studies have examined whether there is a better scoring method. In this contribution, we tested the effect of four relevant parameters for IAT data that are the treatment of extreme latencies, the error treatment, the method for computing the IAT difference, and the distinction between practice and test critical trials. For some options of these different parameters, we included robust statistic methods that can provide viable alternative metrics to existing scoring algorithms, especially given the specificity of reaction time data. We thus elaborated 420 algorithms that result from the combination of all the different options and test the main effect of the four parameters with robust statistical analyses as well as their interaction with the type of IAT (i.e., with or without built-in penalty included in the IAT procedure). From the results, we can elaborate some recommendations. A treatment of extreme latencies is preferable but only if it consists in replacing rather than eliminating them. Errors contain important information and should not be discarded. The D score seems to be still a good way to compute the difference although the G score could be a good alternative, and finally it seems better to not compute the IAT difference separately for practice and test critical trials. From this recommendation, we propose to improve the traditional D scores with small yet effective modifications. PMID:26107176

  18. The TSCA interagency testing committee`s approaches to screening and scoring chemicals and chemical groups: 1977-1983

    SciTech Connect

    Walker, J.D.

    1990-12-31

    This paper describes the TSCA interagency testing committee`s (ITC) approaches to screening and scoring chemicals and chemical groups between 1977 and 1983. During this time the ITC conducted five scoring exercises to select chemicals and chemical groups for detailed review and to determine which of these chemicals and chemical groups should be added to the TSCA Section 4(e) Priority Testing List. 29 refs., 1 fig., 2 tabs.

  19. Development of new risk score for pre-test probability of obstructive coronary artery disease based on coronary CT angiography.

    PubMed

    Fujimoto, Shinichiro; Kondo, Takeshi; Yamamoto, Hideya; Yokoyama, Naoyuki; Tarutani, Yasuhiro; Takamura, Kazuhisa; Urabe, Yoji; Konno, Kumiko; Nishizaki, Yuji; Shinozaki, Tomohiro; Kihara, Yasuki; Daida, Hiroyuki; Isshiki, Takaaki; Takase, Shinichi

    2015-09-01

    Existing methods to calculate pre-test probability of obstructive coronary artery disease (CAD) have been established using selected high-risk patients who were referred to conventional coronary angiography. The purpose of this study is to develop and validate our new method for pre-test probability of obstructive CAD using patients who underwent coronary CT angiography (CTA), which could be applicable to a wider range of patient population. Using consecutive 4137 patients with suspected CAD who underwent coronary CTA at our institution, a multivariate logistic regression model including clinical factors as covariates calculated the pre-test probability (K-score) of obstructive CAD determined by coronary CTA. The K-score was compared with the Duke clinical score using the area under the curve (AUC) for the receiver-operating characteristic curve. External validation was performed by an independent sample of 319 patients. The final model included eight significant predictors: age, gender, coronary risk factor (hypertension, diabetes mellitus, dyslipidemia, smoking), history of cerebral infarction, and chest symptom. The AUC of the K-score was significantly greater than that of the Duke clinical score for both derivation (0.736 vs. 0.699) and validation (0.714 vs. 0.688) data sets. Among patients who underwent coronary CTA, newly developed K-score had better pre-test prediction ability of obstructive CAD compared to Duke clinical score in Japanese population. PMID:24770610

  20. 76 FR 38169 - Toxic Substances Control Act Chemical Testing; Receipt of Test Data

    Federal Register 2010, 2011, 2012, 2013, 2014

    2011-06-29

    ... FR 22402) (FRL-7312-2), and later amended by a final rule titled ``Revocation of TSCA Section 4... 12, 2006 (71 FR 18650) (FRL-7751-7). Section 4(d) of TSCA (15 U.S.C. 2603(d)) requires EPA to publish... AGENCY Toxic Substances Control Act Chemical Testing; Receipt of Test Data AGENCY:...

  1. REPRODUCIBILITY OF THE MODIFIED STAR EXCURSION BALANCE TEST COMPOSITE AND SPECIFIC REACH DIRECTION SCORES

    PubMed Central

    van Lieshout, Remko; Reijneveld, Elja A.E.; van den Berg, Sandra M.; Haerkens, Gijs M.; Koenders, Niek H.; de Leeuw, Arina J.; van Oorsouw, Roel G.; Paap, Davy; Scheffer, Else; Weterings, Stijn

    2016-01-01

    ABSTRACT Background The mSEBT is a screening tool used to evaluate dynamic balance. Most research investigating measurement properties focused on intrarater reliability and was done in small samples. To know whether the mSEBT is useful to discriminate dynamic balance between persons and to evaluate changes in dynamic balance, more research into intra- and interrater reliability and smallest detectable change (synonymous with minimal detectable change) is needed. Purpose To estimate intra- and interrater reliability and smallest detectable change of the mSEBT in adults at risk for ankle sprain. Study Design Cross-sectional, test-retest design Methods Fifty-five healthy young adults participating in sports at risk for ankle sprain participated (mean ± SD age, 24.0 ± 2.9 years). Each participant performed three test sessions within one hour and was rated by two physical therapists (session 1, rater 1; session 2, rater 2; session 3, rater 1). Participants and raters were blinded for previous measurements. Normalized composite and reach direction scores for the right and left leg were collected. Analysis of variance was used to calculate intraclass correlation coefficient values for intra- and interrater reliability. Smallest detectable change values were calculated based on the standard error of measurement. Results Intra- and interrater reliability for both legs was good to excellent (intraclass correlation coefficient ranging from 0.87 to 0.94). The intrarater smallest detectable change for the composite score of the right leg was 7.2% and for the left 6.2%. The interrater smallest detectable change for the composite score of the right leg was 6.9% and for the left 5.0%. Conclusion The mSEBT is a reliable measurement instrument to discriminate dynamic balance between persons. Most smallest detectable change values of the mSEBT appear to be large. More research is needed to investigate if the mSEBT is usable for evaluative purposes. Level of Evidence Level 2

  2. Using the EZ-Diffusion Model to Score a Single-Category Implicit Association Test of Physical Activity

    PubMed Central

    Rebar, Amanda L.; Ram, Nilam; Conroy, David E.

    2014-01-01

    Objective The Single-Category Implicit Association Test (SC-IAT) has been used as a method for assessing automatic evaluations of physical activity, but measurement artifact or consciously-held attitudes could be confounding the outcome scores of these measures. The objective of these two studies was to address these measurement concerns by testing the validity of a novel SC-IAT scoring technique. Design Study 1 was a cross-sectional study, and study 2 was a prospective study. Method In study 1, undergraduate students (N = 104) completed SC-IATs for physical activity, flowers, and sedentary behavior. In study 2, undergraduate students (N = 91) completed a SC-IAT for physical activity, self-reported affective and instrumental attitudes toward physical activity, physical activity intentions, and wore an accelerometer for two weeks. The EZ-diffusion model was used to decompose the SC-IAT into three process component scores including the information processing efficiency score. Results In study 1, a series of structural equation model comparisons revealed that the information processing score did not share variability across distinct SC-IATs, suggesting it does not represent systematic measurement artifact. In study 2, the information processing efficiency score was shown to be unrelated to self-reported affective and instrumental attitudes toward physical activity, and positively related to physical activity behavior, above and beyond the traditional D-score of the SC-IAT. Conclusions The information processing efficiency score is a valid measure of automatic evaluations of physical activity. PMID:25484621

  3. Evaluation of Factors Affecting Continuous Performance Test Identical Pairs Version Score of Schizophrenic Patients in a Japanese Clinical Sample

    PubMed Central

    Koide, Takayoshi; Aleksic, Branko; Kikuchi, Tsutomu; Banno, Masahiro; Kohmura, Kunihiro; Adachi, Yasunori; Kawano, Naoko; Iidaka, Tetsuya; Ozaki, Norio

    2012-01-01

    Aim. Cognitive impairment in schizophrenia strongly relates to social outcome and is a good candidate for endophenotypes. When we accurately measure drug efficacy or effects of genes or variants relevant to schizophrenia on cognitive impairment, clinical factors that can affect scores on cognitive tests, such as age and severity of symptoms, should be considered. To elucidate the effect of clinical factors, we conducted multiple regression analysis using scores of the Continuous Performance Test Identical Pairs Version (CPT-IP), which is often used to measure attention/vigilance in schizophrenia. Methods. We conducted the CPT-IP (4-4 digit) and examined clinical information (sex, age, education years, onset age, duration of illness, chlorpromazine-equivalent dose, and Positive and Negative Symptom Scale (PANSS) scores) in 126 schizophrenia patients in Japanese population. Multiple regression analysis was used to evaluate the effect of clinical factors. Results. Age, chlorpromazine-equivalent dose, and PANSS-negative symptom score were associated with mean d′ score in patients. These three clinical factors explained about 28% of the variance in mean d′ score. Conclusions. As conclusion, CPT-IP score in schizophrenia patients is influenced by age, chlorpromazine-equivalent dose and PANSS negative symptom score. PMID:22966454

  4. Accuracy of Self-Reported SAT and ACT Test Scores: Implications for Research

    ERIC Educational Resources Information Center

    Cole, James S.; Gonyea, Robert M.

    2010-01-01

    Because it is often impractical or impossible to obtain school transcripts or records on subjects, many researchers rely on college students to accurately self-report their academic record as part of their data collection procedures. The purpose of this study is to investigate the validity and reliability of student self-reported academic…

  5. Poisson Approximation-Based Score Test for Detecting Association of Rare Variants.

    PubMed

    Fang, Hongyan; Zhang, Hong; Yang, Yaning

    2016-07-01

    Genome-wide association study (GWAS) has achieved great success in identifying genetic variants, but the nature of GWAS has determined its inherent limitations. Under the common disease rare variants (CDRV) hypothesis, the traditional association analysis methods commonly used in GWAS for common variants do not have enough power for detecting rare variants with a limited sample size. As a solution to this problem, pooling rare variants by their functions provides an efficient way for identifying susceptible genes. Rare variant typically have low frequencies of minor alleles, and the distribution of the total number of minor alleles of the rare variants can be approximated by a Poisson distribution. Based on this fact, we propose a new test method, the Poisson Approximation-based Score Test (PAST), for association analysis of rare variants. Two testing methods, namely, ePAST and mPAST, are proposed based on different strategies of pooling rare variants. Simulation results and application to the CRESCENDO cohort data show that our methods are more powerful than the existing methods. PMID:27346734

  6. The achievement impact of the inclusion model on the standardized test scores of general education students

    NASA Astrophysics Data System (ADS)

    Garrett-Rainey, Syrena

    The purpose of this study was to compare the achievement of general education students within regular education classes to the achievement of general education students in inclusion/co-teach classes to determine whether there was a significant difference in the achievement between the two groups. The school district's inclusion/co-teach model included ongoing professional development support for teachers and administrators. General education teachers, special education teachers, and teacher assistants collaborated to develop instructional strategies to provide additional remediation to help students to acquire the skills needed to master course content. This quantitative study reviewed the end-of course test (EoCT) scores of Grade 10 physical science and math students within an urban school district. It is not known whether general education students in an inclusive/co-teach science or math course will demonstrate a higher achievement on the EoCT in math or science than students not in an inclusive/co-teach classroom setting. In addition, this study sought to determine if students classified as low socioeconomic status benefited from participating in co-teaching classrooms as evidenced by standardized tests. Inferential statistics were used to determine whether there was a significant difference between the achievements of the treatment group (inclusion/co-teach) and the control group (non-inclusion/co-teach). The findings can be used to provide school districts with optional instructional strategies to implement in the diverse classroom setting in the modern classroom to increase academic performance on state standardized tests.

  7. Examination of Substance Use, Risk Factors, and Protective Factors on Student Academic Test Score Performance

    PubMed Central

    Arthur, Michael W.; Brown, Eric C.; Briney, John S.; Hawkins, J. David; Abbott, Robert D.; Catalano, Richard F.; Becker, Linda; Langer, Michael; Mueller, Martin T.

    2016-01-01

    BACKGROUND School administrators and teachers face difficult decisions about how best to use school resources in order to meet academic achievement goals. Many are hesitant to adopt prevention curricula that are not focused directly on academic achievement. Yet, some have hypothesized that prevention curricula can remove barriers to learning and, thus, promote achievement. This study examined relationships between school levels of student substance use and risk and protective factors that predict adolescent problem behaviors and achievement test performance in Washington State. METHODS Hierarchical Generalized Linear Models were used to examine predictive associations between school-averaged levels of substance use and risk and protective factors and Washington State students’ likelihood of meeting achievement test standards on the Washington Assessment of Student Learning, statistically controlling for demographic and economic factors known to be associated with achievement. RESULTS Results indicate that levels of substance use and risk/protective factors predicted the academic test score performance of students. Many of these effects remained significant even after controlling for model covariates. CONCLUSIONS The findings suggest that implementing prevention programs that target empirically identified risk and protective factors have the potential to positively affect students’ academic achievement. PMID:26149305

  8. Investigating the Comparability of School Scores across Test Forms that Are Not Parallel. Technical Guidelines for Performance Assessment.

    ERIC Educational Resources Information Center

    Fitzpatrick, Anne R.

    This study, one of a series designed to answer practical questions about performance based assessment, examined the comparability of school scores on short, nonparallel test forms. The data were obtained from mathematics tests with both multiple choice (MC) and performance assessment (PA) items. The tests were administered in a statewide testing…

  9. Meta-Analyses of the Relationship of Creative Achievement to both IQ and Divergent Thinking Test Scores

    ERIC Educational Resources Information Center

    Kim, Kyung Hee

    2008-01-01

    There is disagreement among researchers about whether IQ tests or divergent thinking (DT) tests are better predictors of creative achievement. Resolving this dispute is complicated by the fact that some research has shown a relationship between IQ and DT test scores (e.g., Runco & Albert, 1986; Wallach, 1970). The present study conducted…

  10. Getting the Message Out: An Evaluation of NAEP Score Reporting Practices with Implications for Disseminating Test Results

    ERIC Educational Resources Information Center

    Zenisky, April L.; Hambleton, Ronald K.; Sireci, Stephen G.

    2009-01-01

    How a testing agency approaches score reporting can have a significant impact on the perception of that assessment and the usefulness of the information among intended users and stakeholders. Too often, important decisions about reporting test data are left to the end of the test development cycle, but by considering the audience(s) and the kinds…

  11. Rugby versus Soccer in South Africa: Content Familiarity Contributes to Cross-Cultural Differences in Cognitive Test Scores

    ERIC Educational Resources Information Center

    Malda, Maike; van de Vijver, Fons J. R.; Temane, Q. Michael

    2010-01-01

    In this study, cross-cultural differences in cognitive test scores are hypothesized to depend on a test's cultural complexity (Cultural Complexity Hypothesis: CCH), here conceptualized as its content familiarity, rather than on its cognitive complexity (Spearman's Hypothesis: SH). The content familiarity of tests assessing short-term memory,…

  12. A Comparative Evaluation of Score Results from Computerized and Paper & Pencil Mathematics Testing in a Large Scale State Assessment Program

    ERIC Educational Resources Information Center

    Poggio, John; Glasnapp, Douglas R.; Yang, Xiangdong; Poggio, Andrew J.

    2005-01-01

    The present study reports results from a quasi-controlled empirical investigation addressing the impact on student test scores when using fixed form computer based testing (CBT) versus paper and pencil (P&P) testing as the delivery mode to assess student mathematics achievement in a state's large scale assessment program. Grade 7 students served…

  13. Reliability and Validity of the New Tanaka B Intelligence Scale Scores: A Group Intelligence Test

    PubMed Central

    Uno, Yota; Mizukami, Hitomi; Ando, Masahiko; Yukihiro, Ryoji; Iwasaki, Yoko; Ozaki, Norio

    2014-01-01

    Objective The present study evaluated the reliability and concurrent validity of the new Tanaka B Intelligence Scale, which is an intelligence test that can be administered on groups within a short period of time. Methods The new Tanaka B Intelligence Scale and Wechsler Intelligence Scale for Children-Third Edition were administered to 81 subjects (mean age ± SD 15.2±0.7 years) residing in a juvenile detention home; reliability was assessed using Cronbach’s alpha coefficient, and concurrent validity was assessed using the one-way analysis of variance intraclass correlation coefficient. Moreover, receiver operating characteristic analysis for screening for individuals who have a deficit in intellectual function (an FIQ<70) was performed. In addition, stratum-specific likelihood ratios for detection of intellectual disability were calculated. Results The Cronbach’s alpha for the new Tanaka B Intelligence Scale IQ (BIQ) was 0.86, and the intraclass correlation coefficient with FIQ was 0.83. Receiver operating characteristic analysis demonstrated an area under the curve of 0.89 (95% CI: 0.85–0.96). In addition, the stratum-specific likelihood ratio for the BIQ≤65 stratum was 13.8 (95% CI: 3.9–48.9), and the stratum-specific likelihood ratio for the BIQ≥76 stratum was 0.1 (95% CI: 0.03–0.4). Thus, intellectual disability could be ruled out or determined. Conclusion The present results demonstrated that the new Tanaka B Intelligence Scale score had high reliability and concurrent validity with the Wechsler Intelligence Scale for Children-Third Edition score. Moreover, the post-test probability for the BIQ could be calculated when screening for individuals who have a deficit in intellectual function. The new Tanaka B Intelligence Test is convenient and can be administered within a variety of settings. This enables evaluation of intellectual development even in settings where performing intelligence tests have previously been difficult. PMID:24940880

  14. ACTS 118x: High Speed TCP Interoperability Testing

    NASA Technical Reports Server (NTRS)

    Brooks, David E.; Buffinton, Craig; Beering, Dave R.; Welch, Arun; Ivancic, William D.; Zernic, Mike; Hoder, Douglas J.

    1999-01-01

    With the recent explosion of the Internet and the enormous business opportunities available to communication system providers, great interest has developed in improving the efficiency of data transfer over satellite links using the Transmission Control Protocol (TCP) of the Internet Protocol (IP) suite. The NASA's ACTS experiments program initiated a series of TCP experiments to demonstrate scalability of TCP/IP and determine to what extent the protocol can be optimized over a 622 Mbps satellite link. Through partnerships with the government technology oriented labs, computer, telecommunication, and satellite industries NASA Glenn was able to: (1) promote the development of interoperable, high-performance TCP/IP implementations across multiple computing / operating platforms; (2) work with the satellite industry to answer outstanding questions regarding the use of standard protocols (TCP/IP and ATM) for the delivery of advanced data services, and for use in spacecraft architectures; and (3) conduct a series of TCP/IP interoperability tests over OC12 ATM over a satellite network in a multi-vendor environment using ACTS. The experiments' various network configurations and the results are presented.

  15. Visual scoring of non-cavitated caries lesions and clinical trial efficiency, testing xylitol in caries active adults

    PubMed Central

    Brown, JP; Amaechi, BT; Bader, JD; Gilbert, GH; Makhija, SK; Lozano-Pineda, J; Leo, MC; Chuhe, C; Vollmer, WM

    2013-01-01

    Objectives To better understand the effectiveness of xylitol in caries prevention in adults, and to attempt improved clinical trial efficiency. Methods As part of the Xylitol for Adult Caries Trial (X-ACT), non-cavitated and cavitated caries lesions were assessed in subjects who were experiencing the disease. The trial was a test of the effectiveness of 5 grams/day of xylitol, consumed by dissolving in the mouth five 1 gram lozenges spaced across each day, compared with a sucralose placebo. For this analysis, seeking trial efficiency, 538 subjects aged 21–80, with complete data for four dental examinations were selected from the 691 randomized into the three year trial, conducted at three sites. Acceptable inter and intra examiner reliability before and during the trial was quantified using the kappa statistic. Results The mean annualized non-cavitated plus cavitated lesion transition scores in coronal and root surfaces, from sound to carious favoured xylitol over placebo, during the three cumulative periods of 12, 24, and 33 months, but these clinically and statistically non-significant differences declined in magnitude over time. Restricting the present assessment to those subjects with a higher baseline lifetime caries experience showed possible but inconsistent benefit. Conclusions There was no clear and clinically relevant preventive effect of xylitol on caries in adults with adequate fluoride exposure when non-cavitated plus cavitated lesions were assessed. This conformed to the X-ACT trial result assessing cavitated lesions. Including non-cavitated lesion assessment in this full scale, placebo controlled, multi site, randomized, double blinded clinical trial in adults experiencing dental caries, did not achieve added trial efficiency or demonstrate practical benefit of xylitol. Trial Registration ClinicalTrials.Gov NCT00393055 PMID:24205951

  16. The increasing impact of socioeconomics and race on standardized academic test scores across elementary, middle, and high school.

    PubMed

    White, Gwyne W; Stepney, Cesalie T; Hatchimonji, Danielle Ryan; Moceri, Dominic C; Linsky, Arielle V; Reyes-Portillo, Jazmin A; Elias, Maurice J

    2016-01-01

    For students and schools, the current policy is to measure success via standardized testing. Yet the immutable factors of socioeconomic status (SES) and race have, consistently, been implicated in fostering an achievement gap. The current study explores, at the school-level, the impact of these factors on test scores. Percentage of students proficient for Language and Math was analyzed from 452 schools across the state of New Jersey. By high school, 52% of the variance in Language and 59% in Math test scores can be accounted for by SES and racial factors. At this level, a 1% increase in school minority population corresponds to a 0.19 decrease in percent Language proficient and 0.33 decrease for Math. These results have significant implications as they suggest that school-level interventions to improve academic achievement scores will be stymied by socioeconomic and racial factors and efforts to improve the achievement gap via testing have largely measured it. PMID:26752444

  17. Statistical Properties of School Value-Added Scores Based on Assessments of College Readiness. ACT Research Report Series. 2015 (5)

    ERIC Educational Resources Information Center

    Bassiri, Dina

    2015-01-01

    The 2001 reauthorization of the Elementary and Secondary Education Act (ESEA) known as No Child Left Behind and more recent federal initiatives such as Race to the Top and the ESEA flexibility waiver have brought student growth to the forefront of education reform for assessing school and teacher effectiveness. This study examined growth…

  18. Effects on Score Distributions of Deleting an Unkeyable Item from a Test.

    ERIC Educational Resources Information Center

    Dorans, Neil J.

    A formal analysis is presented of the effects of item deletion on equating/scaling functions and reported score distributions. The phrase "item deletion" refers to the process of changing the original key of a flawed item to either all options correct, including omits, or to no options correct, i.e., not scoring the flawed item. There are two…

  19. Efforts to Produce Relevant Score Reports to School, District, and State Officials on National Tests

    ERIC Educational Resources Information Center

    Patelis, Thanos; Matos-Elefonte, Haifa

    2009-01-01

    Presented at the Annual National Council on Measurement in Education (NCME) in San Diego in April 2009. This presentation explores how the College Board strives to ensure the relevance and utility of score reporting practices and methods for the PSAT/NMSQT and SAT scores. The new reporting methods allow for greater interaction and intervention at…

  20. Number Right and Elimination Score as a Function of Item Arrangement, Knowledge of Arrangement, and Test Anxiety.

    ERIC Educational Resources Information Center

    Plake, Barbara S.; And Others

    Number right and elimination scores were analyzed on a 48-item college level mathematics test that was assembled from pretest data in three forms by varying the item orderings: easy-hard, uniform, or random. Half of the forms contained information explaining the item arrangement and suggesting strategies for taking the test. Several anxiety…

  1. The Impact of Cooperative Learning on Critical Thinking Test Scores of Associate's Degree Graduates in Southwest Virginia

    ERIC Educational Resources Information Center

    Hodges, James Gregory

    2013-01-01

    This study examined the impact that the teaching technique known as cooperative learning had on the changes between pre- and post-test scores on all sub-categories ("induction, deduction, analysis, evaluation, inference", and "total composite") associated with the "California Critical Thinking Skills Test" (CCTST) for…

  2. Aptitude Test Scores of Prospective Graduate Students in Science Remained Essentially the Same from 1970 to 1975.

    ERIC Educational Resources Information Center

    National Science Foundation, Washington, DC. Div. of Science Resources Studies.

    Presented is a summary of an Educational Testing Service (ETS) review of mean scores on the Graduate Record Examination (GRE) of candidates for graduate study in science and engineering fields for the period 1970-1975. Test results were found to have remained essentially stable over the period within each particular field. Significant differences…

  3. Error Rates in Measuring Teacher and School Performance Based on Student Test Score Gains. NCEE 2010-4004

    ERIC Educational Resources Information Center

    Schochet, Peter Z.; Chiang, Hanley S.

    2010-01-01

    This paper addresses likely error rates for measuring teacher and school performance in the upper elementary grades using value-added models applied to student test score gain data. Using realistic performance measurement system schemes based on hypothesis testing, we develop error rate formulas based on OLS and Empirical Bayes estimators.…

  4. Language Learner Strategies and Linguistic Competence as Factors Affecting Achievement Test Scores in English for Specific Purposes

    ERIC Educational Resources Information Center

    Jurkovic, Violeta

    2010-01-01

    The article examines the effect of two factors on achievement test scores in English as a foreign language for specific purposes in higher education: preexisting linguistic competence and frequency of use of language learner strategies. The rationale for the analysis of language learner strategies as a factor affecting achievement test outcomes is…

  5. On the Question of Secular Trends in the Heritability of Intelligence Test Scores: A Study of Norwegian Twins.

    ERIC Educational Resources Information Center

    Sundet, Jon Martin; And Others

    1988-01-01

    Intelligence test data collected in 1931 through 1960 on 757 identical and 1,093 fraternal male twins, from the files of the Norwegian Armed Forces, were examined for secular trends in the heritability of intelligence test scores. Only ambiguous evidence of such trends was found. (SLD)

  6. Effects of Mayfield's Four Questions (M4Q) on Nursing Students' Self-Efficacy and Multiple-Choice Test Scores

    ERIC Educational Resources Information Center

    Mayfield, Linda Riggs

    2010-01-01

    This study examined the effects of being taught the Mayfield's Four Questions multiple-choice test-taking strategy on the perceived self-efficacy and multiple-choice test scores of nursing students in a two-year associate degree program. Experimental and control groups were chosen by stratified random sampling. Subjects completed the 10-statement…

  7. Childhood Fitness and Academic Performance: An Investigation into the Effect of Aerobic Capacity on Academic Test Scores

    ERIC Educational Resources Information Center

    Hobbs, Mark

    2014-01-01

    The purpose of this quantitate ve study was to determine whether or not students in fifth grade who meet the healthy fitness zone (HFZ) for aerobic capacity on the fall 2013 FITNESSGRAM® Test scored higher on the math portion of the 2013 fall Measures of Academic Progress (MAP) test, than students that failed to reach the HFZ for aerobic capacity…

  8. Assessing Follow Through: Changes in Intelligence Test Scores over Two and Three Years of Experience in the Responsive Program.

    ERIC Educational Resources Information Center

    Rayder, Nicolas; And Others

    1978-01-01

    Four Wechsler subscales were administered in a longitudinal design to children from the Responsive Model Follow Through Program. On the first testing, subjects' average intelligence scores were significantly lower, but on subsequent tests equivalent to or higher than national norms, calling into question Deutsch's cumulative-deficit hypothesis.…

  9. Differentiation between Acting-Out and Non-Acting-Out Alcoholics with the Rorschach and Hand Test.

    ERIC Educational Resources Information Center

    Haramis, Samuel L.; Wagner, Edwin E.

    1980-01-01

    Hand Test and Rorschach variables significantly differentiated two subgroups of aggressive and nonaggressive alcoholics. The aggressive group was characterized as hostile and impulsive. The predictor variables that emerged have practical value for recognizing the potential acting-out alcoholic. (Author)

  10. Genetic parameters for test day somatic cell score in Brazilian Holstein cattle.

    PubMed

    Costa, C N; Santos, G G; Cobuci, J A; Thompson, G; Carvalheira, J G V

    2015-01-01

    Selection for lower somatic cell count has been included in the breeding objectives of several countries in order to increase resistance to mastitis. Genetic parameters of somatic cell scores (SCS) were estimated from the first lactation test day records of Brazilian Holstein cows using random-regression models with Legendre polynomials (LP) of the order 3-5. Data consisted of 87,711 TD produced by 10,084 cows, sired by 619 bulls calved from 1993 to 2007. Heritability estimates varied from 0.06 to 0.14 and decreased from the beginning of the lactation up to 60 days in milk (DIM) and increased thereafter to the end of lactation. Genetic correlations between adjacent DIM were very high (>0.83) but decreased to negative values, obtained with LP of order four, between DIM in the extremes of lactation. Despite the favorable trend, genetic changes in SCS were not significant and did not differ among LP. There was little benefit of fitting an LP of an order >3 to model animal genetic and permanent environment effects for SCS. Estimates of variance components found in this study may be used for breeding value estimation for SCS and selection for mastitis resistance in Holstein cattle in Brazil. PMID:26782564

  11. The Relationship among Student Achievement Scores on the Math and Science End-of-Course-Tests and Scores on the High School Graduation Test

    ERIC Educational Resources Information Center

    Turner, Sherry L.

    2011-01-01

    Thirteen percent of the 2008-2009 senior class in one southeastern state did not pass the science portion of the state's high school graduation test. Another 5% failed to pass the math portion of the graduation test, leaving these students unable to obtain a high school diploma. The purpose of this nonexperimental quantitative research study was…

  12. Testing hypotheses in case-control studies--equivalence of Mantel-Haenszel statistics and logit score tests.

    PubMed

    Day, N E; Byar, D P

    1979-09-01

    The two approaches in common use for the analysis of case-control studies are cross-classification by confounding variables, and modeling the logarithm of the odds ratio as a function of exposure and confounding variables. We show here that score statistics derived from the likelihood function in the latter approach are identical to the Mantel-Haenszel test statistics appropriate for the former approach. This identity holds in the most general situation considered, testing for marginal homogeneity in mK tables. This equivalence is demonstrated by a permutational argument which leads to a general likelihood expression in which the exposure variable may be a vector of discrete and/or continuous variables and in which more than two comparison groups may be considered. This likelihood can be used in analyzing studies in which there are multiple controls for each case or in which several disease categories are being compared. The possibility of including continuous variables makes this likelihood useful in situations that cannot be treated using the Mantel-Haenszel cross-classification approach. PMID:497345

  13. An Investigation of the Relationship Between Readiness Test Scores for Kindergarten Children and Achievement Scores Obtained at the End of Grades One and Two. S.S.T.A. Research Centre Report No. 62.

    ERIC Educational Resources Information Center

    Warkentin, Lena

    The primary purpose of this study was to investigate the relationship between Metropolitan Readiness Test (MRT) scores in kindergarten (MRTK) and grade one (MRT1) with the reading scores of the Canadian Tests of Basic Skills (CTBS) at the end of grades one (CTBSR1) and two (CTBSR2). A secondary purpose of the study was to determine whether the…

  14. A physics-based scoring function for protein structural decoys: Dynamic testing on targets of CASP-ROLL

    NASA Astrophysics Data System (ADS)

    Ruiz-Blanco, Yasser B.; Marrero-Ponce, Yovani; García, Yamila; Puris, Amilkar; Bello, Rafael; Green, James; Sotomayor-Torres, Clivia M.

    2014-08-01

    Most successful structure prediction strategies use knowledge-based functions for global optimization, in spite of their intrinsic limited potential to create new folds, while physics-based approaches are often employed only during structure refinement steps. We here propose a physics-based scoring potential intended to perform global searches of the conformational space. We introduce a dynamic test to evaluate the discrimination power of our function, and compare it with predictions of targets from the CASP-ROLL competition. Results demonstrate that this dynamic test is able to generate 3D models which outrank 59% (according GDT_TS score) of models generated with ab initio structure prediction servers.

  15. Intelligibility, subjective ratings and completion time scores using the FAAF test with hearing-impaired subjects and noisy reverberant environments.

    PubMed

    Shields, P W; Campbell, D R

    2001-08-01

    A series of experiments have been performed with the primary aim of assessing the performance of a signal-processing algorithm for a possible future hearing aid application. As part of this work the four alternative auditory feature (FAAF) test was used to obtain a quantitative assessment of speech intelligibility and a subjective assessment of speech quality. This paper reports results of experiments using normal hearing (NH) subjects that provide partial verification of the FAAF test originators' prior work. Also reported are intelligibility score, mean opinion score (MOS) and completion time data obtained by use of the PMID:11694098

  16. The Effects of Teaching Descriptive Geometry in General Engineering 103 on Spatial Relations Tests Scores.

    ERIC Educational Resources Information Center

    Stallings, William M.

    It was hypothesized that instruction in descriptive geometry produces an increase in SRT scores. The resultant data do not firmly support this hypothesis. It is suggested that this study be replicated with the use of randomly selected control groups. (MS)

  17. Science course sequences: The alignment of written, enacted, and tested curricula and their impact on grade 11 HSPA science scores

    NASA Astrophysics Data System (ADS)

    Lentz, Christine A.

    The purpose of this mixed method study was to examine the alignment of the written, enacted, and tested curricula of the Ocean City High School science course sequencing and its impact on student achievement. This study also examined the school's ability to predict student scores on the science portion of the High School Proficiency Assessment (HSPA). Data collected for science achievement included the science portion of the Grade Eight Proficiency Assessment (GEPA) as a pretest and the scores for the science portion of the HSPA as a posttest. Data collected for curriculum alignment included an examination of teacher generated course curriculum maps to determine the alignment with the New Jersey Core Curriculum Content Standards and the HSPA Test Specifications Directory. The quantitative data were treated through a series of paired samples t-tests, Pearson product moment correlation was used to examine relationships between variables, an ANCOVA analysis and a stepwise regression analysis were also completed. Based on the findings of the data analysis of this research effort, the following conclusions were drawn: (1) the alignment of the enacted curriculum with the tested and written curricula affected science achievement. (2) GEPA scores are significantly tied to HSPA scores and (3) GEPA scores and enrollment in the science sequence whose curriculum was aligned with the written and tested curricula, met the requirements of a predictor of scores on the HSPA exam. It is expected that educational leadership will use the results of this research to inform practice and drive decision-making in respect to student placement in to course sequences. It is hoped that the results will not only increase support for the district's curricula development plan but also add to the overall body of knowledge surrounding science program effectiveness in relation to the No Child Left Behind standards.

  18. The Token and Reporter's Tests--Using Two Scoring Conventions: A Normative Study with 286 Grade and Junior High Students and Use with 123 Language-Disordered Students.

    ERIC Educational Resources Information Center

    Jordan, Linda S.; Hall, Penelope K.

    1985-01-01

    Performance of 286 normal children (grades K-9) on the De Renzi and Faglioni form of the Token Test and the De Renzi and Ferrari Reporter's Test were analyzed. Two different scoring conventions were compared: number correct versus weighted scores. Normative data are presented by grade level and age. Specific administration and scoring procedures…

  19. Longitudinal analysis of standardized test scores of students in the Science Writing Heuristic approach

    NASA Astrophysics Data System (ADS)

    Chanlen, Niphon

    The purpose of this study was to examine the longitudinal impacts of the Science Writing Heuristic (SWH) approach on student science achievement measured by the Iowa Test of Basic Skills (ITBS). A number of studies have reported positive impact of an inquiry-based instruction on student achievement, critical thinking skills, reasoning skills, attitude toward science, etc. So far, studies have focused on exploring how an intervention affects student achievement using teacher/researcher-generated measurement. Only a few studies have attempted to explore the long-term impacts of an intervention on student science achievement measured by standardized tests. The students' science and reading ITBS data was collected from 2000 to 2011 from a school district which had adopted the SWH approach as the main approach in science classrooms since 2002. The data consisted of 12,350 data points from 3,039 students. The multilevel model for change with discontinuity in elevation and slope technique was used to analyze changes in student science achievement growth trajectories prior and after adopting the SWH approach. The results showed that the SWH approach positively impacted students by initially raising science achievement scores. The initial impact was maintained and gradually increased when students were continuously exposed to the SWH approach. Disadvantaged students who were at risk of having low science achievement had bigger benefits from experience with the SWH approach. As a result, existing problematic achievement gaps were narrowed down. Moreover, students who started experience with the SWH approach as early as elementary school seemed to have better science achievement growth compared to students who started experiencing with the SWH approach only in high school. The results found in this study not only confirmed the positive impacts of the SWH approach on student achievement, but also demonstrated additive impacts found when students had longitudinal experiences

  20. Likelihood ratio and score tests to test the non-inferiority (or equivalence) of the odds ratio in a crossover study with binary outcomes.

    PubMed

    Li, Xiaochun; Li, Huilin; Jin, Man; D Goldberg, Judith

    2016-09-10

    We consider the non-inferiority (or equivalence) test of the odds ratio (OR) in a crossover study with binary outcomes to evaluate the treatment effects of two drugs. To solve this problem, Lui and Chang (2011) proposed both an asymptotic method and a conditional method based on a random effects logit model. Kenward and Jones (1987) proposed a likelihood ratio test (LRTM ) based on a log linear model. These existing methods are all subject to model misspecification. In this paper, we propose a likelihood ratio test (LRT) and a score test that are independent of model specification. Monte Carlo simulation studies show that, in scenarios considered in this paper, both the LRT and the score test have higher power than the asymptotic and conditional methods for the non-inferiority test; the LRT, score, and asymptotic methods have similar power, and they all have higher power than the conditional method for the equivalence test. When data can be well described by a log linear model, the LRTM has the highest power among all the five methods (LRTM , LRT, score, asymptotic, and conditional) for both non-inferiority and equivalence tests. However, in scenarios for which a log linear model does not describe the data well, the LRTM has the lowest power for the non-inferiority test and has inflated type I error rates for the equivalence test. We provide an example from a clinical trial that illustrates our methods. Copyright © 2016 John Wiley & Sons, Ltd. PMID:27095359

  1. The Score Reliability of Draw-a-Person Intellectual Ability Test (DAP: IQ) for Rural Malawi Students

    ERIC Educational Resources Information Center

    Khasu, Denis S.; Williams, Thomas O., Jr.

    2016-01-01

    In this brief article, the reliability of scores for the Draw-A-Person Intellectual Ability Test for Children, Adolescents, and Adults (DAP: IQ; Reynolds & Hickman, 2004) was examined through several analyses with a sample of 147 children from rural Malawi, Africa using a Chichewa translation of instructions. Cronbach alpha coefficients for…

  2. Improving Secondary Practical Computer Skills: Logo Test Scores through Graphically Designed Computer Programs and Utilization of Multimedia and Technology.

    ERIC Educational Resources Information Center

    Miller, Douglas S.

    The intent of this project was to improve test and programming scores of 9th through 12th grade students enrolled in the Practical Computer Skills: Logo course in a north central Florida high school. An implementation program that demonstrated teacher-designed graphical computer language Logo programs, utilized multimedia techniques, and used…

  3. Assessing the Effect of School Days and Absences on Test Score Performance. CEP Discussion Paper No. 1302

    ERIC Educational Resources Information Center

    Aucejo, Esteban M.; Romano, Teresa Foy

    2014-01-01

    While instructional time is viewed as crucial to learning, little is known about the effectiveness of reducing absences relative to increasing the number of school days. In this regard, this paper jointly estimates the effect of absences and length of the school calendar on test score performance. Using administrative data from North Carolina…

  4. The Impact of Developmental Education on the Graduation Rate of Students with Low Combined Differential Aptitude Test Scores.

    ERIC Educational Resources Information Center

    Marcotte, James

    Since 1974, Cincinnati Technical College (CTC) has offered a Developmental Education (DE) program designed to upgrade students' skills in mathematics, reading, English, and basic sciences, as well as study and interpersonal skills. The DE program offers students, particularly those with low combined Differential Aptitude Test (DAT) scores, a…

  5. The Effects of Different Types of Anchor Tests on Observed Score Equating. Research Report. ETS RR-09-41

    ERIC Educational Resources Information Center

    Liu, Jinghua; Sinharay, Sandip; Holland, Paul W.; Feigenbaum, Miriam; Curley, Edward

    2009-01-01

    This study explores the use of a different type of anchor, a "midi anchor", that has a smaller spread of item difficulties than the tests to be equated, and then contrasts its use with the use of a "mini anchor". The impact of different anchors on observed score equating were evaluated and compared with respect to systematic error (bias), random…

  6. Incomplete Psychometric Equivalence of Scores Obtained on the Manual and the Computer Version of the Wisconsin Card Sorting Test?

    ERIC Educational Resources Information Center

    Steinmetz, Jean-Paul; Brunner, Martin; Loarer, Even; Houssemand, Claude

    2010-01-01

    The Wisconsin Card Sorting Test (WCST) assesses executive and frontal lobe function and can be administered manually or by computer. Despite the widespread application of the 2 versions, the psychometric equivalence of their scores has rarely been evaluated and only a limited set of criteria has been considered. The present experimental study (N =…

  7. Critical Factors to Consider in Evaluating Standard-Setting Studies to Map Language Test Scores to Frameworks of Language Proficiency

    ERIC Educational Resources Information Center

    Tannenbaum, Richard J.; Cho, Yeonsuk

    2014-01-01

    In this article, we consolidate and present in one place what is known about quality indicators for setting standards so that stakeholders may be able to recognize the signs of standard-setting quality. We use the context of setting standards to associate English language test scores with language proficiency descriptions such as those presented…

  8. Differential Predictive Validity of High School GPA and College Entrance Test Scores for University Students in Yemen

    ERIC Educational Resources Information Center

    Al-Hattami, Abdulghani Ali Dawod

    2012-01-01

    High school grade point average and college entrance test scores are two admission criteria that are currently used by most colleges in Yemen to select their prospective students. Given their widespread use, it is important to investigate their predictive validity to ensure the accuracy of the admission decisions in these institutions. This study…

  9. Validation of a Simplified Method for Determining Passing Scores for Criterion-Referenced, Multiple-Choice Tests.

    ERIC Educational Resources Information Center

    Meredith, John B., Jr.

    The complexity of defining accurate passing scores with a minimum classification error when evaluating criterion-referenced, multiple-choice tests has been a major problem for classroom teachers. Therefore, a practical procedure in which the instructor determines the plausibility of each item option for the minimally acceptable examinee is…

  10. The Effects of Georgia's Choice Curricular Reform Model on Third Grade Science Scores on the Georgia Criterion Referenced Competency Test

    ERIC Educational Resources Information Center

    Phemister, Art W.

    2010-01-01

    The purpose of this study was to evaluate the effectiveness of the Georgia's Choice reading curriculum on third grade science scores on the Georgia Criterion Referenced Competency Test from 2002 to 2008. In assessing the effectiveness of the Georgia's Choice curriculum model this causal comparative study examined the 105 elementary schools that…

  11. Legal Issues in the Use of Student Test Scores and Value-Added Models (VAM) to Determine Educational Quality

    ERIC Educational Resources Information Center

    Pullin, Diana

    2013-01-01

    A growing number of states and local schools across the country have adopted educator evaluation and accountability programs based on the use of student test scores and value-added models (VAM). A wide array of potential legal issues could arise from the implementation of these programs. This article uses legal analysis and social science evidence…

  12. Test Score Measurement Error, Short-Term Knowledge, and Lagged Dependent Variables in Models of the Education Production Function

    ERIC Educational Resources Information Center

    Stacy, Brian; Lockwood, J. R.; McCaffrey, Daniel

    2012-01-01

    Researchers and policymakers are interested in the causal effects of educational inputs on student achievement. Unfortunately, it is not possible to directly observe student learning, so test score data is often used as an approximate measure. To measure their achievement at a given point in time (e.g., in the spring of the school year) students…

  13. Comparing Standardized Test Scores among Arts-Integrated and Non-Arts Integrated Schools in Central Mississippi

    ERIC Educational Resources Information Center

    Dean, Darlene

    2014-01-01

    The topic of arts integration creates continuing dialog among educators and arts advocates. This study examined the degree to which student achievement was affected when arts education is limited or eliminated from schools to meet the mandates of NCLB (2001) legislation. Standardized test scores from 12 schools in Central Mississippi were used to…

  14. "Does Charter School Attendance Improve Test Scores?" Comments and Reactions on the Arizona Achievement Study. Upjohn Institute Staff Working Paper.

    ERIC Educational Resources Information Center

    Nelson, Christopher; Hollenbeck, Kevin

    In a recent report, Lewis Solmon, Kern Paark, and David Garcia (2001) seek to identify the impact of attending charter schools on student achievement using data from Arizona. Based on a sophisticated statistical analysis, these authors report that charter school attendance increases test score gains of students. This note raises some questions…

  15. The Role of Fluid, Crystallized, and Creative Abilities in the Prediction of Scores on Essay and Objective Tests.

    ERIC Educational Resources Information Center

    Legg, Sue M.; Ware, William B.

    Student and test characteristics were examined by multiple regression analysis and discriminant function analysis to explain why 171 political science undergraduates scored differently on essay versus objective final examinations. Student characteristics included: (1) patterns of creative, crystallized, and fluid abilities as measured by the…

  16. Examination of the Psychometric Properties of the Test Anxiety Scale for Elementary Students (TAS-E) Scores

    ERIC Educational Resources Information Center

    Lowe, Patricia A.; Grumbein, Matthew J.; Raad, Jennifer M.

    2011-01-01

    The psychometric properties of the Test Anxiety Scale for Elementary Students (TAS-E) scores were examined. In Study 1, an exploratory factor analysis (EFA) was performed on the responses of 997 students in Grades 2 to 6 on the TAS-E. The results of the EFA produced a four-factor solution: Physiological Hyperarousal, Social Concerns, Task…

  17. Is the Test Score Decline Responsible for the Productivity Growth Decline? Working Paper No. 87-05.

    ERIC Educational Resources Information Center

    Bishop, John

    This paper presents evidence that recent aptitude test score decline is signaling a significant deterioration in the quality of entering cohorts of workers. The impact of general intellectual achievement (GIA) on productivity; trends in the GIA of the adult populations, students, and working adults; accounting for the labor quality growth when…

  18. Classifying and scoring of molecules with the NGN: new datasets, significance tests, and generalization

    PubMed Central

    2010-01-01

    This paper demonstrates how a Neural Grammar Network learns to classify and score molecules for a variety of tasks in chemistry and toxicology. In addition to a more detailed analysis on datasets previously studied, we introduce three new datasets (BBB, FXa, and toxicology) to show the generality of the approach. A new experimental methodology is developed and applied to both the new datasets as well as previously studied datasets. This methodology is rigorous and statistically grounded, and ultimately culminates in a Wilcoxon significance test that proves the effectiveness of the system. We further include a complete generalization of the specific technique to arbitrary grammars and datasets using a mathematical abstraction that allows researchers in different domains to apply the method to their own work. Background Our work can be viewed as an alternative to existing methods to solve the quantitative structure-activity relationship (QSAR) problem. To this end, we review a number approaches both from a methodological and also a performance perspective. In addition to these approaches, we also examined a number of chemical properties that can be used by generic classifier systems, such as feed-forward artificial neural networks. In studying these approaches, we identified a set of interesting benchmark problem sets to which many of the above approaches had been applied. These included: ACE, AChE, AR, BBB, BZR, Cox2, DHFR, ER, FXa, GPB, Therm, and Thr. Finally, we developed our own benchmark set by collecting data on toxicology. Results Our results show that our system performs better than, or comparatively to, the existing methods over a broad range of problem types. Our method does not require the expert knowledge that is necessary to apply the other methods to novel problems. Conclusions We conclude that our success is due to the ability of our system to: 1) encode molecules losslessly before presentation to the learning system, and 2) leverage the design of

  19. Will Teacher Value-Added Scores Change When Accountability Tests Change? What We Know Series: Value-Added Methods and Applications. Knowledge Brief 8

    ERIC Educational Resources Information Center

    McCaffrey, Daniel F.

    2013-01-01

    Value-added evaluations use student test scores to assess teacher effectiveness. How student achievement is judged can depend on which test is used to measure it. Thus it is reasonable to ask whether a teacher's value-added score depends on which test is used to calculate it. Would it change if a different test was used? Specifically, might a…

  20. What Causes the Test-Score Gap in Higher Education? Perspectives on the Office for Civil Rights Resource Guide on High-Stakes Testing.

    ERIC Educational Resources Information Center

    Zwick, Rebecca

    2001-01-01

    Considers the guide on high-stakes testing issued by the federal Office for Civil Rights, including the controversy which ensued upon release of the first draft, changes in the subsequent version, and the issue of differences in educational achievement among ethnic and racial groups of which differences in standardized test scores may be…

  1. How Close Is Close Enough? Testing Nonexperimental Estimates of Impact against Experimental Estimates of Impact with Education Test Scores as Outcomes. Discussion Paper.

    ERIC Educational Resources Information Center

    Wilde, Elizabeth Ty; Hollister, Robinson

    This study tested the performance of nonexperimental estimators of impacts applied to a class size reduction intervention with achievement test scores as the outcome. Nonexperimental estimates of impacts were compared to "true impact" estimates provided by a random-assignment design that assessed intervention effects. Data came from Project STAR,…

  2. How Close Is Close Enough? Testing Nonexperimental Estimates of Impact against Experimental Estimates of Impact with Education Test Scores as Outcomes. Discussion Paper No. 1242-02

    ERIC Educational Resources Information Center

    Wilde, Elizabeth Ty; Hollister, Robinson

    2002-01-01

    In this study we test the performance of some nonexperimental estimators of impacts applied to an educational intervention--reduction in class size--where achievement test scores were the outcome. We compare the nonexperimental estimates of the impacts to "true impact" estimates provided by a random-assignment design used to assess the…

  3. An Investigation of Calculator Use on Employment Tests of Mathematical Ability: Effects on Reliability, Validity, Test Scores, and Speed of Completion

    ERIC Educational Resources Information Center

    Bing, Mark N.; Stewart, Susan M.; Davison, H. Kristl

    2009-01-01

    Handheld calculators have been used on the job for more than 30 years, yet the degree to which these devices can affect performance on employment tests of mathematical ability has not been thoroughly examined. This study used a within-subjects research design (N = 167) to investigate the effects of calculator use on test score reliability, test…

  4. Using NAEP to Confirm State Test Results in the No Child Left Behind Act

    ERIC Educational Resources Information Center

    Stoneberg, Bert D.

    2007-01-01

    The U.S. Department of Education has not yet published an official guidance document for using NAEP achievement level scores to confirm state testing results. A review of the literature, however, identified four principles that inform the valid use of NAEP scores in a confirming analysis. The principles address the appropriate NAEP statistics to…

  5. Contributions of Selected Perinatal Variables to Seven-Year Psychological and Achievement Test Scores.

    ERIC Educational Resources Information Center

    Henderson, N. B.; And Others

    Perinatal variables were used to predict 7-year outcome for 538 children, 32% Negro and 68% white. Mother's age, birthplace, education, occupation, marital status, neuropsychiatric status, family income, number supported, birth weight, one- and five-minute Apgar scores were regressed on 7-year Verbal, Performance and Full Scale IQ, Bender, Wide…

  6. Using Propensity Score Matching to Test the Community College Penalty Assumption

    ERIC Educational Resources Information Center

    Dietrich, Cecile C.; Lichtenberger, Eric J.

    2015-01-01

    Research studies have been ambivalent about whether enrolling in community college makes completing a bachelor's degree less likely than directly enrolling in a four-year institution. This study uses propensity score matching with a posttreatment adjustment to determine the treatment effect associated with taking the community college to…

  7. Out-of-School Time Program Test Score Impact for Black Children of Single-Parents

    ERIC Educational Resources Information Center

    Nagle, Barry T.

    2013-01-01

    Out-of-School Time programs and their impact on standardized college entrance exam scores for black or African-American children of single parents who have applied for a competitive college scholarship program is the study focus. Study importance is supported by the large percentage of black children raised by single parents, the large percentage…

  8. Examining the Achievement Test Score Gap between Urban and Suburban Students

    ERIC Educational Resources Information Center

    Sandy, Jonathan; Duncan, Kevin

    2010-01-01

    Data from the National Longitudinal Survey of Labor Market Experience for Youth (1997 cohort) are used to examine the urban school achievement gap. Specifically, we use the Blinder-Oaxaca technique to decompose differences in Armed Services Vocational Aptitude Battery scores for students who attended urban and suburban schools. We find that…

  9. The Effect of Linguistic Simplification of Science Test Items on Score Comparability

    ERIC Educational Resources Information Center

    Rivera, Charlene; Stansfield, Charles W.

    2004-01-01

    The use of accommodations has been widely proposed as a means of including English language learners (ELLs) or limited English proficient (LEP) students in state and districtwide assessments. However, very little experimental research has been done on specific accommodations to determine whether these pose a threat to score comparability. This…

  10. Effects of Public Preschool Expenditures on the Test Scores of Fourth Graders: Evidence from TIMSS

    ERIC Educational Resources Information Center

    Waldfogel, Jane; Zhai, Fuhua

    2008-01-01

    This study examines the effects of public preschool expenditures on the math and science scores of 4th graders, holding constant child, family, and school characteristics, other relevant social expenditures, and country and year effects, in 7 Organisation for Economic Co-operation and Development (OECD) countries--Australia, Japan, the…

  11. The Relationship between Kindergarten Students' Home Block Play and Their Spatial Ability Test Scores

    ERIC Educational Resources Information Center

    Jones, Tracy Anne

    2010-01-01

    Researchers are increasingly aware of the role of spatial skills in preparing children for future mathematics achievement (National Mathematics Advisory Panel, 2008). In addition, sex differences have been consistently documented showing boys score higher than girls in assessments of spatial ability, particularly mental rotation (Linn & Peterson,…

  12. Comparative assessment of scoring functions on an updated benchmark: 1. Compilation of the test set.

    PubMed

    Li, Yan; Liu, Zhihai; Li, Jie; Han, Li; Liu, Jie; Zhao, Zhixiong; Wang, Renxiao

    2014-06-23

    Scoring functions are often applied in combination with molecular docking methods to predict ligand binding poses and ligand binding affinities or to identify active compounds through virtual screening. An objective benchmark for assessing the performance of current scoring functions is expected to provide practical guidance for the users to make smart choices among available methods. It can also elucidate the common weakness in current methods for future improvements. The primary goal of our comparative assessment of scoring functions (CASF) project is to provide a high-standard, publicly accessible benchmark of this type. Our latest study, i.e., CASF-2013, evaluated 20 popular scoring functions on an updated set of protein-ligand complexes. This data set was selected out of 8302 protein-ligand complexes recorded in the PDBbind database (version 2013) through a fairly complicated process. Sample selection was made by considering the quality of complex structures as well as binding data. Finally, qualified complexes were clustered by 90% similarity in protein sequences. Three representative complexes were chosen from each cluster to control sample redundancy. The final outcome, namely, the PDBbind core set (version 2013), consists of 195 protein-ligand complexes in 65 clusters with binding constants spanning nearly 10 orders of magnitude. In this data set, 82% of the ligand molecules are "druglike" and 78% of the protein molecules are validated or potential drug targets. Correlation between binding constants and several key properties of ligands are discussed. Methods and results of the scoring function evaluation will be described in a companion work in this issue (doi: 10.1021/ci500081m ). PMID:24716849

  13. State Test Score Trends through 2008-09, Part 5: Progress Lags in High School, Especially for Advanced Achievers

    ERIC Educational Resources Information Center

    McMurrer, Jennifer; Kober, Nancy

    2011-01-01

    This report by the Center on Education Policy (CEP), an independent nonprofit organization, examines trends in the achievement of high school students on the state reading/English language arts (ELA) and mathematics tests used for accountability under the No Child Left Behind Act (NCLB). In most states, these tests are first administered in grade…

  14. Identifying the Value of the ACT Score as a Predictor of Student Success in Respiratory Care, Radiography, and Nursing at Southeast Kentucky Community and Technical College

    ERIC Educational Resources Information Center

    Parrott-Robbins, Rebecca Jon

    2010-01-01

    The purpose of this study was to investigate--by utilizing data obtained from the Kentucky Community and Technical College System (KCTCS) PeopleSoft database-- whether the American College Testing (ACT) assessment was a predictor of student success for students who had graduated from respiratory, radiography, and nursing programs at Southeast…

  15. Evolution of the COPD Assessment Test score during chronic obstructive pulmonary disease exacerbations: Determinants and prognostic value

    PubMed Central

    Feliz-Rodriguez, Darwin; Zudaire, Santiago; Carpio, Carlos; Martínez, Elizabet; Gómez-Mendieta, Antonia; Santiago, Ana; Alvarez-Sala, Rodolfo; García-Río, Francisco

    2013-01-01

    BACKGROUND: An adequate evaluation of exacerbations is a primary objective in managing patients with chronic obstructive pulmonary disease (COPD). OBJECTIVES: To define the profile of health status recovery during severe exacerbations of COPD using the COPD Assessment Test (CAT) questionnaire and to evaluate its prognostic value. METHODS: Forty-five patients with previous COPD diagnoses who were hospitalized due to severe exacerbation(s) were included in the study. These patients were treated by their respective physicians following current recommendations; health status was assessed daily using the CAT questionnaire. The CAT score, spirometry and recurrent hospitalizations were recorded one and three months after hospital discharge. RESULTS: Global initiative for chronic Obstructive Lung Disease (GOLD) stage was an independent determinant for increased CAT score during the first days of exacerbation with respect to postexacerbation values. From hospitalization day 5, the CAT score was similar to that obtained in the stable phase. Body mass index, GOLD stage and education level were related to health status recovery pattern. CAT score increase and the area under the curve of CAT recovery were inversely related to the forced expiratory volume in 1 s achieved three months after discharge (r= −0.606; P<0.001 and r= −0.532; P<0.001, respectively). Patients with recurrent hospitalizations showed higher CAT score increases and slower recovery. CONCLUSIONS: The CAT detects early health status improvement during severe COPD exacerbations. Its initial worsening and recovery pattern are related to lung function and recurrent hospitalizations. PMID:24093119

  16. Interpreting the g loadings of intelligence test composite scores in light of Spearman's law of diminishing returns.

    PubMed

    Reynolds, Matthew R

    2013-03-01

    The linear loadings of intelligence test composite scores on a general factor (g) have been investigated recently in factor analytic studies. Spearman's law of diminishing returns (SLODR), however, implies that the g loadings of test scores likely decrease in magnitude as g increases, or they are nonlinear. The purpose of this study was to (a) investigate whether the g loadings of composite scores from the Differential Ability Scales (2nd ed.) (DAS-II, C. D. Elliott, 2007a, Differential Ability Scales (2nd ed.). San Antonio, TX: Pearson) were nonlinear and (b) if they were nonlinear, to compare them with linear g loadings to demonstrate how SLODR alters the interpretation of these loadings. Linear and nonlinear confirmatory factor analysis (CFA) models were used to model Nonverbal Reasoning, Verbal Ability, Visual Spatial Ability, Working Memory, and Processing Speed composite scores in four age groups (5-6, 7-8, 9-13, and 14-17) from the DAS-II norming sample. The nonlinear CFA models provided better fit to the data than did the linear models. In support of SLODR, estimates obtained from the nonlinear CFAs indicated that g loadings decreased as g level increased. The nonlinear portion for the nonverbal reasoning loading, however, was not statistically significant across the age groups. Knowledge of general ability level informs composite score interpretation because g is less likely to produce differences, or is measured less, in those scores at higher g levels. One implication is that it may be more important to examine the pattern of specific abilities at higher general ability levels. PMID:23506024

  17. A quantitative examination of school configurations in Tennessee using sixth grade math, reading, science, and social studies standardized test scores

    NASA Astrophysics Data System (ADS)

    Ramsey, Whitney J.

    The purpose of this study was to determine if there were differences in standardized test scores, expressed as percentage passing, in math, reading-language arts, science, and social studies by comparing 6th grade students in K--8 schools with those in 6--8 schools. The data were gathered from an analysis of 6th grade students' scores on the 2006--2007 TCAP standardized assessment test in the state of Tennessee. The relationship between grade configuration (6--8 or K--8) and percent of 6th grade students scoring at the below proficient, proficient, or advanced level in each subject area was examined. The analysis was based on 5 research questions. A t-test for independent samples was used to identify the relationships between the independent variables, configuration of the school (K--8 or 6--8), and the dependent variables, the percent of students scoring below proficient, proficient, or advanced. A chi square analysis was used to identify the relationship between the proportion of K--8 schools meeting AYP versus the proportion of 6--8 schools meeting AYP. The study showed no relationship between grade configuration (6--8 or K--8) and percent of 6th grade students scoring at the below proficient level in math, reading-language arts, and social studies. Similarly, there was not a significant difference between grade configuration (6--8 or K--8) and percent of 6th grade students scoring at the proficient level in math and reading-language arts and the advanced level in math, reading-language arts, and science. However, there was a significant relationship between grade configuration (6--8 or K--8) and percent of 6th grade students scoring at the below proficient level and the proficient level in science and the percent of 6th grade students scoring at the proficient level and advanced level in social studies. In science, a lower percentage of 6th grade students in K--8 schools scored below proficient than did 6th grade students in 6--8 schools. In science, a higher

  18. Use of e-rater[R] in Scoring of the TOEFL iBT[R] Writing Test. Research Report. ETS RR-11-25

    ERIC Educational Resources Information Center

    Haberman, Shelby J.

    2011-01-01

    Alternative approaches are discussed for use of e-rater[R] to score the TOEFL iBT[R] Writing test. These approaches involve alternate criteria. In the 1st approach, the predicted variable is the expected rater score of the examinee's 2 essays. In the 2nd approach, the predicted variable is the expected rater score of 2 essay responses by the…

  19. Correlates of Children's Eating Attitude Test scores among primary school children.

    PubMed

    Shariff, Zalilah Mohd; Yasin, Zaidah Mohamed

    2005-04-01

    A total of 107 Malay primary school girls (8-9 yr. old) completed a set of measurements on eating behavior (ChEAT, food neophobia scales, and dieting experience), the Rosenberg Self-Esteem Scale, body shape satisfaction, dietary intake, weight, and height. About 38% of the girls scored 20 and more on the ChEAT, and 46% of them reported dieting by reducing sugar and sweets (73%), skipping meals (67%), reducing fat foods (60%) and snacks (53%) as the most frequent methods practiced. In general, those girls with higher ChEAT scores tended to have lower self-esteem (r=.39), indicating they were more unwilling to try new foods (food neophobic) (r=.29), chose a smaller figure for desired body size (r=-.25), and were more dissatisfied with their body size (r=.31). PMID:15974357

  20. Evaluation of the Generalizability of the Number of Abnormal Scores and the Overall Test Battery Mean as Measures of Performance Validity to a Different Test Battery.

    PubMed

    Silk-Eglit, Graham M; Miele, Andrea S; Stenclik, Jessica H; Lynch, Julie K; McCaffrey, Robert J

    2015-01-01

    Davis, Axelrod, McHugh, Hanks, and Millis (2013) documented that in a battery of 25 tests, producing 15, 10, and 5 abnormal scores at 1, 1.5, and 2 standard deviations below the norm-referenced mean, respectively, and an overall test battery mean (OTBM) of T ≤ 38 accurately identifies performance invalidity. However, generalizability of these findings to other samples and test batteries remains unclear. This study evaluated the use of abnormal scores and the OTBM as performance validity measures in a different sample that was administered a 25-test battery that minimally overlapped with Davis et al.'s test battery. Archival analysis of 48 examinees with mild traumatic brain injury seen for medico-legal purposes was conducted. Producing 18 or more, 7 or more, and 5 or more abnormal scores at 1, 1.5, and 2 standard deviations below the norm-referenced mean, respectively, and an OTBM of T ≤ 40 most accurately classified examinees; however, using Davis et al.'s proposed cutoffs in the current sample maintained specificity at or near acceptable levels. Due to convergence across studies, producing ≥5 abnormal scores at 2 standard deviations below the norm-referenced mean is the most appropriate cutoff for clinical implementation; however, for batteries consisting of a different quantity of tests than 25, an OTBM of T ≤ 38 is more appropriate. PMID:25785544

  1. Impact of structured verbal feedback module in medical education: A questionnaire- and test score-based analysis

    PubMed Central

    Aggarwal, Meenakshi; Singh, Sonia; Sharma, Anu; Singh, Poonam; Bansal, Priya

    2016-01-01

    Introduction: Feedback is a divalent bond between the supplier (teacher) and the recipient (student). The strength of the bond depends on the instructional design of the feedback. Feedback is central to medical education in promoting self-directed learning in students. In the present study, a structured verbal feedback module was prepared, implemented, and evaluated. Methods: The study was done on 280 students from four consecutive batches (2011 to 2014) of the 1st year MBBS students exposed to different types and modes of feedback. Analysis was done using student feedback questionnaire for the perception of students to verbal feedback. Quantitative analysis using post hoc test and ANOVA for the impact of type of feedback (verbal or written) and effect of modes (individual or group) of verbal feedback on test score performance were done. Result: In this study, ≥95% of the students preferred verbal feedback of both positive and negative attributes in student questionnaires. It was observed that verbal feedback sessions made a difference of up to 2–2.4 grade points in the mean score of batch when compared to the written feedback. The initial mean test score (T1) of 2011 + 2012 and 2013 + 2014 was not statistically significant (P = 0.113). But, in all subsequent tests (T2, T3, and T4), there was a statistically significant difference in the mean test scores (P = 0.000). Conclusion: (1) Students prefer verbal one-to-one feedback over written feedback. (2) Verbal feedback changes learning process and causes sustained improvement in learning strategies. PMID:27563592

  2. Do classroom ventilation rates in California elementary schools influence standardized test scores? Results from a prospective study.

    PubMed

    Mendell, M J; Eliseeva, E A; Davies, M M; Lobscheid, A

    2016-08-01

    Limited evidence has associated lower ventilation rates (VRs) in schools with reduced student learning or achievement. We analyzed longitudinal data collected over two school years from 150 classrooms in 28 schools within three California school districts. We estimated daily classroom VRs from real-time indoor carbon dioxide measured by web-connected sensors. School districts provided individual-level scores on standard tests in Math and English, and classroom-level demographic data. Analyses assessing learning effects used two VR metrics: average VRs for 30 days prior to tests, and proportion of prior daily VRs above specified thresholds during the year. We estimated relationships between scores and VR metrics in multivariate models with generalized estimating equations. All school districts had median school-year VRs below the California VR standard. Most models showed some positive associations of VRs with test scores; however, estimates varied in magnitude and few 95% confidence intervals excluded the null. Combined-district models estimated statistically significant increases of 0.6 points (P = 0.01) on English tests for each 10% increase in prior 30-day VRs. Estimated increases in Math were of similar magnitude but not statistically significant. Findings suggest potential small positive associations between classroom VRs and learning. PMID:26283474

  3. Co-Educational Tutorial Classes and Their Significance on Gendered Test Scores of Wollo University Students: A Before-After Analyses

    ERIC Educational Resources Information Center

    Gidey, Mu'uz

    2015-01-01

    This action research is carried out in a practical class room setting to devise an innovative way of administering tutorial classes to improve students' learning competence with particular reference to gendered test scores. A before-after test score analyses of mean and standard deviations along with t-statistical tests of hypotheses of second…

  4. The Unintended Consequences of Narrowing Secondary Curriculum in Response to Low Standardized Test Scores

    ERIC Educational Resources Information Center

    Reid, LaShonda Nicole

    2012-01-01

    The No Child Left Behind Act of 2001 placed high demands on public schools by requiring that all students become proficient in math and reading by the year 2014. Many public schools responded to the demand by limiting curriculum to the core curriculum, particularly in the lengthening of the amount of time spent studying the subjects reading and…

  5. Scoring Best on All Life's Tests--By Using Your "Whole Brain."

    ERIC Educational Resources Information Center

    Rubenzer, Ronald L.

    This collection of documents contains advice on doing well on tests by preparing and using the whole brain. The first sheet contains general tips for helping students become fact-smart, how to improve test-smartness, and how to build test-smartness. The combination of these three approaches assures that students will do their best on tests. The…

  6. Development of a standardized scoring method for the Graphic Sequence Test suitable for use in psychiatric populations.

    PubMed

    Fountoulakis, Konstantinos N; Panagiotidis, Panagiotis T; Siamouli, Melina; Magiria, Stamatia; Sokolaki, Stavroula; Kantartzis, Sotiris; Rova, Klairi; Papastergiou, Natalia; Shoretstanitis, George; Oral, Timucin; Mavridis, Theoharis; Iacovides, Apostolos; Kaprinis, George

    2008-03-01

    Although the graphic version of the Alternating Sequences Test which was introduced by Luria exists for years little has been done to standardize it. The aim of the current study was to develop a novel and detailed standardized method of administration and scoring. The study sample included 93 normal control subjects (53 women and 40 men) aged 35.87+/-12.62 and 127 patients suffering from schizophrenia (54 women and 73 men) aged 34.07+/-9.83. The psychometric assessment included the Positive and Negative Symptoms Scale the Young Mania Rating Scale, and the Montgomery-Asberg Depression Rating Scale. A scoring method was developed and was based on the frequencies of responses of healthy controls. Cronbach alpha and test-retest and interrater reliability were very good. Two indices and 6 subscales of the Standardized Graphic Sequence Test were eventually developed. The Standardized Graphic Sequence Test seems to be a reliable, valid, and sensitive to change instrument based on Luria's graphic sequence test. The great advantage of this instrument is the fact that it is paper and pencil, easily administered and little time consuming. Further research is necessary to test its usefulness as a neuropsychologic test. PMID:18327019

  7. Birth order and its relationship to depression, anxiety, and self-concept test scores in children.

    PubMed

    Gates, L; Lineberger, M R; Crockett, J; Hubbard, J

    1988-03-01

    Children (N = 404), 7 to 12 years old, were given the Children's Depression Inventory, the State-Trait Anxiety Inventory for Children, and the Piers-Harris Self-Concept Scale. First-born children scored significantly lower on depression than second-, third-, fourth-born, and youngest children. First borns showed significantly less trait anxiety than third-born children. First-born children also showed significantly higher levels of self-esteem than second-born and youngest children. Girls in this study showed significantly more trait anxiety than boys. PMID:3373186

  8. A quantitative assessment of alkaptonuria: testing the reliability of two disease severity scoring systems.

    PubMed

    Cox, Trevor F; Ranganath, Lakshminarayan

    2011-12-01

    Alkaptonuria (AKU) is due to excessive homogentisic acid accumulation in body fluids due to lack of enzyme homogentisate dioxygenase leading in turn to varied clinical manifestations mainly by a process of conversion of HGA to a polymeric melanin-like pigment known as ochronosis. A potential treatment, a drug called nitisinone, to decrease formation of HGA is available. However, successful demonstration of its efficacy in modifying the natural history of AKU requires an effective quantitative assessment tool. We have described two potential tools that could be used to quantitate disease burden in AKU. One tool describes scoring the clinical features that includes clinical assessments, investigations and questionnaires in 15 patients with AKU. The second tool describes a scoring system that only includes items obtained from questionnaires used in 44 people with AKU. Statistical analyses were carried out on the two patient datasets to assess the AKU tools; these included the calculation of Chronbach's alpha, multidimensional scaling and simple linear regression analysis. The conclusion was that there was good evidence that the tools could be adopted as AKU assessment tools, but perhaps with further refinement before being used in the practical setting of a clinical trial. PMID:21744089

  9. An Adaptive Algebra Test: A Testlet-Based, Hierarchically-Structured Test with Validity-Based Scoring. Technical Report No. 90-92.

    ERIC Educational Resources Information Center

    Wainer, Howard; And Others

    The initial development of a testlet-based algebra test was previously reported (Wainer and Lewis, 1990). This account provides the details of this excursion into the use of hierarchical testlets and validity-based scoring. A pretest of two 15-item hierarchical testlets was carried out in which examinees' performance on a 4-item subset of each…

  10. Does It Matter if You "Kill" the Patient or Order Too Many Tests? Scoring Alternatives for a Test of Clinical Reasoning Skill

    ERIC Educational Resources Information Center

    Childs, Ruth A.; Dunn, Jennifer L.; van Barneveld, Christina; Jaciw, Andrew P.

    2007-01-01

    This study compares five scoring approaches for a test of clinical reasoning skills. All of the approaches incorporate information about the correct item responses selected and the errors, such as selecting too many responses or selecting a response that is inappropriate and/or harmful to the patient. The approaches are combinations of theoretical…

  11. Effects of Multidimensionality on IRT Item Characteristics and True Score Estimates: Implications for Computerized Test Assembly. Computerized Testing Report. LSAC Research Report Series.

    ERIC Educational Resources Information Center

    Wang, Xiang-Bo; Harris, Vincent; Roussos, Louis

    Multidimensionality is known to affect the accuracy of item parameter and ability estimations, which subsequently influences the computation of item characteristic curves (ICCs) and true scores. By judiciously combining sections of a Law School Admission Test (LSAT), 11 sections of varying degrees of uni- and multidimensional structures are used…

  12. A Primer-Test Centered Equating Method for Setting Cut-Off Scores

    ERIC Educational Resources Information Center

    Zhu, Weimo; Plowman, Sharon Ann; Park, Youngsik

    2010-01-01

    This study evaluated the use of a new primary field test method based on test equating to address inconsistent classification among field tests. We analyzed students' information on the Progressive Aerobic Cardiovascular Endurance Run (PACER), mile run (MR), and VO[subscript 2]max from three data sets (college: n = 94; middle school: n = 39;…

  13. Pass-Fail Reliability for Tests with Cut Scores: A Simplified Method.

    ERIC Educational Resources Information Center

    Breyer, F. Jay; Lewis, Charles

    A single-administration classification reliability index is described that estimates the probability of consistently classifying examinees to mastery or nonmastery states as if those examinees had been tested with two alternate forms. The procedure is applicable to any test used for classification purposes, subdividing that test into two…

  14. The Search for the Holy Grail: Content-Referenced Score Interpretations from Large-Scale Tests

    ERIC Educational Resources Information Center

    Marion, Scott F.

    2015-01-01

    The measurement industry is in crisis. The public outcry against "over testing" and the opt-out movement are symptoms of a larger sociopolitical battle being fought over Common Core, teacher evaluation, federal intrusion, and a host of other issues, but much of the vitriol is directed at the tests and the testing industry. If we, as…

  15. Comparison of Physical Therapy Anatomy Performance and Anxiety Scores in Timed and Untimed Practical Tests

    ERIC Educational Resources Information Center

    Schwartz, Sarah M.; Evans, Cathy; Agur, Anne M.R.

    2015-01-01

    Students in health care professional programs face many stressful tests that determine successful completion of their program. Test anxiety during these high stakes examinations can affect working memory and lead to poor outcomes. Methods of decreasing test anxiety include lengthening the time available to complete examinations or evaluating…

  16. State Test Score Trends through 2008-09, Part 4: Is Achievement Improving and Are Gaps Narrowing for Title I Students? Pennsylvania

    ERIC Educational Resources Information Center

    Center on Education Policy, 2011

    2011-01-01

    This paper profiles Pennsylvania's test score trends through 2008-09. In 2006, the mean scale score on the state 4th grade reading test was 1390 for non-Title I students and 1220 for Title I students. In 2009, the mean scale score in 4th grade reading was 1420 for non-Title I students and 1270 for Title I students. Between 2006 and 2009, the mean…

  17. State Test Score Trends through 2008-09, Part 4: Is Achievement Improving and Are Gaps Narrowing for Title I Students? Tennessee

    ERIC Educational Resources Information Center

    Center on Education Policy, 2011

    2011-01-01

    This paper profiles Tennessee's test score trends through 2008-09. In 2004, the mean scale score on the state 4th grade reading test was 501 for non-Title I students and 486 for Title I students. In 2009, the mean scale score in 4th grade reading was 512 for non-Title I students and 495 for Title I students. Between 2004 and 2009, the mean scale…

  18. State Test Score Trends through 2008-09, Part 4: Is Achievement Improving and Are Gaps Narrowing for Title I Students? Kentucky

    ERIC Educational Resources Information Center

    Center on Education Policy, 2011

    2011-01-01

    This paper profiles Kentucky's test score trends through 2008-09. In 2007, the mean scale score on the state 4th grade reading test was 455 for non-Title I students and 451 for Title I students. In 2009, the mean scale score in 4th grade reading was 455 for non-Title I students and 451 for Title I students. Between 2007 and 2009, the mean scale…

  19. State Test Score Trends through 2008-09, Part 4: Is Achievement Improving and Are Gaps Narrowing for Title I Students? Maryland

    ERIC Educational Resources Information Center

    Center on Education Policy, 2011

    2011-01-01

    This paper profiles Maryland's test score trends through 2008-09. In 2004, 82% of non-Title I 4th graders and 61% of Title I 4th graders scored at the proficient level on the state reading test. In 2009, 90% of non-Title I 4th graders and 78% of Title I 4th graders scored at the proficient level in reading. Between 2004 and 2009, the percentage…

  20. State Test Score Trends through 2008-09, Part 4: Is Achievement Improving and Are Gaps Narrowing for Title I Students? Idaho

    ERIC Educational Resources Information Center

    Center on Education Policy, 2011

    2011-01-01

    This paper profiles Idaho's test score trends through 2008-09. In 2007, the mean scale score on the state 4th grade reading test was 209 for non-Title I students and 205 for Title I students. In 2007, the mean scale score in 4th grade reading was 211 for non-Title I students and 208 for Title I students. Between 2007 and 2009, the mean scale score…

  1. State Test Score Trends through 2008-09, Part 4: Is Achievement Improving and Are Gaps Narrowing for Title I Students? California

    ERIC Educational Resources Information Center

    Center on Education Policy, 2011

    2011-01-01

    This paper profiles California's test score trends through 2008-09. In 2004, the mean scale score on the state 4th grade reading test was 341 for non-Title I students and 315 for Title I students. In 2008, the mean scale score in 4th grade reading was 379 for non-Title I students and 340 for Title I students. Between 2004 and 2008, the mean scale…

  2. State Test Score Trends through 2008-09, Part 4: Is Achievement Improving and Are Gaps Narrowing for Title I Students? Delaware

    ERIC Educational Resources Information Center

    Center on Education Policy, 2011

    2011-01-01

    This paper profiles Delaware's test score trends through 2008-09. In 2006, the mean scale score on the state 4th grade reading test was 474 for non-Title I students and 464 for Title I students. In 2009, the mean scale score in 4th grade reading was 478 for non-Title I students and 467 for Title I students. Between 2006 and 2009, the mean scale…

  3. State Test Score Trends through 2008-09, Part 4: Is Achievement Improving and Are Gaps Narrowing for Title I Students? Utah

    ERIC Educational Resources Information Center

    Center on Education Policy, 2011

    2011-01-01

    This paper profiles Utah's test score trends through 2008-09. In 2004, the mean scale score on the state 4th grade reading test was 167 for non-Title I students and 164 for Title I students. In 2009 the mean scale score in 4th grade reading was 168 for non-Title I students and 164 for Title I students. Between 2004 and 2009, the mean scale score…

  4. State Test Score Trends through 2008-09, Part 4: Is Achievement Improving and Are Gaps Narrowing for Title I Students? Washington

    ERIC Educational Resources Information Center

    Center on Education Policy, 2011

    2011-01-01

    This paper profiles Washington's test score trends through 2008-09. Three years of comparable mean scale score data were not available from the state. In 2004, 77% of non-Title I 4th graders and 60% of Title I 4th graders scored at the proficient level on the state reading test. In 2009, 75% of non-Title I 4th graders and 61% of Title I 4th…

  5. State Test Score Trends through 2008-09, Part 4: Is Achievement Improving and Are Gaps Narrowing for Title I Students? Massachusetts

    ERIC Educational Resources Information Center

    Center on Education Policy, 2011

    2011-01-01

    This paper profiles Massachusetts's test score trends through 2008-09. In 2006, 59% of non-Title I 4th graders and 29% of Title I 4th graders scored at the proficient level on the state reading test. In 2009, 64% of non-Title I 4th graders and 31% of Title I 4th graders scored at the proficient level in reading. Between 2006 and 2009, the…

  6. State Test Score Trends through 2008-09, Part 4: Is Achievement Improving and Are Gaps Narrowing for Title I Students? Missouri

    ERIC Educational Resources Information Center

    Center on Education Policy, 2011

    2011-01-01

    This paper profiles Missouri's test score trends through 2008-09. In 2006, the mean scale score on the state 4th grade reading test was 661 for non-Title I students and 642 for Title I students. In 2009, the mean scale score in 4th grade reading was 661 for non-Title I students and 648 for Title I students. Between 2006 and 2009, there was no…

  7. State Test Score Trends through 2008-09, Part 4: Is Achievement Improving and Are Gaps Narrowing for Title I Students? New Hampshire

    ERIC Educational Resources Information Center

    Center on Education Policy, 2011

    2011-01-01

    This paper profiles New Hampshire's test score trends through 2008-09. In 2006, the mean scale score on the state 4th grade reading test was 445 for non-Title I students and 438 for Title I students. In 2009, the mean scale score in 4th grade reading was 448 for non-Title I students and 441 for Title I students. Between 2006 and 2009, the mean…

  8. State Test Score Trends through 2008-09, Part 4: Is Achievement Improving and Are Gaps Narrowing for Title I Students? Arizona

    ERIC Educational Resources Information Center

    Center on Education Policy, 2011

    2011-01-01

    This paper profiles Arizona's test score trends through 2008-09. In 2005, the mean scale score on the state 4th grade reading test was 478 for non-Title I students and 445 for Title I students. In 2008, the mean scale score in 4th grade reading was 477 for non-title I students and 450 for title I students. Between 2005 and 2008, the mean scale…

  9. State Test Score Trends through 2008-09, Part 4: Is Achievement Improving and Are Gaps Narrowing for Title I Students? Rhode Island

    ERIC Educational Resources Information Center

    Center on Education Policy, 2011

    2011-01-01

    This paper profiles Rhode Island's test score trends through 2008-09. In 2006, the mean scale score on the state 4th grade reading test was 445 for non-Title I students and 435 for Title I students. In 2009, the mean scale score in 4th grade reading was 448 for non-Title I students and 440 for Title I students. Between 2006 and 2009, the mean…

  10. State Test Score Trends through 2008-09, Part 4: Is Achievement Improving and Are Gaps Narrowing for Title I Students? North Carolina

    ERIC Educational Resources Information Center

    Center on Education Policy, 2011

    2011-01-01

    This paper profiles North Carolina's test score trends through 2008-09. In 2006, the mean scale score on the state 4th grade math test was 351 for non-Title I students and 347 for Title I students. In 2009, the mean scale score in 4th grade math was 354 for non-Title I students and 350 for Title I students. Between 2006 and 2009, the mean scale…

  11. State Test Score Trends through 2008-09, Part 4: Is Achievement Improving and Are Gaps Narrowing for Title I Students? Kansas

    ERIC Educational Resources Information Center

    Center on Education Policy, 2011

    2011-01-01

    This paper profiles Kansas' test score trends through 2008-09. In 2006, the mean scale score on the state 4th grade reading test was 80 for non-Title I students and 73 for Title I students. In 2009, the mean scale score in 4th grade reading was 84 for non-Title I students and 78 for Title I students. Between 2006 and 2009, the mean scale score…

  12. State Test Score Trends through 2008-09, Part 4: Is Achievement Improving and Are Gaps Narrowing for Title I Students? Maine

    ERIC Educational Resources Information Center

    Center on Education Policy, 2011

    2011-01-01

    This paper profiles Maine's test score trends through 2008-09. In 2006, the mean scale score on the state 4th grade reading test was 445 for non-Title I students and 438 for Title I students. In 2009, the mean scale score in 4th grade reading was 477 for non-Title I students and 441 for Title I students. Between 2006 and 2009, the mean scale score…

  13. State Test Score Trends through 2008-09, Part 4: Is Achievement Improving and Are Gaps Narrowing for Title I Students? Texas

    ERIC Educational Resources Information Center

    Center on Education Policy, 2011

    2011-01-01

    This paper profiles Texas's test score trends through 2008-09. In 2005, the mean scale score on the state 4th grade reading test was 2297 for non-Title I students and 2207 for Title I students. In 2009, the mean scale score in 4th grade reading was 2334 for non-Title I students and 2235 for Title I students. Between 2005 and 2009, the mean scale…

  14. State Test Score Trends through 2008-09, Part 4: Is Achievement Improving and Are Gaps Narrowing for Title I Students? Colorado

    ERIC Educational Resources Information Center

    Center on Education Policy, 2011

    2011-01-01

    This paper profiles Colorado's test score trends through 2008-09. In 2003, the mean scale score on the state 4th grade reading test was 598 for non-Title I students and 558 for Title I students. In 2009, the mean scale score in 4th grade reading was 599 for non-Title I students and 556 for Title I students. Between 2003 and 2009, the mean scale…

  15. Score Equity Assessment:Development of a Prototype Analysis Using SAT[R] Mathematics Test Data Across Several Administrations. Research Report. ETS RR-09-08

    ERIC Educational Resources Information Center

    Dorans, Neil J.; Liu, Jinghua

    2009-01-01

    The equating process links scores from different editions of the same test. For testing programs that build nearly parallel forms to the same explicit content and statistical specifications and administer forms under the same conditions, the linkings between the forms are expected to be equatings. Score equity assessment (SEA) provides a useful…

  16. The Impact of Scholastic Instrumental Music and Scholastic Chess Study on the Standardized Test Scores of Students in Grades Three, Four, and Five

    ERIC Educational Resources Information Center

    Martinez, Edwin E.

    2012-01-01

    This study examines the impact of instrumental music study and group chess lessons on the standardized test scores of suburban elementary public school students (grades three through five) in Levittown, New York. The study divides the students into the following groups and compares the standardized test scores of each: a) instrumental music…

  17. Explaining the Gap in Black-White Scores on IQ and College Admission Tests.

    ERIC Educational Resources Information Center

    Cross, Theodore, Ed.

    1998-01-01

    Argues that differences in black performance and white performance on standardized tests likely comes from deeply rooted environmental forces such as expectations of one's life being restricted to a small and poorly rewarded set of social roles. Issues of test bias, the influence of caste-like minorities, the conflict between African American…

  18. Health Behaviors and Standardized Test Scores: The Impact of School Health Climate on Performance

    ERIC Educational Resources Information Center

    Gunter, Whitney D.; Daly, Kevin

    2013-01-01

    Research has found that many characteristics are related to performance on standardized tests. Many of these are not necessarily "academic" attributes. One area of this research is on the connection between physical health or lifestyles and test performance. The research that exists in this area is often disconnected with each other and…

  19. Predicting Success in Graduate School Using GRE and PAEG Aptitude Test Scores.

    ERIC Educational Resources Information Center

    Bornheimer, Deane G.

    1984-01-01

    Performance of limited-English speaking graduate school applicants on the Prueba de Admision para Estudios Graduados aptitude test is compared with Graduate Record Examination results, and the validity of the two tests as predictors of academic success for bilingual doctoral students in the New York University Puerto Rico program is examined. (MSE)

  20. The Impact of Linking Distinct Achievement Test Scores on the Interpretation of Student Growth in Achievement

    ERIC Educational Resources Information Center

    Airola, Denise Tobin

    2011-01-01

    Changes to state tests impact the ability of State Education Agencies (SEAs) to monitor change in performance over time. The purpose of this study was to evaluate the Standardized Performance Growth Index (PGIz), a proposed statistical model for measuring change in student and school performance, across transitions in tests. The PGIz is a…