Science.gov

Sample records for admission test scores

  1. Beyond Correlations: Usefulness of High School GPA and Test Scores in Making College Admissions Decisions

    ERIC Educational Resources Information Center

    Sawyer, Richard

    2013-01-01

    Correlational evidence suggests that high school GPA is better than admission test scores in predicting first-year college GPA, although test scores have incremental predictive validity. The usefulness of a selection variable in making admission decisions depends in part on its predictive validity, but also on institutions' selectivity and…

  2. Test Scores and the Graduate Admission of Older Students.

    ERIC Educational Resources Information Center

    Clark, Mary Jo

    This paper examined test scores and information about test takers collected from the Graduate Record Examination (GRE) national test administration files, GRE validity study data provided by institutional users, and survey questionnaires sent to test repeaters. Data were collected from 200,000 men and women who took the GRE General Test in June…

  3. The Validity of Law School Admission Test Scores for Repeaters: A Replication. LSAC Research Report Series.

    ERIC Educational Resources Information Center

    Wightman, Linda F.

    This study, a partial replication of an earlier study by B. Pitcher (1977), examined the validity of using initial, most recent, highest, and average scores in decisions about repeat takers of the Law School Admission Test (LSAT). The study included only schools that enrolled 50 or more first-year students who had taken the LSAT on more than 1…

  4. The Validity of Law School Admission Test Scores for Repeaters: A Replication. LSAT Technical Report. LSAC Research Report Series.

    ERIC Educational Resources Information Center

    Dalessandro, Susan P.; McLeod, Lori D.

    The fair and accurate treatment of multiple test scores for law school applicants who take the Law School Admission Test (LSAT) more than one time is the focus of this study. The study reexamines the differential validity and predictive accuracy of the different test scores that are presented by repeat test takers. The study includes U.S. law…

  5. Predicting First-Quarter Test Scores from the New Medical College Admission Test.

    ERIC Educational Resources Information Center

    Cullen, Thomas J.; And Others

    1980-01-01

    The predictive validity of the new Medical College Admission Test as it relates to end-of-quarter examinations in anatomy, histology, physiology, biochemistry, and "ages of man" is presented. Results indicate that the Science Knowledge assessment areas of chemistry and physics and the Science Problems subtest were most useful in predicting student…

  6. Comparability of Scores on Word-Processed and Handwritten Essays on the Graduate Management Admissions Test.

    ERIC Educational Resources Information Center

    Bridgeman, Brent; Cooper, Peter

    Essays for the Graduate Management Admissions Test must be written with a word processor (except in some foreign countries). The test sponsors, the Graduate Management Admissions Council, believed that this is fair because some word processing skill is a prerequisite for advanced management education. Furthermore, it might also be unfair to…

  7. Using College Admission Test Scores to Clarify High School Placement. Leading Indicator Spotlight

    ERIC Educational Resources Information Center

    Flug, Susanna

    2010-01-01

    In "Beyond Test Scores: Leading Indicators for Education," Foley and colleagues (2008) define leading indicators as those that "provide early signals of progress toward academic achievement" (p. 1) and stress that educators "need leading indicators to help them see the direction their efforts are going in and to take corrective action as soon as…

  8. Science standardized achievement tests: The relationship between publishers, textbook completion, admission standards and science test scores of seventh through ninth grade students in FACCS schools

    NASA Astrophysics Data System (ADS)

    Nix, Sharon J.

    Scaled scores from the Stanford Achievement Test Series, Tenth Edition were examined in this causal-comparative study to determine if science publishers in Florida Association of Christian Colleges and Schools (FACCS), textbook completion rates, and admission standards affect standardized test scores. Administrators from 34 schools in FACCS participated in the study by returning an original eleven-question survey instrument to help ascertain what differences or relationships affect standardized test scores. Nine Mann-Whitney tests, one for each grade level in seventh through ninth, did not reveal a significant difference on hypotheses 1a-3c. Publishers (BJ U Press, A.C.E., Glencoe, Prentice Hall), standardized tests, entrance exams, GPA, and ability index factors were reviewed in the study. The results of this study might prompt administrators to consider factors other than publisher usage, textbook completion, and admission standards when attempting to close achievement gaps.

  9. Admissions Testing & Institutional Admissions Processes

    ERIC Educational Resources Information Center

    Hossler, Don; Kalsbeek, David

    2009-01-01

    The array of admissions models and the underlying, and sometimes conflicting goals people have for college admissions, create the dynamics and the tensions that define the contemporary context for enrollment management. The senior enrollment officer must ask, for example, how does an institution try to assure transparency, equality of access,…

  10. Explaining the Gap in Black-White Scores on IQ and College Admission Tests.

    ERIC Educational Resources Information Center

    Cross, Theodore, Ed.

    1998-01-01

    Argues that differences in black performance and white performance on standardized tests likely comes from deeply rooted environmental forces such as expectations of one's life being restricted to a small and poorly rewarded set of social roles. Issues of test bias, the influence of caste-like minorities, the conflict between African American…

  11. Predictive value of grade point average (GPA), Medical College Admission Test (MCAT), internal examinations (Block) and National Board of Medical Examiners (NBME) scores on Medical Council of Canada qualifying examination part I (MCCQE-1) scores

    PubMed Central

    Roy, Banibrata; Ripstein, Ira; Perry, Kyle; Cohen, Barry

    2016-01-01

    Background To determine whether the pre-medical Grade Point Average (GPA), Medical College Admission Test (MCAT), Internal examinations (Block) and National Board of Medical Examiners (NBME) scores are correlated with and predict the Medical Council of Canada Qualifying Examination Part I (MCCQE-1) scores. Methods Data from 392 admitted students in the graduating classes of 2010–2013 at University of Manitoba (UofM), College of Medicine was considered. Pearson’s correlation to assess the strength of the relationship, multiple linear regression to estimate MCCQE-1 score and stepwise linear regression to investigate the amount of variance were employed. Results Complete data from 367 (94%) students were studied. The MCCQE-1 had a moderate-to-large positive correlation with NBME scores and Block scores but a low correlation with GPA and MCAT scores. The multiple linear regression model gives a good estimate of the MCCQE-1 (R2 =0.604). Stepwise regression analysis demonstrated that 59.2% of the variation in the MCCQE-1 was accounted for by the NBME, but only 1.9% by the Block exams, and negligible variation came from the GPA and the MCAT. Conclusions Amongst all the examinations used at UofM, the NBME is most closely correlated with MCCQE-1. PMID:27103953

  12. Assessing Practical Intelligence in Business School Admissions: A Supplement to the Graduate Management Admissions Test

    ERIC Educational Resources Information Center

    Hedlund, Jennifer; Wilt, Jeanne M.; Nebel, Kristina L.; Ashford, Susan J.; Sternberg, Robert J.

    2006-01-01

    The Graduate Management Admission Test (GMAT) is the most widely used measure of managerial potential in MBA admissions. GMAT scores, although predictive of grades in business school, leave much of the variance in graduate school performance unexplained. The GMAT also produces disparities in test scores between groups, generating the potential for…

  13. Using Minimum Acceptable GRE Scores for Graduate Admissions Suppresses Diversity

    NASA Astrophysics Data System (ADS)

    Miller, Casey

    2014-01-01

    I will present data showing that significant performance disparities on the GRE general test exist based on the test taker's race and gender [1]. Because of the belief that high GRE scores qualify one for graduate studies, the diversity issues faced by STEM fields may originate, at least in part, in misuse of the GRE scores by graduate admissions committees. I will quantitatively demonstrate this by showing that the combination of a hard cut-off and the different score distributions leads to the systematic underrepresentation of certain groups. I will present data from USF’s PhD program that shows a lack of correlation between GRE scores and research ability; similar null results are emerging from numerous other programs. I will then discuss how assessing non-cognitive competencies in the selection process may lead to a more enlightened search for the next generation of scientists. [1] C. W. Miller, "Admissions Criteria and Diversity in Graduate School", APS News Vol 22, Issue 2, The Back Page (2013) http://www.aps.org/publications/apsnews/201302/backpage.cfm

  14. Differential Prediction Generalization in College Admissions Testing

    ERIC Educational Resources Information Center

    Aguinis, Herman; Culpepper, Steven A.; Pierce, Charles A.

    2016-01-01

    We introduce the concept of "differential prediction generalization" in the context of college admissions testing. Specifically, we assess the extent to which predicted first-year college grade point average (GPA) based on high-school grade point average (HSGPA) and SAT scores depends on a student's ethnicity and gender and whether this…

  15. The Roles of Testing and Diversity in College Admissions.

    ERIC Educational Resources Information Center

    Clarke, Marguerite; Shore, Arnold

    In order to understand the roles of test scores and diversity characteristics (including race and ethnicity) in the admission process, National Board researchers interviewed admissions directors who worked at selective public and private institutions are well as admissions consultants in the summer and fall of 1999. This report presents an…

  16. College Admissions: Beyond Conventional Testing

    ERIC Educational Resources Information Center

    Sternberg, Robert J.

    2012-01-01

    Standardized admissions tests such as the SAT (originally stood for "Scholastic Aptitude Test") and the ACT measure only a narrow segment of the skills needed to become an active citizen and possibly a leader who makes a positive, meaningful, and enduring difference to the world. The problem with these tests is that they promised, under what have…

  17. 24-Hour ICH Score Is a Better Predictor of Outcome than Admission ICH Score

    PubMed Central

    Aysenne, Aimee M.; Albright, Karen C.; Mathias, Tiffany; Chang, Tiffany R.; Boehme, Amelia K.; Beasley, T. Mark; Martin-Schild, Sheryl

    2016-01-01

    Background The ICH score is a validated tool for predicting 30-day morbidity and mortality in patients with intracerebral hemorrhage. Aims and/or Hypothesis The aim of this study is to determine if the ICH score calculated 24 hours after admission is a better predictor of mortality than the ICH score calculated on admission. Methods Patients presenting to our center with ICH from 7/08-12/10 were retrospectively identified from our prospective stroke registry. ICH scores were calculated based on initial Glasgow coma scale (GCS) and emergent head computed tomography (CT) on initial presentation and were recalculated after 24 hours. Results A total of 91 patients out of 121 had complete data for admission and 24-hour ICH score. The ICH score changed in 38% from baseline to 24 hours. After adjusting for age, NIHSS on admission, and glucose, ICH score at 24 hours was a significant, independent predictor of mortality (OR = 2.71, 95% CI 1–19–6.20, and P = 0.018), but ICH score on admission was not (OR = 2.14, 95% CI 0.88-5.24, and P = 0.095). Conclusion Early determination of the ICH score may incorrectly estimate the severity and expected outcome after ICH. Calculations of the ICH score 24 hours after admission will better predict early outcomes.

  18. TOEFL Scores and Admissions: Using TOEFL Scores Ethically.

    ERIC Educational Resources Information Center

    Tasker, Raymond S.

    2001-01-01

    Looks at the use of the Test of English as a Foreign Language (TOEFL), the most widely used test to assess the English language proficiency of students applying to colleges and universities in the United States or Canada. Examines the TOEFL in relation to backwash, reliability, administration, validity, and ethics. (Author/VWL)

  19. An Evaluation of the Pharmacy College Admissions Test as a Tool for Pharmacy College Admissions Committees.

    ERIC Educational Resources Information Center

    Kelley, Katherine A.; Secnik, Kristina; Boye, Mark E.

    2001-01-01

    Investigated the capacity of the Pharmacy College Admissions Test (PCAT) to predict success in pharmacy school. Found demographic differences in PCAT scores, and that the PCAT used in combination with pre-pharmacy grade point average is meaningful in assessing applicants to pharmacy school; applicants with PCAT composite percentile scores below 40…

  20. Why Do Students Repeat Admissions Tests?

    ERIC Educational Resources Information Center

    Jones, Martha S.

    Attitudes and beliefs about the admissions process, especially the role of standardized testing in admissions, were examined for students who took a standardized admissions test more than once. Their attitudes were compared with those of students who did not repeat the test. About 200 preveterinary students who had taken the Veterinary Aptitude…

  1. Lexical Profiles of Thailand University Admission Tests

    ERIC Educational Resources Information Center

    Cherngchawano, Wirun; Jaturapitakkul, Natjiree

    2014-01-01

    University Admission Tests in Thailand are important documents which reflect Thailand's education system. To study at a higher education level, all students generally need to take the University Admission Tests designed by the National Institute of Educational Testing Service (NIETS). For the English test, vocabulary and reading comprehension is…

  2. The Impact of Score Differences on the Admission of Minority Students: An Illustration. NBETPP Statements, Volume 1, Number 5.

    ERIC Educational Resources Information Center

    Koretz, Daniel

    This paper discusses one of the arguments that has been advanced against the use of standardized college admissions tests: the notion that their use leads admissions officers to reject non-Asian minority students on the basis of small and insignificant differences in scores. In the aggregate, the disadvantage minority students face as a result of…

  3. The National Center Test for University Admissions

    ERIC Educational Resources Information Center

    Watanabe, Yoshinori

    2013-01-01

    This article describes the National Center Test for University Admissions, a unified national test in Japan, which is taken by 500,000 students every year. It states that implementation of the Center Test began in 1990, with the English component consisting only of the written section until 2005, when the listening section was first implemented…

  4. Validity of the Optometry Admission Test in Predicting Performance in Schools and Colleges of Optometry.

    ERIC Educational Resources Information Center

    Kramer, Gene A.; Johnston, JoElle

    1997-01-01

    A study examined the relationship between Optometry Admission Test scores and pre-optometry or undergraduate grade point average (GPA) with first and second year performance in optometry schools. The test's predictive validity was limited but significant, and comparable to those reported for other admission tests. In addition, the scores…

  5. What To Look for in ESL Admission Tests: Cambridge Certificate Exams, IELTS, and TOEFL.

    ERIC Educational Resources Information Center

    Chalhoub-Deville, Micheline; Turner, Carolyn E.

    2000-01-01

    Familiarizes test users with issues to consider when employing assessments for screening and admission purposes. Examines the purpose, content, and scoring methods of three English-as-a-Second-Language admissions tests--the Cambridge certificate exams, International English Language Teaching System, and Test of English as a Foreign…

  6. Providing Transparency and Credibility: The Selection of International Students for Australian Universities. An Examination of the Relationship between Scores in the International Student Admissions Test (ISAT), Final Year Academic Programs and an Australian University's Foundation Program

    ERIC Educational Resources Information Center

    Lai, Kelvin; Nankervis, Susan; Story, Margot; Hodgson, Wayne; Lewenberg, Michael; Ball, Marita MacMahon

    2008-01-01

    Throughout 2003-04 five cohorts of students in their final year of school studies in various Malaysian colleges and a group of students completing an Australian university foundation year in Malaysia sat the International Student Admissions Test (ISAT). The ISAT is a multiple-choice test of general academic abilities developed for students whose…

  7. Setting Local Cut Scores on the Sat Reasoning Test™ Writing Section: For Use in College Placement and Admissions Decisions

    ERIC Educational Resources Information Center

    Morgan, Deanna L.

    2006-01-01

    The introduction of the SAT Reasoning Test™ with a writing section in March 2005 and the concomitant elimination of the SAT® Subject Test in Writing after January 2005 have led many colleges and institutions to ask for guidance in using the new SAT Reasoning Test writing section scores for college placement and admissions. Standard-setting…

  8. Repeater Effects on Score Equating for a Graduate Admissions Exam. Research Report. ETS RR-11-17

    ERIC Educational Resources Information Center

    Yang, Wen-Ling; Bontya, Andrea M.; Moses, Tim P.

    2011-01-01

    Using self-reported but empirically verified repeater groups, we analyzed vast amounts of real test data across a wide range of administrations from a graduate admissions examination that was administered in a non-English language to investigate repeater effects on score equating using the nonequivalent groups with anchor test (NEAT) design. Both…

  9. Z-Score Demystified: A Critical Analysis of the Sri Lankan University Admission Policy

    ERIC Educational Resources Information Center

    Warnapala, Yajni; Silva, Karishma

    2011-01-01

    In the year 2001, the University Grants Commission of Sri Lanka successfully appealed to change the method of determining the cut-off scores for university admissions from raw scores to standardized z-scores. This standardization allegedly eliminated the discrepancy caused due to the assumption of equal difficulty levels across all subjects. This…

  10. Standardized Tests and Other Criteria in Admissions Decisions: A Classroom Activity

    ERIC Educational Resources Information Center

    Pawlow, Laura A.

    2010-01-01

    This exercise aims to provide a hands-on, role-playing activity that requires students to evaluate the strengths and limitations of standardized tests in making admission decisions. Small groups pretend to be an admissions committee and review fictitious student applications containing both standardized test scores and other information admissions…

  11. Beyond the Test Scores.

    ERIC Educational Resources Information Center

    Thibodeau, Janice J.

    1985-01-01

    A diagnostic-prescriptive scheme is illustrated using subtests of the Slingerland Screening Tests for Identifying Children with Specific Language Disability and the Detroit Tests of Learning Aptitude. The scheme is intended to focus on the child's learning style by examining the task and the strategies employed. (CL)

  12. How Does the Use of Concordant Scores Impact Admission Decisions? Research Report 2016-3

    ERIC Educational Resources Information Center

    Marini, Jessica P.; Shaw, Emily J.; Young, Linda; Walker, Michael

    2016-01-01

    College admission offices receive many different pieces of information on each applicant to their institution. The job of considering this information becomes further complicated when the information is not consistent across applicants. An example of one such complication is receiving SAT scores for some applicants and receiving ACT scores for…

  13. Interpreting Test Scores: More Complicated than You Think

    ERIC Educational Resources Information Center

    Tully, Susannah

    2008-01-01

    As more colleges move to "test optional" admissions policies, the debate over the utility and interpretation of standardized-test scores continues. In this article, the author interviews Daniel Koretz, a professor of education at Harvard University and author of "Measuring Up: What Educational Testing Really Tells Us". Koretz shares his thoughts…

  14. Summary of Validity Data from the Admissions Testing Program Validity Study Service.

    ERIC Educational Resources Information Center

    Ford, Susan F.; Campos, Sandy

    Validity data (prediction of first-year grade point average) for colleges participating in the Admissions Testing Program Validity Study Service (VSS) and based on students entering college in 1964 through 1974 are summarized for the following predictors: Scholastic Aptitude Test (SAT)--verbal score, SAT--mathematical score, high school record,…

  15. EDUCATION AND PSYCHOLOGICAL TEST SCORES

    PubMed Central

    Pershad, Dwarka; Verma, S. K.

    1980-01-01

    Education, a long neglected variable affecting psychological test score, is in search of reemphasis. Some evidence for this has accumulated on the psychological tests constructed and standardized here at the department of Psychiatry, P.G.I., Chandigarh. Tentative norms prepared education wise on WAIS-Verbal section, PGI-Memory Scale, Proverb and Similarity Tests, Psychoticism Questionnaire, and PGI MQN 2, for adults, in the age range of 16-50, are reported. The results showed marked difference in the mean scores of different educational categories and thus stressed the need for reporting norms separately for different educational levels. PMID:22064617

  16. Education and psychological test scores.

    PubMed

    Pershad, D; Verma, S K

    1980-04-01

    Education, a long neglected variable affecting psychological test score, is in search of reemphasis. Some evidence for this has accumulated on the psychological tests constructed and standardized here at the department of Psychiatry, P.G.I., Chandigarh. Tentative norms prepared education wise on WAIS-Verbal section, PGI-Memory Scale, Proverb and Similarity Tests, Psychoticism Questionnaire, and PGI MQN 2, for adults, in the age range of 16-50, are reported. The results showed marked difference in the mean scores of different educational categories and thus stressed the need for reporting norms separately for different educational levels. PMID:22064617

  17. The Effects of Differently Weighting Interview Scores on the Admission of Underrepresented Minority Medical Students.

    ERIC Educational Resources Information Center

    Edwards, Janine C.; Maldonado, Filomeno G. Jr.; Calvin, James A.

    1999-01-01

    Analysis of admission records for 439 applicants to Texas A&M University's medical school compared applicants actually admitted under a formula that equally weighted academic and interview scores, with applicants who would have been admitted under two different weighting formulas. Results indicate that readjusting criteria weights may help meet…

  18. Redundancy, Discrimination and Corruption in the Multibillion-Dollar Business of College Admissions Testing

    ERIC Educational Resources Information Center

    Rizzo, Monica Ellen

    2012-01-01

    Most American colleges and universities require standardized entrance exams when making admissions decisions. Scores on these exams help determine if, when and where students will be allowed to pursue higher education. These scores are also used to determine eligibility for merit based financial aid. This testing persists even though half of the…

  19. Handedness and sex differences in intelligence: evidence from the medical college admission test.

    PubMed

    Halpern, D F; Haviland, M G; Killian, C D

    1998-10-01

    Our analysis of Medical College Admission Test subtest scores by writing hand preference and sex suggests that (a) right hemispheric dominance is associated with intellectual giftedness in verbal reasoning (left-handers obtained higher scores on the verbal reasoning test and were overrepresented in the upper tail of the distribution), (b) different patterns of brain lateralization are associated with different subcomponents of cognition (right-handers scored higher, on average, on the writing test and were overrepresented in the upper tail of the distribution), and (c) men generally score higher than women on tests of scientific knowledge (the most striking differences between men and women were on the biological and physical science tests).

  20. Test-Optional Admission at a Liberal Arts College: A Founding Mission Affirmed

    ERIC Educational Resources Information Center

    Shanley, Brian J.

    2007-01-01

    In this essay, Father Brian J. Shanley discusses Providence College's pilot program to eliminate standardized test scores from the required components of an admission application. Building on the college's ninety-year history of opening the doors of higher education to underrepresented populations, Providence College's test-optional policy is…

  1. Admission Risk Score to Predict Inpatient Pediatric Mortality at Four Public Hospitals in Uganda.

    PubMed

    Mpimbaza, Arthur; Sears, David; Sserwanga, Asadu; Kigozi, Ruth; Rubahika, Denis; Nadler, Adam; Yeka, Adoke; Dorsey, Grant

    2015-01-01

    Mortality rates among hospitalized children in many government hospitals in sub-Saharan Africa are high. Pediatric emergency services in these hospitals are often sub-optimal. Timely recognition of critically ill children on arrival is key to improving service delivery. We present a simple risk score to predict inpatient mortality among hospitalized children. Between April 2010 and June 2011, the Uganda Malaria Surveillance Project (UMSP), in collaboration with the National Malaria Control Program (NMCP), set up an enhanced sentinel site malaria surveillance program for children hospitalized at four public hospitals in different districts: Tororo, Apac, Jinja and Mubende. Clinical data collected through March 2013, representing 50249 admissions were used to develop a mortality risk score (derivation data set). One year of data collected subsequently from the same hospitals, representing 20406 admissions, were used to prospectively validate the performance of the risk score (validation data set). Using a backward selection approach, 13 out of 25 clinical parameters recognizable on initial presentation, were selected for inclusion in a final logistic regression prediction model. The presence of individual parameters was awarded a score of either 1 or 2 based on regression coefficients. For each individual patient, a composite risk score was generated. The risk score was further categorized into three categories; low, medium, and high. Patient characteristics were comparable in both data sets. Measures of performance for the risk score included the receiver operating characteristics curves and the area under the curve (AUC), both demonstrating good and comparable ability to predict deathusing both the derivation (AUC =0.76) and validation dataset (AUC =0.74). Using the derivation and validation datasets, the mortality rates in each risk category were as follows: low risk (0.8% vs. 0.7%), moderate risk (3.5% vs. 3.2%), and high risk (16.5% vs. 12.6%), respectively. Our

  2. Admission Risk Score to Predict Inpatient Pediatric Mortality at Four Public Hospitals in Uganda

    PubMed Central

    Mpimbaza, Arthur; Sears, David; Sserwanga, Asadu; Kigozi, Ruth; Rubahika, Denis; Nadler, Adam; Yeka, Adoke; Dorsey, Grant

    2015-01-01

    Mortality rates among hospitalized children in many government hospitals in sub-Saharan Africa are high. Pediatric emergency services in these hospitals are often sub-optimal. Timely recognition of critically ill children on arrival is key to improving service delivery. We present a simple risk score to predict inpatient mortality among hospitalized children. Between April 2010 and June 2011, the Uganda Malaria Surveillance Project (UMSP), in collaboration with the National Malaria Control Program (NMCP), set up an enhanced sentinel site malaria surveillance program for children hospitalized at four public hospitals in different districts: Tororo, Apac, Jinja and Mubende. Clinical data collected through March 2013, representing 50249 admissions were used to develop a mortality risk score (derivation data set). One year of data collected subsequently from the same hospitals, representing 20406 admissions, were used to prospectively validate the performance of the risk score (validation data set). Using a backward selection approach, 13 out of 25 clinical parameters recognizable on initial presentation, were selected for inclusion in a final logistic regression prediction model. The presence of individual parameters was awarded a score of either 1 or 2 based on regression coefficients. For each individual patient, a composite risk score was generated. The risk score was further categorized into three categories; low, medium, and high. Patient characteristics were comparable in both data sets. Measures of performance for the risk score included the receiver operating characteristics curves and the area under the curve (AUC), both demonstrating good and comparable ability to predict deathusing both the derivation (AUC =0.76) and validation dataset (AUC =0.74). Using the derivation and validation datasets, the mortality rates in each risk category were as follows: low risk (0.8% vs. 0.7%), moderate risk (3.5% vs. 3.2%), and high risk (16.5% vs. 12.6%), respectively. Our

  3. Admission Risk Score to Predict Inpatient Pediatric Mortality at Four Public Hospitals in Uganda.

    PubMed

    Mpimbaza, Arthur; Sears, David; Sserwanga, Asadu; Kigozi, Ruth; Rubahika, Denis; Nadler, Adam; Yeka, Adoke; Dorsey, Grant

    2015-01-01

    Mortality rates among hospitalized children in many government hospitals in sub-Saharan Africa are high. Pediatric emergency services in these hospitals are often sub-optimal. Timely recognition of critically ill children on arrival is key to improving service delivery. We present a simple risk score to predict inpatient mortality among hospitalized children. Between April 2010 and June 2011, the Uganda Malaria Surveillance Project (UMSP), in collaboration with the National Malaria Control Program (NMCP), set up an enhanced sentinel site malaria surveillance program for children hospitalized at four public hospitals in different districts: Tororo, Apac, Jinja and Mubende. Clinical data collected through March 2013, representing 50249 admissions were used to develop a mortality risk score (derivation data set). One year of data collected subsequently from the same hospitals, representing 20406 admissions, were used to prospectively validate the performance of the risk score (validation data set). Using a backward selection approach, 13 out of 25 clinical parameters recognizable on initial presentation, were selected for inclusion in a final logistic regression prediction model. The presence of individual parameters was awarded a score of either 1 or 2 based on regression coefficients. For each individual patient, a composite risk score was generated. The risk score was further categorized into three categories; low, medium, and high. Patient characteristics were comparable in both data sets. Measures of performance for the risk score included the receiver operating characteristics curves and the area under the curve (AUC), both demonstrating good and comparable ability to predict deathusing both the derivation (AUC =0.76) and validation dataset (AUC =0.74). Using the derivation and validation datasets, the mortality rates in each risk category were as follows: low risk (0.8% vs. 0.7%), moderate risk (3.5% vs. 3.2%), and high risk (16.5% vs. 12.6%), respectively. Our

  4. SAT Wars: The Case for Test-Optional College Admissions

    ERIC Educational Resources Information Center

    Soares, Joseph A., Ed.

    2011-01-01

    What can a college admissions officer safely predict about the future of a 17-year-old? Are the best and the brightest students the ones who can check off the most correct boxes on a multiple-choice exam? Or are there better ways of measuring ability and promise? In this penetrating and revealing look at high-stakes standardized admissions tests,…

  5. Perspective: Medical school admissions and noncognitive testing: some open questions.

    PubMed

    Bardes, Charles L; Best, Peggy Carey; Kremer, Sara J; Dienstag, Jules L

    2009-10-01

    Medical schools use a variety of criteria in selecting applicants for admission, attempting to assess both the academic preparation and the personal characteristics suitable for a career in medicine. While assessing academic preparation is fairly straightforward, assessing applicants' personal characteristics is difficult and controversial. The possibility of implementing standardized testing of personal characteristics, so-called "noncognitive testing," has been proposed as part of the admissions process. Such a proposal, however, raises numerous questions about the validity, fairness, and cost of such testing and the impact of commercial test-preparation services on test performance and reliability. Therefore, before noncognitive testing is adopted for screening applicants to medical school, open discussion among all stakeholders in the admissions process is critically important.

  6. More than Just Test Scores

    ERIC Educational Resources Information Center

    Levin, Henry M.

    2012-01-01

    Around the world we hear considerable talk about creating world-class schools. Usually the term refers to schools whose students get very high scores on the international comparisons of student achievement such as PISA or TIMSS. The practice of restricting the meaning of exemplary schools to the narrow criterion of achievement scores is usually…

  7. A Study of the Long-Term Stability of GRE General Test Scores.

    ERIC Educational Resources Information Center

    Wilson, Kenneth M.

    1988-01-01

    A study of the long-term stability of Graduate Record Examination scores for graduate program admission purposes investigated the changes in scores among test repeaters. Results suggest that long-term test repeaters have greater gains on verbal than quantitative tests. Explanations and implications are considered. (MSE)

  8. Computing the University of Washington Freshman Admission's Index Using the Scholastic Aptitude Test or American College Test.

    ERIC Educational Resources Information Center

    de Wolf, Virginia A.

    The development of the University of Washington freshman Admission Index (AI) is reviewed. The AI is a weighted linear combination of a student's overall high school grade point average and test scores from a standard college entrance examination of the verbal-quantitative type. The existing equation for computation of the AI using the Washington…

  9. Smoothing Methods for Estimating Test Score Distributions.

    ERIC Educational Resources Information Center

    Kolen, Michael J.

    1991-01-01

    Estimation/smoothing methods that are flexible enough to fit a wide variety of test score distributions are reviewed: kernel method, strong true-score model-based method, and method that uses polynomial log-linear models. Applications of these methods include describing/comparing test score distributions, estimating norms, and estimating…

  10. Theory-Based University Admissions Testing for a New Millennium

    ERIC Educational Resources Information Center

    Sternberg, Robert J.

    2004-01-01

    This article describes two projects based on Robert J. Sternberg's theory of successful intelligence and designed to provide theory-based testing for university admissions. The first, Rainbow Project, provided a supplementary test of analytical, practical, and creative skills to augment the SAT in predicting college performance. The Rainbow…

  11. The association of pre-operative STOP-BANG scores with postoperative critical care admission.

    PubMed

    Chia, P; Seet, E; Macachor, J D; Iyer, U S; Wu, D

    2013-09-01

    The STOP-BANG questionnaire screens for obstructive sleep apnoea. We retrospectively analysed the independent association of pre-operative variables with postoperative critical care admission using multivariable logistic regression for patients undergoing elective surgery from January to December 2011. Of 5432 patients, 338 (6.2%) were admitted postoperatively to the critical care unit. In multivariate analysis, the odds ratios (95% CI) for critical care admission were: 2.2 (1.1-4.6), p = 0.037; 3.2 (1.2-8.1), p = 0.017; and 5.1 (1.8-14.9), p = 0.002, for STOP-BANG scores of 4, 5 and ≥ 6, respectively. The odds ratio was also independently increased for: each year of age, 1.015 (1.004-1.026), p = 0.019; asthma, 1.6 (1.1-2.4), p = 0.016; obstructive sleep apnoea, 3.2 (1.9-5.6), p < 0.001; and for ASA physical status 2, 3 and ≥ 4, 2.1 (1.4-3.3), 6.5 (3.9-11.0), 6.3 (2.9-13.8), respectively, p < 0.001 for all. PMID:23848465

  12. The Relationship of Performance on the Dental Admission Test and Performance on Part I of the National Board Dental Examinations.

    ERIC Educational Resources Information Center

    De Ball, Suzanne; Sullivan, Kathleen; Horine, Julie; Duncan, William K.; Replogle, William

    2002-01-01

    Comapred University of Mississippi dental student scores on the Dental Admission Test (DAT) and Part I of the National Board Dental Examinations (NBDE) and found that DAT reading comprehension was a statistically significant predictor of all four subtests of the NBDE. Also found that DAT biology and organic chemistry scores were predictors of NBDE…

  13. Linking Scores from Tests of Similar Content Given in Different Languages: An Illustration Involving Methodological Alternatives

    ERIC Educational Resources Information Center

    Cascallar, Alicia S.; Dorans, Neil J.

    2005-01-01

    This study compares two methods commonly used (concordance and prediction) to establish linkages between scores from tests of similar content given in different languages. Score linkages between the Verbal and Math sections of the SAT I and the corresponding sections of the Spanish-language admissions test, the Prueba de Aptitud Academica (PAA),…

  14. Linking Scores From Tests of Similar Content Given in Different Languages: An Illustration Involving Methodological Alternatives

    ERIC Educational Resources Information Center

    Cascallar, Alicia S.; Dorans, Neil J.

    2005-01-01

    This study compares two methods commonly used (concordance and prediction) to establish linkages between scores from tests of similar content given in different languages. Score linkages between the Verbal and Math sections of the SAT I and the corresponding sections of the Spanish-language admissions test, the Prueba de Aptitud Academica (PAA),…

  15. Validating use of a critical thinking test for the dental admission test.

    PubMed

    Tsai, Tsung-Hsun

    2014-04-01

    The purpose of this study was to validate the use of a test to assess dental school applicants' critical thinking abilities. The intent was to include this test on the Dental Admission Test (DAT) if it was shown to enhance the DAT's validity. Correlation and regression analyses of undergraduate and dental school performance with scores on each of the tests on the DAT battery and the California Critical Thinking Skills Test (CCTST) were performed. Data were collected from 439 third- and fourth-year dental students who consented to participate and were enrolled at one of the ten accredited dental schools included in the study. These ten dental schools were from most regions of the United States. This study concluded that including the CCTST on the DAT did not significantly enhance the DAT's validity.

  16. The Law School Admission Council: A Hundred-Million-Dollar Investment Fund That Does Law School Testing on the Side.

    ERIC Educational Resources Information Center

    Journal of Blacks in Higher Education, 2000

    2000-01-01

    The nonprofit corporation that performs standardized testing for law school admission has in recent years produced $100 million in profits. For minority groups who tend to score poorly on the test, the question is why the corporation refuses to dedicate some of its wealth to commissioning an independent study of whether test results correlate with…

  17. The Use of the Addiction Severity Index Psychiatric Composite Scores to Predict Psychiatric Inpatient Admissions.

    PubMed

    Drymalski, Walter M; Nunley, Michael R

    2016-01-01

    The high prevalence of co-occurring mental health and substance use disorders indicates a need for integrated behavioral health treatment that addresses both types of disorder simultaneously. One component of this integrated treatment is the use of an assessment process that can concurrently identify the presence of each class of disorder. The Addiction Severity Index (ASI) has been extensively used and researched in the field of substance use disorders for over 30 years. The ASI has seven sections, including a section on substance use disorders and a section on psychiatric symptoms, making it a potential candidate for a co-occurring screen during intake. The following study utilized a receiver operating characteristic curve analysis to determine an optimal cutoff score on the ASI psychiatric composite score to identify which individuals seeking substance use disorder treatment were admitted to the Milwaukee County Behavioral Health Division's psychiatric hospital in the year subsequent to their ASI administration. Of the 19,320 individuals who completed an initial ASI in our system, 343 had an inpatient admission. The receiver operating characteristic curve was statistically significant, with an area under the curve of 0.75. A cutoff of 0.27 had a sensitivity of 0.77 and a specificity of 0.61, such that over 60% (11,963/19,320) of the sample was excluded. These results suggest that the ASI psychiatric composite score may be a useful initial screen to identify those with potential mental health problems/needs in a behavioral health system attempting to integrate addiction and mental health services. PMID:27580192

  18. The Use of the Addiction Severity Index Psychiatric Composite Scores to Predict Psychiatric Inpatient Admissions.

    PubMed

    Drymalski, Walter M; Nunley, Michael R

    2016-01-01

    The high prevalence of co-occurring mental health and substance use disorders indicates a need for integrated behavioral health treatment that addresses both types of disorder simultaneously. One component of this integrated treatment is the use of an assessment process that can concurrently identify the presence of each class of disorder. The Addiction Severity Index (ASI) has been extensively used and researched in the field of substance use disorders for over 30 years. The ASI has seven sections, including a section on substance use disorders and a section on psychiatric symptoms, making it a potential candidate for a co-occurring screen during intake. The following study utilized a receiver operating characteristic curve analysis to determine an optimal cutoff score on the ASI psychiatric composite score to identify which individuals seeking substance use disorder treatment were admitted to the Milwaukee County Behavioral Health Division's psychiatric hospital in the year subsequent to their ASI administration. Of the 19,320 individuals who completed an initial ASI in our system, 343 had an inpatient admission. The receiver operating characteristic curve was statistically significant, with an area under the curve of 0.75. A cutoff of 0.27 had a sensitivity of 0.77 and a specificity of 0.61, such that over 60% (11,963/19,320) of the sample was excluded. These results suggest that the ASI psychiatric composite score may be a useful initial screen to identify those with potential mental health problems/needs in a behavioral health system attempting to integrate addiction and mental health services.

  19. 10 Tips for Higher Test Scores.

    ERIC Educational Resources Information Center

    Priestley, Michael

    2000-01-01

    Ten suggestions to help students increase standardized test scores include: read directions carefully; peek at the questions before reading stories or articles; note key words; use parts of questions to help plan answers; look back at the text; think before writing; write clearly and legibly; pay attention to how the test is scored; manage time…

  20. Interpreting Standardized Test Scores: Some Fine Points.

    ERIC Educational Resources Information Center

    Hunter, William J.

    1980-01-01

    An essential function of the school guidance worker is the translation of test results into plain language and/or concrete recommendations. To do so requires a thorough understanding of the various test scores publishers provide. (Author)

  1. [Medical school admission test at the University of Goettingen - which applicants will benefit?].

    PubMed

    Simmenroth-Nayda, Anne; Meskauskas, Erik; Burckhardt, Gerhard; Görlich, Yvonne

    2014-01-01

    Medical schools in Germany may select 60% of the student applicants through their own admission tests. The influence of the school-leaving examination grades (EGs) in each of the procedural steps is controversial. At Goettingen Medical School, we combine a structured interview and a communicative skills assessment. We analysed how many applicants succeeded in our admission test, compared to a model which only takes EGs into account. Admission scores were transferred into SPSS-21. Sociodemographic data were submitted by the Stiftung Hochschulstart. Besides descriptive statistics, we used Pearson-correlation and means comparisons (t-test, analysis of variance). 221 applicants (EGs 1.0-1.9) were invited in the winter semester 2013/14 and 222 applicants (EGs 1.1-1.8) in the summer semester 2014. The proportion of women was 68% (winter) and 74% (summer). Sixteen and 37 applicants had a medical vocational training and performed slightly better. The analysis showed that our test was gender neutral. EGs did not correlate with interviews or skills assessment. Despite a two-fold impact of EGs, 26 (winter) and 44 (summer) of the overall 181 applicants had EGs of 1.4 -1.9, which would have been too low for admission otherwise. If EGs were only considered once, 40 (winter) and 59 (summer) applicants would have succeeded. PMID:25499117

  2. [Medical school admission test at the University of Goettingen - which applicants will benefit?].

    PubMed

    Simmenroth-Nayda, Anne; Meskauskas, Erik; Burckhardt, Gerhard; Görlich, Yvonne

    2014-01-01

    Medical schools in Germany may select 60% of the student applicants through their own admission tests. The influence of the school-leaving examination grades (EGs) in each of the procedural steps is controversial. At Goettingen Medical School, we combine a structured interview and a communicative skills assessment. We analysed how many applicants succeeded in our admission test, compared to a model which only takes EGs into account. Admission scores were transferred into SPSS-21. Sociodemographic data were submitted by the Stiftung Hochschulstart. Besides descriptive statistics, we used Pearson-correlation and means comparisons (t-test, analysis of variance). 221 applicants (EGs 1.0-1.9) were invited in the winter semester 2013/14 and 222 applicants (EGs 1.1-1.8) in the summer semester 2014. The proportion of women was 68% (winter) and 74% (summer). Sixteen and 37 applicants had a medical vocational training and performed slightly better. The analysis showed that our test was gender neutral. EGs did not correlate with interviews or skills assessment. Despite a two-fold impact of EGs, 26 (winter) and 44 (summer) of the overall 181 applicants had EGs of 1.4 -1.9, which would have been too low for admission otherwise. If EGs were only considered once, 40 (winter) and 59 (summer) applicants would have succeeded.

  3. Who Benefits Most From Preparing for a "Coachable" Admissions Test?

    ERIC Educational Resources Information Center

    Powers, Donald E.

    1987-01-01

    Data on test familiarity effects on Graduate Record Examinations (GRE) analytical ability scores (EJ 304 980) were reanalyzed. Little difference was found among subgroups: age, ethnic groups, degree aspiration, English language dominance, and GRE scores. Test preparation included practice, knowledge of results, and strategies for each item type.…

  4. Profile of Graduate Management Admission Test[R] Candidates 2007-08 to 2011-12: Five-Year Summary

    ERIC Educational Resources Information Center

    Graduate Management Admission Council, 2012

    2012-01-01

    The 2011-2012 testing year held a number of significant milestones for the Graduate Management Admission Test exam. A total of 286,529 GMAT exams were administered, with 831,337 score reports sent to more than 5,200 graduate-level management programs around the world--all record numbers. This record volume reflects the increase in graduate-level…

  5. Equating Scores from Adaptive to Linear Tests

    ERIC Educational Resources Information Center

    van der Linden, Wim J.

    2006-01-01

    Two local methods for observed-score equating are applied to the problem of equating an adaptive test to a linear test. In an empirical study, the methods were evaluated against a method based on the test characteristic function (TCF) of the linear test and traditional equipercentile equating applied to the ability estimates on the adaptive test…

  6. Do Examinees Understand Score Reports for Alternate Methods of Scoring Computer Based Tests?

    ERIC Educational Resources Information Center

    Whittaker, Tiffany A.; Williams, Natasha J.; Dodd, Barbara G.

    2011-01-01

    This study assessed the interpretability of scaled scores based on either number correct (NC) scoring for a paper-and-pencil test or one of two methods of scoring computer-based tests: an item pattern (IP) scoring method and a method based on equated NC scoring. The equated NC scoring method for computer-based tests was proposed as an alternative…

  7. Differential Predictive Validity of High School GPA and College Entrance Test Scores for University Students in Yemen

    ERIC Educational Resources Information Center

    Al-Hattami, Abdulghani Ali Dawod

    2012-01-01

    High school grade point average and college entrance test scores are two admission criteria that are currently used by most colleges in Yemen to select their prospective students. Given their widespread use, it is important to investigate their predictive validity to ensure the accuracy of the admission decisions in these institutions. This study…

  8. Teacher Greetings Increase College Students' Test Scores

    ERIC Educational Resources Information Center

    Weinstein, Lawrence; Laverghetta, Antonio; Alexander, Ralph; Stewart, Megan

    2009-01-01

    The current study is an extension of a previous investigation dealing with teacher greetings to students. The present investigation used teacher greetings with college students and academic performance (test scores). We report data using university students and in-class test performance. Students in introductory psychology who received teachers'…

  9. Physics and the revised Medical College Admission Test

    NASA Astrophysics Data System (ADS)

    Hilborn, Robert C.

    2014-05-01

    Physics has played an important role in the preparation of future physicians and other health professionals for more than 100 years. Almost all pre-health students take a year of college-level physics as part of their preparation for medical, dental, and pharmacy school. In particular, the widely-used Medical College Admission Test (MCAT) contains a significant number of questions that require physics knowledge and skills. This paper describes the changes in the MCAT to be implemented in 2015, the role of physics in the revised MCAT, and implications for introductory physics courses for the life sciences.

  10. Validity of the Medical College Admission Test for predicting MD-PhD student outcomes.

    PubMed

    Bills, James L; VanHouten, Jacob; Grundy, Michelle M; Chalkley, Roger; Dermody, Terence S

    2016-03-01

    The Medical College Admission Test (MCAT) is a quantitative metric used by MD and MD-PhD programs to evaluate applicants for admission. This study assessed the validity of the MCAT in predicting training performance measures and career outcomes for MD-PhD students at a single institution. The study population consisted of 153 graduates of the Vanderbilt Medical Scientist Training Program (combined MD-PhD program) who matriculated between 1963 and 2003 and completed dual-degree training. This population was divided into three cohorts corresponding to the version of the MCAT taken at the time of application. Multivariable regression (logistic for binary outcomes and linear for continuous outcomes) was used to analyze factors associated with outcome measures. The MCAT score and undergraduate GPA (uGPA) were treated as independent variables; medical and graduate school grades, time-to-PhD defense, USMLE scores, publication number, and career outcome were dependent variables. For cohort 1 (1963-1977), MCAT score was not associated with any assessed outcome, although uGPA was associated with medical school preclinical GPA and graduate school GPA (gsGPA). For cohort 2 (1978-1991), MCAT score was associated with USMLE Step II score and inversely correlated with publication number, and uGPA was associated with preclinical GPA (mspGPA) and clinical GPA (mscGPA). For cohort 3 (1992-2003), the MCAT score was associated with mscGPA, and uGPA was associated with gsGPA. Overall, MCAT score and uGPA were inconsistent or weak predictors of training metrics and career outcomes for this population of MD-PhD students.

  11. What Do Test Score Really Mean? A Latent Class Analysis of Danish Test Score Performance

    ERIC Educational Resources Information Center

    McIntosh, James; Munk, Martin D.

    2014-01-01

    Latent class Poisson count models are used to analyse a sample of Danish test score results from a cohort of individuals born in 1954-1955, tested in 1968, and followed until 2011. The procedure takes account of unobservable effects as well as excessive zeros in the data. We show that the test scores measure manifest or measured ability as it has…

  12. Measuring What Matters: Robert Sternberg's Enlightened Approach to Admissions Testing

    ERIC Educational Resources Information Center

    Grace, Catherine O'Neill

    2011-01-01

    Psychologist Robert J. Sternberg's conviction that American standardized testing does not accurately reflect a child's intelligence or potential is far from theoretical. As an elementary school student in the 1950s, he scored poorly on the ubiquitous IQ test of the time, freezing up when the school psychologist entered the room. Thankfully for…

  13. Critical Thinking: More than Test Scores

    ERIC Educational Resources Information Center

    Smith, Vernon G.; Szymanski, Antonia

    2013-01-01

    This article is for practicing or aspiring school administrators. The demand for excellence in public education has lead to an emphasis on standardized test scores. This article explores the development of a professional enhancement program designed to prepare teachers to teach higher order thinking skills. Higher order thinking is the primary…

  14. Misidentifying Factors Underlying Singapore's High Test Scores

    ERIC Educational Resources Information Center

    Usiskin, Zalman

    2012-01-01

    Singapore students have scored exceedingly well on international tests in mathematics. In response, there has been a desire in the United States--both at the policy level and at the school level--to emulate Singapore. Because what can be identified most easily about Singapore's school mathematics can be gleaned from curriculum documents from the…

  15. Teacher Use of Achievement Test Score Data

    ERIC Educational Resources Information Center

    Miller, Steven C.

    2012-01-01

    The Wyoming Department of Education (WDE) has invested time and money developing standardized achievement test score reports designed to give teachers data about each of their students' levels of mastery of particular concepts in order to differentiate their instruction. The purpose of this study was to determine the extent to which…

  16. Leveraging Gender Differences to Boost Test Scores

    ERIC Educational Resources Information Center

    Costello, Bill

    2008-01-01

    According to the 2004 National Assessment of Educational Progress, males who have made it through 12 years of school have significantly poorer reading skills than their female peers. In every age group, boys have been scoring lower than girls annually for more than three decades on U.S. Department of Education reading tests. The longer boys are in…

  17. At Admissions Conference, Talk of Standardized Tests, Early Decisions, and Swag

    ERIC Educational Resources Information Center

    Farrell, Elizabeth F.; Hoover, Eric

    2007-01-01

    At the annual conference of the National Association for College Admission Counseling (Nacac), admissions deans and high-school counselors gathered in September 2007 to grapple with questions such as: (1) Rethinking the role of standardized tests in admissions (many attendees predict that psychometric giants ACT and SAT, will not always dominate…

  18. Validating Test Score Meaning and Defending Test Score Use: Different Aims, Different Methods

    ERIC Educational Resources Information Center

    Cizek, Gregory J.

    2016-01-01

    Advances in validity theory and alacrity in validation practice have suffered because the term "validity" has been used to refer to two incompatible concerns: (1) the degree of support for specified interpretations of test scores (i.e. intended score meaning) and (2) the degree of support for specified applications (i.e. intended test…

  19. The new Medical College Admission Test: Implications for teaching psychology.

    PubMed

    Mitchell, Karen; Lewis, Richard S; Satterfield, Jason; Hong, Barry A

    2016-01-01

    This year's applicants to medical school took a newly revised version of the Medical College Admission Test. Unlike applicants in the past, they were asked to demonstrate their knowledge and use of concepts commonly taught in introductory psychology courses. The new Psychological, Social, and Biological Foundations of Behavior Test asked applicants to demonstrate the ways in which psychological, social, and biological factors influence perceptions and reactions to the world; behavior and behavior change; what people think about themselves and others; the cultural and social differences that influence well-being; and the relationships among social stratification, access to resources, and well-being. Building from the classic biopsychosocial model, this article provides the rationale for testing psychology concepts in application to medical school. It describes the concepts and skills that the new exam tests and shows how they lay the foundation for learning in medical school about the behavioral and sociocultural determinants of health. This article discusses the implications of these changes for undergraduate psychology faculty and psychology curricula as well as their importance to the profession of psychology at large. PMID:26866988

  20. Teachers' Use of Background Knowledge to Interpret Test Scores

    ERIC Educational Resources Information Center

    Leiter, Kenneth C. W.

    1976-01-01

    An examination of how teachers interpret standardized test scores reveals that in using the score the teacher embeds it in the subjective kinds of knowledge the test scores are supposed to replace. (Author/DE)

  1. ITC Guidelines on Quality Control in Scoring, Test Analysis, and Reporting of Test Scores

    ERIC Educational Resources Information Center

    Allalouf, Avi

    2014-01-01

    The Quality Control (QC) Guidelines are intended to increase the efficiency, precision, and accuracy of the scoring, analysis, and reporting process of testing. The QC Guidelines focus on large-scale testing operations where multiple forms of tests are created for use on set dates. However, they may also be used for a wide variety of other testing…

  2. [Correlation between results of the residency admission test and of pediatric certification test in Buenos Aires, Argentina].

    PubMed

    Ferreira, Juan Pablo; Hamui, Magali; Paganini, Agustina; Torres, Fernando A; Ossorio, María Fabiana; Eiguchi, Kumiko; Ferrero, Fernando

    2016-01-01

    In the city of Buenos Aires (CABA), pediatric residents enter the residency program after taking a unified admission test. After completion of the program and passing a final test, the Universidad de Buenos Aires (UBA) provides a professional certification. The objective of this study is to determine if the results obtained in the residency admission test (RAT) and those of the professional certification test (PCT) correlated. This is a cross-sectional study, that included all subjects who passed the pediatrics RAT in CABA in 2004-2009, and that attended the pediatric PCT of the UBA. The score for each subject in both tests was obtained and the corresponding correlation was calculated. Results were divided in quintiles, and the proportion of subjects who improved their position in the PCT with respect to the RAT was calculated. Data from 303 subjects was obtained. The RAT showed a median of 45.0 (over 60 maximum) (IC-range: 43.0-48.7), and the PCT showed a median of 6 points (over 10 max.)(IC-range: 6-8). A significative correlation between results in RAT and PCT was observed (r = 0.37, p < 0.001). Based on their position in the RAT, 43.8% of subjects improved their position in the PCT, without differences between residents attending pediatric and general hospitals (45.6 vs. 31.5%; p = 0.1). In the case of pediatric residents, results of the residency admission test correlate with those obtained in the professional certification test. PMID:27295703

  3. Validating the Interpretations and Uses of Test Scores

    ERIC Educational Resources Information Center

    Kane, Michael T.

    2013-01-01

    To validate an interpretation or use of test scores is to evaluate the plausibility of the claims based on the scores. An argument-based approach to validation suggests that the claims based on the test scores be outlined as an argument that specifies the inferences and supporting assumptions needed to get from test responses to score-based…

  4. Aligned Expectations? A Closer Look at College Admissions and Placement Tests

    ERIC Educational Resources Information Center

    Achieve, Inc., 2007

    2007-01-01

    Achieve launched this study to help inform the decisions states are making about high school assessments by providing greater insights into the world of college admissions and placement testing. Achieve analyzed more than 2,000 questions from college admissions and placement exams to determine how these tests compare to one another and how well…

  5. Report of the Commission on the Use of Standardized Tests in Undergraduate Admission

    ERIC Educational Resources Information Center

    National Association for College Admission Counseling, 2008

    2008-01-01

    Standardized admission tests have become an increasingly important factor in undergraduate admission. At the same time, reforms in elementary and secondary education at both the state and federal level have elevated the importance of standardized tests as a tool to measure educational outcomes. Nevertheless, long-standing concerns with…

  6. Test Scores, Class Rank and College Performance: Lessons for Broadening Access and Promoting Success

    PubMed Central

    Niu, Sunny X.; Tienda, Marta

    2012-01-01

    Using administrative data for five Texas universities that differ in selectivity, this study evaluates the relative influence of two key indicators for college success—high school class rank and standardized tests. Empirical results show that class rank is the superior predictor of college performance and that test score advantages do not insulate lower ranked students from academic underperformance. Using the UT-Austin campus as a test case, we conduct a simulation to evaluate the consequences of capping students admitted automatically using both achievement metrics. We find that using class rank to cap the number of students eligible for automatic admission would have roughly uniform impacts across high schools, but imposing a minimum test score threshold on all students would have highly unequal consequences by greatly reduce the admission eligibility of the highest performing students who attend poor high schools while not jeopardizing admissibility of students who attend affluent high schools. We discuss the implications of the Texas admissions experiment for higher education in Europe. PMID:23788828

  7. Assessing readiness for medical education: evolution of the medical college admission test.

    PubMed

    McGaghie, William C

    2002-09-01

    The attrition rate of 5% to 50% from US medical schools in the 1920s propelled the development of a test that would measure aptitude for medical studies. Since its development in 1928, the Medical College Admission Test (MCAT) has undergone 5 revisions. The first version was divided into 6 to 8 subtests that focused on memory, knowledge of scientific terminology, reading and comprehension, and logic. The second, which appeared in 1946, was reduced to 4 categories: verbal and quantitative skills, science knowledge, and added a category called understanding modern society. The major difference in the third version, launched in 1962, expanded the test's understanding modern society section to a broader test of general information. In 1977, the MCAT underwent its fourth change: its science section, reading and quantitative skill assessment sections were expanded; its general liberal arts knowledge section was eliminated; its scoring report structure and scoring range were altered; and its cultural and social bias was minimized. The current version, beginning in 1991, has undergone another significant change. Although it does not contain independent measures of either liberal arts or numeracy as separate categories, quantitative skills are needed to solve some of the problems in biological and physical sciences. However, its principal innovation is the writing sample section. Through its 74-year history, the various renditions of the MCAT demonstrate that the definition of aptitude for medical education reflects the professional and social mores and values of the time.

  8. To what extent does the Health Professions Admission Test-Ireland predict performance in early undergraduate tests of communication and clinical skills? – An observational cohort study

    PubMed Central

    2013-01-01

    Background Internationally, tests of general mental ability are used in the selection of medical students. Examples include the Medical College Admission Test, Undergraduate Medicine and Health Sciences Admission Test and the UK Clinical Aptitude Test. The most widely used measure of their efficacy is predictive validity. A new tool, the Health Professions Admission Test- Ireland (HPAT-Ireland), was introduced in 2009. Traditionally, selection to Irish undergraduate medical schools relied on academic achievement. Since 2009, Irish and EU applicants are selected on a combination of their secondary school academic record (measured predominately by the Leaving Certificate Examination) and HPAT-Ireland score. This is the first study to report on the predictive validity of the HPAT-Ireland for early undergraduate assessments of communication and clinical skills. Method Students enrolled at two Irish medical schools in 2009 were followed up for two years. Data collected were gender, HPAT-Ireland total and subsection scores; Leaving Certificate Examination plus HPAT-Ireland combined score, Year 1 Objective Structured Clinical Examination (OSCE) scores (Total score, communication and clinical subtest scores), Year 1 Multiple Choice Questions and Year 2 OSCE and subset scores. We report descriptive statistics, Pearson correlation coefficients and Multiple linear regression models. Results Data were available for 312 students. In Year 1 none of the selection criteria were significantly related to student OSCE performance. The Leaving Certificate Examination and Leaving Certificate plus HPAT-Ireland combined scores correlated with MCQ marks. In Year 2 a series of significant correlations emerged between the HPAT-Ireland and subsections thereof with OSCE Communication Z-scores; OSCE Clinical Z-scores; and Total OSCE Z-scores. However on multiple regression only the relationship between Total OSCE Score and the Total HPAT-Ireland score remained significant; albeit the

  9. Validity of the Medical College Admission Test for Predicting MD-PhD Student Outcomes

    ERIC Educational Resources Information Center

    Bills, James L.; VanHouten, Jacob; Grundy, Michelle M.; Chalkley, Roger; Dermody, Terence S.

    2016-01-01

    The Medical College Admission Test (MCAT) is a quantitative metric used by MD and MD-PhD programs to evaluate applicants for admission. This study assessed the validity of the MCAT in predicting training performance measures and career outcomes for MD-PhD students at a single institution. The study population consisted of 153 graduates of the…

  10. Impact of a novel dental school admission test on student performance at Innsbruck Medical University, Austria.

    PubMed

    Beier, Ulrike Stephanie; Kapferer, Ines; Ostermann, Herwig; Staudinger, Roland; Dumfahrt, Herbert

    2010-05-01

    Since the year 2000, prospective dental students at Innsbruck Medical University, Innsbruck, Austria, have undergone both theoretical and practical preadmission exams, called the Dental Admission Test (DAT). The aim of this investigation was to assess the suitability and outcome of this selection practice. Five classes from 2001 to 2005 (N=97; forty-three female, fifty-four male) were retrospectively reviewed. DAT results were compared with student performance, gender, ability to graduate on time, and dropout rates. Furthermore, the influence of a previous medical degree was evaluated. The t-test was used to analyze correlations between the results of the DAT and the following: gender, students who graduated on time, and students who had previously completed a medical degree. Pearson's correlation coefficient (r) was applied to analyze correlations among test scores, age, and students' performance during the first clinical year. Students graduating on time were noted to have significantly better DAT results; students with a previous medical degree showed significantly better grades during their first clinical year. The difference between the performance of male and female applicants on the DAT was not significant. Correlation was found between DAT results and dental school performance (r=-0.462). We conclude that the DAT may reduce dropout rates by excluding applicants unlikely to be successful in practical courses and that DAT scores are a reliable tool to predict student performance during the first clinical year of dental school in Innsbruck. PMID:20442431

  11. Testing Peer Effects among College Students: Evidence from an Unusual Admission Policy Change in China

    ERIC Educational Resources Information Center

    Lu, Fangwen

    2014-01-01

    This paper studies a natural experiment due to an unusual change in the college admission policy at a Chinese university, which brought a large number of low-score students into several academic departments in the university. Exploiting large variations in peer characteristics and strong interactions among peer groups, the analysis finds that…

  12. Estimating Total-Test Scores from Partial Scores in a Matrix Sampling Design.

    ERIC Educational Resources Information Center

    Sachar, Jane; Suppes, Patrick

    1980-01-01

    The present study compared six methods, two of which utilize the content structure of items, to estimate total-test scores using 450 students and 60 items of the 110-item Stanford Mental Arithmetic Test. Three methods yielded fairly good estimates of the total-test score. (Author/RL)

  13. Verbal Reasoning Test Scores and Their Stability over Time.

    ERIC Educational Resources Information Center

    Primrose, Alison F.; Fuller, Mary; Littledyke, Michael

    2000-01-01

    Stability of verbal reasoning test scores was measured for 146 students aged 8-13. Results suggest that reasoning test scores are not constant and vary considerably over time. Scores are not finite measures of intellectual capacity but of current verbal functioning, reflecting education and experiences to that point. (SK)

  14. Testing Intelligently Includes Double-Checking Wechsler IQ Scores

    ERIC Educational Resources Information Center

    Kuentzel, Jeffrey G.; Hetterscheidt, Lesley A.; Barnett, Douglas

    2011-01-01

    The rigors of standardized testing make for numerous opportunities for examiner error, including simple computational mistakes in scoring. Although experts recommend that test scoring be double-checked, the extent to which independent double-checking would reduce scoring errors is not known. A double-checking procedure was established at a…

  15. The GRE Analytical Score as a Predictor for Admission to Ph.D. Candidacy: A Two Year Study.

    ERIC Educational Resources Information Center

    Mowsesian, Richard; Hays, William L.

    The purpose of this study was to test the predictive efficiency of the Graduate Record Examination Analytical (GRE-A) score when used in combination with other predictors for doctoral candidacy decisions in an educational psychology department. It was hypothesized that the GRE-A would be a more efficient predictor than faculty ratings on a…

  16. Opening a Side-Gate: Engaging the Excluded in Chilean Higher Education through Test-Blind Admission

    ERIC Educational Resources Information Center

    Koljatic, Mladen; Silva, Monica

    2013-01-01

    The article describes a test-blind admission initiative in a Chilean research university aimed at expanding the inclusion of talented, albeit educationally and socially disadvantaged, students. The outcomes of the test-blind admission cohort were compared with those of students admitted via the regular admission procedure to the same academic…

  17. Does Test Preparation Work? Implications for Score Validity

    ERIC Educational Resources Information Center

    Xie, Qin

    2013-01-01

    This article reports an empirical study that examined the pattern of test preparation for College English Test Band 4 (CET4) and the differential effects of test preparation practices on its scores, thereby drawing implications for CET4 score validity. Data collection involved 1,003 test takers of CET4. A pretest was administered at the beginning…

  18. Reliability of Total Test Scores When Considered as Ordinal Measurements

    ERIC Educational Resources Information Center

    Biswas, Ajoy Kumar

    2006-01-01

    This article studies the ordinal reliability of (total) test scores. This study is based on a classical-type linear model of observed score (X), true score (T), and random error (E). Based on the idea of Kendall's tau-a coefficient, a measure of ordinal reliability for small-examinee populations is developed. This measure is extended to large…

  19. A Review of Scoring Algorithms for Ability and Aptitude Tests.

    ERIC Educational Resources Information Center

    Chevalier, Shirley A.

    In conventional practice, most educators and educational researchers score cognitive tests using a dichotomous right-wrong scoring system. Although simple and straightforward, this method does not take into consideration other factors, such as partial knowledge or guessing tendencies and abilities. This paper discusses alternative scoring models:…

  20. Estimating Total-test Scores from Partial Scores in a Matrix Sampling Design.

    ERIC Educational Resources Information Center

    Sachar, Jane; Suppes, Patrick

    It is sometimes desirable to obtain an estimated total-test score for an individual who was administered only a subset of the items in a total test. The present study compared six methods, two of which utilize the content structure of items, to estimate total-test scores using 450 students in grades 3-5 and 60 items of the ll0-item Stanford Mental…

  1. Results of College Admissions Testing in Texas for 1992-93 Graduating Seniors.

    ERIC Educational Resources Information Center

    Texas Education Agency, Austin. Office of Policy Analysis and Evaluation.

    This annual report presents results from the Scholastic Aptitude Test (SAT) of the College Board's Admissions Testing Program and the American College Testing (ACT) Program's original and enhanced ACT assessment for prior year graduating seniors. The last decade had brought some recovery over the decline of the previous 10 years in both Texas and…

  2. Evaluation of Admission and Placement Testing at Florence-Darlington Technical College.

    ERIC Educational Resources Information Center

    Grulick, Lawrence E.

    A study was conducted at Florence-Darlington Technical College (FDTC) in South Carolina to assess the college's admissions testing program through a review of the current literature on placement testing and a statistical study comparing currently employed instruments, i.e., the Career Planning Program (CPP) test and the Scholastic Aptitude Test…

  3. Estimating the Reliability of a Test Battery Composite or a Test Score Based on Weighted Item Scoring

    ERIC Educational Resources Information Center

    Feldt, Leonard S.

    2004-01-01

    In some settings, the validity of a battery composite or a test score is enhanced by weighting some parts or items more heavily than others in the total score. This article describes methods of estimating the total score reliability coefficient when differential weights are used with items or parts.

  4. Reliability of a science admission test (HAM-Nat) at Hamburg medical school

    PubMed Central

    Hissbach, Johanna; Klusmann, Dietrich; Hampe, Wolfgang

    2011-01-01

    Objective: The University Hospital in Hamburg (UKE) started to develop a test of knowledge in natural sciences for admission to medical school in 2005 (Hamburger Auswahlverfahren für Medizinische Studiengänge, Naturwissenschaftsteil, HAM-Nat). This study is a step towards establishing the HAM-Nat. We are investigating parallel forms reliability, the effect of a crash course in chemistry on test results, and correlations of HAM-Nat test results with a test of scientific reasoning (similar to a subtest of the "Test for Medical Studies", TMS). Methods: 316 first-year students participated in the study in 2007. They completed different versions of the HAM-Nat test which consisted of items that had already been used (HN2006) and new items (HN2007). Four weeks later half of the participants were tested on the HN2007 version of the HAM-Nat again, while the other half completed the test of scientific reasoning. Within this four week interval students were offered a five day chemistry course. Results: Parallel forms reliability for four different test versions ranged from rtt=.53 to rtt=.67. The retest reliabilities of the HN2007 halves were rtt=.54 and rtt =.61. Correlations of the two HAM-Nat versions with the test of scientific reasoning were r=.34 und r=.21. The crash course in chemistry had no effect on HAM-Nat scores. Conclusions: The results suggest that further versions of the test of natural sciences will not easily conform to the standards of internal consistency, parallel-forms reliability and retest reliability. Much care has to be taken in order to assemble items which could be used interchangeably for the construction of new test versions. The test of scientific reasoning and the HAM-Nat are tapping different constructs. Participation in a chemistry course did not improve students’ achievement, probably because the content of the course was not coordinated with the test and many students lacked of motivation to do well in the second test. PMID:21866246

  5. An Investigation into the Possible Speededness of the Medical College Admission Test. MCAT Monograph 3.

    ERIC Educational Resources Information Center

    Neustel, Sandra

    As a continuing part of its validity studies, the Association of American Medical Colleges commissioned a study of the speediness of the Medical College Admission Test (MCAT). If speed is a hidden part of the test, it is a threat to its construct validity. As a general rule, the criterion used to indicate lack of speediness is that 80% of the…

  6. Staff Report on the Federal Trade Commission Investigation of Coaching for Standardized Admission Tests.

    ERIC Educational Resources Information Center

    Federal Trade Commission, Boston, MA.

    Whether coaching for standardized admission tests such as the Scholastic Aptitude Test (SAT) is effective was investigated. The Bureau of Consumer Protection of the Federal Trade Commission evaluated the effectiveness of two commercial coaching schools that offered preparation courses for the SAT. The researchers concluded that one of the two…

  7. Reflections on a Century of College Admissions Tests. Research & Occasional Paper Series: CSHE.4.09

    ERIC Educational Resources Information Center

    Atkinson, Richard C.; Geiser, Saul

    2009-01-01

    Standardized testing for college admissions has grown exponentially since the first administration of the old "College Boards" in 1901. This paper surveys major developments since then: the introduction of the "Scholastic Aptitude Test" in 1926, designed to tap students' general analytic ability; E.F. Lindquist's creation of the ACT in 1959 as a…

  8. Development and Validation of an Admission Test Designed to Assess Samples of Performance on Academic Tasks

    ERIC Educational Resources Information Center

    Tanilon, Jenny; Segers, Mien; Vedder, Paul; Tillema, Harm

    2009-01-01

    This study illustrates the development and validation of an admission test, labeled as Performance Samples on Academic Tasks in Educational Sciences (PSAT-Ed), designed to assess samples of performance on academic tasks characteristic of those that would eventually be encountered by examinees in an Educational Sciences program. The test was based…

  9. Improving Scores on the IELTS Speaking Test

    ERIC Educational Resources Information Center

    Issitt, Steve

    2008-01-01

    This article presents three strategies for teaching students who are taking the IELTS speaking test. The first strategy is aimed at improving confidence and uses a variety of self-help materials from the field of popular psychology. The second encourages students to think critically and invokes a range of academic perspectives. The third strategy…

  10. Equating Test Scores (without IRT). Second Edition

    ERIC Educational Resources Information Center

    Livingston, Samuel A.

    2014-01-01

    This booklet grew out of a half-day class on equating that author Samuel Livingston teaches for new statistical staff at Educational Testing Service (ETS). The class is a nonmathematical introduction to the topic, emphasizing conceptual understanding and practical applications. The class consists of illustrated lectures, interspersed with…

  11. Fuzzy Math: A Meditation on Test Scoring

    ERIC Educational Resources Information Center

    Jacks, Meredith

    2011-01-01

    As a public school English teacher, the author observes standardized testing season each year with a sort of grim fascination. "So this is it," she thinks as she paces around her silent classroom, peering over kids' shoulders at articles about parasailing. Line graphs tracking the rainfall in Tulsa. Parts of speech. Functions of "x." "These are…

  12. Accountability Is More than a Test Score

    ERIC Educational Resources Information Center

    Turnipseed, Stephan; Darling-Hammond, Linda

    2015-01-01

    The number one quality business leaders look for in employees is creativity and yet the U.S. education system undermines the development of the higher-order skills that promote creativity by its dogged focus on multiple-choice tests. Stephan Turnipseed and Linda DarlingHammond discuss the kind of rich accountability system that will help students…

  13. SAT® II: Subject Tests in Foreign Languages--Using the Tests for Admission and Placement. Research Summary RS-07

    ERIC Educational Resources Information Center

    College Entrance Examination Board, 2002

    2002-01-01

    As an admissions test, the SAT II: Subject Tests in foreign languages allow students to demonstrate academic competence in the selected language area. As a placement tool, the SAT II: Subject Tests in foreign languages serve the same function as the SAT II: Subject Tests do in other academic areas such as world history, chemistry, or math. SAT II:…

  14. Predictive validity of the Medical College Admissions Test Writing Sample for the United States medical licensing examination steps 1 and 2.

    PubMed

    Gilbert, Gregory E; Basco, William T; Blue, Amy V; O'Sullivan, Patricia S

    2002-01-01

    Despite the amount of published research on the predictive validity of the Medical College Admissions Test (MCAT) taken as a whole, few published reports separate the individual predictive validity of the Writing Sample. The purpose of this study is to provide data on the predictive validity of the Writing Sample for the national licensing exam used in the United States. Subjects consisted of 1992-1995 matriculants from a publicly owned medical school in the Southeastern United States. Independent variables were undergraduate grade point average, and four section scores on a required admissions test (Biological Sciences, Physical Sciences, Verbal Reasoning, and Writing Sample). The dependent variables were Steps 1 and 2 on the three step licensing examination. Steps 1 and 2 of the examination are taken during medical school. Multiple regression models calculated additional variance accounted for by the addition of the Writing Sample to a model containing grade point average and the other admissions test section scores. In multivariate analyses, when grade point average and all admissions test scores were considered as predictors of licensing exam scores, the Writing Sample variable did not add to the ability to predicting the Step 1 or Step 2 scores. The results of this study suggest that the Writing Sample has limited predictive validity for assessing success on a national licensing exam. However, as others suggest, the value of the Writing Sample and other surrogates of communication probably lie in predicting performance in the clinical years of medical school and beyond. Additional work should include evaluating the predictive validity of the Writing Sample and other pre-medical school measures of communication with widely acceptable measures of performance in clinical settings, including physician-patient communication.

  15. Grade Inflation and Law School Admissions

    ERIC Educational Resources Information Center

    Wongsurawat, Winai

    2008-01-01

    Purpose: The purpose of this paper is to evaluate the evidence on whether grade inflation has led to an increasing emphasis on standardized test scores as a criterion for law school admissions. Design/methodology/approach: Fit probabilistic models to admissions data for American law schools during the mid to late 1990s, a period during which…

  16. Test Score Decline Among High Achievers: Policy Implications.

    ERIC Educational Resources Information Center

    Goldman, Jerrold; Hsia, Jayjia

    Since 1967, the mean Scholastic Aptitude Test (SAT) score has declined. Likewise, the numbers of candidates receiving high SAT scores have been decreasing steadily. The same downward trend in student achievement can be seen among student groups from grade 4 through post graduate studies. In recent years, policy has been directed towards making…

  17. Missing the Mark: What Test Scores Really Tell Us

    ERIC Educational Resources Information Center

    Tanner, John R.

    2011-01-01

    State test scores administered for accountability purposes are regularly used to adjust instruction in nuanced ways. This is no accident--No Child Left Behind demanded that students' scores be returned quickly to teachers in order that this might be the case, and the idea of data-driven decision making continues as one way the promise of education…

  18. This Test Is Unfair: African American and Latino High School Students' Perceptions of Standardized College Admission Tests.

    ERIC Educational Resources Information Center

    Walpole, MaryBeth; McDonough, Patricia M.; Bauer, Constance J.; Gibson, Carolyn; Kanyi, Kamau T.; Toliver, Rita

    This qualitative study focused on African American and Latino high school students perceptions of standardized admission tests, including the Scholastic Assessment Tests (I and II) and the ACT Assessment. Students enrolled in college preparatory classes were interviewed about these tests individually and in focus groups in fall 1998 in their…

  19. TOEFL Test and Score Manual, 1990-91.

    ERIC Educational Resources Information Center

    Educational Testing Service, Princeton, NJ.

    This manual has been prepared for those responsible for interpreting scores on the Test of English as a Foreign Language (TOEFL). In addition to test interpretation information, the manual describes the test, explains the TOEFL program, and discusses program research activities. The TOEFL was developed in 1963 to test the English-language…

  20. Test Scores and What They Mean. Sixth Edition.

    ERIC Educational Resources Information Center

    Lyman, Howard B.

    The first edition of this book was written to give information about testing to people whose work gave them access to test results, but whose training included little or nothing about the use and interpretation of tests. Later editions have been intended for a broader audience as the need for understanding what test scores really mean has…

  1. Predictive validity of the Biomedical Admissions Test: an evaluation and case study.

    PubMed

    McManus, I C; Ferguson, Eamonn; Wakeford, Richard; Powis, David; James, David

    2011-01-01

    There has been an increase in the use of pre-admission selection tests for medicine. Such tests need to show good psychometric properties. Here, we use a paper by Emery and Bell [2009. The predictive validity of the Biomedical Admissions Test for pre-clinical examination performance. Med Educ 43:557-564] as a case study to evaluate and comment on the reporting of psychometric data in the field of medical student selection (and the comments apply to many papers in the field). We highlight pitfalls when reliability data are not presented, how simple zero-order associations can lead to inaccurate conclusions about the predictive validity of a test, and how biases need to be explored and reported. We show with BMAT that it is the knowledge part of the test which does all the predictive work. We show that without evidence of incremental validity it is difficult to assess the value of any selection tests for medicine.

  2. 40 CFR 85.1509 - Final admission of modification and test vehicles.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... 40 Protection of Environment 18 2010-07-01 2010-07-01 false Final admission of modification and test vehicles. 85.1509 Section 85.1509 Protection of Environment ENVIRONMENTAL PROTECTION AGENCY (CONTINUED) AIR PROGRAMS (CONTINUED) CONTROL OF AIR POLLUTION FROM MOBILE SOURCES Importation of...

  3. 40 CFR 85.1509 - Final admission of modification and test vehicles.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... 40 Protection of Environment 19 2014-07-01 2014-07-01 false Final admission of modification and test vehicles. 85.1509 Section 85.1509 Protection of Environment ENVIRONMENTAL PROTECTION AGENCY (CONTINUED) AIR PROGRAMS (CONTINUED) CONTROL OF AIR POLLUTION FROM MOBILE SOURCES Importation of...

  4. 40 CFR 85.1509 - Final admission of modification and test vehicles.

    Code of Federal Regulations, 2012 CFR

    2012-07-01

    ... 40 Protection of Environment 19 2012-07-01 2012-07-01 false Final admission of modification and test vehicles. 85.1509 Section 85.1509 Protection of Environment ENVIRONMENTAL PROTECTION AGENCY (CONTINUED) AIR PROGRAMS (CONTINUED) CONTROL OF AIR POLLUTION FROM MOBILE SOURCES Importation of...

  5. 40 CFR 85.1509 - Final admission of modification and test vehicles.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... 40 Protection of Environment 19 2013-07-01 2013-07-01 false Final admission of modification and test vehicles. 85.1509 Section 85.1509 Protection of Environment ENVIRONMENTAL PROTECTION AGENCY (CONTINUED) AIR PROGRAMS (CONTINUED) CONTROL OF AIR POLLUTION FROM MOBILE SOURCES Importation of...

  6. 40 CFR 85.1509 - Final admission of modification and test vehicles.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... 40 Protection of Environment 18 2011-07-01 2011-07-01 false Final admission of modification and test vehicles. 85.1509 Section 85.1509 Protection of Environment ENVIRONMENTAL PROTECTION AGENCY (CONTINUED) AIR PROGRAMS (CONTINUED) CONTROL OF AIR POLLUTION FROM MOBILE SOURCES Importation of...

  7. Testing Race-Neutral Admissions Models: Lessons from California State University-Long Beach

    ERIC Educational Resources Information Center

    Rendon, Laura I.; Novack, Vincent; Dowell, David

    2005-01-01

    This policy analysis article examines how California State University-Long Beach, an institution where upward of 22,000 student applied for roughly 3,400 freshman slots and where the transfer class had to be reduced because of mandatory enrollment reductions, tested race-neutral admissions models in accordance with Proposition 209, which prohibits…

  8. Quality of Education Outcomes: The Role of the Graduate Management Admission Test

    ERIC Educational Resources Information Center

    Wamala, Robert; Buyinza, Mukadasi

    2013-01-01

    Although the Graduate Management Admission Test (GMAT) is considered by leading business institutions worldwide as a predictor of success in graduate programs, an issue of contention is whether the introduction of the examination enhances the quality of education outcomes. This study sought to obtain an understanding of this issue, focusing on…

  9. Predicting Performance during Clinical Years from the New Medical College Admission Test.

    ERIC Educational Resources Information Center

    Caroline, Jan D.; And Others

    1983-01-01

    The results of a predictive validity study of the new Medical College Admission Test (MCAT) using criteria from the clinical years of undergraduate medical education are discussed. The criteria included course grades and faculty ratings of clerks in internal medicine, surgery, obstetrics and gynecology, pediatrics, and psychiatry. (Author/MLW)

  10. Sex-Specific Differential Prediction of College Admission Tests: A Meta-Analysis

    ERIC Educational Resources Information Center

    Fischer, Franziska T.; Schult, Johannes; Hell, Benedikt

    2013-01-01

    This is the first meta-analysis that investigates the differential prediction of undergraduate and graduate college admission tests for women and men. Findings on 130 independent samples representing 493,048 students are summarized. The underprediction of women's academic performance (d = 0.14) and the overprediction of men's academic performance…

  11. Exploring Population Sensitivity of Linking Functions across Three Law School Admission Test Administrations

    ERIC Educational Resources Information Center

    Liu, Mei; Holland, Paul W.

    2008-01-01

    The simplified version of the Dorans and Holland (2000) measure of population invariance, the root mean square difference (RMSD), is used to explore the degree of dependence of linking functions on the Law School Admission Test (LSAT) subpopulations defined by examinees' gender, ethnic background, geographic region, law school application status,…

  12. School and Individual Factors That Contribute to the Achievement Gap in College Admissions Tests in Chile

    ERIC Educational Resources Information Center

    Perez Mejias, Paulina

    2012-01-01

    In Chile, reports and research papers have shown that there is an achievement gap in college admissions tests mostly associated to students' gender, socioeconomic status and type of school attended. This gap represents a barrier for low-income and female students to access higher education, as well as for graduates of public schools. Prior…

  13. Appraising the Dimensionality of the Medical College Admission Test. MCAT Monograph.

    ERIC Educational Resources Information Center

    Meara, Kevin; Sireci, Stephen G.

    To provide a better understanding of the structure of the Medical College Admission Test (MCAT) and to determine if there are structural differences across selected groups of MCAT examinees, several dimensionality analyses were conducted on data from recent administrations of the MCAT. The first set of analyses focused on the global structure of…

  14. Prediction of true test scores from observed item scores and ancillary data.

    PubMed

    Haberman, Shelby J; Yao, Lili; Sinharay, Sandip

    2015-05-01

    In many educational tests which involve constructed responses, a traditional test score is obtained by adding together item scores obtained through holistic scoring by trained human raters. For example, this practice was used until 2008 in the case of GRE(®) General Analytical Writing and until 2009 in the case of TOEFL(®) iBT Writing. With use of natural language processing, it is possible to obtain additional information concerning item responses from computer programs such as e-rater(®). In addition, available information relevant to examinee performance may include scores on related tests. We suggest application of standard results from classical test theory to the available data to obtain best linear predictors of true traditional test scores. In performing such analysis, we require estimation of variances and covariances of measurement errors, a task which can be quite difficult in the case of tests with limited numbers of items and with multiple measurements per item. As a consequence, a new estimation method is suggested based on samples of examinees who have taken an assessment more than once. Such samples are typically not random samples of the general population of examinees, so that we apply statistical adjustment methods to obtain the needed estimated variances and covariances of measurement errors. To examine practical implications of the suggested methods of analysis, applications are made to GRE General Analytical Writing and TOEFL iBT Writing. Results obtained indicate that substantial improvements are possible both in terms of reliability of scoring and in terms of assessment reliability.

  15. Wage and Test Score Dispersion: Some International Evidence.

    ERIC Educational Resources Information Center

    Bedard, Kelly; Ferrall, Christopher

    2003-01-01

    Compares the distribution of test scores at age 13 in 1964 and 1982 and wages later in life across 11 countries. Finds that wage dispersion later in life is never greater than test-score dispersion. For three countries (U.S., UK, and Japan), finds evidence of skill-biased changes in wage dispersion between the early 1970s and the late 1980s.…

  16. Making Sense of Test Scores. Assessment Brief. Number 10

    ERIC Educational Resources Information Center

    Bergman, Lincoln

    2004-01-01

    It is challenging for parents and the general public to make sense of the reports on test scores that appear in the mass media. This article offers some things for readers to consider as they bring a critical eye to what is read in the papers. Usually reports on test scores in the media are quite short and focus on one or two aspects of test…

  17. The Revised SAT Score and Its Potential Benefits for the Admission of Minority Students to Higher Education

    ERIC Educational Resources Information Center

    Santelices, Maria Veronica; Wilson, Mark

    2015-01-01

    This paper investigates the predictive validity of the Revised SAT (R-SAT) score, proposed by Freedle (2003) as an alternative to compensate minority students for the potential harm caused by the relationship between item difficulty and ethnic DIF observed in the SAT. The R-SAT score is the score minority students would have received if only the…

  18. Dearborn 1981-82 Achievement Test Scores (Fifth Annual Report).

    ERIC Educational Resources Information Center

    Dearborn Public Schools, MI.

    The purpose of the fifth annual Dearborn Achievement Test Score report is to summarize and to help interpret the test results so that Dearborn citizens and educators will have a better understanding of the educational achievements of Dearborn students. The District-wide Testing Program assesses reading readiness, scholastic aptitude, academic…

  19. Test Takers and the Validity of Score Interpretations

    ERIC Educational Resources Information Center

    Kopriva, Rebecca J.; Thurlow, Martha L.; Perie, Marianne; Lazarus, Sheryl S.; Clark, Amy

    2016-01-01

    This article argues that test takers are as integral to determining validity of test scores as defining target content and conditioning inferences on test use. A principled sustained attention to how students interact with assessment opportunities is essential, as is a principled sustained evaluation of evidence confirming the validity or calling…

  20. Two Language Screening Tests Compared with Developmental Sentence Scoring.

    ERIC Educational Resources Information Center

    Blaxley, Lynn; And Others

    1983-01-01

    The performance of 90 children between the ages of four and six years on two language screening tests was compared with their performance on Developmental Sentence Scoring (DSS) to determine the accuracy of these screening tests in identifying language impairments. The Bankson Language Screening Test was generally accurate in the identification of…

  1. College Admissions in Twenty-First-Century America: The Role of Grades, Tests, and Games of Chance

    ERIC Educational Resources Information Center

    Zwick, Rebecca

    2007-01-01

    In this essay, Rebecca Zwick confronts the controversy surrounding the use of standardized tests in college admissions. She examines the degree to which the SAT and its lesser known cousin, the ACT, limit access to college, particularly for racial and ethnic minorities, and considers two alternative admissions policies that do not involve tests:…

  2. State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. Pennsylvania

    ERIC Educational Resources Information Center

    Center on Education Policy, 2010

    2010-01-01

    This paper profiles Pennsylvania's test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the basic level on NAEP (National Assessment of Educational Progress) increased in grade 8 reading and math. Average annual gains were larger on the state test than on NAEP in…

  3. State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. Maryland

    ERIC Educational Resources Information Center

    Center on Education Policy, 2010

    2010-01-01

    This paper profiles Maryland's test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the basic level on NAEP (National Assessment of Educational Progress) increased at grades 4 and 8 in both reading and math. Average annual gains were larger on the state test than…

  4. Accountancy, teaching methods, sex, and American College Test scores.

    PubMed

    Heritage, J; Harper, B S; Harper, J P

    1990-10-01

    This study examines the significance of sex, methodology, academic preparation, and age as related to development of judgmental and problem-solving skills. Sex, American College Test (ACT) Mathematics scores, Composite ACT scores, grades in course work, grade point average (GPA), and age were used in studying the effects of teaching method on 96 students' ability to analyze data in financial statements. Results reflect positively on accounting students compared to the general college population and the women students in particular.

  5. Effect of self-assessment on test scores: student perceptions.

    PubMed

    Ramirez, Beatriz U

    2010-09-01

    After a sudden increase in most of the individual grades in a multiple-choice test, students were asked to rank the three most relevant factors responsible for this outcome. Among eight others, the availability of a test for self-assessment before the final test was by far the most frequently mentioned (82.4% of the students). Questions applied during different course activities did not have the same effect on student scores as the "online" self-assessment test.

  6. Motivating High School Students to Score Proficient on State Tests

    ERIC Educational Resources Information Center

    Brown, Sarah Lee

    2015-01-01

    The researcher interviewed two groups of eleventh grade students, in a rural Appalachian setting, who tended to score low on the state mandated high stakes/low stakes test to discover their efforts on the test, specifically in reading, and to obtain their opinions concerning the effects of a specific incentive or consequence. Before the eleventh…

  7. High Test Scores: The Wrong Road to National Economic Success

    ERIC Educational Resources Information Center

    Baker, Keith

    2011-01-01

    A widely held view is that good schools are essential to a nation's international economic success and that high test scores on international tests of academic skills and knowledge indicate how good a nation's schools are. The widespread belief that good schools are an important contributor to a nation's economic success in the world is supported…

  8. The Uses and Misuses of Test Scores: Technical Assistance Perspective.

    ERIC Educational Resources Information Center

    Echternacht, Gary

    The uses and misuses of standardized test results used for program evaluation as seen by a staff member of an Elementary Secondary Education Act (ESEA) Title I Technical Assistance Center are described. In ESEA Title I, test scores are used to select students for the program. Although federal requirements do not require using standardized test…

  9. Effort Analysis: Individual Score Validation of Achievement Test Data

    ERIC Educational Resources Information Center

    Wise, Steven L.

    2015-01-01

    Whenever the purpose of measurement is to inform an inference about a student's achievement level, it is important that we be able to trust that the student's test score accurately reflects what that student knows and can do. Such trust requires the assumption that a student's test event is not unduly influenced by construct-irrelevant factors…

  10. A prognostic scoring system for arm exercise stress testing

    PubMed Central

    Xie, Yan; Xian, Hong; Chandiramani, Pooja; Bainter, Emily; Wan, Leping; Martin, Wade H

    2016-01-01

    Objective Arm exercise stress testing may be an equivalent or better predictor of mortality outcome than pharmacological stress imaging for the ≥50% for patients unable to perform leg exercise. Thus, our objective was to develop an arm exercise ECG stress test scoring system, analogous to the Duke Treadmill Score, for predicting outcome in these individuals. Methods In this retrospective observational cohort study, arm exercise ECG stress tests were performed in 443 consecutive veterans aged 64.1 (11.1) years. (mean (SD)) between 1997 and 2002. From multivariate Cox models, arm exercise scores were developed for prediction of 5-year and 12-year all-cause and cardiovascular mortality and 5-year cardiovascular mortality or myocardial infarction (MI). Results Arm exercise capacity in resting metabolic equivalents (METs), 1 min heart rate recovery (HRR) and ST segment depression ≥1 mm were the stress test variables independently associated with all-cause and cardiovascular mortality by step-wise Cox analysis (all p<0.01). A score based on the relation HRR (bpm)+7.3×METs−10.5×ST depression (0=no; 1=yes) prognosticated 5-year cardiovascular mortality with a C-statistic of 0.81 before and 0.88 after adjustment for significant demographic and clinical covariates. Arm exercise scores for the other outcome end points yielded C-statistic values of 0.77–0.79 before and 0.82–0.86 after adjustment for significant covariates versus 0.64–0.72 for best fit pharmacological myocardial perfusion imaging models in a cohort of 1730 veterans who were evaluated over the same time period. Conclusions Arm exercise scores, analogous to the Duke Treadmill Score, have good power for prediction of mortality or MI in patients who cannot perform leg exercise. PMID:26835142

  11. Does weight affect children's test scores and teacher assessments differently?

    PubMed

    Zavodny, Madeline

    2013-06-01

    The prevalence of childhood overweight and obesity increased dramatically in the United States during the past three decades. This increase has adverse public health implications, but its implication for children's academic outcomes is less clear. This paper uses data from five waves of the Early Childhood Longitudinal Study-Kindergarten to examine how children's weight is related to their scores on standardized tests and to their teachers' assessments of their academic ability. The results indicate that children's weight is more negatively related to teacher assessments of their academic performance than to test scores.

  12. State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. Massachusetts

    ERIC Educational Resources Information Center

    Center on Education Policy, 2010

    2010-01-01

    This paper profiles Massachusetts' test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the basic level on NAEP (National Assessment of Educational Progress) increased in grade 4 reading and math and grade 8 math. Average annual gains were larger on the state test…

  13. A Latent Class Approach to Estimating Test-Score Reliability

    ERIC Educational Resources Information Center

    van der Ark, L. Andries; van der Palm, Daniel W.; Sijtsma, Klaas

    2011-01-01

    This study presents a general framework for single-administration reliability methods, such as Cronbach's alpha, Guttman's lambda-2, and method MS. This general framework was used to derive a new approach to estimating test-score reliability by means of the unrestricted latent class model. This new approach is the latent class reliability…

  14. Commentary on "Validating the Interpretations and Uses of Test Scores"

    ERIC Educational Resources Information Center

    Brennan, Robert L.

    2013-01-01

    Kane's paper "Validating the Interpretations and Uses of Test Scores" is the most complete and clearest discussion yet available of the argument-based approach to validation. At its most basic level, validation as formulated by Kane is fundamentally a simply-stated two-step enterprise: (1) specify the claims inherent in a particular interpretation…

  15. Between-District Test Score Variation, 2009-2012

    ERIC Educational Resources Information Center

    Fahle, Erin; Reardon, Sean

    2016-01-01

    Describing the variation in test scores between and within school districts is critical for: (1) for policy-related and descriptive work that investigates the sorting of students among districts and the differential effectiveness of those districts; and (2) for methodological work planning future experiments or interventions. Intraclass…

  16. Univariate and Bivariate Loglinear Models for Discrete Test Score Distributions.

    ERIC Educational Resources Information Center

    Holland, Paul W.; Thayer, Dorothy T.

    2000-01-01

    Applied the theory of exponential families of distributions to the problem of fitting the univariate histograms and discrete bivariate frequency distributions that often arise in the analysis of test scores. Considers efficient computation of the maximum likelihood estimates of the parameters using Newton's Method and computationally efficient…

  17. Benefits of Coaching on Test Scores Seen as Negligible.

    ERIC Educational Resources Information Center

    Report on Education Research, 1983

    1983-01-01

    THE FOLLOWING IS THE FULL TEXT OF THIS DOCUMENT: A new study by a pair of Harvard University researchers discounts earlier findings that coaching can substantially improve student performance on the Scholastic Aptitude Test (SAT). "There is simply insufficient evidence that large score increases are a result of a coaching program," write Rebecca…

  18. Source Country Differences in Test Score Gaps: Evidence from Denmark

    ERIC Educational Resources Information Center

    Rangvid, Beatrice Schindler

    2010-01-01

    We combine data from three studies for Denmark in the PISA 2000 framework to investigate differences in the native-immigrant test score gap by country of origin. In addition to the controls available from PISA data sources, we use student-level data on home background and individual migration histories linked from administrative registers. We find…

  19. America's Mediocre Test Scores: Education Crisis or Poverty Crisis?

    ERIC Educational Resources Information Center

    Petrilli, Michael J.; Wright, Brandon L.

    2016-01-01

    At a time when the national conversation is focused on lagging upward mobility, it is no surprise that many educators point to poverty as the explanation for mediocre test scores among U.S. students compared to those of students in other countries. If American teachers in struggling U.S. schools taught in Finland, says Finnish educator Pasi…

  20. Student Laptop Use and Scores on Standardized Tests

    ERIC Educational Resources Information Center

    Kposowa, Augustine J.; Valdez, Amanda D.

    2013-01-01

    Objectives: The primary objective of the study was to investigate the relationship between ubiquitous laptop use and academic achievement. It was hypothesized that students with ubiquitous laptops would score on average higher on standardized tests than those without such computers. Methods: Data were obtained from two sources. First, demographic…

  1. What We Lose in Winning the Test Score Race

    ERIC Educational Resources Information Center

    Jorgenson, Olaf

    2012-01-01

    To achieve perpetually better test results each year as mandated by the No Child Left Behind Act (NCLB), teachers in successful schools such as Leroy Anderson Elementary in San Jose, California, will "try anything" to raise scores, as the school's principal stated in an interview with "The San Jose Mercury News." In schools across California for…

  2. Scoring Rod-and-Frame Tests: Quantitative and Qualitative Considerations.

    ERIC Educational Resources Information Center

    Haller, Otto; Edgington, Eugene S.

    1982-01-01

    Current scoring procedures depend on unrealistic assumptions about subjects' performance on the rod-and-frame test. A procedure is presented which corrects for constant error, is sensitive to response strategy and consistency, and examines qualitative and quantitative aspects of performance and individual differences in laterality bias as defined…

  3. Racial Differences in Mathematics Test Scores for Advanced Mathematics Students

    ERIC Educational Resources Information Center

    Minor, Elizabeth Covay

    2016-01-01

    Research on achievement gaps has found that achievement gaps are larger for students who take advanced mathematics courses compared to students who do not. Focusing on the advanced mathematics student achievement gap, this study found that African American advanced mathematics students have significantly lower test scores and are less likely to be…

  4. EAP Study Recommendations and Score Gains on the IELTS Academic Writing Test

    ERIC Educational Resources Information Center

    Green, Anthony

    2005-01-01

    The IELTS test is widely accepted by university admissions offices as evidence of English language ability. The test is also used to guide decisions about the amount of language study required for students to satisfy admissions requirements. Guidelines currently published by the British Association of Lecturers in English for Academic Purposes…

  5. Flow and diffusion of high-stakes test scores

    NASA Astrophysics Data System (ADS)

    Marder, M.; Bansal, D.

    2009-10-01

    We apply visualization and modeling methods for convective and diffusive flows to public school mathematics test scores from Texas. We obtain plots that show the most likely future and past scores of students, the effects of random processes such as guessing, and the rate at which students appear in and disappear from schools. We show that student outcomes depend strongly upon economic class, and identify the grade levels where flows of different groups diverge most strongly. Changing the effectiveness of instruction in one grade naturally leads to strongly nonlinear effects on student outcomes in subsequent grades.

  6. Simplifying multivariate survival analysis using global score test methodology

    NASA Astrophysics Data System (ADS)

    Zain, Zakiyah; Aziz, Nazrina; Ahmad, Yuhaniz

    2015-12-01

    In clinical trials, the main purpose is often to compare efficacy between experimental and control treatments. Treatment comparisons often involve multiple endpoints, and this situation further complicates the analysis of survival data. In the case of tumor patients, endpoints concerning survival times include: times from tumor removal until the first, the second and the third tumor recurrences, and time to death. For each patient, these endpoints are correlated, and the estimation of the correlation between two score statistics is fundamental in derivation of overall treatment advantage. In this paper, the bivariate survival analysis method using the global score test methodology is extended to multivariate setting.

  7. Which Test? Whose Scores? Comparing Standardized Critical Thinking Tests

    ERIC Educational Resources Information Center

    Hatcher, Donald L.

    2011-01-01

    In this article, after describing one approach for teaching critical thinking (CT) that was in place at Baker University from 1990 to 2008, the author describes their experience assessing CT using three standardized exams and shows why the choice of a standardized CT test can be problematic and the results misleading. These results can be…

  8. Score to identify the severity of adult patients with influenza A (H1N1) 2009 virus infection at hospital admission.

    PubMed

    Capelastegui, A; Quintana, J M; Bilbao, A; España, P P; Garin, O; Alonso, J; Astray, J; Cantón, R; Castilla, J; Castro, A; Delgado-Rodríguez, M; Godoy, P; Gónzález-Candelas, F; Martín, V; Mayoral, J M; Pumarola, T; Tamames, S; Soldevila, N; Baricot, M; Domínguez, A

    2012-10-01

    The objective of this paper was to develop a prognostic index for severe complications among hospitalized patients with influenza A (H1N1) 2009 virus infection. We conducted a prospective observational cohort study of 618 inpatients with 2009 H1N1 virus infection admitted to 36 Spanish hospitals between July 2009 and February 2010. Risk factors evaluated included host-related factors and clinical data at admission. We developed a composite index of severe in-hospital complications (SIHC), which included: mortality, mechanical ventilation, septic shock, acute respiratory distress syndrome, and requirement for resuscitation maneuvers. Six factors were independently associated with SIHC: age >45 years, male sex, number of comorbidities, pneumonia, dyspnea, and confusion. From the β parameter obtained in the multivariate model, a weight was assigned to each factor to compute the individual influenza risk score. The score shows an area under the receiver operating characteristic (ROC) curve of 0.77. The SIHC rate was 1.9 % in the low-risk group, 10.3 % in the intermediate-risk group, and 29.6 % in the high-risk group. The odds ratio for complications was 21.8 for the high-risk group compared with the low-risk group. This easy-to-score influenza A (H1N1) 2009 virus infection risk index accurately stratifies patients hospitalized for H1N1 virus infection into low-, intermediate-, and high-risk groups for SIHC.

  9. High Scores but Low Skills

    ERIC Educational Resources Information Center

    Liu, Liqun; Neilson, William S.

    2011-01-01

    In this paper college admissions are based on test scores and students can exert two types of effort: real learning and exam preparation. The former improves skills but the latter is more effective in raising test scores. In this setting the students with the lowest skills are no longer the ones with the lowest aptitude, but instead are the ones…

  10. College Admissions and the Transition to Postsecondary Education: Standards and Practices.

    ERIC Educational Resources Information Center

    Sjogren, Cliff

    In this review of college admission practices, four areas are analyzed: (1) changing patterns of admission standards and practices and factors that have influenced those changes during the past 25 years; (2) importance assigned to high school achievement, test scores, and other criteria in arriving at admission decisions; (3) influence of college…

  11. Validating Standardized Testing: The Role of the SAT and ACT in Undergraduate Admissions. [Hearing before the California Legislature, Senate Select Committee on Higher Education Admissions and Outreach, Sacramento, California, February 5, 1998].

    ERIC Educational Resources Information Center

    California State Legislature, Sacramento. Senate Select Committee on Higher Education Admissions and Outreach.

    This report contains testimony and materials presented at the California Senate Select Committee on Higher Education Admissions and Outreach hearing, February 1998, held to examine the role of the Scholastic Assessment Test (SAT) and the American College Test (ACT) in undergraduate admissions to the public universities of California. In her…

  12. Correlation of the Scores on Barron's Ego Strength Scale with the Scores on the Bender-Gestalt Test.

    ERIC Educational Resources Information Center

    Martin, John D.; And Others

    1979-01-01

    The degree of relationship between scores on the Barron Ego Strength Scale and the scores on the Bender-Gestalt Test was investigated on a sample of college students. Correlations were moderate to low. Racial differences were observed on the Bender-Gestalt Test. (Author/JKS)

  13. The Impact of Participation in the Advanced Placement Program on Students' College Admissions Test Scores

    ERIC Educational Resources Information Center

    Warne, Russell T.; Larsen, Ross; Anderson, Braydon; Odasso, Alyce J.

    2015-01-01

    The Advanced Placement (AP) program is an educational program that permits high school students to take introductory college-level courses and receive college credit by passing a standardized end-of-course exam. Data were obtained from a statewide database of 2 high school graduating cohorts (N = 90,044). We used a series of propensity score…

  14. "Score Choice": A Tempest in a Teapot?

    ERIC Educational Resources Information Center

    Hoover, Eric

    2009-01-01

    A new option that allows students to choose which of their test scores to send to colleges has generated renewed criticism of the College Board. College Board officials tout the option, called Score Choice, as a way to ease test taker anxiety. Some prominent admissions officials have publicly described Score Choice as a sales tactic that will…

  15. Primary and Secondary Selection Tools in an Optometry Admission Process.

    ERIC Educational Resources Information Center

    Spafford, Marlee M.

    2000-01-01

    A five-year evaluation of the admissions decision process at the University of Waterloo (Ontario) School of Optometry found that when primary tools (i.e., university grades, Optometry Admission Test scores) did not differentiate candidates, there was an increased emphasis on secondary tools (i.e., interview, autobiographic sketch, prerequisite…

  16. Relationship between Praxis 1 Scores and SAT/ACT Scores: A Correlational Study

    ERIC Educational Resources Information Center

    Saravanabhavan, Sheila; Jones, Enid B.; Wilson, Carolyn H.

    2005-01-01

    The purpose of this paper is to test the premise that there is a significant correlation between Praxis1 scores and SAT scores among African American students who are applying for admission into the teacher education program. Data for the study included the Praxis 1 (reading, writing and math) scores and SAT (reading, writing and math) scores of…

  17. Test-Optional Admission Policies and Their Effect on the Composition of the Student Body at King's College

    ERIC Educational Resources Information Center

    Schmude, Michelle

    2011-01-01

    This study examined test-optional admission policies and their effect on the composition of the student body at King's College. According to the College Board (2009), students from underrepresented populations do not perform as well as White students on standardized tests. Therefore, higher education institutions have established test-optional…

  18. Digit symbol substitution test score and hyperhomocysteinemia in older adults.

    PubMed

    Hsu, Wen-Chuin; Chu, Yi-Chuan; Fung, Hon-Chung; Wai, Yau-Yau; Wang, Jiun-Jie; Lee, Jiann-Der; Chen, Yi-Chun

    2016-08-01

    Mounting evidence shows that hyperhomocysteinemia is a risk factor for cognitive decline. This study enrolled subjects with normal serum levels of B12 and folate and performed thorough neuropsychological assessments to illuminate the independent role of homocysteine on cognitive functions.Participants between ages 50 and 85 were enrolled with Modified Hachinski ischemic score of <4, adequate visual and auditory acuity to allow neuropsychological testing, and good general health. Subjects with cognitive impairment resulting from secondary causes were excluded. Each of the participants completed evaluations of general intellectual function, including the Mini-Mental State Examination, Cognitive Abilities Screening Instrument, Clinical Dementia Rating, and a battery of neuropsychological assessments.This study enrolled 225 subjects (90 subjects younger than 65 years and 135 subjects aged 65 years or older). The sex proportion was similar between the 2 age groups. Years of education were significantly fewer in the elderly (7.49 ± 5.40 years) than in the young (9.76 ± 4.39 years, P = 0.001). There was no significant difference in body mass index or levels of vitamin B12 and folate between the 2 age groups. Homocysteine levels were significantly higher in the elderly group compared to the younger group (10.8 ± 2.7 vs. 9.5 ± 2.5 μmol/L, respectively, P = 0.0006). After adjusting for age, sex, and education, only the Digit Symbol Substitution (DSS) score was significantly lower in subjects with hyperhomocysteinemia (homocysteine >12 μmol/L) than those with homocysteine ≤12 μmol/L in the elderly group (DSS score: 7.1 ± 2.7 and 9.0 ± 3.0, respectively, beta = -1.6, 95% confidence interval [CI] = -2.8∼-0.5, P = 0.001) and borderline significance was noted in the combined age group (beta = -1.1, 95% CI = -2.1∼-0.1, P = 0.04). We did not find an association between hyperhomocysteinemia and other

  19. Digit symbol substitution test score and hyperhomocysteinemia in older adults

    PubMed Central

    Hsu, Wen-Chuin; Chu, Yi-Chuan; Fung, Hon-Chung; Wai, Yau-Yau; Wang, Jiun-Jie; Lee, Jiann-Der; Chen, Yi-Chun

    2016-01-01

    Abstract Mounting evidence shows that hyperhomocysteinemia is a risk factor for cognitive decline. This study enrolled subjects with normal serum levels of B12 and folate and performed thorough neuropsychological assessments to illuminate the independent role of homocysteine on cognitive functions. Participants between ages 50 and 85 were enrolled with Modified Hachinski ischemic score of <4, adequate visual and auditory acuity to allow neuropsychological testing, and good general health. Subjects with cognitive impairment resulting from secondary causes were excluded. Each of the participants completed evaluations of general intellectual function, including the Mini-Mental State Examination, Cognitive Abilities Screening Instrument, Clinical Dementia Rating, and a battery of neuropsychological assessments. This study enrolled 225 subjects (90 subjects younger than 65 years and 135 subjects aged 65 years or older). The sex proportion was similar between the 2 age groups. Years of education were significantly fewer in the elderly (7.49 ± 5.40 years) than in the young (9.76 ± 4.39 years, P = 0.001). There was no significant difference in body mass index or levels of vitamin B12 and folate between the 2 age groups. Homocysteine levels were significantly higher in the elderly group compared to the younger group (10.8 ± 2.7 vs. 9.5 ± 2.5 μmol/L, respectively, P = 0.0006). After adjusting for age, sex, and education, only the Digit Symbol Substitution (DSS) score was significantly lower in subjects with hyperhomocysteinemia (homocysteine >12 μmol/L) than those with homocysteine ≤12 μmol/L in the elderly group (DSS score: 7.1 ± 2.7 and 9.0 ± 3.0, respectively, beta = −1.6, 95% confidence interval [CI] = −2.8∼−0.5, P = 0.001) and borderline significance was noted in the combined age group (beta = −1.1, 95% CI = −2.1∼−0.1, P = 0.04). We did not find an association between

  20. School accountability and the black-white test score gap.

    PubMed

    Gaddis, S Michael; Lauen, Douglas Lee

    2014-03-01

    Since at least the 1960s, researchers have closely examined the respective roles of families, neighborhoods, and schools in producing the black-white achievement gap. Although many researchers minimize the ability of schools to eliminate achievement gaps, the No Child Left Behind Act (NCLB) increased pressure on schools to do so by 2014. In this study, we examine the effects of NCLB's subgroup-specific accountability pressure on changes in black-white math and reading test score gaps using a school-level panel dataset on all North Carolina public elementary and middle schools between 2001 and 2009. Using difference-in-difference models with school fixed effects, we find that accountability pressure reduces black-white achievement gaps by raising mean black achievement without harming mean white achievement. We find no differential effects of accountability pressure based on the racial composition of schools, but schools with more affluent populations are the most successful at reducing the black-white math achievement gap. Thus, our findings suggest that school-based interventions have the potential to close test score gaps, but differences in school composition and resources play a significant role in the ability of schools to reduce racial inequality.

  1. Assessing the Usefulness of SAT and ACT Tests in Minority Admissions

    ERIC Educational Resources Information Center

    Micceri, Theodore

    2010-01-01

    This study sought to determine whether the use of standardized test scores contributes any useful information regarding First Time in College (FTIC) students' probable success at USF, using more detailed analysis of underrepresented minorities and women, who Micceri (2009) shows, experience substantial negative bias relative to males and whites on…

  2. An Investigation into the Relationships Between Cloze Test Scores and Informal Reading Inventory Scores of Fifth Grade Pupils.

    ERIC Educational Resources Information Center

    Walter, Richard Barry

    This study investigated the relationship between instructional level scores as determined by a cloze test and instructional level scores as determined by an informal reading inventory (IRI). Fifty male and 50 female subjects were randomly selected from the total fifth grade population of five schools chosen from a total of 22 midwestern elementary…

  3. GMAT Scores of Undergraduate Economics Majors

    ERIC Educational Resources Information Center

    Nelson, Paul A.; Monson, Terry D.

    2008-01-01

    The average score of economics majors on the Graduate Management Admission Test (GMAT) exceeds those of nearly all humanities and arts, social sciences, and business undergraduate majors but not those of most science, engineering, and mathematics majors. (Contains 1 table.)

  4. The Relationship of Scores on Elizur's Hostility System on the Rorschach to the Acting-Out Score on the Hand Test.

    ERIC Educational Resources Information Center

    Martin, John D.; And Others

    1978-01-01

    The relationship between Elizur's Hostility Scoring on the Rorschach Test and the Acting-Out Score on the Hand Test was examined. Correlations between the two measures (using several scoring procedures) ranged from .40 to .64. (JKS)

  5. Assessment of TOEFL Scores and ESL Classes as Criteria for Admission to Career & Technical Education and Other Selected Marshall University Graduate Programs.

    ERIC Educational Resources Information Center

    Person, Nancy E.

    The academic success of international students who are admitted to Marshall University's career and technical education (CTE) and other selected graduate degree programs was examined. The academic success of students who were admitted to Marshall based on their Test of English as a Foreign Language (TOEFL) scores was compared to that students who…

  6. The Visual Aural Digit Span Test and Bender Gestalt Test as Predictors of Wide Range Achievement Test-Revised Scores.

    ERIC Educational Resources Information Center

    Smith, Teresa C.; Smith, Billy L.

    1988-01-01

    Examined Visual Aural Digit Span Test (VADS) and Bender-Gestalt (BG) scores as predictors of Wide Range Achievement Test-Revised (WRAT-R) scores among 115 elementary school students referred for low academic achievement. Divided children into three age groups. Results suggest BG and VADS Test can be effective screening devices for young children…

  7. Score Gains on "g"-Loaded Tests: No "g"

    ERIC Educational Resources Information Center

    te Nijenhuis, Jan; van Vianen, Annelies E. M.; van der Flier, Henk

    2007-01-01

    IQ scores provide the best general predictor of success in education, job training, and work. However, there are many ways in which IQ scores can be increased, for instance by means of retesting or participation in learning potential training programs. What is the nature of these score gains? Jensen [Jensen, A. R. (1998a). "The g factor: The…

  8. Use of Practice Tests in the Prediction of GED Test Scores.

    ERIC Educational Resources Information Center

    Herring, Warren

    1999-01-01

    Analyzes the relationship between success on the two new practice-test forms (EE and FF) developed by Steck-Vaughn and success on the General Educational Development (GED) test. Success with practice-form EE correlated with GED test scores; form FF failed to correlate. (JOW)

  9. Using Patterns of Summed Scores in Paper-and-Pencil Tests and Computer-Adaptive Tests to Detect Misfitting Item Score Patterns

    ERIC Educational Resources Information Center

    Meijer, Rob R.

    2004-01-01

    Two new methods have been proposed to determine unexpected sum scores on sub-tests (testlets) both for paper-and-pencil tests and computer adaptive tests. A method based on a conservative bound using the hypergeometric distribution, denoted p, was compared with a method where the probability for each score combination was calculated using a…

  10. Four-Stage Audit Demonstrating Increased Uptake of HIV Testing in Acute Neurology Admissions Using Staged Practical Interventions

    PubMed Central

    Sokhi, Dilraj Singh; Oxenham, Chantal; Coates, Rebecca; Forbes, Mhairi; Gupta, Nadi K.; Blackburn, Daniel J.

    2015-01-01

    Background UK National Guidelines (UKNG) advise HIV testing in clinically indicated neurological presentations. We audited the impact of our practical strategies to increase uptake of HIV testing at a regional acute neurology admissions unit. Methods We audited HIV testing in 4 periods over 2 years: before we designed a UKNG-based “HIV testing in Neurology” protocol (“pre-protocol”); after dissemination of the protocol alone (“post-protocol”); post-protocol dissemination combined with both a tailored departmental admissions clerking proforma to prompt for HIV testing & consenting, and regular focussed tutorials to doctors on HIV testing in neurological patients (“post-proforma”); and finally one year after the post-proforma period (“+1 year”). We also looked at the total number of HIV tests sent from the unit during the two-year period. We assessed significance using Fisher’s exact test. Results 47.8% of all acute neurology non-stroke admissions were eligible for HIV testing during all the audit periods. Testing rates were as follows: pre-protocol 21.9%; post-protocol 36.6%; post-proforma 83.3%; and at +1 year 65.4% (p<0.05 for both post-protocol and +1 year when compared to pre-protocol). Documentation of consent for HIV testing improved from 25% to 67.6% with the HIV-tailored clerking proforma. The total number of HIV tests requested from the unit doubled in the post-proforma period compared to pre-protocol (p<0.05). Conclusion In conclusion: the combination of an HIV testing protocol, a tailored departmental clerking proforma and regular focussed teaching to doctors on indications for HIV testing led to a sustained increase in HIV testing uptake in our regional acute neurology admissions unit. PMID:26335351

  11. Linking Scores from Tests of Similar Content Given in Different Languages: Spanish Language PAA™ and English Language SAT® I. Research Report No. 2003-5. ETS RR-03-20

    ERIC Educational Resources Information Center

    Cascallar, Alicia S.; Dorans, Neil J.

    2003-01-01

    Score linkages between the Verbal and Math sections of the SAT® I: Reasoning Test and the corresponding sections of the new version of a Spanish-language admissions test, the Prueba de Aptitud Academica® (PAA™) were investigated. A bilingual group design was employed. A language proficiency measure (ESLAT) was used to define the bilingual group…

  12. The Comparison of Accuracy Scores on the Paper and Pencil Testing vs. Computer-Based Testing

    ERIC Educational Resources Information Center

    Retnawati, Heri

    2015-01-01

    This study aimed to compare the accuracy of the test scores as results of Test of English Proficiency (TOEP) based on paper and pencil test (PPT) versus computer-based test (CBT). Using the participants' responses to the PPT documented from 2008-2010 and data of CBT TOEP documented in 2013-2014 on the sets of 1A, 2A, and 3A for the Listening and…

  13. Test Scores Count! A Handbook for Teaching Test-Taking Skills.

    ERIC Educational Resources Information Center

    Koenigs, Sharon

    This handbook provides guidelines for teaching test-taking skills to students of all grade levels to help the students raise their standardized test scores. Topics covered include: understanding instructions and following directions, efficient use of time, intelligent guessing, and application of special strategies for multiple-choice and…

  14. Test/score/report: Simulation techniques for automating the test process

    NASA Technical Reports Server (NTRS)

    Hageman, Barbara H.; Sigman, Clayton B.; Koslosky, John T.

    1994-01-01

    A Test/Score/Report capability is currently being developed for the Transportable Payload Operations Control Center (TPOCC) Advanced Spacecraft Simulator (TASS) system which will automate testing of the Goddard Space Flight Center (GSFC) Payload Operations Control Center (POCC) and Mission Operations Center (MOC) software in three areas: telemetry decommutation, spacecraft command processing, and spacecraft memory load and dump processing. Automated computer control of the acceptance test process is one of the primary goals of a test team. With the proper simulation tools and user interface, the task of acceptance testing, regression testing, and repeatability of specific test procedures of a ground data system can be a simpler task. Ideally, the goal for complete automation would be to plug the operational deliverable into the simulator, press the start button, execute the test procedure, accumulate and analyze the data, score the results, and report the results to the test team along with a go/no recommendation to the test team. In practice, this may not be possible because of inadequate test tools, pressures of schedules, limited resources, etc. Most tests are accomplished using a certain degree of automation and test procedures that are labor intensive. This paper discusses some simulation techniques that can improve the automation of the test process. The TASS system tests the POCC/MOC software and provides a score based on the test results. The TASS system displays statistics on the success of the POCC/MOC system processing in each of the three areas as well as event messages pertaining to the Test/Score/Report processing. The TASS system also provides formatted reports documenting each step performed during the tests and the results of each step. A prototype of the Test/Score/Report capability is available and currently being used to test some POCC/MOC software deliveries. When this capability is fully operational it should greatly reduce the time necessary

  15. Developing Test Score Reports that Work: The Process and Best Practices for Effective Communication

    ERIC Educational Resources Information Center

    Zenisky, April L.; Hambleton, Ronald K.

    2012-01-01

    Test scores matter these days. Test-takers want to understand how they performed, and test score reports, particularly those for individual examinees, are the vehicles by which most people get the bulk of this information. Historically, score reports have not always met the examinees' information or usability needs, but this is clearly changing…

  16. Evidence-Based Decision about Test Scoring Rules in Clinical Anatomy Multiple-Choice Examinations

    ERIC Educational Resources Information Center

    Severo, Milton; Gaio, A. Rita; Povo, Ana; Silva-Pereira, Fernanda; Ferreira, Maria Amélia

    2015-01-01

    In theory the formula scoring methods increase the reliability of multiple-choice tests in comparison with number-right scoring. This study aimed to evaluate the impact of the formula scoring method in clinical anatomy multiple-choice examinations, and to compare it with that from the number-right scoring method, hoping to achieve an…

  17. The Formalization of Fairness: Issues in Testing for Measurement Invariance Using Subtest Scores

    ERIC Educational Resources Information Center

    Molenaar, Dylan; Borsboom, Denny

    2013-01-01

    Measurement invariance is an important prerequisite for the adequate comparison of group differences in test scores. In psychology, measurement invariance is typically investigated by means of linear factor analyses of subtest scores. These subtest scores typically result from summing the item scores. In this paper, we discuss 4 possible problems…

  18. A Diet Score Assessing Norwegian Adolescents’ Adherence to Dietary Recommendations—Development and Test-Retest Reproducibility of the Score

    PubMed Central

    Handeland, Katina; Kjellevold, Marian; Wik Markhus, Maria; Eide Graff, Ingvild; Frøyland, Livar; Lie, Øyvind; Skotheim, Siv; Stormark, Kjell Morten; Dahl, Lisbeth; Øyen, Jannike

    2016-01-01

    Assessment of adolescents’ dietary habits is challenging. Reliable instruments to monitor dietary trends are required to promote healthier behaviours in this group. The purpose of this cross-sectional study was to assess adolescents’ adherence to Norwegian dietary recommendations with a diet score and to report results from, and test-retest reliability of, the score. The diet score involved seven food groups and one physical activity indicator, and was applied to answers from a semi-quantitative food frequency questionnaire (FFQ) administered twice. Reproducibility of the score was assessed with Cohen’s Kappa (κ statistics) at an interval of three months. The setting was eight lower-secondary schools in Hordaland County, Norway, and subjects were adolescents (n = 472) aged 14–15 years and their caregivers. Results showed that the proportion of adolescents consistently classified by the diet score was 87.6% (κ = 0.465). For food groups, proportions ranged from 74.0% to 91.6% (κ = 0.249 to κ = 0.573). Less than 40% of the participants were found to adhere to recommendations for frequencies of eating fruits, vegetables, added sugar, and fish. Highest compliance to recommendations was seen for choosing water as beverage and limit the intake of red meat. The score was associated with parental socioeconomic status. The diet score was found to be reproducible at an acceptable level. Health promoting work targeting adolescents should emphasize to increase the intake of recommended foods to approach nutritional guidelines. PMID:27483312

  19. A Diet Score Assessing Norwegian Adolescents' Adherence to Dietary Recommendations-Development and Test-Retest Reproducibility of the Score.

    PubMed

    Handeland, Katina; Kjellevold, Marian; Wik Markhus, Maria; Eide Graff, Ingvild; Frøyland, Livar; Lie, Øyvind; Skotheim, Siv; Stormark, Kjell Morten; Dahl, Lisbeth; Øyen, Jannike

    2016-01-01

    Assessment of adolescents' dietary habits is challenging. Reliable instruments to monitor dietary trends are required to promote healthier behaviours in this group. The purpose of this cross-sectional study was to assess adolescents' adherence to Norwegian dietary recommendations with a diet score and to report results from, and test-retest reliability of, the score. The diet score involved seven food groups and one physical activity indicator, and was applied to answers from a semi-quantitative food frequency questionnaire (FFQ) administered twice. Reproducibility of the score was assessed with Cohen's Kappa (κ statistics) at an interval of three months. The setting was eight lower-secondary schools in Hordaland County, Norway, and subjects were adolescents (n = 472) aged 14-15 years and their caregivers. Results showed that the proportion of adolescents consistently classified by the diet score was 87.6% (κ = 0.465). For food groups, proportions ranged from 74.0% to 91.6% (κ = 0.249 to κ = 0.573). Less than 40% of the participants were found to adhere to recommendations for frequencies of eating fruits, vegetables, added sugar, and fish. Highest compliance to recommendations was seen for choosing water as beverage and limit the intake of red meat. The score was associated with parental socioeconomic status. The diet score was found to be reproducible at an acceptable level. Health promoting work targeting adolescents should emphasize to increase the intake of recommended foods to approach nutritional guidelines. PMID:27483312

  20. Scoring and Testing Procedures Devoted to Probabilistic Seismic Hazard Assessment

    NASA Astrophysics Data System (ADS)

    Albarello, Dario; D'Amico, Vera

    2015-03-01

    This review addresses long-term (tens of years) seismic ground-motion forecasting (seismic hazard assessment) in the presence of alternative computational models (the so-called epistemic uncertainty affecting hazard estimates). We review the different approaches that have been proposed to manage epistemic uncertainty in the context of probabilistic seismic hazard assessment (PSHA). Ex- ante procedures (based on the combination of expert judgments about inherent characteristics of the PSHA model) and ex- post approaches (based on empirical comparison of model outcomes and observations) should not be considered as mutually exclusive alternatives but can be combined in a coherent Bayesian view. Therefore, we propose a procedure that allows a better exploitation of available PSHA models to obtain comprehensive estimates, which account for both epistemic and aleatory uncertainty. We also discuss the respective roles of empirical ex-post scoring and testing of alternative models concurring in the development of comprehensive hazard maps. In order to show how the proposed procedure may work, we also present a tentative application to the Italian area. In particular, four PSHA models are evaluated ex-post against macroseismic effects actually observed in a large set of Italian municipalities during the time span 1957-2006. This analysis shows that, when the whole Italian area is considered, all the models provide estimates that do not agree with the observations. However, two of them provide results that are compatible with observations, when a subregion of Italy (Apulia Region) is considered. By focusing on this area, we computed a comprehensive hazard curve for a single locality in order to show the feasibility of the proposed procedure.

  1. Construct Validity and Test Re-Test Reliability of the Forgotten Joint Score.

    PubMed

    Thompson, Simon M; Salmon, Lucy J; Webb, Justin M; Pinczewski, Leo A; Roe, Justin P

    2015-11-01

    Consecutive patients undergoing knee arthroplasty completed questionnaires: FJS, Knee Injury and Osteoarthritis Outcome Score (KOOS) and WOMAC Score (mean 39 months after surgery), and were mailed a repeat questionnaire after 4 to 6 weeks. The test-retest reliability was almost perfect for the FJS (ICC = 0.97), and the FJS subdomains (ICC > 0.8). Convergent construct validity of the FJS was correlated with the KOOS Subscores of Quality of Life (0.63, P = 0.001), Symptom (0.33, P = 0.001), Pain (0.68, P = 0.001) and ADL (0.66, P = 0.001) and the Total WOMAC (0.70, P = 0.001). The FJS demonstrates high test-retest reliability and construct validity compared to the Normalised WOMAC and KOOS Subscales. The FJS does not demonstrate the ceiling effect of the WOMAC or KOOS pain scores so may have greater discriminatory ability following TKR.

  2. Relationships between Gender and Alberta Achievement Test Scores during a Four-Year Period

    ERIC Educational Resources Information Center

    Pope, Gregory A.; Wentzel, Carolyn; Braden, Brigitta; Anderson, Jordan

    2006-01-01

    The purpose of this study was to investigate statistical relationships between gender and Alberta Achievement Testing Program scores. Achievement test scores from grades 3, 6, and 9 in all subject areas were investigated during a four-year period. Results showed statistically significant positive correlations between gender and scores in most…

  3. Difficulty and Discriminating Indices of Three-Multiple Choice Tests Using the Confidence Scoring Procedure

    ERIC Educational Resources Information Center

    Omirin, M. S.

    2007-01-01

    The study investigated the comparison of the difficulty and discrimination incides of three multiple choice tests using the confidence scoring procedure (CSP). The study was also set to determine whether or not the difficulty and discrimination indices would be improved, if the tests were scored by the confidence scoring procedure. Two null…

  4. Situational Effects May Account for Gain Scores in Cognitive Ability Testing: A Longitudinal SEM Approach

    ERIC Educational Resources Information Center

    Matton, Nadine; Vautier, Stephane; Raufaste, Eric

    2009-01-01

    Mean gain scores for cognitive ability tests between two sessions in a selection setting are now a robust finding, yet not fully understood. Many authors do not attribute such gain scores to an increase in the target abilities. Our approach consists of testing a longitudinal SEM model suitable to this view. We propose to model the scores' changes…

  5. How Parents Can Help Kids Improve Test Scores: Taking the Stakes out of Literacy Testing

    ERIC Educational Resources Information Center

    Schneider, Steven

    2006-01-01

    In order to meet the goals of No Child Left Behind, standardized testing is preeminent as the sole indicator determining whether states all across America demonstrate adequate yearly progress regarding the improvement of student achievement in literacy education. This book will help teachers and parents raise children's scores on standardized…

  6. Individual Organ Failure and Concomitant Risk of Mortality Differs According to the Type of Admission to ICU – A Retrospective Study of SOFA Score of 23,795 Patients

    PubMed Central

    Bingold, Tobias M.; Lefering, Rolf; Zacharowski, Kai; Meybohm, Patrick; Waydhas, Christian; Rosenberger, Peter; Scheller, Bertram

    2015-01-01

    Introduction Organ dysfunction or failure after the first days of ICU treatment and subsequent mortality with respect to the type of intensive care unit (ICU) admission is poorly elucidated. Therefore we analyzed the association of ICU mortality and admission for medical (M), scheduled surgery (ScS) or unscheduled surgery (US) patients mirrored by the occurrence of organ dysfunction/failure (OD/OF) after the first 72h of ICU stay. Methods For this retrospective cohort study (23,795 patients; DIVI registry; German Interdisciplinary Association for Intensive Care Medicine (DIVI)) organ dysfunction or failure were derived from the Sequential Organ Failure Assessment (SOFA) score (excluding the Glasgow Coma Scale). SOFA scores were collected on admission to ICU and 72h later. For patients with a length of stay of at least five days, a multivariate analysis was performed for individual OD/OF on day three. Results M patients had the lowest prevalence of cardiovascular failure (M 31%; ScS 35%; US 38%), and the highest prevalence of respiratory (M 24%; ScS 13%; US 17%) and renal failure (M 10%; ScS 6%; US 7%). Risk of death was highest for M- and ScS-patients in those with respiratory failure (OR; M 2.4; ScS 2.4; US 1.4) and for surgical patients with renal failure (OR; M 1.7; ScS 2.7; US 2.4). Conclusion The dynamic evolution of OD/OF within 72h after ICU admission and mortality differed between patients depending on their types of admission. This has to be considered to exclude a systematic bias during multi-center trials. PMID:26241475

  7. Improving Student Achievement: What State NAEP Test Scores Tell Us.

    ERIC Educational Resources Information Center

    Grissmer, David; Flanagan, Ann; Kawata, Jennifer; Williamson, Stephanie

    This report uses data from the National Assessment of Educational Progress (NAEP) from 1990 through 1996 to estimate score gains nationally and by state. It also uses these data to estimate the effects of varying levels and uses of per-pupil expenditures. The report also estimates the cost-effectiveness of the major alternatives for using…

  8. Transiency, Test Scores, and the Public: One School District's Story

    ERIC Educational Resources Information Center

    Sanderson, D.R.

    2004-01-01

    Research documents that transient students who change schools frequently oftensuffer from low academic achievement. This article investigates standardized group measures by disentangling elementary achievement scores. Located in a highly transient area outside of Philadelphia, Pennsylvania, Main Street School had their fifth grade Pennsylvania…

  9. Why African American College Students Miss the Perfect Test Score

    ERIC Educational Resources Information Center

    Gentry, Ruben; Stokes, Dorothy

    2016-01-01

    Many African Americans were imbued with the cliché that they must work twice as hard as others to be a success in life. Entering college, students with this belief put extensive effort into earning top grades to ensure quality preparation for their chosen career; yet, some fail to earn top scores. Why? This is the million dollar question, but the…

  10. A Study of Methods for Estimating Distributions of Test Scores.

    ERIC Educational Resources Information Center

    Cope, Ronald T.; Kolen, Michael J.

    This study compared five density estimation techniques applied to samples from a population of 272,244 examinees' ACT English Usage and Mathematics Usage raw scores. Unsmoothed frequencies, kernel method, negative hypergeometric, four-parameter beta compound binomial, and Cureton-Tukey methods were applied to 500 replications of random samples of…

  11. Demystifying the GMAT: Where Do Scale Scores Comes from?

    ERIC Educational Resources Information Center

    Rudner, Lawrence M.

    2012-01-01

    GMAT (Graduate Management Admission Test) scaled scores convey the same level of ability over time, and GMAT percentiles convey the competitiveness of scores relative to today's GMAT test takers. In an earlier column, the author discussed the role of the GMAT scaled scores and percentiles. Here, he gets more technical and discusses how GMAT scaled…

  12. The Effects of Changing Answers on Scores of Non-Test-Sophisticated Examinees

    ERIC Educational Resources Information Center

    Smith, Ann; And Others

    1976-01-01

    The effects of answer changes on test scores of relatively non-test-sophisticated examinees was studied. Findings showed that a significantly greater number of changes were from wrong to right, thus increasing total test scores. Answer-changing behavior may reflect more than learned test-taking skill. (Author)

  13. Understanding the Gender Gap in Science and Engineering: Evidence from the Chilean College Admissions Tests

    ERIC Educational Resources Information Center

    Gándara, Fernanda; Silva, Monica

    2016-01-01

    This study seeks to develop a better understanding of the underrepresentation of women in science and engineering by analyzing the gender gaps (a) in the interest in pursuing a science degree and (b) on science achievement. We use national-level college admissions data to examine gender differences and to explore the association between these…

  14. Neuropsychological test scores, academic performance, and developmental disorders in Spanish-speaking children.

    PubMed

    Rosselli, M; Ardila, A; Bateman, J R; Guzmán, M

    2001-01-01

    Limited information is currently available about performance of Spanish-speaking children on different neuropsychological tests. This study was designed to (a) analyze the effects of age and sex on different neuropsychological test scores of a randomly selected sample of Spanish-speaking children, (b) analyze the value of neuropsychological test scores for predicting school performance, and (c) describe the neuropsychological profile of Spanish-speaking children with learning disabilities (LD). Two hundred ninety (141 boys, 149 girls) 6- to 11-year-old children were selected from a school in Bogotá, Colombia. Three age groups were distinguished: 6- to 7-, 8- to 9-, and 10- to 11-year-olds. Performance was measured utilizing the following neuropsychological tests: Seashore Rhythm Test, Finger Tapping Test (FTT), Grooved Pegboard Test, Children's Category Test (CCT), California Verbal Learning Test-Children's Version (CVLT-C), Benton Visual Retention Test (BVRT), and Bateria Woodcock Psicoeducativa en Español (Woodcock, 1982). Normative scores were calculated. Age effect was significant for most of the test scores. A significant sex effect was observed for 3 test scores. Intercorrelations were performed between neuropsychological test scores and academic areas (science, mathematics, Spanish, social studies, and music). In a post hoc analysis, children presenting very low scores on the reading, writing, and arithmetic achievement scales of the Woodcock battery were identified in the sample, and their neuropsychological test scores were compared with a matched normal group. Finally, a comparison was made between Colombian and American norms.

  15. Can Machine Scoring Deal with Broad and Open Writing Tests as Well as Human Readers?

    ERIC Educational Resources Information Center

    McCurry, Doug

    2010-01-01

    This article considers the claim that machine scoring of writing test responses agrees with human readers as much as humans agree with other humans. These claims about the reliability of machine scoring of writing are usually based on specific and constrained writing tasks, and there is reason for asking whether machine scoring of writing requires…

  16. Comparing Graphical and Verbal Representations of Measurement Error in Test Score Reports

    ERIC Educational Resources Information Center

    Zwick, Rebecca; Zapata-Rivera, Diego; Hegarty, Mary

    2014-01-01

    Research has shown that many educators do not understand the terminology or displays used in test score reports and that measurement error is a particularly challenging concept. We investigated graphical and verbal methods of representing measurement error associated with individual student scores. We created four alternative score reports, each…

  17. D.C. Student Test Scores Show Uneven Progress. Data Snapshot

    ERIC Educational Resources Information Center

    DuPre, Mary

    2011-01-01

    Over the past five years, both DC Public Schools (DCPS) and public charter schools (PCS) have seen significant growth in secondary reading and math scores on the state test known as the District of Columbia Comprehensive Assessment System (DC CAS). However, scores have not improved as much at the elementary level. Reading and math scores for DCPS…

  18. A Comparison of Three Methods of Scoring True-False Tests.

    ERIC Educational Resources Information Center

    Hsu, Louis M.

    1979-01-01

    Though the Paired-Item-Score (Eakin and Long) (EJ 174 780) method of scoring true-false tests has certain advantages over the traditional scoring methods (percentage right and right minus wrong), these advantages are attained at the cost of a larger risk of misranking the examinees. (Author/BW)

  19. The value of Bayes' theorem for interpreting abnormal test scores in cognitively healthy and clinical samples.

    PubMed

    Gavett, Brandon E

    2015-03-01

    The base rates of abnormal test scores in cognitively normal samples have been a focus of recent research. The goal of the current study is to illustrate how Bayes' theorem uses these base rates--along with the same base rates in cognitively impaired samples and prevalence rates of cognitive impairment--to yield probability values that are more useful for making judgments about the absence or presence of cognitive impairment. Correlation matrices, means, and standard deviations were obtained from the Wechsler Memory Scale--4th Edition (WMS-IV) Technical and Interpretive Manual and used in Monte Carlo simulations to estimate the base rates of abnormal test scores in the standardization and special groups (mixed clinical) samples. Bayes' theorem was applied to these estimates to identify probabilities of normal cognition based on the number of abnormal test scores observed. Abnormal scores were common in the standardization sample (65.4% scoring below a scaled score of 7 on at least one subtest) and more common in the mixed clinical sample (85.6% scoring below a scaled score of 7 on at least one subtest). Probabilities varied according to the number of abnormal test scores, base rates of normal cognition, and cutoff scores. The results suggest that interpretation of base rates obtained from cognitively healthy samples must also account for data from cognitively impaired samples. Bayes' theorem can help neuropsychologists answer questions about the probability that an individual examinee is cognitively healthy based on the number of abnormal test scores observed.

  20. Further Validation of the Qualitative Scoring System for the Modified Bender-Gestalt Test.

    ERIC Educational Resources Information Center

    Brannigan, Gary G.; And Others

    1995-01-01

    Compares the Qualitative Scoring System and the Developmental Scoring Systems, both Bender-Gestalt tests, in predicting achievement on the Metropolitan Achievement Test (MAT). In this study, first through fourth graders (n=409) from regular elementary schools were subjected to both tests; both systems correlated significantly with school…

  1. Are Score Comparisons across Language Proficiency Test Batteries Justified?: An IELTS-TOEFL Comparability Study.

    ERIC Educational Resources Information Center

    Geranpayeh, Ardeshir

    1994-01-01

    This paper reports on a study conducted to determine if comparisons between scores on the Test of English as a Foreign Language (TOEFL) and the International English Language Testing Service (IELTS) are justifiable. The test scores of 216 Iranian graduate students who took the TOEFL and IELTS, as well as the Iranian Ministry of Culture and Higher…

  2. Noncognitive Skills and the Gender Disparities in Test Scores and Teacher Assessments: Evidence from Primary School

    ERIC Educational Resources Information Center

    Cornwell, Christopher; Mustard, David B.; Van Parys, Jessica

    2013-01-01

    Using data from the 1998-99 ECLS-K cohort, we show that the grades awarded by teachers are not aligned with test scores. Girls in every racial category outperform boys on reading tests, while boys score at least as well on math and science tests as girls. However, boys in all racial categories across all subject areas are not represented in…

  3. Score Reporting in Teacher Certification Testing: A Review, Design, and Interview/Focus Group Study

    ERIC Educational Resources Information Center

    Klesch, Heather S.

    2010-01-01

    The reporting of scores on educational tests is at times misunderstood, misinterpreted, and potentially confusing to examinees and other stakeholders who may need to interpret test scores. In reporting test results to examinees, there is a need for clarity in the message communicated. As pressure rises for students to demonstrate performance at a…

  4. School Inputs, Household Substitution, and Test Scores. NBER Working Paper No. 16830

    ERIC Educational Resources Information Center

    Das, Jishnu; Dercon, Stefan; Habyarimana, James; Krishnan, Pramila; Muralidharan, Karthik; Sundararaman, Venkatesh

    2011-01-01

    Empirical studies of the relationship between school inputs and test scores typically do not account for the fact that households will respond to changes in school inputs. We present a dynamic household optimization model relating test scores to school and household inputs, and test its predictions in two very different low-income country…

  5. Note on the Scoring of Foreign Language Speaking and Writing Fluency Tests.

    ERIC Educational Resources Information Center

    Carroll, John B.

    The problem of determining relative weights for quantity and quality in scoring foreign language speaking and writing fluency tests is studied. French speaking and writing fluency tests were administered to students of French in several schools in England. Data from these tests was analyzed to support the suggestion that scoring formulas should…

  6. Clinical Importance of the Heel Drop Test and a New Clinical Score for Adult Appendicitis

    PubMed Central

    Ahn, Shin; Lee, Hyeji; Choi, Wookjin; Ahn, Ryeok; Hong, Jung-Suk; Sohn, Chang Hwan; Seo, Dong Woo; Lee, Yoon-Seon; Lim, Kyung Soo; Kim, Won Young

    2016-01-01

    Objective We tried to evaluate the accuracy of the heel drop test in patients with suspected appendicitis and tried to develop a new clinical score, which incorporates the heel drop test and other parameters, for the diagnosis of this condition. Methods We performed a prospective observational study on adult patients with suspected appendicitis at two academic urban emergency departments between January and August 2015. The predictive characteristics of each parameter, along with heel drop test results were calculated. A composite score was generated by logistic regression analysis. The performance of the generated score was compared to that of the Alvarado score. Results Of the 292 enrolled patients, 165 (56.5%) had acute appendicitis. The heel drop test had a higher predictive value than rebound tenderness. Variables and their points included in the new (MESH) score were pain migration (2), elevated white blood cell (WBC) >10,000/μL (3), shift to left (2), and positive heel drop test (3). The MESH score had a higher AUC than the Alvarado score (0.805 vs. 0.701). Scores of 5 and 11 were chosen as cut-off values; a MESH score ≥5 compared to an Alvarado score ≥5, and a MESH score ≥8 compared to an Alvarado score ≥7 showed better performance in diagnosing appendicitis. Conclusion MESH (migration, elevated WBC, shift to left, and heel drop test) is a simple clinical scoring system for assessing patients with suspected appendicitis and is more accurate than the Alvarado score. Further validation studies are needed. PMID:27723842

  7. Practice effects in medical school entrance testing with the undergraduate medicine and health sciences admission test (UMAT)

    PubMed Central

    2014-01-01

    Background The UMAT is widely used for selection into undergraduate medical and dental courses in Australia and New Zealand (NZ). It tests aptitudes thought to be especially relevant to medical studies and consists of 3 sections – logical reasoning and problem solving (UMAT-1), understanding people (UMAT-2) and non-verbal reasoning (UMAT-3). A substantial proportion of all candidates re-sit the UMAT. Re-sitting raises the issue as to what might be the precise magnitude and determinants of any practice effects on the UMAT and their implications for equity in subsequent selection processes. Methods Between 2000 and 2012, 158,909 UMAT assessments were completed. From these, 135,833 cases were identified where a candidate had sat once or more during that period with 117,505 cases (86.5%) having sat once, 14,739 having sat twice (10.9%), 2,752 thrice (2%) and 837, 4 or more times (0.6%). Subsequent analyses determined predictors of multiple re-sits as well as the magnitude and socio-demographic determinants of any practice effects. Results Increased likelihood of re-sitting the UMAT twice or more was predicted by being male, of younger age, being from a non-English language speaking background and being from NZ and for Australian candidates, being urban rather than rurally based. For those who sat at least twice, the total UMAT score between a first and second attempt improved by 10.7 ± 0.2 percentiles, UMAT-1 by 8.3 ± 0.2 percentiles, UMAT-2 by 8.3 ± 0.2 percentiles and UMAT-3 by 7.7 ± 0.2 percentiles. An increase in total UMAT percentile score on re-testing was predicted by a lower initial score and being a candidate from NZ rather than from Australia while a decrease was related to increased length of time since initially sitting the test, older age and non-English language background. Conclusions Re-sitting the UMAT augments performance in each of its components together with the total UMAT percentile score. Whether this increase represents just an improvement

  8. Effects of Test Media on Different EFL Test-Takers in Writing Scores and in the Cognitive Writing Process

    ERIC Educational Resources Information Center

    Zou, Xiao-Ling; Chen, Yan-Min

    2016-01-01

    The effects of computer and paper test media on EFL test-takers with different computer familiarity in writing scores and in the cognitive writing process have been comprehensively explored from the learners' aspect as well as on the basis of related theories and practice. The results indicate significant differences in test scores among the…

  9. The Scoring of Matching Questions Tests: A Closer Look

    ERIC Educational Resources Information Center

    Jancarík, Antonín; Kostelecká, Yvona

    2015-01-01

    Electronic testing has become a regular part of online courses. Most learning management systems offer a wide range of tools that can be used in electronic tests. With respect to time demands, the most efficient tools are those that allow automatic assessment. The presented paper focuses on one of these tools: matching questions in which one…

  10. Increases in Test Scores as a Function of Material Rewards.

    ERIC Educational Resources Information Center

    Tuinman, J. Jaap; And Others

    From the entire population (N=341) of grades 7 and 8 in a rural Indiana junior high school, 160 subjects were randomly selected and assigned to the experimental and the control groups. Form A of the Nelson Reading Test was administered twice with a 4-week interval. While the control group was told only that the post-test was given to measure how…

  11. Comparing Scores on Individual Intelligence Tests for Disabled Readers.

    ERIC Educational Resources Information Center

    Busch, Robert F.

    1981-01-01

    Concludes that the Slosson Intelligence Test and the Leiter International Performance Scale measure only limited aspects of reading disabled students' abilities as measured by the Weschsler Intelligence Scale for Children--Revised. (FL)

  12. Comparison of Two Scoring Systems for the Modified Version of the Bender-Gestalt Test.

    ERIC Educational Resources Information Center

    Schachter, Steven; And Others

    1991-01-01

    Examined relative utility of two scoring systems for Modified Version of Bender-Gestalt Test in predicting performance on Developmental Test of Visual-Motor Integration. Findings from 53 kindergarten and 47 first grade students indicated that Qualitative Scoring System was significantly better predictor of visual-motor integration skills than…

  13. Scoring Yes-No Vocabulary Tests: Reaction Time vs. Nonword Approaches

    ERIC Educational Resources Information Center

    Pellicer-Sanchez, Ana; Schmitt, Norbert

    2012-01-01

    Despite a number of research studies investigating the Yes-No vocabulary test format, one main question remains unanswered: What is the best scoring procedure to adjust for testee overestimation of vocabulary knowledge? Different scoring methodologies have been proposed based on the inclusion and selection of nonwords in the test. However, there…

  14. Using Raters from India to Score a Large-Scale Speaking Test

    ERIC Educational Resources Information Center

    Xi, Xiaoming; Mollaun, Pam

    2011-01-01

    We investigated the scoring of the Speaking section of the Test of English as a Foreign Language[TM] Internet-based (TOEFL iBT[R]) test by speakers of English and one or more Indian languages. We explored the extent to which raters from India, after being trained and certified, were able to score the TOEFL examinees with mixed first languages…

  15. The Influence of an NCLB Accountability Plan on the Distribution of Student Test Score Gains

    ERIC Educational Resources Information Center

    Springer, Matthew G.

    2008-01-01

    Previous research on the effect of accountability programs on the distribution of student test score gains is decidedly mixed. This study examines the issue by estimating an educational production function in which test score gains are a function of the incentives schools have to focus instruction on below-proficient students. NCLB's threat of…

  16. AP Trends: Tests Soar, Scores Slip--Gaps between Groups Spur Equity Concerns

    ERIC Educational Resources Information Center

    Cech, Scott J.

    2008-01-01

    More students are taking Advanced Placement tests, but the proportion of tests receiving what is deemed a passing score has dipped, and the mean score is down for the fourth year in a row. Data released here this week by the New York City-based nonprofit organization that owns the AP brand shows that a greater-than-ever proportion of students…

  17. Correcting for Test Score Measurement Error in ANCOVA Models for Estimating Treatment Effects

    ERIC Educational Resources Information Center

    Lockwood, J. R.; McCaffrey, Daniel F.

    2014-01-01

    A common strategy for estimating treatment effects in observational studies using individual student-level data is analysis of covariance (ANCOVA) or hierarchical variants of it, in which outcomes (often standardized test scores) are regressed on pretreatment test scores, other student characteristics, and treatment group indicators. Measurement…

  18. Kindergarten Black-White Test Score Gaps: Replicating and Updating Previous Findings with New National Data

    ERIC Educational Resources Information Center

    Quinn, David

    2014-01-01

    A substantial body of evidence has shown large academic test score gaps between black and white students in early childhood. These gaps remain, and probably grow, as students progress through school. Many researchers have sought to explain these persistent test score gaps, and particularly, to understand the role of students' socio-economic status…

  19. Effects of Scoring by Section and Independent Scorers' Patterns on Scorer Reliability in Biology Essay Tests

    ERIC Educational Resources Information Center

    Ebuoh, Casmir N.; Ezeudu, S. A.

    2015-01-01

    The study investigated the effects of scoring by section, use of independent scorers and conventional patterns on scorer reliability in Biology essay tests. It was revealed from literature review that conventional pattern of scoring all items at a time in essay tests had been criticized for not being reliable. The study was true experimental study…

  20. An Investigation of Methods for Improving Estimation of Test Score Distributions.

    ERIC Educational Resources Information Center

    Hanson, Bradley A.

    Three methods of estimating test score distributions that may improve on using the observed frequencies (OBFs) as estimates of a population test score distribution are considered: the kernel method (KM); the polynomial method (PM); and the four-parameter beta binomial method (FPBBM). The assumption each method makes about the smoothness of the…

  1. The Influence of Foreign Language Learning during Early Childhood on Standardized Test Scores

    ERIC Educational Resources Information Center

    Shaw, Tommetta

    2010-01-01

    Increasing standardized test scores in reading and math is of high importance to the California Department of Education to meet requirements mandated by the No Child Left Behind (NCLB) act of 2001. More research is needed to understand the best ways to improve tests scores to meet concerns of the NCLB act. The purpose of the study was to evaluate…

  2. The Effect of Misinformation, Partial Information, and Guessing on Expected Multiple-Choice Test Item Scores.

    ERIC Educational Resources Information Center

    Frary, Robert B.

    1980-01-01

    Six scoring methods for assigning weights to right or wrong responses according to various instructions given to test takers are analyzed with respect to expected change scores and the effect of various levels of information and misinformation. Three of the methods provide feedback to the test taker. (Author/CTM)

  3. The Dynamics of the Evolution of the Black-White Test Score Gap

    ERIC Educational Resources Information Center

    Sohn, Kitae

    2012-01-01

    We apply a quantile version of the Oaxaca-Blinder decomposition to estimate the counterfactual distribution of the test scores of Black students. In the Early Childhood Longitudinal Study, Kindergarten Class of 1998-1999 (ECLS-K), we find that the gap initially appears only at the top of the distribution of test scores. As children age, however,…

  4. The Effects of Accountability System Design on Teachers' Use of Test Score Data

    ERIC Educational Resources Information Center

    Jennings, Jennifer L.

    2012-01-01

    Background/Context: Many studies have concluded that educational accountability policies increase data use, but we know little about how to design accountability systems to encourage productive versus distortive uses of test score data. Purpose: I propose that five features of accountability systems affect how test score data are used and examine…

  5. The Effects of Developmental Placement and Early Retention on Children's Later Scores on Standardized Tests.

    ERIC Educational Resources Information Center

    May, Deborah C.; Welch, Edward L.

    1984-01-01

    Examined the relationship between early school retention as a result of preschool and kindergarten developmental testing and children's later academic achievement (N=223). Results showed children who scored as immature on the Gesell Screening Test and who were retained a year had the lowest scores on all measures. (JAC)

  6. Peer Effects and the Indigenous/Non-Indigenous Early Test-Score Gap in Peru

    ERIC Educational Resources Information Center

    Sakellariou, Chris

    2008-01-01

    This paper assesses the magnitude of the non-indigenous/indigenous test-score gap for third-year and fourth-year primary school pupils in Peru, in relation to the main family, school and peer inputs contributing to the test-score gap using the estimation method of feasible generalized least squares. The article then decomposes the gap into its…

  7. Analysis of Score Change Patterns of Examinees Repeating the Graduate Record Examinations General Test.

    ERIC Educational Resources Information Center

    Kingston, Neal; Turner, Nancy

    This investigation examines the impact the l98l Graduate Record Examination (GRE) General Test Format Revision had on the stability over time of the verbal, quantitative, and analytical scores. Scores were used from the self-selected group of repeaters who took the GRE General Test twice between October 1980 and June 1982. Examinees were divided…

  8. Many Children Left Behind? Textbooks and Test Scores in Kenya. NBER Working Paper No. 13300

    ERIC Educational Resources Information Center

    Glewwe, Paul; Kremer, Michael; Moulin, Sylvie

    2007-01-01

    A randomized evaluation suggests that a program which provided official textbooks to randomly selected rural Kenyan primary schools did not increase test scores for the average student. In contrast, the previous literature suggests that textbook provision has a large impact on test scores. Disaggregating the results by students' initial academic…

  9. Are Mathematics and Science Test Scores Good Indicators of Labor-Force Quality?

    ERIC Educational Resources Information Center

    Chen, Shiu-Sheng; Luoh, Ming-Ching

    2010-01-01

    Using data from the Programme for International Student Assessment (PISA) and the Trends in International Mathematics and Science Study (TIMSS), we investigate the link between test scores (mathematics and science) and cross-country income differences. We would like to know whether test scores are good indicators of labor-force quality. The…

  10. Alignment, High Stakes, and the Inflation of Test Scores. CSE Report 655

    ERIC Educational Resources Information Center

    Koretz, Daniel

    2005-01-01

    There are many reasons to align tests with curricular standards, but this alignment is not sufficient to protect against score inflation. This report explains the relationship between alignment and score inflation by clarifying what is meant by inappropriate test preparation. It provides a concrete, hypothetical example that illustrates a process…

  11. Improving behavior observation audiometry testing and scoring procedures.

    PubMed

    Gans, D P

    1987-04-01

    The need exists for improving behavioral testing of low-functioning children, particularly profoundly involved multiply handicapped children. In this study, behavior observation audiometry (BOA) was modified for systematic evaluation of childrens' responses to sound without observer bias. The data were analyzed statistically and a minimum response level was computed. This BOA technique was used to test the hearing of 82 profoundly involved handicapped children. Results suggested that BOA can be helpful in evaluating the hearing of low-functioning children.

  12. Medical School Application Interview Score Has Limited Predictive Validity for Performance on a Fourth Year Clinical Practice Examination

    ERIC Educational Resources Information Center

    Basco, William T., Jr.; Lancaster, Carol J.; Gilbert, Gregory E.; Carey, Maura E.; Blue, Amy V.

    2008-01-01

    Background and purpose: Data supporting the predictive validity of the medical school admission interview are mixed. This study tested the hypothesis that the admission interview is predictive of interpersonal interactions between medical students and standardized patients. Method: We determined correlations between admission interview scores and…

  13. An Item Analysis and Validity Investigation of Bender Visual Motor Gestalt Test Score Items

    ERIC Educational Resources Information Center

    Lambert, Nadine M.

    1971-01-01

    This investigation attempted to demonstrate the utility of standard item analysis procedures for selecting the most reliable and valid items for scoring Bender Visual Motor Gestalt Test test records. (Author)

  14. Shocking Admission

    ERIC Educational Resources Information Center

    Hoover, Eric; Millman, Sierra

    2007-01-01

    Marilee Jones's career had been a remarkable success. She joined Massachusetts Institute of Technology's (MIT's) admissions office in 1979, landing a job in Cambridge at a time when boys ruled the sandbox of the admissions profession. Her job was to help MIT recruit more women, who then made up less than one-fifth of the institute's students. She…

  15. Estimating Conditional Distributions of Scores on an Alternate Form of a Test. Research Report. ETS RR-15-18

    ERIC Educational Resources Information Center

    Livingston, Samuel A.; Chen, Haiwen H.

    2015-01-01

    Quantitative information about test score reliability can be presented in terms of the distribution of equated scores on an alternate form of the test for test takers with a given score on the form taken. In this paper, we describe a procedure for estimating that distribution, for any specified score on the test form taken, by estimating the joint…

  16. Maintaining Equivalent Cut Scores for Small Sample Test Forms

    ERIC Educational Resources Information Center

    Dwyer, Andrew C.

    2016-01-01

    This study examines the effectiveness of three approaches for maintaining equivalent performance standards across test forms with small samples: (1) common-item equating, (2) resetting the standard, and (3) rescaling the standard. Rescaling the standard (i.e., applying common-item equating methodology to standard setting ratings to account for…

  17. Test Scores and the Rural School and Community Trust.

    ERIC Educational Resources Information Center

    Rural School and Community Trust, Washington, DC.

    A number of studies suggests that the small size of many rural schools gives their students, especially the poorest, a leg up on academic achievement. This notion is supported by the standardized test results presented in this report, from a sample of the primarily small schools participating in the Rural School and Community Trust, a national…

  18. Allometric Scaling of Wingate Anaerobic Power Test Scores in Women

    ERIC Educational Resources Information Center

    Hetzler, Ronald K.; Stickley, Christopher D.; Kimura, Iris F.

    2011-01-01

    In this study, we developed allometric exponents for scaling Wingate anaerobic test (WAnT) power data that are reflective in controlling for body mass (BM) and lean body mass (LBM) and established a normative WAnT data set for college-age women. One hundred women completed a standard WAnT. Allometric exponents and percentile ranks for peak (PP)…

  19. Prevalence of low scores in children and adolescents on the test of verbal conceptualization and fluency.

    PubMed

    Brooks, Brian L; Iverson, Grant L; Koushik, Nikhil S; Mazur-Mosiewicz, Anya; Horton, Arthur MacNeill; Reynolds, Cecil R

    2013-01-01

    It is important to consider the prevalence of low scores when administering a battery of psychological tests. Understanding the prevalence of low scores is important for minimizing false-positive diagnoses of cognitive deficits in clinical practice. The purpose of this study was to expand the literature on base rates for use in children and adolescents. Participants were 408 healthy children and adolescents (M(age) = 13.1 years, SD = 3.7) and 139 children and adolescents (M(age) = 12.4 years, SD = 3.1) diagnosed with a medical, neurological, or learning condition. All participants were administered the Test of Verbal Conceptualization and Fluency (TVCF; Reynolds & Horton, 2006 ). The clinical sample performed significantly lower compared with the healthy control participants on three of the five TVCF scores. When all scores were considered simultaneously, 38% of healthy children obtained one or more scores below the 16th percentile and 15% had one or more scores in the 5th percentile or lower. By comparison, significantly higher proportions of children in the clinical sample had low scores below each of the five cutoffs (i.e., 63% had one or more test scores below the 16th percentile and 37% had one or more scores in the 5th percentile or lower). Our findings illustrate the importance of considering the prevalence of low TVCF scores in everyday clinical practice with children and adolescents.

  20. Differential Freshman Admission by Sex

    ERIC Educational Resources Information Center

    Suddick, David E.; McBee, M. Louise

    1974-01-01

    The authors report on a study whose purpose was to determine if, after adjusting for initial differences in high school averages and SAT scores via separate regression equations, differential admissions criterion by sex is justifiable. No justification is found. (RP)

  1. The Effects of Coaching on Standardized Admission Examinations. Staff Memorandum of the Boston Regional Office of the Federal Trade Commission.

    ERIC Educational Resources Information Center

    Federal Trade Commission, Washington, DC. Bureau of Consumer Protection.

    A non-experimental design was used to determine if scores of students enrolled in specified major coaching schools were significantly higher than scores of comparable uncoached groups. Score increases at two Scholastic Aptitude Test (SAT) coaching schools and Law School Admission Test (LSAT) schools were compared. Over 1,400 SAT examinees and…

  2. Personnel Test Battery and Scoring Procedures. Memorandum No. L.S. 15.

    ERIC Educational Resources Information Center

    Berson, Barry L.

    The purpose of this memo is to present tests that comprise the test battery used to select Navy personnel to train marine mammals, and to describe the scoring procedures of the tests. The test battery consists of: Biosystems General Information Test (BGIT), Personnel History Questionnaire (PHQ), Gordon Personal Inventory, Gordon Personal Profile,…

  3. Scoring Divergent Thinking Tests by Computer With a Semantics-Based Algorithm.

    PubMed

    Beketayev, Kenes; Runco, Mark A

    2016-05-01

    Divergent thinking (DT) tests are useful for the assessment of creative potentials. This article reports the semantics-based algorithmic (SBA) method for assessing DT. This algorithm is fully automated: Examinees receive DT questions on a computer or mobile device and their ideas are immediately compared with norms and semantic networks. This investigation compared the scores generated by the SBA method with the traditional methods of scoring DT (i.e., fluency, originality, and flexibility). Data were collected from 250 examinees using the "Many Uses Test" of DT. The most important finding involved the flexibility scores from both scoring methods. This was critical because semantic networks are based on conceptual structures, and thus a high SBA score should be highly correlated with the traditional flexibility score from DT tests. Results confirmed this correlation (r = .74). This supports the use of algorithmic scoring of DT. The nearly-immediate computation time required by SBA method may make it the method of choice, especially when it comes to moderate- and large-scale DT assessment investigations. Correlations between SBA scores and GPA were insignificant, providing evidence of the discriminant and construct validity of SBA scores. Limitations of the present study and directions for future research are offered. PMID:27298632

  4. Wisconsin card sorting test: a new global score, with Italian norms, and its relationship with the Weigl sorting test.

    PubMed

    Laiacona, M; Inzaghi, M G; De Tanti, A; Capitani, E

    2000-10-01

    The Wisconsin card sorting test and the Weigl test are two neuropsychological tools widely used in clinical practice to assess frontal lobe functions. In this study we present norms useful for Italian subjects aged from 15 to 85 years, with 5-17 years of education. Concerning the Wisconsin card sorting test, a new measure of global efficiency (global score) is proposed as well as norms for some well known qualitative aspects of the performance, i.e. perseverative responses, failure to maintain the set and non-perseverative errors. In setting normative values, we followed a statistical methodology (equivalent scores) employed in Italy for other neuropsychological tests, in order to favour the possibility of comparison among these tests. A correlation study between the global score of the Wisconsin card sorting test and the score on the Weigl test was carried out and it emerges that some cognitive aspects are not overlapping in these two measures.

  5. Comparison of the Koppitz and Watkins Scoring Systems for the Bender Gestalt Test.

    ERIC Educational Resources Information Center

    Johnston, Cris W.; Lanak, Brenda

    1985-01-01

    The Bender Gestalt Test was administered to 25 children (7-10 years old) referred for neuropsychological assessment and scored using the Koppitz system and the Watkins system. Although the scores obtained using the two different sets of criteria were highly correlated, the Watkins rules produced generally better performance. (Author/CL)

  6. Using Test Scores from Students with Disabilities in Teacher Effectiveness Indicators

    ERIC Educational Resources Information Center

    Buzick, Heather M.; Jones, Nathan D.

    2015-01-01

    The increased emphasis on using student growth measures in teacher evaluation has raised questions about how to treat test scores from students with disabilities. This study explores the consequences of three common approaches for treating scores from students with disabilities in statistical approaches to estimating teacher effectiveness: (1)…

  7. TOEFL iBT Speaking Test Scores as Indicators of Oral Communicative Language Proficiency

    ERIC Educational Resources Information Center

    Bridgeman, Brent; Powers, Donald; Stone, Elizabeth; Mollaun, Pamela

    2012-01-01

    Scores assigned by trained raters and by an automated scoring system (SpeechRater[TM]) on the speaking section of the TOEFL iBT[TM] were validated against a communicative competence criterion. Specifically, a sample of 555 undergraduate students listened to speech samples from 184 examinees who took the Test of English as a Foreign Language…

  8. See It, Be It, Write It: Using Performing Arts to Improve Writing Skills and Test Scores

    ERIC Educational Resources Information Center

    Blecher-Sass, Hope Sara; Moffitt, Maryellen

    2010-01-01

    Improve students' writing skills and boost their assessment scores while adding arts education, creativity, and fun to your writing curriculum. With this vibrant resource, improving writing skills goes hand-in-hand with improving test scores. Students learn how to use acting and visualization as prewriting activities to help them connect writing…

  9. Psychometric Properties of Raw and Scale Scores on Mixed-Format Tests

    ERIC Educational Resources Information Center

    Kolen, Michael J.; Lee, Won-Chan

    2011-01-01

    This paper illustrates that the psychometric properties of scores and scales that are used with mixed-format educational tests can impact the use and interpretation of the scores that are reported to examinees. Psychometric properties that include reliability and conditional standard errors of measurement are considered in this paper. The focus is…

  10. The Impact of the 2004 Hurricanes on Florida Comprehensive Assessment Test Scores: Implications for School Counselors

    ERIC Educational Resources Information Center

    Baggerly, Jennifer; Ferretti, Larissa K.

    2008-01-01

    What is the impact of natural disasters on students' statewide assessment scores? To answer this question, Florida Comprehensive Assessment Test (FCAT) scores of 55,881 students in grades 4 through 10 were analyzed to determine if there were significant decreases after the 2004 hurricanes. Results reveal that there was statistical but no practical…

  11. Optimal Scoring Methods of Hand-Strength Tests in Patients with Stroke

    ERIC Educational Resources Information Center

    Huang, Sheau-Ling; Hsieh, Ching-Lin; Lin, Jau-Hong; Chen, Hui-Mei

    2011-01-01

    The purpose of this study was to determine the optimal scoring methods for measuring strength of the more-affected hand in patients with stroke by examining the effect of reducing measurement errors. Three hand-strength tests of grip, palmar pinch, and lateral pinch were administered at two sessions in 56 patients with stroke. Five scoring methods…

  12. Language Variation and Score Variation in the Testing of English Language Learners, Native Spanish Speakers

    ERIC Educational Resources Information Center

    Solano-Flores, Guillermo; Li, Min

    2009-01-01

    We investigated language variation and score variation in the testing of English language learners, native Spanish speakers. We gave students the same set of National Assessment of Educational Progress mathematics items in both their first language and their second language. We examined the amount of score variation due to the main and interaction…

  13. Test Score or Student Progress? A Value-Added Evaluation of School Effectiveness in Urban China

    ERIC Educational Resources Information Center

    Peng, Pai; Hochweber, Jan; Klieme, Eckhard

    2013-01-01

    Outcome-oriented evaluation of school effectiveness is often based on student test scores in certain critical examinations. This study provides another method of evaluation--value-added--which is based on student achievement progress. This paper introduces the method of estimating the value-added score of schools in multi-level models. Based on…

  14. Use of Standardized Test Scores to Predict Success in a Computer Applications Course

    ERIC Educational Resources Information Center

    Harris, Robert V.; King, Stephanie B.

    2016-01-01

    The purpose of this study was to see if a relationship existed between American College Testing (ACT) scores (i.e., English, reading, mathematics, science reasoning, and composite) and student success in a computer applications course at a Mississippi community college. The study showed that while the ACT scores were excellent predictors of…

  15. Effects of Targeted Test Preparation on Scores of Two Tests of Oral English as a Second Language

    ERIC Educational Resources Information Center

    Farnsworth, Tim

    2013-01-01

    This study investigated the effect of targeted test preparation, or coaching, on oral English as a second language test scores. The tests in question were the Basic English Skills Test Plus (BEST Plus), a scripted oral interview published by the Center for Applied Linguistics, and the Versant English Test (VET), a computer-administered and…

  16. Attacking the African American-White Achievement Gap on College Admissions Tests

    ERIC Educational Resources Information Center

    Nettles, Michael T.; Millett, Catherine M.; Ready, Douglas D.

    2003-01-01

    The African American-white achievement gap exists even among the youngest children; African American students arrive at kindergarten considerably behind their white peers in measurable cognitive skills. Although the gap has narrowed somewhat over the past several decades, the average African American still scores below 75 percent of white students…

  17. Loanwords and Vocabulary Size Test Scores: A Case of Different Estimates for Different L1 Learners

    ERIC Educational Resources Information Center

    Laufer, Batia; McLean, Stuart

    2016-01-01

    The article investigated how the inclusion of loanwords in vocabulary size tests affected the test scores of two L1 groups of EFL learners: Hebrew and Japanese. New BNC- and COCA-based vocabulary size tests were constructed in three modalities: word form recall, word form recognition, and word meaning recall. Depending on the test modality, the…

  18. An Attempt to Revise Scoring Conventions for the Token and Reporter's Tests.

    ERIC Educational Resources Information Center

    Hall, Penelope K.; Jordan, Linda S.

    1988-01-01

    Revised methods of scoring the Token Test and Reporter's Test were developed to accommodate specific types of errors committed by language-disordered children during a previous standardization study. Test modifications are explained as are the results of administering the revised tests to both normal and language-disordered school-aged children.…

  19. 76 FR 16350 - Medical Devices; Ovarian Adnexal Mass Assessment Score Test System; Labeling; Black Box Restrictions

    Federal Register 2010, 2011, 2012, 2013, 2014

    2011-03-23

    ...-alone diagnostic test, or as a test to determine whether or not to proceed with surgery. Elsewhere in... assessment score test system is intended for use in those patients for whom surgery is planned, and should not be used to decide whether or not a patient should receive surgery. The test is used in...

  20. Estimating Achievement Gaps from Test Scores Reported in Ordinal "Proficiency" Categories

    ERIC Educational Resources Information Center

    Ho, Andrew D.; Reardon, Sean F.

    2012-01-01

    Test scores are commonly reported in a small number of ordered categories. Examples of such reporting include state accountability testing, Advanced Placement tests, and English proficiency tests. This article introduces and evaluates methods for estimating achievement gaps on a familiar standard-deviation-unit metric using data from these ordered…

  1. Scoring Divergent Thinking Tests by Computer With a Semantics-Based Algorithm

    PubMed Central

    Beketayev, Kenes; Runco, Mark A.

    2016-01-01

    Divergent thinking (DT) tests are useful for the assessment of creative potentials. This article reports the semantics-based algorithmic (SBA) method for assessing DT. This algorithm is fully automated: Examinees receive DT questions on a computer or mobile device and their ideas are immediately compared with norms and semantic networks. This investigation compared the scores generated by the SBA method with the traditional methods of scoring DT (i.e., fluency, originality, and flexibility). Data were collected from 250 examinees using the “Many Uses Test” of DT. The most important finding involved the flexibility scores from both scoring methods. This was critical because semantic networks are based on conceptual structures, and thus a high SBA score should be highly correlated with the traditional flexibility score from DT tests. Results confirmed this correlation (r = .74). This supports the use of algorithmic scoring of DT. The nearly-immediate computation time required by SBA method may make it the method of choice, especially when it comes to moderate- and large-scale DT assessment investigations. Correlations between SBA scores and GPA were insignificant, providing evidence of the discriminant and construct validity of SBA scores. Limitations of the present study and directions for future research are offered. PMID:27298632

  2. A weighted generalized score statistic for comparison of predictive values of diagnostic tests.

    PubMed

    Kosinski, Andrzej S

    2013-03-15

    Positive and negative predictive values are important measures of a medical diagnostic test performance. We consider testing equality of two positive or two negative predictive values within a paired design in which all patients receive two diagnostic tests. The existing statistical tests for testing equality of predictive values are either Wald tests based on the multinomial distribution or the empirical Wald and generalized score tests within the generalized estimating equations (GEE) framework. As presented in the literature, these test statistics have considerably complex formulas without clear intuitive insight. We propose their re-formulations that are mathematically equivalent but algebraically simple and intuitive. As is clearly seen with a new re-formulation we presented, the generalized score statistic does not always reduce to the commonly used score statistic in the independent samples case. To alleviate this, we introduce a weighted generalized score (WGS) test statistic that incorporates empirical covariance matrix with newly proposed weights. This statistic is simple to compute, always reduces to the score statistic in the independent samples situation, and preserves type I error better than the other statistics as demonstrated by simulations. Thus, we believe that the proposed WGS statistic is the preferred statistic for testing equality of two predictive values and for corresponding sample size computations. The new formulas of the Wald statistics may be useful for easy computation of confidence intervals for difference of predictive values. The introduced concepts have potential to lead to development of the WGS test statistic in a general GEE setting.

  3. HIV testing and clinical status upon admission to a specialized health care unit in Pará, Brazil

    PubMed Central

    Abati, Paulo Afonso Martins; Segurado, Aluisio Cotrim

    2015-01-01

    OBJECTIVE To analyze the clinical and laboratory characteristics of HIV-infected individuals upon admission to a reference health care center. METHODS This cross-sectional study was conducted between 1999 and 2010 on 527 individuals with confirmed serological diagnosis of HIV infection who were enrolled in an outpatient health care service in Santarém, PA, Northern Brazil. Data were collected from medical records and included the reason for HIV testing, clinical status, and count of peripheral CD4+ T lymphocytes upon enrollment. The data were divided into three groups, according to the patient’s year of admission – P1 (1999-2002), P2 (2003-2006), and P3 (2007-2010) – for comparative analysis of the variables of interest. RESULTS In the study group, 62.0% of the patients were assigned to the P3 group. The reason for undergoing HIV testing differed between genders. In the male population, most tests were conducted because of the presence of symptoms suggesting infection. Among women, tests were the result of knowledge of the partner’s seropositive status in groups P1 and P2. Higher proportion of women undergoing testing because of symptoms of HIV/AIDS infection abolished the difference between genders in the most recent period. A higher percentage of patients enrolling at a more advanced stage of the disease was observed in P3. CONCLUSIONS Despite the increased awareness of the number of HIV/AIDS cases, these patients have identified their serological status late and were admitted to health care units with active disease. The HIV/AIDS epidemic in Pará presents specificities in its progression that indicate the complex characteristics of the epidemic in the Northern region of Brazil and across the country. PMID:25741647

  4. An Analysis of Cross Racial Identity Scale Scores Using Classical Test Theory and Rasch Item Response Models

    ERIC Educational Resources Information Center

    Sussman, Joshua; Beaujean, A. Alexander; Worrell, Frank C.; Watson, Stevie

    2013-01-01

    Item response models (IRMs) were used to analyze Cross Racial Identity Scale (CRIS) scores. Rasch analysis scores were compared with classical test theory (CTT) scores. The partial credit model demonstrated a high goodness of fit and correlations between Rasch and CTT scores ranged from 0.91 to 0.99. CRIS scores are supported by both methods.…

  5. Psychometric Evaluation of the Lower Extremity Computerized Adaptive Test, the Modified Harris Hip Score, and the Hip Outcome Score

    PubMed Central

    Hung, Man; Hon, Shirley D.; Cheng, Christine; Franklin, Jeremy D.; Aoki, Stephen K.; Anderson, Mike B.; Kapron, Ashley L.; Peters, Christopher L.; Pelt, Christopher E.

    2014-01-01

    Background: The applicability and validity of many patient-reported outcome measures in the high-functioning population are not well understood. Purpose: To compare the psychometric properties of the modified Harris Hip Score (mHHS), the Hip Outcome Score activities of daily living subscale (HOS-ADL) and sports (HOS-sports), and the Lower Extremity Computerized Adaptive Test (LE CAT). The hypotheses was that all instruments would perform well but that the LE CAT would show superiority psychometrically because a combination of CAT and a large item bank allows for a high degree of measurement precision. Study Design: Cohort study (diagnosis); Level of evidence, 2. Methods: Data were collected from 472 advanced-age, active participants from the Huntsman World Senior Games in 2012. Validity evidences were examined through item fit, dimensionality, monotonicity, local independence, differential item functioning, person raw score to measure correlation, and instrument coverage (ie, ceiling and floor effects), and reliability evidences were examined through Cronbach alpha and person separation index. Results: All instruments demonstrated good item fit, unidimensionality, monotonicity, local independence, and person raw score to measure correlations. The HOS-ADL had high ceiling effects of 36.02%, and the mHHS had ceiling effects of 27.54%. The LE CAT had ceiling effects of 8.47%, and the HOS-sports had no ceiling effects. None of the instruments had any floor effects. The mHHS had a very low Cronbach alpha of 0.41 and an extremely low person separation index of 0.08. Reliabilities for the LE CAT were excellent and for the HOS-ADL and HOS-sports were good. Conclusion: The LE CAT showed better psychometric properties overall than the HOS-ADL, HOS-sports, and mHHS for the senior population. The mHHS demonstrated pronounced ceiling effects and poor reliabilities that should be of concern. The high ceiling effects for the HOS-ADL were also of concern. The LE CAT was superior

  6. Optimal scoring methods of hand-strength tests in patients with stroke.

    PubMed

    Huang, Sheau-Ling; Hsieh, Ching-Lin; Lin, Jau-Hong; Chen, Hui-Mei

    2011-06-01

    The purpose of this study was to determine the optimal scoring methods for measuring strength of the more-affected hand in patients with stroke by examining the effect of reducing measurement errors. Three hand-strength tests of grip, palmar pinch, and lateral pinch were administered at two sessions in 56 patients with stroke. Five scoring methods were used to present the strength scores. The smallest real difference was used to provide information on the measurement error. The smallest real difference percentage was used to compare the effect on minimizing the error. Using mean score of tests for nonspastic patients carried out at least twice was found to be advisable to minimize measurement errors in the grip, palmar pinch, and lateral pinch tests. However, the use of hand-strength tests for patients with spasticity is limited because of the relatively high measurement errors.

  7. Reviews of the Tests Approved by the Secretary of Education for Ability To Benefit Admissions.

    ERIC Educational Resources Information Center

    Rudner, Lawrence M.

    To comply with the new U.S. Department of Education Ability-To-Benefit policy, schools need to select tests on the Secretary's approved list. The pertinent aspects of 22 approved tests are individually summarized. The test reviews are based on examinations of the test publishers' technical documentation and the tests. Information provided in the…

  8. Generalization of the Lord-Wingersky Algorithm to Computing the Distribution of Summed Test Scores Based on Real-Number Item Scores

    ERIC Educational Resources Information Center

    Kim, Seonghoon

    2013-01-01

    With known item response theory (IRT) item parameters, Lord and Wingersky provided a recursive algorithm for computing the conditional frequency distribution of number-correct test scores, given proficiency. This article presents a generalized algorithm for computing the conditional distribution of summed test scores involving real-number item…

  9. Determining When Single Scoring for Constructed-Response Items Is as Effective as Double Scoring in Mixed-Format Licensure Tests

    ERIC Educational Resources Information Center

    Kim, Sooyeon; Moses, Tim

    2013-01-01

    The major purpose of this study is to assess the conditions under which single scoring for constructed-response (CR) items is as effective as double scoring in the licensure testing context. We used both empirical datasets of five mixed-format licensure tests collected in actual operational settings and simulated datasets that allowed for the…

  10. Test Score Stability and the Relationship of Adult Manifest Anxiety Scale-College Version Scores to External Variables among Graduate Students

    ERIC Educational Resources Information Center

    Lowe, Patricia A.; Peyton, Vicki; Reynolds, Cecil R.

    2007-01-01

    A sample of 79 individuals participated in the present study to evaluate the test score stability (8-week test-retest interval) and construct validity of the scores of the Adult Manifest Anxiety Scale-College Version, a new measure used to assess anxiety in college students, for application to graduate-level students. Results of the study…

  11. Validity of Alternative Cut-Off Scores for the Back-Saver Sit and Reach Test

    ERIC Educational Resources Information Center

    Looney, Marilyn A.; Gilbert, Jennie

    2012-01-01

    The purpose of the study was to determine if currently used FITNESSGRAM[R] cut-off scores for the Back Saver Sit and Reach Test had the best criterion-referenced validity evidence for 6-12 year old children. Secondary analyses of an existing data set focused on the passive straight leg raise and Back Saver Sit and Reach Test flexibility scores of…

  12. Explaining the black-white gap in cognitive test scores: Toward a theory of adverse impact.

    PubMed

    Cottrell, Jonathan M; Newman, Daniel A; Roisman, Glenn I

    2015-11-01

    In understanding the causes of adverse impact, a key parameter is the Black-White difference in cognitive test scores. To advance theory on why Black-White cognitive ability/knowledge test score gaps exist, and on how these gaps develop over time, the current article proposes an inductive explanatory model derived from past empirical findings. According to this theoretical model, Black-White group mean differences in cognitive test scores arise from the following racially disparate conditions: family income, maternal education, maternal verbal ability/knowledge, learning materials in the home, parenting factors (maternal sensitivity, maternal warmth and acceptance, and safe physical environment), child birth order, and child birth weight. Results from a 5-wave longitudinal growth model estimated on children in the NICHD Study of Early Child Care and Youth Development from ages 4 through 15 years show significant Black-White cognitive test score gaps throughout early development that did not grow significantly over time (i.e., significant intercept differences, but not slope differences). Importantly, the racially disparate conditions listed above can account for the relation between race and cognitive test scores. We propose a parsimonious 3-Step Model that explains how cognitive test score gaps arise, in which race relates to maternal disadvantage, which in turn relates to parenting factors, which in turn relate to cognitive test scores. This model and results offer to fill a need for theory on the etiology of the Black-White ethnic group gap in cognitive test scores, and attempt to address a missing link in the theory of adverse impact. PMID:25867168

  13. A Maturing Global Testing Regime Meets the World Economy: Test Scores and Economic Growth, 1960-2012

    ERIC Educational Resources Information Center

    Kamens, David H.

    2015-01-01

    This article considers the growth of the international testing regime. It discusses sources of growth and empirically examines two related sets of issues: (1) the stability of countries' achievement scores, and (2) the influence of those national scores on subsequent economic development over different time lags. The article suggests that…

  14. A Cross-sectional Study of Current Doctors’ Performance in a Modified Version of a Medical School Admission Aptitude Test

    PubMed Central

    Blackmur, James P.; Lone, Nazir I.; Stone, Oliver D.; Webb, David J.; Dhaun, Neeraj

    2016-01-01

    Abstract The 2-hour long United Kingdom Clinical Aptitude Test (UKCAT) is used by many universities in the United Kingdom as part of their selection process for undergraduate medical and dentistry degrees. We aimed to compare the performance of senior doctors in primary and secondary care and across a range of specialties, in a modified version of the medical school entrance examination—the mUKCAT. Lay people were also included in the study. Despite its widespread use, this is the first study that examines the performance of senior clinicians in the UKCAT. The study used a prospective cross-sectional design. It used mock questions from the UKCAT website to generate an mUKCAT that was anticipated to take 15 minutes to complete. In all, 167 doctors at consultant, general practitioner (GP), or specialty trainee grade and 26 lay people took part. The overall mean mUKCAT score of all participants was 2486 (69.1%). Of the total cohort, 126 (65.3%) scored above our designated threshold of 2368 and were deemed to have passed the mUKCAT. Excluding lay people, 113 (67.7%) of the 167 doctors scored above that threshold. Medical specialty was associated with overall score (P = 0.003), with anesthetists/intensive care physicians scoring highest (n = 20, mean score 2660) and GPs scoring lowest (n = 38, mean score 2302). Academics outperformed nonacademics (mean score of academics, n = 44 vs nonacademics, n = 123: 2750 vs 2406; P < 0.001). Those clinicians in senior management positions scored lower than those in “standard” roles (mean score of senior management, n = 31 vs standard roles, n = 136: 2332 vs 2534, mean difference 202, 95% confidence interval 67–337, P = 0.004). In the situational judgement section, there was no evidence that specialty was associated with score (P = 0.15). Academics exhibited greater situational judgement than their nonacademic colleagues (academics vs nonacademics: 69.8 vs 63.6%; P = 0.01). The

  15. Instructions for additional qualitative scoring of the initial-letter Word-association Test.

    PubMed

    Zivković, M

    1994-04-01

    An additional scoring method is based on grouping test-words according to whether the same sign is given by subjects to the test-words. In this way five test-word categories are formed, Eros (test-words with double plus signs), demi-Eros (single plus sign), demi-Thanatos (single minus), Thanatos (double minus), and Deviant (+/- and theta signs). The next step in scoring is to count the number of test-words in a given scoring category whose meanings do not conform. The greater the discrepancy between the test-word category and its meaning, the less well adapted is the subject. Several illustrative protocols are discussed. PMID:8022674

  16. Test Scores in New Castle County, DE.--Before and After Busing.

    ERIC Educational Resources Information Center

    D'Onofrio, William D., Comp.

    This analysis compares student test scores before and after school busing in New Castle County, Delaware, in an attempt to see if busing to achieve racial balance reduces the achievement gap between black and white students. School authorities pre-tested students with the California Achievement Test (CAT) in 1978-79, the first year of busing, and…

  17. The 5-Step Way to Raise Test Scores: Using the Data to Drive Your Instruction

    ERIC Educational Resources Information Center

    East, Pam C.

    2005-01-01

    Many teachers look at standardized tests as something to be dreaded. This author and teacher looks at standardized-test scores and sees a tool to bring students learning to new heights. This is a way for teachers to target instruction exactly where it's needed. A way to get students looking forward to end-of-the-year tests (really!) as a way to…

  18. Effects of Student Self-Corrective Measures on Learning and Standardized Test Scores

    ERIC Educational Resources Information Center

    Poplin, Beth D.

    2010-01-01

    This study examined whether students who graded and corrected their own test papers improved their learning and standardized test scores on the North Carolina end-of-course test in United States History. Four preexisting, intact classrooms of 11th grade United States History students in two different high schools formed the basis of this…

  19. The Relationship between Career Maturity Test Scores and Appropriateness of Career Choices: A Replication.

    ERIC Educational Resources Information Center

    Westbrook, Bert W.; And Others

    1990-01-01

    Attempted to replicate study determining relationship between appropriateness of career choices and career maturity test scores in rural ninth grade students (N=112) using Goal Selection scale of Career Maturity Inventory Competence Test and American College Testing Program Career Planning Program. Found two career maturity measures correlated…

  20. Demands on Users for Interpretation of Achievement Test Scores: Implications for the Evaluation Profession

    ERIC Educational Resources Information Center

    Della-Piana, Gabriel Mario; Gardner, Michael

    2011-01-01

    Background: Professional standards for validity of achievement tests have long reflected a consensus that validity is the degree to which evidence and theory support interpretations of test scores entailed by the intended uses of tests. Yet there are convincing lines of evidence that the standards are not adequately followed in practice, that…

  1. Commentary: Student Cognition, the Situated Learning Context, and Test Score Interpretation

    ERIC Educational Resources Information Center

    La Marca, Paul M.

    2006-01-01

    Although it is assumed that student cognition contributes to student performance on achievement tests, it may be that current testing models lack the degree of specification necessary to warrant such inferences. With test score interpretations as the referent, the authors in this special issue address the role of student cognition in learning and…

  2. How Does Emergency Department Crowding Affect Medical Student Test Scores and Clerkship Evaluations?

    PubMed Central

    Wei, Grant; Arya, Rajiv; Ritz, Z. Trevor; He, Albert S.; Ohman-Strickland, Pamela A.; McCoy, Jonathan V.

    2015-01-01

    Introduction The effect of emergency department (ED) crowding has been recognized as a concern for more than 20 years; its effect on productivity, medical errors, and patient satisfaction has been studied extensively. Little research has reviewed the effect of ED crowding on medical education. Prior studies that have considered this effect have shown no correlation between ED crowding and resident perception of quality of medical education. Objective To determine whether ED crowding, as measured by the National ED Overcrowding Scale (NEDOCS) score, has a quantifiable effect on medical student objective and subjective experiences during emergency medicine (EM) clerkship rotations. Methods We collected end-of-rotation examinations and medical student evaluations for 21 EM rotation blocks between July 2010 and May 2012, with a total of 211 students. NEDOCS scores were calculated for each corresponding period. Weighted regression analyses examined the correlation between components of the medical student evaluation, student test scores, and the NEDOCS score for each period. Results When all 21 rotations are included in the analysis, NEDOCS scores showed a negative correlation with medical student tests scores (regression coefficient= −0.16, p=0.04) and three elements of the rotation evaluation (attending teaching, communication, and systems-based practice; p<0.05). We excluded an outlying NEDOCS score from the analysis and obtained similar results. When the data were controlled for effect of month of the year, only student test score remained significantly correlated with NEDOCS score (p=0.011). No part of the medical student rotation evaluation attained significant correlation with the NEDOCS score (p≥0.34 in all cases). Conclusion ED overcrowding does demonstrate a small but negative association with medical student performance on end-of-rotation examinations. Additional studies are recommended to further evaluate this effect. PMID:26594289

  3. "Noncognitive" Measures: The Next Frontier in College Admissions

    ERIC Educational Resources Information Center

    Hoover, Eric

    2013-01-01

    The handyman has a tool for everything, but the admissions dean is not so lucky: He must make do with just a few. Every year, presidents and professors expect freshmen who are curious, determined, and hungry for challenges. The traditional metrics of merit, however, can't reveal such qualities. Standardized-test scores may or may not predict a…

  4. Plotting Admissions Trends in a Private, Independent University.

    ERIC Educational Resources Information Center

    Yost, Michael

    Recruiting and admissions trends at Trinity University during 1981-1984 are discussed, with attention to freshman applicants, tuition and room deposits, and Scholastic Aptitude Test (SAT) scores. Fluctuations in the rate of receipt of applications within a year seemed to be influenced most by deadlines. The rates at which applications were…

  5. Results of College Admissions Testing in Texas for 1994-95 Graduating Seniors.

    ERIC Educational Resources Information Center

    Texas Education Agency, Austin. Div. of Research and Evaluation.

    This annual report presents results from the College Board's new SAT I: Reasoning Test (Formerly the Scholastic Aptitude Test, or SAT) and the American College Testing Program's Enhanced ACT Assessment for 1994-1995 graduating seniors. The last decade showed some recovery over the previous decade's decline in both Texas and national SAT/SAT I…

  6. Equivalence of three score tests for association mapping of quantitative trait loci under selective genotyping.

    PubMed

    Tang, Yongqiang

    2010-07-01

    Huang and Lin ([2007] Am J Hum Genet 80:567-572) proposed a conditional-likelihood approach for mapping quantitative trait loci (QTL) under selective genotyping, and demonstrated via simulation that their model tends to be more powerful than the prospective linear regression. However, we show that the three score tests based on the conditional, prospective and retrospective likelihoods are numerically identical in testing association between a quantitative trait and a candidate locus. Two approximations are derived for calculating power and sample size for the score test. Compared to the random sampling, a single-tail selection generally reduces the power of the score test in mapping small effect QTLs. A two-tail selection generally enhances the QTL heritability; however, in small samples, the power of the test may actually decrease if the sample sizes are highly unbalanced in the upper and lower tails of the trait distribution. PMID:20552655

  7. Neighborhood Social Context and Individual Polycyclic Aromatic Hydrocarbon Exposures Associated with Child Cognitive Test Scores

    PubMed Central

    Eldred-Skemp, Nicolia; Quinn, James W.; Chang, Hsin-wen; Rauh, Virginia A.; Rundle, Andrew; Orjuela, Manuela A.; Perera, Frederica P.

    2013-01-01

    Childhood cognitive and test-taking abilities have long-term implications for educational achievement and health, and may be influenced by household environmental exposures and neighborhood contexts. This study evaluates whether age 5 scores on the Wechsler Preschool and Primary Scale of Intelligence-Revised (WPPSI-R, administered in English) are associated with polycyclic aromatic hydrocarbon (PAH) exposure and neighborhood context variables including poverty, low educational attainment, low English language proficiency, and inadequate plumbing. The Columbia Center for Children’s Environmental Health enrolled African-American and Dominican-American New York City women during pregnancy, and conducted follow-up for subsequent childhood health outcomes including cognitive test scores. Individual outcomes were linked to data characterizing 1-km network buffers around prenatal addresses, home observations, interviews, and prenatal PAH exposure data from personal air monitors. Prenatal PAH exposure above the median predicted 3.5 point lower total WPPSI-R scores and 3.9 point lower verbal scores; the association was similar in magnitude across models with adjustments for neighborhood characteristics. Neighborhood-level low English proficiency was independently associated with 2.3 point lower mean total WPPSI-R score, 1.2 point lower verbal score, and 2.7 point lower performance score per standard deviation. Low neighborhood-level educational attainment was also associated with 2.0 point lower performance scores. In models examining effect modification, neighborhood associations were similar or diminished among the high PAH exposure group, as compared with the low PAH exposure group. Early life exposure to personal PAH exposure or selected neighborhood-level social contexts may predict lower cognitive test scores. However, these results may reflect limited geographic exposure variation and limited generalizability. PMID:24994947

  8. Assessment of Communications-related Admissions Criteria in a Three-year Pharmacy Program

    PubMed Central

    Tejada, Frederick R.; Lang, Lynn A.; Purnell, Miriam; Acedera, Lisa; Ngonga, Ferdinand

    2015-01-01

    Objective. To determine if there is a correlation between TOEFL and other admissions criteria that assess communications skills (ie, PCAT variables: verbal, reading, essay, and composite), interview, and observational scores and to evaluate TOEFL and these admissions criteria as predictors of academic performance. Methods. Statistical analyses included two sample t tests, multiple regression and Pearson’s correlations for parametric variables, and Mann-Whitney U for nonparametric variables, which were conducted on the retrospective data of 162 students, 57 of whom were foreign-born. Results. The multiple regression model of the other admissions criteria on TOEFL was significant. There was no significant correlation between TOEFL scores and academic performance. However, significant correlations were found between the other admissions criteria and academic performance. Conclusion. Since TOEFL is not a significant predictor of either communication skills or academic success of foreign-born PharmD students in the program, it may be eliminated as an admissions criterion. PMID:26430273

  9. An electrophysiological correlate of Eating Attitudes Test scores in female college students.

    PubMed

    Wilson, J F; Mercer, J C

    1990-11-01

    Eating Attitudes Test (EAT) scores of forty female college students were compared to their electrodermal activity (EDA) responses when offered a plate of chocolate chip cookies. A significant positive correlation was detected between the EAT scores and the skin conductivity measures associated with the presentation of food. Women with the highest EAT scores also exhibited the greatest sympathetic nervous system responses to a plate of cookies. This finding supports the conclusion that the EAT is capable of identifying individuals who are preoccupied with food or anxious about eating.

  10. Use and interpretation of test scores from limited cognitive test batteries: how g + Gc can equal g.

    PubMed

    Annell, Stefan; Sjöberg, Anders; Sverke, Magnus

    2014-10-01

    Single scores from limited and unbalanced test batteries of cognitive ability can be ambiguous to interpret theoretically. In this study, a limited verbally and knowledge-loaded cognitive test battery, from applicants to the Swedish police academies (N = 1,344), was examined to provide foundations for the use and interpretation of test scores. Three measurement models were compared: one single factor model and two bifactor models, which decomposed the variance of the battery into orthogonal components. The models were evaluated by fit indices and omega coefficients, and then applied to the prediction of academic performance. The overall prediction of all models was similar, although specific abilities also were found to provide substantial predictive validity over and above general intelligence (g). The findings provide support for the use of single scores in applied settings (selection), but suggest that it may be more appropriate to interpret such scores as composites of substantive components, and not just as measures of g. PMID:25040205

  11. Grades--Scores--Predictions: A Study of the Efficiency of High School Grades and American College Test Scores in Predicting Academic Achievement at Montgomery College.

    ERIC Educational Resources Information Center

    Gell, Robert L.; Bleil, David F.

    This report analyzes the relationship between high school grades, American College Test (ACT) scores, and first-semester college grades. Based on the Standard Research Service of the ACT program, 1,379 students in the fall 1969 freshman class of Montgomery College (Maryland) were studied. Measures of academic background used ACT scores in English,…

  12. Stochastic Processes as True-Score Models for Highly Speeded Mental Tests.

    ERIC Educational Resources Information Center

    Moore, William E.

    The previous theoretical development of the Poisson process as a strong model for the true-score theory of mental tests is discussed, and additional theoretical properties of the model from the standpoint of individual examinees are developed. The paper introduces the Erlang process as a family of test theory models and shows in the context of…

  13. An Investigation of the Effectiveness of Vocabulary Learning Strategies on Iranian EFL Learners' Vocabulary Test Score

    ERIC Educational Resources Information Center

    Rahimy, Ramin; Shams, Kiana

    2012-01-01

    This study aims to investigate the effectiveness of vocabulary learning strategies on Iranian EFL learners' vocabulary test score. To achieve this aim, fifty Intermediate level students from Kish English Institute were randomly selected from among fifteen classes after administering the Oxford Placement Test (OPT). Then, an intermediate level…

  14. Two for One: Using QAR to Increase Reading Comprehension and Improve Test Scores

    ERIC Educational Resources Information Center

    Green, Susan

    2016-01-01

    This teaching tip describes an intervention used in a third-grade classroom implemented to help students pass an end-of-grade reading comprehension test. Low scores on a practice end-of-grade comprehension test prompted a re-examination of classroom reading instruction and a plan for intervention. This teaching tip describes the phases implemented…

  15. The Disaggregation of Value-Added Test Scores to Assess Learning Outcomes in Economics Courses

    ERIC Educational Resources Information Center

    Walstad, William B.; Wagner, Jamie

    2016-01-01

    This study disaggregates posttest, pretest, and value-added or difference scores in economics into four types of economic learning: positive, retained, negative, and zero. The types are derived from patterns of student responses to individual items on a multiple-choice test. The micro and macro data from the "Test of Understanding in College…

  16. Investigating Score Dependability in English/Chinese Interpreter Certification Performance Testing: A Generalizability Theory Approach

    ERIC Educational Resources Information Center

    Han, Chao

    2016-01-01

    As a property of test scores, reliability/dependability constitutes an important psychometric consideration, and it underpins the validity of measurement results. A review of interpreter certification performance tests (ICPTs) reveals that (a) although reliability/dependability checking has been recognized as an important concern, its theoretical…

  17. Predicting Teacher Performance with Test Scores and Grade Point Average: A Meta-Analysis

    ERIC Educational Resources Information Center

    D'Agostino, Jerome V.; Powers, Sonya J.

    2009-01-01

    A meta-analysis was conducted to examine the degree to which teachers' test scores and their performance in preparation programs as measured by their collegiate grade point average (GPA) predicted their teaching competence. Results from 123 studies that yielded 715 effect sizes were analyzed, and the mediating effects of test and GPA type,…

  18. Bi-Factor MIRT Observed-Score Equating for Mixed-Format Tests

    ERIC Educational Resources Information Center

    Lee, Guemin; Lee, Won-Chan

    2016-01-01

    The main purposes of this study were to develop bi-factor multidimensional item response theory (BF-MIRT) observed-score equating procedures for mixed-format tests and to investigate relative appropriateness of the proposed procedures. Using data from a large-scale testing program, three types of pseudo data sets were formulated: matched samples,…

  19. Demographically Adjusted Groups for Equating Test Scores. Research Report. ETS RR-14-30

    ERIC Educational Resources Information Center

    Livingston, Samuel A.

    2014-01-01

    In this study, I investigated 2 procedures intended to create test-taker groups of equal ability by poststratifying on a composite variable created from demographic information. In one procedure, the stratifying variable was the composite variable that best predicted the test score. In the other procedure, the stratifying variable was the…

  20. A Defensible Model for Determining a Minimal Cut-Off Score for Criterion Referenced Tests.

    ERIC Educational Resources Information Center

    Bernknopf, Stan; And Others

    The effectiveness of a model for determining a minimal cut-off score for criterion-referenced tests was examined. The model, based upon techniques presented originally by Nedelsky and by Angoff, was first used in conjunction with a multiple choice test developed for use in certifying school counselors in Georgia. A "knowledge estimation panel" was…

  1. Investigation and Treatment of Missing Item Scores in Test and Questionnaire Data

    ERIC Educational Resources Information Center

    Sijtsma, Klaas; van der Ark, L. Andries

    2003-01-01

    This article first discusses a statistical test for investigating whether or not the pattern of missing scores in a respondent-by-item data matrix is random. Since this is an asymptotic test, we investigate whether it is useful in small but realistic sample sizes. Then, we discuss two known simple imputation methods, person mean (PM) and two-way…

  2. 40 CFR 89.609 - Final admission of modification nonroad engines and test nonroad engines.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... valid certificate of conformity for the same nonroad engine class and fuel type as the nonroad engine... section may be tested or inspected by EPA at any time during the recall period specified in §...

  3. Likelihood ratio and score burden tests for detecting disease-associated rare variants.

    PubMed

    Lee, Woojoo; Lee, Donghwan; Pawitan, Yudi

    2015-11-01

    This paper presents two simple rare variant (RV) burden tests based on the likelihood ratio test (LRT) and score statistics. LRT is one of the commonly used tests in practical data analysis, and we show here that there is no reason to ignore it in testing RV associations. With the Bartlett correction, we have numerically shown that the LRT-based test can have a reliable distribution. Our simulation study indicates that if the non-null variants are as common as the null variants, then the LRT and score statistics have comparable performance to the C-alpha test, and if the former is rarer than the null variants, then they outperform the C-alpha test. PMID:26426897

  4. Table of SAT and Achievement Test Scores for Samples of Candidates Taking Achievement Tests 1966-67 to 1975-76.

    ERIC Educational Resources Information Center

    Stern, June

    This table provides the Scholastic Aptitude Test (SAT) and the Achievement Test means and standard deviations for samples of candidates taking each of the 15 achievement tests offered in the Admissions Testing Program battery during the period 1966-67 to 1975-76. The samples, drawn annually through 1971-72 and biennially thereafter, provide…

  5. A Score Based on Screening Tests to Differentiate Mild Cognitive Impairment from Subjective Memory Complaints

    PubMed Central

    de Gobbi Porto, Fábio Henrique; Spíndola, Lívia; de Oliveira, Maira Okada; Figuerêdo do Vale, Patrícia Helena; Orsini, Marco; Nitrini, Ricardo; Dozzi Brucki, Sonia Maria

    2013-01-01

    It is not easy to differentiate patients with mild cognitive impairment (MCI) from subjective memory complainers (SMC). Assessments with screening cognitive tools are essential, particularly in primary care where most patients are seen. The objective of this study was to evaluate the diagnostic accuracy of screening cognitive tests and to propose a score derived from screening tests. Elderly subjects with memory complaints were evaluated using the Mini Mental State Examination (MMSE) and the Brief Cognitive Battery (BCB). We added two delayed recalls in the MMSE (a delayed recall and a late-delayed recall, LDR), and also a phonemic fluency test of letter P fluency (LPF). A score was created based on these tests. The diagnoses were made on the basis of clinical consensus and neuropsychological testing. Receiver operating characteristic curve analyses were used to determine area under the curve (AUC), the sensitivity and specificity for each test separately and for the final proposed score. MMSE, LDR, LPF and delayed recall of BCB scores reach statistically significant differences between groups (P=0.000, 0.03, 0.001 and 0.01, respectively). Sensitivity, specificity and AUC were MMSE: 64%, 79% and 0.75 (cut off <29); LDR: 56%, 62% and 0.62 (cut off <3); LPF: 71%, 71% and 0.71 (cut off <14); delayed recall of BCB: 56%, 82% and 0.68 (cut off <9). The proposed score reached a sensitivity of 88% and 76% and specificity of 62% and 75% for cut off over 1 and over 2, respectively. AUC were 0.81. In conclusion, a score created from screening tests is capable of discriminating MCI from SMC with moderate to good accurancy. PMID:24147213

  6. Do We Really Become Smarter When Our Fluid-Intelligence Test Scores Improve?

    PubMed Central

    Hayes, Taylor R.; Petrov, Alexander A.; Sederberg, Per B.

    2014-01-01

    Recent reports of training-induced gains on fluid intelligence tests have fueled an explosion of interest in cognitive training—now a billion-dollar industry. The interpretation of these results is questionable because score gains can be dominated by factors that play marginal roles in the scores themselves, and because intelligence gain is not the only possible explanation for the observed control-adjusted far transfer across tasks. Here we present novel evidence that the test score gains used to measure the efficacy of cognitive training may reflect strategy refinement instead of intelligence gains. A novel scanpath analysis of eye movement data from 35 participants solving Raven’s Advanced Progressive Matrices on two separate sessions indicated that one-third of the variance of score gains could be attributed to test-taking strategy alone, as revealed by characteristic changes in eye-fixation patterns. When the strategic contaminant was partialled out, the residual score gains were no longer significant. These results are compatible with established theories of skill acquisition suggesting that procedural knowledge tacitly acquired during training can later be utilized at posttest. Our novel method and result both underline a reason to be wary of purported intelligence gains, but also provide a way forward for testing for them in the future. PMID:25395695

  7. Predictive effects of teachers and schools on test scores, college attendance, and earnings.

    PubMed

    Chamberlain, Gary E

    2013-10-22

    I studied predictive effects of teachers and schools on test scores in fourth through eighth grade and outcomes later in life such as college attendance and earnings. For example, predict the fraction of a classroom attending college at age 20 given the test score for a different classroom in the same school with the same teacher and given the test score for a classroom in the same school with a different teacher. I would like to have predictive effects that condition on averages over many classrooms, with and without the same teacher. I set up a factor model that, under certain assumptions, makes this feasible. Administrative school district data in combination with tax data were used to calculate estimates and do inference.

  8. Individual differences in left parietal white matter predict math scores on the Preliminary Scholastic Aptitude Test.

    PubMed

    Matejko, Anna A; Price, Gavin R; Mazzocco, Michèle M M; Ansari, Daniel

    2013-02-01

    Mathematical skills are of critical importance, both academically and in everyday life. Neuroimaging research has primarily focused on the relationship between mathematical skills and functional brain activity. Comparatively few studies have examined which white matter regions support mathematical abilities. The current study uses diffusion tensor imaging (DTI) to test whether individual differences in white matter predict performance on the math subtest of the Preliminary Scholastic Aptitude Test (PSAT). Grades 10 and 11 PSAT scores were obtained from 30 young adults (ages 17-18) with wide-ranging math achievement levels. Tract based spatial statistics was used to examine the correlation between PSAT math scores, fractional anisotropy (FA), radial diffusivity (RD) and axial diffusivity (AD). FA in left parietal white matter was positively correlated with math PSAT scores (specifically in the left superior longitudinal fasciculus, left superior corona radiata, and left corticospinal tract) after controlling for chronological age and same grade PSAT critical reading scores. Furthermore, RD, but not AD, was correlated with PSAT math scores in these white matter microstructures. The negative correlation with RD further suggests that participants with higher PSAT math scores have greater white matter integrity in this region. Individual differences in FA and RD may reflect variability in experience dependent plasticity over the course of learning and development. These results are the first to demonstrate that individual differences in white matter are associated with mathematical abilities on a nationally administered scholastic aptitude measure.

  9. Correlation of Simulation Examination to Written Test Scores for Advanced Cardiac Life Support Testing: Prospective Cohort Study

    PubMed Central

    Strom, Suzanne L.; Anderson, Craig L.; Yang, Luanna; Canales, Cecilia; Amin, Alpesh; Lotfipour, Shahram; McCoy, C. Eric; Langdorf, Mark I.

    2015-01-01

    Introduction Traditional Advanced Cardiac Life Support (ACLS) courses are evaluated using written multiple-choice tests. High-fidelity simulation is a widely used adjunct to didactic content, and has been used in many specialties as a training resource as well as an evaluative tool. There are no data to our knowledge that compare simulation examination scores with written test scores for ACLS courses. Objective To compare and correlate a novel high-fidelity simulation-based evaluation with traditional written testing for senior medical students in an ACLS course. Methods We performed a prospective cohort study to determine the correlation between simulation-based evaluation and traditional written testing in a medical school simulation center. Students were tested on a standard acute coronary syndrome/ventricular fibrillation cardiac arrest scenario. Our primary outcome measure was correlation of exam results for 19 volunteer fourth-year medical students after a 32-hour ACLS-based Resuscitation Boot Camp course. Our secondary outcome was comparison of simulation-based vs. written outcome scores. Results The composite average score on the written evaluation was substantially higher (93.6%) than the simulation performance score (81.3%, absolute difference 12.3%, 95% CI [10.6–14.0%], p<0.00005). We found a statistically significant moderate correlation between simulation scenario test performance and traditional written testing (Pearson r=0.48, p=0.04), validating the new evaluation method. Conclusion Simulation-based ACLS evaluation methods correlate with traditional written testing and demonstrate resuscitation knowledge and skills. Simulation may be a more discriminating and challenging testing method, as students scored higher on written evaluation methods compared to simulation. PMID:26594288

  10. Tests of executive functioning predict scores on the MacAndrew Alcoholism Scale.

    PubMed

    Deckel, A W

    1999-02-01

    1. Previous work reported that tests of executive functioning (EF) predict the risk of alcoholism in subject populations selected for a "high density" of a family history of alcoholism and/or the presence of sociopathic traits. The current experiment examined the ability of EF tests to predict the risk of alcoholism, as measured by the MacAndrew Alcoholism Scale (MAC), in outpatient subjects referred to a general neuropsychological testing service. 2. Sixty-eight male and female subjects referred for neuropsychological testing were assessed for their past drinking histories and administered the Wisconsin Card Sorting Test, the Wechsler Adult Intelligence Scale-Revised, the Trails (Part B) Test, and the MAC. Principal Components analysis (PCA) reduced the number of EF tests to two measures, including one that loaded on the WCST, and one that loaded on the Similarities, Picture Arrangement, and Trails tests. Multiple hierarchical regression first removed the variance from demographic variables, alcohol consumption, and verbal (i.e., Vocabulary) and non-verbal (i.e., Block Design) IQ, and then entered the executive functioning factors into the prediction of the MAC. 3. Seventy-six percent of the subjects were classified as either light, infrequent, or non-drinkers on the Quantity-Frequency-Variability scale. The factor derived from the WCST on PCA significantly added to the prediction of risk on the MAC (p = .0063), as did scores on Block Design (p = .033). Relatively more impaired scores on the WCST factor and Block Design were predictive of higher scores on the MAC. The other factors were not associated with MAC scores. 4. These results support the hypothesis that decrements in EF are associated with risk factors for alcoholism, even in populations where the density of alcoholic behaviors are not unusually high. When taken in conjunction with other findings, these results implicate EF test scores, and prefrontal brain functioning, in the neurobiology of the risk for

  11. Does Emotional Intelligence at Medical School Admission Predict Future Academic Performance?

    PubMed Central

    Leddy, John J.; Wood, Timothy J.; Puddester, Derek; Moineau, Geneviève

    2014-01-01

    Purpose Medical school admissions committees are increasingly considering noncognitive measures like emotional intelligence (EI) in evaluating potential applicants. This study explored whether scores on an EI abilities test at admissions predicted future academic performance in medical school to determine whether EI could be used in making admissions decisions. Method The authors invited all University of Ottawa medical school applicants offered an interview in 2006 and 2007 to complete the Mayer–Salovey–Caruso EI Test (MSCEIT) at the time of their interview (105 and 101, respectively), then again at matriculation (120 and 106, respectively). To determine predictive validity, they correlated MSCEIT scores to scores on written examinations and objective structured clinical examinations (OSCEs) administered during the four-year program. They also correlated MSCEIT scores to the number of nominations for excellence in clinical performance and failures recorded over the four years. Results The authors found no significant correlations between MSCEIT scores and written examination scores or number of failures. The correlations between MSCEIT scores and total OSCE scores ranged from 0.01 to 0.35; only MSCEIT scores at matriculation and OSCE year 4 scores for the 2007 cohort were significantly correlated. Correlations between MSCEIT scores and clinical nominations were low (range 0.12–0.28); only the correlation between MSCEIT scores at matriculation and number of clinical nominations for the 2007 cohort were statistically significant. Conclusions EI, as measured by an abilities test at admissions, does not appear to reliably predict future academic performance. Future studies should define the role of EI in admissions decisions. PMID:24556771

  12. Impact of a standardized test package on exit examination scores and NCLEX-RN outcomes.

    PubMed

    Homard, Catherine M

    2013-03-01

    The purpose of this ex post facto correlational study was to compare exit examination scores and NCLEX-RN(®) pass rates of baccalaureate nursing students who differed in level of participation in a standardized test package. Three cohort groups emerged as a standardized test package was introduced: (a) students who did not participate in a standardized test package; (b) students with two semesters of a standardized test package; and (c) students with four semesters of a standardized test package. Benner's novice-to-expert theory framed the study in the belief that students best acquire knowledge and skills through practice and reflection. Students participating in four semesters of a standardized test package demonstrated higher exit examination scores and NCLEX-RN pass rates compared with students who did not participate in this package. This study's results could inform nurse educators about strategies to facilitate nursing student success on exit examinations and the NCLEX-RN.

  13. Potential Utility of Non-Cognitive Constructs in Graduate Admissions

    NASA Astrophysics Data System (ADS)

    Miller, Casey

    2015-03-01

    It is becoming clear that the methods employed by many graduate admissions committees need updating. Regarding outcomes, we cannot select students that will actually graduate much better than would a coin toss. Further, the GRE is often misused. For example, the most recent GRE general test data (2006-2007) shows that for US citizens in the physical sciences, a cut-off score of ~64th percentile (700/155 on old/new test) would eliminate from eligibility: 63% of women vs 42% of men; 76% of all under-represented minorities vs 38% of Asian and 47% of White applicants. Fortunately, Organizational Psychologists have identified and validated several ``non-cognitive constructs'' for admissions: aspects of personality (conscientiousness); and self-management factors. Some intriguing facts about these parameters: they are measurable with the help of social scientists; they do not show race/ethnicity/gender performance differences; they are orthogonal to cognitive metrics measured by GPA and tests scores. These are proven to enhance both validity and diversity in admissions. My goals for this talk are to overview the non-cognitive constructs with the most potential for being used in physics graduate admissions, and to suggest example admissions protocols. Supported by the National Science Foundation.

  14. The Effects of Group Members' Personalities on a Test Taker's L2 Group Oral Discussion Test Scores

    ERIC Educational Resources Information Center

    Ockey, Gary J.

    2009-01-01

    The second language group oral is a test of second language speaking proficiency, in which a group of three or more English language learners discuss an assigned topic without interaction with interlocutors. Concerns expressed about the extent to which test takers' personal characteristics affect the scores of others in the group have limited its…

  15. The Effect of School Poverty on Racial Gaps in Tests Scores: The Case of the Minnesota Basic Standards Tests

    ERIC Educational Resources Information Center

    Myers, Samuel L.; Kim, Hyeoneui; Mandala, Cheryl

    2004-01-01

    A data from 1996,1998 and 1999 Minnesota comprehensive statewide testing on eight graders is used to analyze whether African American students perform worse than the white students who attend the poverty schools. The analyses conclude that African American-White test score gap is attributed more to the racial discriminations and racial treatments…

  16. Graduate Management Admission Test Outcomes and the Academic Achievement: A Study on Masters of Business Administration Students at Makerere University, Uganda

    ERIC Educational Resources Information Center

    Wamala, Robert; Kizito, Saint Omala; Kakumba, Umar

    2012-01-01

    This study investigates whether the outcomes of the Graduate Management Admission Test (GMAT) can predict the academic achievement of enrollees in masters programs. The study is based on administrative data of 516 Masters of Business Administration (MBA) enrollees at the College of Business and Management Science, Makerere University in the 2011…

  17. The Black-White Scoring Gap on SAT II Achievement Tests: Some of the News Is Cheering.

    ERIC Educational Resources Information Center

    Journal of Blacks in Higher Education, 2003

    2003-01-01

    Academically accomplished applicants to the nation's top colleges usually take SAT II Achievement Tests. While scoring gaps between college-bound Blacks and Whites on these tests tend to be smaller than gaps on the basic SAT, a racial scoring gap persists. However, black students appear to be making progress in closing the racial scoring gap on…

  18. Relationships between Scores of Gifted Children on the Stanford-Binet IV and Woodcock-Johnson Tests of Achievement.

    ERIC Educational Resources Information Center

    Carvajal, Howard; And Others

    1989-01-01

    Forty-five gifted children, ages 11-17, were tested with the Stanford-Binet Intelligence Scale and the Woodcock-Johnson Tests of Achievement. Results indicated 18 of 20 correlations between the area and composite scores were significant. The Stanford-Binet Short-Term Memory standard age score mean was lower than other scores' means. (Author/JDD)

  19. Linear score tests for variance components in linear mixed models and applications to genetic association studies.

    PubMed

    Qu, Long; Guennel, Tobias; Marshall, Scott L

    2013-12-01

    Following the rapid development of genome-scale genotyping technologies, genetic association mapping has become a popular tool to detect genomic regions responsible for certain (disease) phenotypes, especially in early-phase pharmacogenomic studies with limited sample size. In response to such applications, a good association test needs to be (1) applicable to a wide range of possible genetic models, including, but not limited to, the presence of gene-by-environment or gene-by-gene interactions and non-linearity of a group of marker effects, (2) accurate in small samples, fast to compute on the genomic scale, and amenable to large scale multiple testing corrections, and (3) reasonably powerful to locate causal genomic regions. The kernel machine method represented in linear mixed models provides a viable solution by transforming the problem into testing the nullity of variance components. In this study, we consider score-based tests by choosing a statistic linear in the score function. When the model under the null hypothesis has only one error variance parameter, our test is exact in finite samples. When the null model has more than one variance parameter, we develop a new moment-based approximation that performs well in simulations. Through simulations and analysis of real data, we demonstrate that the new test possesses most of the aforementioned characteristics, especially when compared to existing quadratic score tests or restricted likelihood ratio tests. PMID:24328714

  20. California and the SAT: A Reanalysis of University of California Admissions Data. Research & Occasional Paper Series: CSHE.8.04

    ERIC Educational Resources Information Center

    Zwick, Rebecca; Brown, Terran; Sklar, Jeffrey C.

    2004-01-01

    As part of the University of California's recent reconsideration of the role of the SAT in admissions, the UC Office of the President published an extensive report, "UC and the SAT" (2001), which examined the value of SAT I Reasoning Test scores, SAT II Subject Test scores, and high school grades in predicting the grade-point averages of UC…

  1. Effects of Classroom Ventilation Rate and Temperature on Students’ Test Scores

    PubMed Central

    2015-01-01

    Using a multilevel approach, we estimated the effects of classroom ventilation rate and temperature on academic achievement. The analysis is based on measurement data from a 70 elementary school district (140 fifth grade classrooms) from Southwestern United States, and student level data (N = 3109) on socioeconomic variables and standardized test scores. There was a statistically significant association between ventilation rates and mathematics scores, and it was stronger when the six classrooms with high ventilation rates that were indicated as outliers were filtered (> 7.1 l/s per person). The association remained significant when prior year test scores were included in the model, resulting in less unexplained variability. Students’ mean mathematics scores (average 2286 points) were increased by up to eleven points (0.5%) per each liter per second per person increase in ventilation rate within the range of 0.9–7.1 l/s per person (estimated effect size 74 points). There was an additional increase of 12–13 points per each 1°C decrease in temperature within the observed range of 20–25°C (estimated effect size 67 points). Effects of similar magnitude but higher variability were observed for reading and science scores. In conclusion, maintaining adequate ventilation and thermal comfort in classrooms could significantly improve academic achievement of students. PMID:26317643

  2. The Relationship between Computer Use and Standardized Test Scores: Does Gender Play a Role?

    ERIC Educational Resources Information Center

    Kay, Rachel E.

    2010-01-01

    Over the past few decades, and especially in the past ten years, computer use in schools has increased dramatically; however there has been little research examining the effects of technology use on student achievement, specifically defined by standardized test scores. There is also concern as to how technology use differs by gender and if that…

  3. Relationship of Friends, Physical Education, and State Test Scores: Implications for School Counselors

    ERIC Educational Resources Information Center

    Hollingsworth, Mary Ann

    2010-01-01

    This study examined the relationship between dimensions of wellness and academic performance for 634 third through fifth grade students in Title One schools in rural Mississippi, using composites of the Five Factor Wellness Inventory for Elementary Children and Reading, Language, and Math Scores of the Mississippi Curriculum Test (a state level…

  4. Florida Defeats the Skeptics: Test Scores Show Genuine Progress in the Sunshine State

    ERIC Educational Resources Information Center

    Winters, Marcus

    2012-01-01

    Among the 50 states, Florida's gains on the National Assessment of Educational Progress (NAEP) between 1992 and 2011 ranked second only to Maryland's. Florida's progress has been particularly impressive in the early grades. In 1998, Florida scored about one grade level below the national average on the 4th-grade NAEP reading test, but it was…

  5. Comparing State and District Test Results to National Norms: Interpretations of Scoring "Above the National Average."

    ERIC Educational Resources Information Center

    Linn, Robert L.; And Others

    Norm-referenced test results reported by states and school districts and factors related to those scores were studied through mail and telephone surveys of 35 states and a nationally representative sample of 153 school districts to determine the degree to which "above average" results were being reported. Part of the stimulus for this study came…

  6. End of Course Grades and Standardized Test Scores: Are Grades Predictive of Student Achievement?

    ERIC Educational Resources Information Center

    Ricketts, Christine R.

    2010-01-01

    This study examined the extent to which end-of-course grades are predictive of Virginia Standards of Learning test scores in nine high school content areas. It also analyzed the impact of the variables school cluster attended, gender, ethnicity, disability status, Limited English Proficiency status, and socioeconomic status on the relationship…

  7. Response to "What Do Klein et al. Tell Us about Test Scores in Texas?"

    ERIC Educational Resources Information Center

    Klein, Stephen P.; Hamilton, Laura S.; McCaffrey, Daniel F.; Stecher, Brian M.

    2005-01-01

    The authors reviewed the article "What Do Klein et al. Tell Us About Test Scores in Texas?" by Toenjes. A summary of their responses is presented. First, Toenjes incorrectly describes the focus of the authors' study. Second, Toenjes appears to have misunderstood the purpose of their 20-schools analysis. Third, Toenjes misunderstands the…

  8. A Comparative Analysis of Career Development Test Scores of Eighth Grade Boys and Girls.

    ERIC Educational Resources Information Center

    McTeer, J. Hugh; And Others

    Eighth grade students' knowledge of the principles involved in career choice was investigated to determine whether sex stereotyped attitudes about jobs were present. Scores from the career development section of the Georgia Criterion-Referenced Tests were analyzed for 734 (379 females; 355 males) eighth grade students. The career development…

  9. Comprehensive School Reform and Standardized Test Scores in Illinois Elementary and Middle Schools

    ERIC Educational Resources Information Center

    McEnroe, James D.

    2010-01-01

    The study examined the effects of the federally funded Comprehensive School Reform (CSR) program on student performance on mandated standardized tests. The study focused on the mathematics and reading scores of Illinois public elementary and middle and junior high school students. The federal CSR program provided Illinois schools with an annual…

  10. Fitting the Normal-Ogive Factor Analytic Model to Scores on Tests.

    ERIC Educational Resources Information Center

    Ferrando, Pere J.; Lorenzo-Seva, Urbano

    2001-01-01

    Describes how the nonlinear factor analytic approach of R. McDonald to the normal ogive curve can be used to factor analyze test scores. Discusses the conditions in which this model is more appropriate than the linear model and illustrates the applicability of both models using an empirical example based on data from 1,769 adolescents who took the…

  11. Defending the Quality of Links between Scores from Different Tests and Exams

    ERIC Educational Resources Information Center

    Cresswell, Mike

    2010-01-01

    Paul Newton (2010), with his characteristic concern about theory, has set out two different ways of thinking about the basis upon which equivalences of one sort or another are established between test score scales. His reason for doing this is a desire to establish "the defensibility of linkages lower on the continuum than concordance." His…

  12. A Confirmatory Factor Analysis of Cross Racial Identity Scale (CRIS) Scores: Testing the Expanded Nigrescence Model

    ERIC Educational Resources Information Center

    Worrell, Frank C.; Watson, Stevie

    2008-01-01

    In this study, the authors tested the viability of the expanded nigrescence (NT-E) model as operationalized by Cross Racial Identity Scale (CRIS) scores using confirmatory factor analyses. Participants were 594 Black college students from the Southeastern United States. Results indicated a good fit for NT-E's proposed six-factor structure.…

  13. Intelligence Test Scores and Birth Order among Young Norwegian Men (Conscripts) Analyzed within and between Families

    ERIC Educational Resources Information Center

    Bjerkedal, Tor; Kristensen, Petter; Skjeret, Geir A.; Brevik, John I.

    2007-01-01

    The present paper reports the results of a within and between family analysis of the relation between birth order and intelligence. The material comprises more than a quarter of a million test scores for intellectual performance of Norwegian male conscripts recorded during 1984-2004. Conscripts, mostly 18-19 years of age, were born to women for…

  14. Impact of School Reform on Dropout Rates and Test Scores in an Urban High School

    ERIC Educational Resources Information Center

    Bartlett, Lydia; Kupczynski, Lori; Holland, Glenda

    2011-01-01

    The primary focus of the study was to explore the effect of School Within a School reform on two components in the Adequate Yearly Progress Report by which Texas schools are measured--dropout rates and scores on the Texas Assessment of Knowledge and Skills test. The demographics of the particular study sample included 11th grade students…

  15. Permanent Income and the Black-White Test Score Gap. NBER Working Paper No. 17610

    ERIC Educational Resources Information Center

    Rothstein, Jesse; Wozny, Nathan

    2011-01-01

    Analysts often examine the black-white test score gap conditional on family income. Typically only a current income measure is available. We argue that the gap conditional on permanent income is of greater interest, and we describe a method for identifying this gap using an auxiliary data set to estimate the relationship between current and…

  16. Changes in Student Populations and Average Test Scores of Dutch Primary Schools

    ERIC Educational Resources Information Center

    Luyten, Hans; de Wolf, Inge

    2011-01-01

    This article focuses on the relation between student population characteristics and average test scores per school in the final grade of primary education from a dynamic perspective. Aggregated data of over 5,000 Dutch primary schools covering a 6-year period were used to study the relation between changes in school populations and shifts in mean…

  17. Segregation and the Black-White Test Score Gap. NBER Working Paper No. 12988

    ERIC Educational Resources Information Center

    Vigdor, Jacob; Ludwig, Jens

    2007-01-01

    The mid-1980s witnessed breaks in two important trends related to race and schooling. School segregation, which had been declining, began a period of relative stasis. Black-white test score gaps, which had also been declining, also stagnated. The notion that these two phenomena may be related is also supported by basic cross-sectional evidence. We…

  18. Detecting Dissimulation in Personality Test Scores: A Comparison between Person-Fit Indices and Detection Scales.

    ERIC Educational Resources Information Center

    Ferrando, Pere J.; Chico, Eliseo

    2001-01-01

    Examined whether a procedure based on item response theory (IRT) for assessing the scalability of response patterns could detect deliberate dissimulation (faking good) on scores from three tests of the Eysenck Personality Questionnaire Revised. Results for 489 and 140 undergraduates show that IRT measures were not powerful enough to detect…

  19. The Effect of Four Intervention Programs on Standardized Test Scores by Gender

    ERIC Educational Resources Information Center

    Cryder, Rebecca E.

    2012-01-01

    This quantitative correlational study involved the analysis, by gender, of the effect of four intervention programs at an Arizona middle school as seen on Arizona's Instrument to Measure Standards (AIMS) test scores. These four intervention programs included: Advancement Via Individual Determination (AVID), a planner stamping system, a World…

  20. Using Automated Essay Scores as an Anchor When Equating Constructed Response Writing Tests

    ERIC Educational Resources Information Center

    Almond, Russell G.

    2014-01-01

    Assessments consisting of only a few extended constructed response items (essays) are not typically equated using anchor test designs as there are typically too few essay prompts in each form to allow for meaningful equating. This article explores the idea that output from an automated scoring program designed to measure writing fluency (a common…

  1. Estimating Teacher Effectiveness from Two-Year Changes in Students' Test Scores

    ERIC Educational Resources Information Center

    Leigh, Andrew

    2010-01-01

    Using a dataset covering over 10,000 Australian school teachers and over 90,000 pupils, I estimate how effective teachers are in raising students' test scores. Since the exams are biennial, it is necessary to take account of the teacher's work in the intervening year. Even adjusting for measurement error, the teacher fixed effects are widely…

  2. Integrating GIS in the Middle School Curriculum: Impacts on Diverse Students' Standardized Test Scores

    ERIC Educational Resources Information Center

    Goldstein, Donna; Alibrandi, Marsha

    2013-01-01

    This case study conducted with 1,425 middle school students in Palm Beach County, Florida, included a treatment group receiving GIS instruction (256) and a control group without GIS instruction (1,169). Quantitative analyses on standardized test scores indicated that inclusion of GIS in middle school curriculum had a significant effect on student…

  3. Supplemental Educational Services and Student Test Score Gains: Evidence from a Large, Urban School District

    ERIC Educational Resources Information Center

    Springer, Matthew G.; Pepper, Matthew J.; Ghosh-Dastidar, Bonnie

    2014-01-01

    This study examines the effect of supplemental education services (SES) on student test score gains and whether particular subgroups of students benefit more from NCLB tutoring services. Our sample includes information on students enrolled in third through eighth grades nested in 121 elementary and middle schools over a five-year period comprising…

  4. Secular Declines in Cognitive Test Scores: A Reversal of the Flynn Effect

    ERIC Educational Resources Information Center

    Teasdale, Thomas W.; Owen, David R.

    2008-01-01

    Scores on cognitive tests have been very widely reported to have increased through the decades of the last century, a generational phenomenon termed the "Flynn Effect" since it was most comprehensively documented by James Flynn in the 1980's. There has, however, been very little evidence concerning any continuity of the effect specifically into…

  5. Estimated Effect of the Teacher Advancement Program on Student Test Score Gains

    ERIC Educational Resources Information Center

    Springer, Matthew G.; Ballou, Dale; Peng, Art

    2014-01-01

    This article presents findings from the first independent, third-party appraisal of the impact of the Teacher Advancement Program (TAP) on student test score gains in mathematics. TAP is a comprehensive school reform model designed to attract highly effective teachers, improve instructional effectiveness, and elevate student achievement. We use a…

  6. California Standards Test Scores and Attendance Rates in an Afterschool Program

    ERIC Educational Resources Information Center

    Diamond, Sandra M.

    2013-01-01

    The Problem: The purpose of this study was to investigate whether or not there were any statistically significant differences in the Mathematics California Standard Test scores and attendance rates for African American and Latina high school girls who participated in an afterschool program. Method: A quasi-experimental design was conducted with…

  7. Identifying Local Dependence with a Score Test Statistic Based on the Bifactor Logistic Model

    ERIC Educational Resources Information Center

    Liu, Yang; Thissen, David

    2012-01-01

    Local dependence (LD) refers to the violation of the local independence assumption of most item response models. Statistics that indicate LD between a pair of items on a test or questionnaire that is being fitted with an item response model can play a useful diagnostic role in applications of item response theory. In this article, a new score test…

  8. Recent Trends in Mean Scores and Characteristics of Test-Takers on "Praxis II" Licensure Tests

    ERIC Educational Resources Information Center

    Gitomer, Drew H.; Qi, Yi

    2010-01-01

    This study concerns the "highly qualified teacher" provisions of the "Elementary and Secondary Education Act" ("ESEA," 2002), as reauthorized, and other policies at the federal, state and local levels, which have aimed to elevate the content knowledge of teachers. This examination of "Praxis II" score trends was not meant to serve as an evaluation…

  9. An NCME Instructional Module on Quality Control Procedures in the Scoring, Equating, and Reporting of Test Scores

    ERIC Educational Resources Information Center

    Allalouf, Avi

    2007-01-01

    There is significant potential for error in long production processes that consist of sequential stages, each of which is heavily dependent on the previous stage, such as the SER (Scoring, Equating, and Reporting) process. Quality control procedures are required in order to monitor this process and to reduce the number of mistakes to a minimum. In…

  10. Medical devices; ovarian adnexal mass assessment score test system; labeling; black box restrictions. Final rule.

    PubMed

    2011-12-30

    The Food and Drug Administration (FDA) is amending the regulation classifying ovarian adnexal mass assessment score test systems to restrict these devices so that a prescribed warning statement that addresses a risk identified in the special controls guidance document must be in a black box and must appear in all labeling, advertising, and promotional material. The black box warning mitigates the risk to health associated with off-label use as a screening test, stand-alone diagnostic test, or as a test to determine whether or not to proceed with surgery.

  11. The comparison question polygraph test: a contrast of methods and scoring.

    PubMed

    Honts, Charles R; Reavy, Racheal

    2015-05-01

    We conducted a mock crime experiment with 250 paid participants (126 females, Mdn age = 30 years) contrasting the validity of the probable-lie and the directed-lie variants of the comparison question test (CQT) for the detection of deception. Subjects were assigned at random to one of eight conditions in a Guilt (Guilty/Innocent) × Test Type (Probable-Lie/Directed-Lie) × Stimulation (Between Repetition Stimulation/No Stimulation) factorial design. The data were scored by an experienced polygraph examiner who was unaware of subject assignment to conditions and with a computer algorithm known as the Objective Scoring System Version 2 (OSS2). There were substantial main effects of guilt in both the OSS2 computer scores F(1, 241) = 143.82, p < .001, η(p)(2) = 0.371, and in the human scoring, F(1, 242) = 98.92, p<.001, η(p)(2) = .29. There were no differences between the test types in the number of spontaneous countermeasure attempts made against them. Although under the controlled conditions of an experiment the probable-lie and the directed-lie variants of the CQT produced equivocal results in terms of detection accuracy, the directed-lie variant has much to recommend it as it is inherently more standardized in its administration and construction. PMID:25703188

  12. The comparison question polygraph test: a contrast of methods and scoring.

    PubMed

    Honts, Charles R; Reavy, Racheal

    2015-05-01

    We conducted a mock crime experiment with 250 paid participants (126 females, Mdn age = 30 years) contrasting the validity of the probable-lie and the directed-lie variants of the comparison question test (CQT) for the detection of deception. Subjects were assigned at random to one of eight conditions in a Guilt (Guilty/Innocent) × Test Type (Probable-Lie/Directed-Lie) × Stimulation (Between Repetition Stimulation/No Stimulation) factorial design. The data were scored by an experienced polygraph examiner who was unaware of subject assignment to conditions and with a computer algorithm known as the Objective Scoring System Version 2 (OSS2). There were substantial main effects of guilt in both the OSS2 computer scores F(1, 241) = 143.82, p < .001, η(p)(2) = 0.371, and in the human scoring, F(1, 242) = 98.92, p<.001, η(p)(2) = .29. There were no differences between the test types in the number of spontaneous countermeasure attempts made against them. Although under the controlled conditions of an experiment the probable-lie and the directed-lie variants of the CQT produced equivocal results in terms of detection accuracy, the directed-lie variant has much to recommend it as it is inherently more standardized in its administration and construction.

  13. Talent Search Qualifying: Comparisons between Talent Search Students Qualifying via Scores on Standardized Tests and via Parent Nomination

    ERIC Educational Resources Information Center

    Lee, Seon-Young; Olszewski-Kubilius, Paula

    2006-01-01

    This study examined differences between students who qualified for talent search testing via scores on standardized tests and via parent nomination in their performances on the SAT or ACT and some demographic characteristics. Overall, the standardized testing group earned higher scores on the off-level tests than the parent nominated group. Asian…

  14. Student Test Scores: How the Sausage Is Made and Why You Should Care. Evidence Speaks Reports, Vol 1, #25

    ERIC Educational Resources Information Center

    Jacob, Brian A.

    2016-01-01

    Contrary to popular belief, modern cognitive assessments--including the new Common Core tests--produce test scores based on sophisticated statistical models rather than the simple percent of items a student answers correctly. While there are good reasons for this, it means that reported test scores depend on many decisions made by test designers,…

  15. Agreement in the Scoring of Respiratory Events Among International Sleep Centers for Home Sleep Testing

    PubMed Central

    Magalang, Ulysses J.; Arnardottir, Erna S.; Chen, Ning-Hung; Cistulli, Peter A.; Gíslason, Thorarinn; Lim, Diane; Penzel, Thomas; Schwab, Richard; Tufik, Sergio; Pack, Allan I.

    2016-01-01

    Study Objectives: Home sleep testing (HST) is used worldwide to confirm the presence of obstructive sleep apnea (OSA). We sought to determine the agreement of HST scoring among international sleep centers. Methods: Fifteen HSTs, previously recorded using a type 3 monitor, were deidentified and saved in European Data Format. The studies were scored by nine technologists from the sleep centers of the Sleep Apnea Global Interdisciplinary Consortium (SAGIC) using the locally available software. Each study was scored separately using one of three different airflow signals: nasal pressure (NP), transformed (square root) nasal pressure signal (transformed NP), and uncalibrated respiratory inductive plethysmography (RIP) flow. Only one of the three airflow signals was visible to the scorer at each scoring session. The scoring procedure was repeated to determine the intrarater reliability. Results: The intraclass correlation coefficients (ICCs) using the NP were: apnea-hypopnea index (AHI) = 0.96 (95% confidence interval [CI]: 0.93–0.99); apnea index = 0.91 (0.83–0.96); and hypopnea index = 0.75 (0.59–0.89). The ICCs using the transformed NP were: AHI = 0.98 (0.96–0.99); apnea index = 0.95 (0.90–0.98); and hypopnea index = 0.90 (0.82–0.96). The ICCs using the RIP flow were: AH I = 0.98 (0.96–0.99); apnea index = 0.66 (0.48–0.84); and hypopnea index = 0.78 (0.63–0.90). The mean difference of first and second scoring sessions of the same respiratory variables ranged from −1.02 to 0.75/h. Conclusion: There is a strong agreement in the scoring of the respiratory events for HST among international sleep centers. Our results suggest that centralized scoring of HSTs may not be necessary in future research collaboration among international sites. Commentary: A commentary on this article appears in this issue on page 7. Citation: Magalang UJ, Arnardottir ES, Chen NH, Cistulli PA, Gíslason T, Lim D, Penzel T, Schwab R, Tufik S, Pack AI, SAGIC Investigators

  16. The Relationship between Scores on the GED Writing Skills Test and on Direct Measures of Writing. GED Testing Service Research Studies, Number 6.

    ERIC Educational Resources Information Center

    Swartz, Richard; Whitney, Douglas R.

    The primary purpose of this study was to examine the relationship between scores on the multiple-choice General Educational Development (GED) Writing Skills test and scores on holistically graded essays. Secondary purposes included the following: (1) examining the relationship of essay scores to scores on the multiple-choice GED Reading Skills…

  17. Pose prediction and virtual screening performance of GOLD scoring functions in a standardized test.

    PubMed

    Liebeschuetz, John W; Cole, Jason C; Korb, Oliver

    2012-06-01

    The performance of all four GOLD scoring functions has been evaluated for pose prediction and virtual screening under the standardized conditions of the comparative docking and scoring experiment reported in this Edition. Excellent pose prediction and good virtual screening performance was demonstrated using unmodified protein models and default parameter settings. The best performing scoring function for both pose prediction and virtual screening was demonstrated to be the recently introduced scoring function ChemPLP. We conclude that existing docking programs already perform close to optimally in the cognate pose prediction experiments currently carried out and that more stringent pose prediction tests should be used in the future. These should employ cross-docking sets. Evaluation of virtual screening performance remains problematic and much remains to be done to improve the usefulness of publically available active and decoy sets for virtual screening. Finally we suggest that, for certain target/scoring function combinations, good enrichment may sometimes be a consequence of 2D property recognition rather than a modelling of the correct 3D interactions.

  18. Pose prediction and virtual screening performance of GOLD scoring functions in a standardized test.

    PubMed

    Liebeschuetz, John W; Cole, Jason C; Korb, Oliver

    2012-06-01

    The performance of all four GOLD scoring functions has been evaluated for pose prediction and virtual screening under the standardized conditions of the comparative docking and scoring experiment reported in this Edition. Excellent pose prediction and good virtual screening performance was demonstrated using unmodified protein models and default parameter settings. The best performing scoring function for both pose prediction and virtual screening was demonstrated to be the recently introduced scoring function ChemPLP. We conclude that existing docking programs already perform close to optimally in the cognate pose prediction experiments currently carried out and that more stringent pose prediction tests should be used in the future. These should employ cross-docking sets. Evaluation of virtual screening performance remains problematic and much remains to be done to improve the usefulness of publically available active and decoy sets for virtual screening. Finally we suggest that, for certain target/scoring function combinations, good enrichment may sometimes be a consequence of 2D property recognition rather than a modelling of the correct 3D interactions. PMID:22371207

  19. The Use of One-, Two-, and Three-Parameter and Nominal Item Response Scoring in Place of Number-Right Scoring in the Presence of Test-Wiseness

    ERIC Educational Resources Information Center

    Tomkowicz, Joanna; Rogers, W. Todd

    2005-01-01

    Ability estimates yielded by the one- (1PL), two- (2PL), and three-parameter (3PL) models and the nominal response model (NRM) were compared with the number-right (NR) scoring model using items not susceptible to test-wiseness (NTW) and items susceptible to the ID1 test-wiseness strategy. These items were contained in grade 12 diploma examinations…

  20. Relation between perceived driving disability and scores of vision screening tests

    PubMed Central

    van Rijn, L J; Wilhelm, H; Emesz, M; Kaper, R; Heine, S; Nitsch, S; Grabner, G; Völker-Dieben, H J

    2002-01-01

    Aim: To determine the relation between perceived driving disability and vision screening tests. Methods: 93 subjects, aged 50 years and over, with binocular visual acuity of at least 20/80. Perceived driving disability (PDD) was assessed by a questionnaire. Subtracting daytime from night-time driving question scores revealed PDD at night (PDDN), subtracting scores of questions for driving in familiar places from those in unfamiliar places revealed PDD at unfamiliar places (PDDU). Results: PDD was strongly related to visual acuity, contrast sensitivity and useful field of view (UFOV). Specific relations existed between PDDN and Nyktotests and Mesotests and between PDDU and UFOV. These associations were enhanced in a subset of subjects with better visual acuities. Conclusions: Vision screening tests correlate well with perceived driving disabilities, especially when a subtraction method is used in the questionnaire to reveal condition dependent disabilities. Additional tests for visual acuity are useful, especially in subjects with better visual acuity. PMID:12386085

  1. Spatial and verbal memory test scores following yoga and fine arts camps for school children.

    PubMed

    Manjunath, N K; Telles, Shirley

    2004-07-01

    The performance scores of children (aged 11 to 16 years) in verbal and spatial memory tests were compared for two groups (n = 30, each), one attending a yoga camp and the other a fine arts camp. Both groups were assessed on the memory tasks initially and after ten days of their respective interventions. A control group (n = 30) was similarly studied to assess the test-retest effect. At the final assessment the yoga group showed a significant increase of 43% in spatial memory scores (Multivariate analysis, Tukey test), while the fine arts and control groups showed no change. The results suggest that yoga practice, including physical postures, yoga breathing, meditation and guided relaxation improved delayed recall of spatial information.

  2. Correcting Two-Sample "z" and "t" Tests for Correlation: An Alternative to One-Sample Tests on Difference Scores

    ERIC Educational Resources Information Center

    Zimmerman, Donald W.

    2012-01-01

    In order to circumvent the influence of correlation in paired-samples and repeated measures experimental designs, researchers typically perform a one-sample Student "t" test on difference scores. That procedure entails some loss of power, because it employs N - 1 degrees of freedom instead of the 2N - 2 degrees of freedom of the…

  3. Interrater Reliability of the Original and a Revised Scoring System for the Developmental Test of Visual-Motor Integration.

    ERIC Educational Resources Information Center

    Lepkin, Sheila Ratsch; Pryzwansky, Walter B.

    1983-01-01

    Investigated the interrater reliability of teachers' and school psychology externs' scoring of protocols for the Developmental Test of Visual-Motor Integration (VMI), using a revised scoring system. Results showed high reliability coefficients for all raters, regardless of the scoring system employed. The influence of rater training is discussed.…

  4. The Effect of Luck and Misinformation on the Discrepancy between Multiple-Choice Test Scores and True Ability.

    ERIC Educational Resources Information Center

    Lowry, Stephen R.

    The effects of luck and misinformation on ability of multiple-choice test scores to estimate examinee ability were investigated. Two measures of examinee ability were defined. Misinformation was shown to have little effect on ability of raw scores and a substantial effect on ability of corrected-for-guessing scores to estimate examinee ability.…

  5. New scores for the Category Test: measures of interference for subtests 5 and 6.

    PubMed

    Webster, Jeffrey S; Lopez, Michael N

    2006-12-01

    The Category Test is a well-known neuropsychological instrument used to assess concept formation and higher executive abilities. The present study investigated the utility of additional scores for the Category Test. We used principles developed in cognitive psychology to create several new measures for subtests 5 and 6 of this test. These scores were primarily designed to be sensitive to interference effects of learning decision rules from subtest 2, subtest 3, and subtest 4. The new scores as well as the total error scores from subtests 5 and 6 were used to discriminate subjects with documented brain injury from subjects who were neurologically normal based on neuroimaging and neurologic evaluation. The Category Test was given following Reitan's (1979) instructions, with the exception that no additional prompting was given to participants who struggled early with the test in order to reduce the "executive" guidance of the examiner. Because any "interference" from earlier subtests on performance of subtest 5 and subtest 6 should be related to mastery of these earlier subtests, the normal group was matched to the brain-impaired group on which subtest(s) they learned. This resulted in four learning groups: (a) learned subtests 3 and 4; (b) learned subtest 4 but not 3; (c) learned subtest 3 but not 4; and (d) failed to learn either subtest. ANOVA analyses revealed that the three measures of interference were significantly greater in the brain-damaged group than in the normal controls. Also, specific interference measures were related to specific prior subtest mastery, thus providing support for a proactive interference effect. In addition, we have evidence that our new measures may be selectively sensitive to frontal system dysfunction.

  6. The effect of instructional methodology on high school students natural sciences standardized tests scores

    NASA Astrophysics Data System (ADS)

    Powell, P. E.

    Educators have recently come to consider inquiry based instruction as a more effective method of instruction than didactic instruction. Experience based learning theory suggests that student performance is linked to teaching method. However, research is limited on inquiry teaching and its effectiveness on preparing students to perform well on standardized tests. The purpose of the study to investigate whether one of these two teaching methodologies was more effective in increasing student performance on standardized science tests. The quasi experimental quantitative study was comprised of two stages. Stage 1 used a survey to identify teaching methods of a convenience sample of 57 teacher participants and determined level of inquiry used in instruction to place participants into instructional groups (the independent variable). Stage 2 used analysis of covariance (ANCOVA) to compare posttest scores on a standardized exam by teaching method. Additional analyses were conducted to examine the differences in science achievement by ethnicity, gender, and socioeconomic status by teaching methodology. Results demonstrated a statistically significant gain in test scores when taught using inquiry based instruction. Subpopulation analyses indicated all groups showed improved mean standardized test scores except African American students. The findings benefit teachers and students by presenting data supporting a method of content delivery that increases teacher efficacy and produces students with a greater cognition of science content that meets the school's mission and goals.

  7. Effect of Mindfulness Meditation on Perceived Stress Scores and Autonomic Function Tests of Pregnant Indian Women

    PubMed Central

    Jain, Reena; Kohli, Sangeeta; Batra, Swaraj

    2016-01-01

    Introduction Various pregnancy complications like hypertension, preeclampsia have been strongly correlated with maternal stress. One of the connecting links between pregnancy complications and maternal stress is mind-body intervention which can be part of Complementary and Alternative Medicine (CAM). Biologic measures of stress during pregnancy may get reduced by such interventions. Aim To evaluate the effect of Mindfulness meditation on perceived stress scores and autonomic function tests of pregnant Indian women. Materials and Methods Pregnant Indian women of 12 weeks gestation were randomised to two treatment groups: Test group with Mindfulness meditation and control group with their usual obstetric care. The effect of Mindfulness meditation on perceived stress scores and cardiac sympathetic functions and parasympathetic functions (Heart rate variation with respiration, lying to standing ratio, standing to lying ratio and respiratory rate) were evaluated on pregnant Indian women. Results There was a significant decrease in perceived stress scores, a significant decrease of blood pressure response to cold pressor test and a significant increase in heart rate variability in the test group (p< 0.05, significant) which indicates that mindfulness meditation is a powerful modulator of the sympathetic nervous system and can thereby reduce the day-to-day perceived stress in pregnant women. Conclusion The results of this study suggest that mindfulness meditation improves parasympathetic functions in pregnant women and is a powerful modulator of the sympathetic nervous system during pregnancy. PMID:27190795

  8. The Relationship between Academic Averages of Primary School Science and Technology Class and Test Sub-Test Scores of Placement Test of Science

    ERIC Educational Resources Information Center

    Guzeller, Cem Oktay

    2012-01-01

    In this research, the relationship between written exam scores of science and technology class of 6th, 7th, and 8th grades, project, participation in class activities and performance work, year-end academic success point averages and sub-test raw scores of LDT science of 6th, 7th and 8th grades. Academic success point averages were used as…

  9. Comparison of the Qualitative and Developmental Scoring Systems for the Modified Version of the Bender-Gestalt Test.

    ERIC Educational Resources Information Center

    Brannigan, Gary G.; Brunner, Nancy A.

    1993-01-01

    Examined two scoring systems for Modified Version of the Bender-Gestalt Test. Administered Bender-Gestalt and Otis-Lennon School Ability Test to 75 first-grade and 84 second-grade students. Both systems were significantly correlated with school ability. Results of tests for differences between correlations indicated that Qualitative Scoring System…

  10. Examining the Validity of GED[R] Tests Scores with Scheduling and Setting Accommodations. GED Testing Service Research Studies, 2004-1

    ERIC Educational Resources Information Center

    George-Ezzelle, Carol E.; Skaggs, Gary

    2004-01-01

    Current testing standards call for test developers to provide evidence that testing procedures and test scores, and the inferences made based on the test scores, show evidence of validity and are comparable across subpopulations (American Educational Research Association [AERA], American Psychological Association [APA], & National Council on…

  11. The effects of calculator-based laboratories on standardized test scores

    NASA Astrophysics Data System (ADS)

    Stevens, Charlotte Bethany Rains

    Nationwide, the goal of providing a productive science and math education to our youth in today's educational institutions is centering itself around the technology being utilized in these classrooms. In this age of digital technology, educational software and calculator-based laboratories (CBL) have become significant devices in the teaching of science and math for many states across the United States. Among the technology, the Texas Instruments graphing calculator and Vernier Labpro interface, are among some of the calculator-based laboratories becoming increasingly popular among middle and high school science and math teachers in many school districts across this country. In Tennessee, however, it is reported that this type of technology is not regularly utilized at the student level in most high school science classrooms, especially in the area of Physical Science (Vernier, 2006). This research explored the effect of calculator based laboratory instruction on standardized test scores. The purpose of this study was to determine the effect of traditional teaching methods versus graphing calculator teaching methods on the state mandated End-of-Course (EOC) Physical Science exam based on ability, gender, and ethnicity. The sample included 187 total tenth and eleventh grade physical science students, 101 of which belonged to a control group and 87 of which belonged to the experimental group. Physical Science End-of-Course scores obtained from the Tennessee Department of Education during the spring of 2005 and the spring of 2006 were used to examine the hypotheses. The findings of this research study suggested the type of teaching method, traditional or calculator based, did not have an effect on standardized test scores. However, the students' ability level, as demonstrated on the End-of-Course test, had a significant effect on End-of-Course test scores. This study focused on a limited population of high school physical science students in the middle Tennessee

  12. The Frog Pond Revisited: High School Academic Context, Class Rank, and Elite College Admission

    ERIC Educational Resources Information Center

    Espenshade, Thomas J.; Hale, Lauren E.; Chung, Chang Y.

    2005-01-01

    In this article, the authors test a "frog-pond" model of elite college admission proposed by Attewell, operationalizing high school academic context as the secondary school-average SAT score and number of Advanced Placement tests per high school senior. Data on more than 45,000 applications to three elite universities show that a high school's…

  13. The Childhood Asperger Syndrome Test (CAST): Test-Retest Reliability in a High Scoring Sample

    ERIC Educational Resources Information Center

    Allison, Carrie; Williams, Jo; Scott, Fiona; Stott, Carol; Bolton, Patrick; Baron-Cohen, Simon; Brayne, Carol

    2007-01-01

    The Childhood Asperger Syndrome Test (CAST) is a 37-item parental self-completion questionnaire designed to screen for high-functioning autism spectrum conditions in epidemiological research. The CAST has previously demonstrated good accuracy for use as a screening test, with high sensitivity in studies with primary school aged children in…

  14. The Role of Noncognitive Assessment in Admissions

    ERIC Educational Resources Information Center

    Hoerle, Heather

    2014-01-01

    Confident that understanding and employing new approaches to assessment is a top priority for admissions professionals, the Secondary School Admission Test Board (SSATB) recently launched a Think Tank on the Future of Admission Assessment, with a two-year timeline and a charge to educate its membership and inspire greater innovation in admissions…

  15. Testing Students with Special Educational Needs in Large-Scale Assessments - Psychometric Properties of Test Scores and Associations with Test Taking Behavior.

    PubMed

    Pohl, Steffi; Südkamp, Anna; Hardt, Katinka; Carstensen, Claus H; Weinert, Sabine

    2016-01-01

    Assessing competencies of students with special educational needs in learning (SEN-L) poses a challenge for large-scale assessments (LSAs). For students with SEN-L, the available competence tests may fail to yield test scores of high psychometric quality, which are-at the same time-measurement invariant to test scores of general education students. We investigated whether we can identify a subgroup of students with SEN-L, for which measurement invariant competence measures of adequate psychometric quality may be obtained with tests available in LSAs. We furthermore investigated whether differences in test-taking behavior may explain dissatisfying psychometric properties and measurement non-invariance of test scores within LSAs. We relied on person fit indices and mixture distribution models to identify students with SEN-L for whom test scores with satisfactory psychometric properties and measurement invariance may be obtained. We also captured differences in test-taking behavior related to guessing and missing responses. As a result we identified a subgroup of students with SEN-L for whom competence scores of adequate psychometric quality that are measurement invariant to those of general education students were obtained. Concerning test taking behavior, there was a small number of students who unsystematically picked response options. Removing these students from the sample slightly improved item fit. Furthermore, two different patterns of missing responses were identified that explain to some extent problems in the assessments of students with SEN-L.

  16. Testing Students with Special Educational Needs in Large-Scale Assessments - Psychometric Properties of Test Scores and Associations with Test Taking Behavior.

    PubMed

    Pohl, Steffi; Südkamp, Anna; Hardt, Katinka; Carstensen, Claus H; Weinert, Sabine

    2016-01-01

    Assessing competencies of students with special educational needs in learning (SEN-L) poses a challenge for large-scale assessments (LSAs). For students with SEN-L, the available competence tests may fail to yield test scores of high psychometric quality, which are-at the same time-measurement invariant to test scores of general education students. We investigated whether we can identify a subgroup of students with SEN-L, for which measurement invariant competence measures of adequate psychometric quality may be obtained with tests available in LSAs. We furthermore investigated whether differences in test-taking behavior may explain dissatisfying psychometric properties and measurement non-invariance of test scores within LSAs. We relied on person fit indices and mixture distribution models to identify students with SEN-L for whom test scores with satisfactory psychometric properties and measurement invariance may be obtained. We also captured differences in test-taking behavior related to guessing and missing responses. As a result we identified a subgroup of students with SEN-L for whom competence scores of adequate psychometric quality that are measurement invariant to those of general education students were obtained. Concerning test taking behavior, there was a small number of students who unsystematically picked response options. Removing these students from the sample slightly improved item fit. Furthermore, two different patterns of missing responses were identified that explain to some extent problems in the assessments of students with SEN-L. PMID:26941665

  17. Testing Students with Special Educational Needs in Large-Scale Assessments – Psychometric Properties of Test Scores and Associations with Test Taking Behavior

    PubMed Central

    Pohl, Steffi; Südkamp, Anna; Hardt, Katinka; Carstensen, Claus H.; Weinert, Sabine

    2016-01-01

    Assessing competencies of students with special educational needs in learning (SEN-L) poses a challenge for large-scale assessments (LSAs). For students with SEN-L, the available competence tests may fail to yield test scores of high psychometric quality, which are—at the same time—measurement invariant to test scores of general education students. We investigated whether we can identify a subgroup of students with SEN-L, for which measurement invariant competence measures of adequate psychometric quality may be obtained with tests available in LSAs. We furthermore investigated whether differences in test-taking behavior may explain dissatisfying psychometric properties and measurement non-invariance of test scores within LSAs. We relied on person fit indices and mixture distribution models to identify students with SEN-L for whom test scores with satisfactory psychometric properties and measurement invariance may be obtained. We also captured differences in test-taking behavior related to guessing and missing responses. As a result we identified a subgroup of students with SEN-L for whom competence scores of adequate psychometric quality that are measurement invariant to those of general education students were obtained. Concerning test taking behavior, there was a small number of students who unsystematically picked response options. Removing these students from the sample slightly improved item fit. Furthermore, two different patterns of missing responses were identified that explain to some extent problems in the assessments of students with SEN-L. PMID:26941665

  18. A score-type test for heterogeneity in zero-inflated models in a stratified population.

    PubMed

    Cao, Guanqun; Hsu, Wei-Wen; Todem, David

    2014-05-30

    We propose a score-type statistic to evaluate heterogeneity in zero-inflated models for count data in a stratified population, where heterogeneity is defined as instances in which the zero counts are generated from two sources. Evaluating heterogeneity in this class of models has attracted considerable attention in the literature, but existing testing procedures have primarily relied on the constancy assumption under the alternative hypothesis. In this paper, we extend the literature by describing a score-type test to evaluate homogeneity against general alternatives that do not neglect the stratification information under the alternative hypothesis. The limiting null distribution of the proposed test statistic is a mixture of chi-squared distributions that can be well approximated by a simple parametric bootstrap procedure. Our numerical simulation studies show that the proposed test can greatly improve efficiency over tests of heterogeneity that ignore the stratification information. An empirical application to dental caries data in early childhood further shows the importance and practical utility of the methodology in using the stratification profile to detect heterogeneity in the population. PMID:24488881

  19. The black-white test score gap and early home environment.

    PubMed

    Yeung, Wei-Jun Jean; Pfeiffer, Kathryn M

    2009-06-01

    Based on panel data for three age cohorts of children from the Panel Study of Income Dynamics, we examine how early home environment contributes to black-white achievement gaps at different developmental stages and the extent to which early gaps contribute to later racial achievement gaps. We find large black-white test score differences among children of all ages even before children start formal schooling. Except for the oldest cohort, the gaps for all tests widened when children's cognitive skills were assessed six years later. Racial achievement gaps in applied problem scores by grade three and letter-word scores by grade six, can be accounted for by child's characteristics, family socioeconomic background, and mother's cognitive skills. However, these covariates explain an increasingly smaller proportion of the black-white achievement gap as children advance to higher grades. Gaps in early cognitive skills are highly predictive of gaps at later ages, setting off a trajectory of cumulative disadvantage for black children over time. Our results underscore the key role of early home environment and the intergenerational roots of the persistent black-white achievement gap.

  20. Classroom attributes and achievement test scores for deaf and hard of hearing students.

    PubMed

    Holt, J

    1994-10-01

    This study examined reading comprehension and mathematics computation achievement of deaf and hard-of-hearing students in a variety of school settings. Data were collected by Gallaudet University's Center for Assessment and Demographic Studies during its 1990 standardization of the 8th Edition Stanford Achievement Test. Descriptive and inferential statistical methods were used to analyze the relationships among achievement scores, classroom attributes, and demographic factors associated with achievement. Based on the results of this study, inclusion with hearing students in regular classrooms is related to a variety of demographic factors. When reading comprehension and mathematics computation scores are adjusted for these factors, they are higher for the deaf and hard-of-hearing students in regular classrooms. However, it is not known whether the higher achievement is due to inclusion or whether students were selected for inclusion due to their higher achievement levels.

  1. The relationship between subjective knee scores, isokinetic testing, and functional testing in the ACL-reconstructed knee.

    PubMed

    Wilk, K E; Romaniello, W T; Soscia, S M; Arrigo, C A; Andrews, J R

    1994-08-01

    It is important to examine the functional relationships between commonly performed clinical tests and to resolve inconsistencies in previous investigative results. The purpose of this study was to determine if a correlation exists between three commonly performed clinical tests: isokinetic isolated knee concentric muscular testing, the single-leg hop test, and the subjective knee score in anterior cruciate ligament reconstructed knees. To determine if a relationship exists would be beneficial to clinicians in determining patient progression, treatment modification, and return-to-sport objective parameters. Several investigators have analyzed two of these parameters, but no one has investigated three parameters to date. Additionally, this study explored the concept of limb acceleration and deceleration during high-speed isokinetics and its relationship to function. Fifty patients were randomly selected (29 males) with a mean age of 23.7 years (range 15-52). The subjects completed a subjective knee score questionnaire that rated symptoms (pain, swelling, giving way) and specific sport function and completed an overall knee score assessment. The patients were then evaluated performing three one-legged functional tests: 1) hop for distance, 2) timed hop, and 3) cross-over triple hop. Isokinetic testing was performed on a Biodex dynamometer at 180, 300, and 450 degrees/sec for knee extension/flexion. The patients' mean value of the self-assessed knee rating was 86 points. Sixty-four percent of the patients exhibited normal limb symmetry (within 85%) on all three single-leg hop tests. Sixteen percent exhibited quadriceps strength at least 90% of the contralateral limb isokinetically. A positive correlation was noted between isokinetic knee extension peak torque (180, 300 degrees/sec) and subjective knee scores, and the three hop tests (p < 0.001). A statistical trend was noted between knee extension acceleration and deceleration range at 180 and 300 degrees/sec for the

  2. An exposure-weighted score test for genetic associations integrating environmental risk factors.

    PubMed

    Han, Summer S; Rosenberg, Philip S; Ghosh, Arpita; Landi, Maria Teresa; Caporaso, Neil E; Chatterjee, Nilanjan

    2015-09-01

    Current methods for detecting genetic associations lack full consideration of the background effects of environmental exposures. Recently proposed methods to account for environmental exposures have focused on logistic regressions with gene-environment interactions. In this report, we developed a test for genetic association, encompassing a broad range of risk models, including linear, logistic and probit, for specifying joint effects of genetic and environmental exposures. We obtained the test statistics by maximizing over a class of score tests, each of which involves modified standard tests of genetic association through a weight function. This weight function reflects the potential heterogeneity of the genetic effects by levels of environmental exposures under a particular model. Simulation studies demonstrate the robust power of these methods for detecting genetic associations under a wide range of scenarios. Applications of these methods are further illustrated using data from genome-wide association studies of type 2 diabetes with body mass index and of lung cancer risk with smoking. PMID:26134142

  3. An exposure-weighted score test for genetic associations integrating environmental risk factors.

    PubMed

    Han, Summer S; Rosenberg, Philip S; Ghosh, Arpita; Landi, Maria Teresa; Caporaso, Neil E; Chatterjee, Nilanjan

    2015-09-01

    Current methods for detecting genetic associations lack full consideration of the background effects of environmental exposures. Recently proposed methods to account for environmental exposures have focused on logistic regressions with gene-environment interactions. In this report, we developed a test for genetic association, encompassing a broad range of risk models, including linear, logistic and probit, for specifying joint effects of genetic and environmental exposures. We obtained the test statistics by maximizing over a class of score tests, each of which involves modified standard tests of genetic association through a weight function. This weight function reflects the potential heterogeneity of the genetic effects by levels of environmental exposures under a particular model. Simulation studies demonstrate the robust power of these methods for detecting genetic associations under a wide range of scenarios. Applications of these methods are further illustrated using data from genome-wide association studies of type 2 diabetes with body mass index and of lung cancer risk with smoking.

  4. Providing Subscale Scores for Diagnostic Information: A Case Study when the Test Is Essentially Unidimensional

    ERIC Educational Resources Information Center

    Stone, Clement A.; Ye, Feifei; Zhu, Xiaowen; Lane, Suzanne

    2010-01-01

    Although reliability of subscale scores may be suspect, subscale scores are the most common type of diagnostic information included in student score reports. This research compared methods for augmenting the reliability of subscale scores for an 8th-grade mathematics assessment. Yen's Objective Performance Index, Wainer et al.'s augmented scores,…

  5. Comparison of the Bender Gestalt Test for Both Black and White Brain-Damaged Patients Using Two Scoring Systems

    ERIC Educational Resources Information Center

    Butler, Oliver T.; And Others

    1976-01-01

    This study tested for cultural bias in the Bender Visual Motor Gestalt Test. Subjects were 72 black and white patients diagnosed as either brain damaged or psychiatric. Bender protocols were scored by Pascal-Suttell and Hain systems. No race effect appeared except for the Pascal-Suttell system for which blacks scored significantly better. (Author)

  6. Interpreting the "g" Loadings of Intelligence Test Composite Scores in Light of Spearman's Law of Diminishing Returns

    ERIC Educational Resources Information Center

    Reynolds, Matthew R.

    2013-01-01

    The linear loadings of intelligence test composite scores on a general factor ("g") have been investigated recently in factor analytic studies. Spearman's law of diminishing returns (SLODR), however, implies that the "g" loadings of test scores likely decrease in magnitude as g increases, or they are nonlinear. The purpose of this study was to (a)…

  7. Effects of Knowledge of Cognitive-Moral Development and Request to Fake on Defining Issues Test P-Scores.

    ERIC Educational Resources Information Center

    Napier, John D.

    1979-01-01

    Support claims that the "Defining Issues Test" of cognitive-moral development cannot be faked higher. Finds that instruction about cognitive-moral development affected the scores of the teacher trainees who were tested. (RL)

  8. Relationship of Students' Prior Knowledge and Order of Questions on Tests to Students' Test Scores.

    ERIC Educational Resources Information Center

    Papp, Klara K.; And Others

    1987-01-01

    A study examined whether students beginning a cell biology course with prior knowledge of its three areas (genetics, histology, and biochemistry) would retain that advantage throughout the course and whether achievement was influenced by the order of questions in a test. (MSE)

  9. The relationship between selected standardized test scores and performance in advanced placement math and science exams: Analyzing the differential effectiveness of scores for course identification and placement

    NASA Astrophysics Data System (ADS)

    Urbina, Josue N.

    There is a national need to increase the STEM-related workforce. Among factors leading towards STEM careers include the number of advanced high school mathematics and science courses students complete. Florida's enrollment patterns in STEM-related Advanced Placement (AP) courses, however, reveal that only a small percentage of students enroll into these classes. Therefore, screening tools are needed to find more students for these courses, who are academically ready, yet have not been identified. The purpose of this study was to investigate the extent to which scores from a national standardized test, Preliminary Scholastic Assessment Test/ National Merit Qualifying Test (PSAT/NMSQT), in conjunction with and compared to a state-mandated standardized test, Florida Comprehensive Assessment Test (FCAT), are related to selected AP exam performance in Seminole County Public Schools. An ex post facto correlational study was conducted using 6,189 student records from the 2010 - 2012 academic years. Multiple regression analyses using simultaneous Full Model testing showed differential moderate to strong relationships between scores in eight of the nine AP courses (i.e., Biology, Environmental Science, Chemistry, Physics B, Physics C Electrical, Physics C Mechanical, Statistics, Calculus AB and BC) examined. For example, the significant unique contribution to overall variance in AP scores was a linear combination of PSAT Math (M), Critical Reading (CR) and FCAT Reading (R) for Biology and Environmental Science. Moderate relationships for Chemistry included a linear combination of PSAT M, W (Writing) and FCAT M; a combination of FCAT M and PSAT M was most significantly associated with Calculus AB performance. These findings have implications for both research and practice. FCAT scores, in conjunction with PSAT scores, can potentially be used for specific STEM-related AP courses, as part of a systematic approach towards AP course identification and placement. For courses with

  10. Scoring in genetically modified organism proficiency tests based on log-transformed results.

    PubMed

    Thompson, Michael; Ellison, Stephen L R; Owen, Linda; Mathieson, Kenneth; Powell, Joanne; Key, Pauline; Wood, Roger; Damant, Andrew P

    2006-01-01

    The study considers data from 2 UK-based proficiency schemes and includes data from a total of 29 rounds and 43 test materials over a period of 3 years. The results from the 2 schemes are similar and reinforce each other. The amplification process used in quantitative polymerase chain reaction determinations predicts a mixture of normal, binomial, and lognormal distributions dominated by the latter 2. As predicted, the study results consistently follow a positively skewed distribution. Log-transformation prior to calculating z-scores is effective in establishing near-symmetric distributions that are sufficiently close to normal to justify interpretation on the basis of the normal distribution.

  11. The Bender Gestalt Test with the Human Figure Drawing Test for Young School Children. A Manual for Use with the Koppitz Scoring System.

    ERIC Educational Resources Information Center

    Koppitz, Elizabeth Munsterberg

    Presented is a manual for scoring the Bender Gestalt Test and the Human Figure Drawing Test for screening and diagnostic uses with emotionally disturbed, brain damaged, or perceptually handicapped 5- to 11-year-old children. Given are suggestions for administering and scoring the Bender test which examines distortion of shape, rotation,…

  12. A Comparison of the Approaches of Generalizability Theory and Item Response Theory in Estimating the Reliability of Test Scores for Testlet-Composed Tests

    ERIC Educational Resources Information Center

    Lee, Guemin; Park, In-Yong

    2012-01-01

    Previous assessments of the reliability of test scores for testlet-composed tests have indicated that item-based estimation methods overestimate reliability. This study was designed to address issues related to the extent to which item-based estimation methods overestimate the reliability of test scores composed of testlets and to compare several…

  13. A Bayesian Method for the Detection of Item Preknowledge in CAT. Law School Admission Council Computerized Testing Report. LSAC Research Report Series.

    ERIC Educational Resources Information Center

    McLeod, Lori D.; Lewis, Charles; Thissen, David.

    With the increased use of computerized adaptive testing, which allows for continuous testing, new concerns about test security have evolved, one being the assurance that items in an item pool are safeguarded from theft. In this paper, the risk of score inflation and procedures to detect test takers using item preknowledge are explored. When test…

  14. Improving personality facet scores with multidimensional computer adaptive testing: an illustration with the NEO PI-R.

    PubMed

    Makransky, Guido; Mortensen, Erik Lykke; Glas, Cees A W

    2013-02-01

    Narrowly defined personality facet scores are commonly reported and used for making decisions in clinical and organizational settings. Although these facets are typically related, scoring is usually carried out for a single facet at a time. This method can be ineffective and time consuming when personality tests contain many highly correlated facets. This article investigates the possibility of increasing the precision of the NEO PI-R facet scores by scoring items with multidimensional item response theory and by efficiently administering and scoring items with multidimensional computer adaptive testing (MCAT). The increase in the precision of personality facet scores is obtained from exploiting the correlations between the facets. Results indicate that the NEO PI-R could be substantially shorter without attenuating precision when the MCAT methodology is used. Furthermore, the study shows that the MCAT methodology is particularly appropriate for constructs that have many highly correlated facets.

  15. Test and Score Data Summary for TOEFL[R] Internet-Based and Paper-Based Tests. January 2008-December 2008 Test Data

    ERIC Educational Resources Information Center

    Educational Testing Service, 2008

    2008-01-01

    The Test of English as a Foreign Language[TM], better known as TOEFL[R], is designed to measure the English-language proficiency of people whose native language is not English. TOEFL scores are accepted by more than 6,000 colleges, universities, and licensing agencies in 130 countries. The test is also used by governments, and scholarship and…

  16. CT densitovolumetry in children with obliterative bronchiolitis: correlation with clinical scores and pulmonary function test results*,**

    PubMed Central

    Mocelin, Helena; Bueno, Gilberto; Irion, Klaus; Marchiori, Edson; Sarria, Edgar; Watte, Guilherme; Hochhegger, Bruno

    2013-01-01

    OBJECTIVE: To determine whether air trapping (expressed as the percentage of air trapping relative to total lung volume [AT%]) correlates with clinical and functional parameters in children with obliterative bronchiolitis (OB). METHODS: CT scans of 19 children with OB were post-processed for AT% quantification with the use of a fixed threshold of −950 HU (AT%950) and of thresholds selected with the aid of density masks (AT%DM). Patients were divided into three groups by AT% severity. We examined AT% correlations with oxygen saturation (SO2) at rest, six-minute walk distance (6MWD), minimum SO2 during the six-minute walk test (6MWT_SO2), FVC, FEV1, FEV1/FVC, and clinical parameters. RESULTS: The 6MWD was longer in the patients with larger normal lung volumes (r = 0.53). We found that AT%950 showed significant correlations (before and after the exclusion of outliers, respectively) with the clinical score (r = 0.72; 0.80), FVC (r = 0.24; 0.59), FEV1 (r = −0.58; −0.67), and FEV1/FVC (r = −0.53; r = −0.62), as did AT%DM with the clinical score (r = 0.58; r = 0.63), SO2 at rest (r = −0.40; r = −0.61), 6MWT_SO2 (r = −0.24; r = −0.55), FVC (r = −0.44; r = −0.80), FEV1 (r = −0.65; r = −0.71), and FEV1/FVC (r = −0.41; r = −0.52). CONCLUSIONS: Our results show that AT% correlates significantly with clinical scores and pulmonary function test results in children with OB. PMID:24473764

  17. [Development and clinical testing of the Russian version of the Acute Cystitis Symptom Score - ACSS].

    PubMed

    Alidjanov, J F; Abdufattaev, U A; Makhmudov, D Kh; Mirkhamidov, D Kh; Khadzhikhanov, F A; Azgamov, A V; Pilatz, A; Naber, K G; Wagenlehner, F M; Akilov, F A

    2014-01-01

    The Acute Cystitis Symptom Score - ACSS was originally developed in the Uzbek language and has demonstrated high reliability and validity. The study was aimed to develop a Russian version of the ACSS questionnaire and evaluate its psychometric properties. Translation and adaptation of the ACSS questionnaire containing 18 questions, 6 of them - for the typical symptoms of acute cystitis (AC), 4 - for the differential diagnosis; 3 - for the quality of life, and 5 - for the conditions that may affect the choice of treatment, were performed according to the recommendations developed by the Mapi Research Institute. Study involved 83 Russian-speaking women (mean age, 35.6 ±13.7 years); 38 (45.8%) patients were in the main group (patients with AC), and 45 (54.2%) - in the control group (without AC). Medical examination and appropriate treatment of the respondents were conducted in accordance with approved standards. After completing the course of therapy, 19 (50%) patients of the main group came for the control examination. There was statistically significant difference in the scores obtained in the two groups. Score profiles positively correlated with the results of laboratory tests (rho = 0.26-0.48). Cronbach's alpha for the Russian version of the questionnaire was 0.86 (95% CI, 0.81-0.91), area under the curve in the ROC analysis was 0.96. The results of testing the Russian version correspond to those of the original version. The Russian version of the ACSS questionnaire has high. reliability and validity, and can be recommended for clinical research and diagnosis of primary AC, and dynamic monitoring of the effectiveness of the treatment of the Russian-speaking population of patients.

  18. Expanded Koppitz Scoring System of the Bender Gestalt Visual-Motor Test for Adolescents: A Pilot Study.

    ERIC Educational Resources Information Center

    Bolen, Larry M.; And Others

    1992-01-01

    Examined use of Bender Gestalt Visual-Motor Test with school-age adolescents over age 11. Mean error scores suggest that visual-motor development is not maturationally complete by age 11 years, 11 months. Suggests additional research focusing on extending normative sample or developing new scoring system for adolescents. (Author/NB)

  19. Improving Personality Facet Scores with Multidimensional Computer Adaptive Testing: An Illustration with the Neo Pi-R

    ERIC Educational Resources Information Center

    Makransky, Guido; Mortensen, Erik Lykke; Glas, Cees A. W.

    2013-01-01

    Narrowly defined personality facet scores are commonly reported and used for making decisions in clinical and organizational settings. Although these facets are typically related, scoring is usually carried out for a single facet at a time. This method can be ineffective and time consuming when personality tests contain many highly correlated…

  20. Sorting and Supporting: Why Double-Dose Algebra Led to Better Test Scores but More Course Failures

    ERIC Educational Resources Information Center

    Nomi, Takako; Allensworth, Elaine M.

    2013-01-01

    In 2003, Chicago schools required students entering ninth grade with below-average math scores to take two periods of algebra. This led to higher test scores for students with both above- and below-average skills, yet failure rates increased for above-average students. We examine the mechanisms behind these surprising results. Sorting by incoming…

  1. Implications of Deployed and Nondeployed Fathers on Seventh Graders' California Achievement Test Scores during a Military Crisis.

    ERIC Educational Resources Information Center

    Pisano, Mark C.

    The differences in California Achievement Test (CAT) scores from 1990 to 1991 in seventh graders, currently enrolled in Albritton Junior High School in the Fort Bragg Schools, of deployed and nondeployed fathers were analyzed. CAT percentile scores from 1990 and 1991 (1991 being the year of "Desert Storm") were obtained in reading, math and…

  2. The Mote In Thy Brother's Eye, and The Beam in Thine Own: Predicting One's Own and Others' Personality Test Scores.

    ERIC Educational Resources Information Center

    Furnham, Adrian; Henderson, Monika

    1983-01-01

    Examined the similarity between subjects' (N=63) ratings of themselves and others, on various tests of personality. Results revealed that subjects correctly estimated several of their own scores, but only two scores of another person. They believed themselves to be similar to their friend, thereby showing attributional errors. (JAC)

  3. Validation of Automated Scores of TOEFL iBT Tasks against Non-Test Indicators of Writing Ability

    ERIC Educational Resources Information Center

    Weigle, Sara Cushing

    2010-01-01

    Automated scoring has the potential to dramatically reduce the time and costs associated with the assessment of complex skills such as writing, but its use must be validated against a variety of criteria for it to be accepted by test users and stakeholders. This study approaches validity by comparing human and automated scores on responses to…

  4. Priming competence diminishes the link between cognitive test anxiety and test performance. Implications for the interpretation of test scores.

    PubMed

    Lang, Jonas W B; Lang, Jessica

    2010-06-01

    Researchers disagree whether the correlation between cognitive test anxiety and test performance is causal or explainable by skill deficits, which lead to both cognitive test anxiety and lower test performance. Most causal theories of test anxiety assume that individual differences in cognitive test anxiety originate from differences in self-perceived competence. Accordingly, in the present research, we sought to temporarily heighten perceptions of competence using a priming intervention. Two studies with secondary- and vocational-school students (Ns = 219 and 232, respectively) contrasted this intervention with a no-priming control condition. Priming competence diminished the association between cognitive test anxiety and test performance by heightening the performance of cognitively test-anxious students and by lowering the performance of students with low levels of cognitive test anxiety. The findings suggest that cognitively test-anxious persons have greater abilities than they commonly show. Competency priming may offer a way to improve the situation of people with cognitive test anxiety. PMID:20435953

  5. Performance on large-scale science tests: Item attributes that may impact achievement scores

    NASA Astrophysics Data System (ADS)

    Gordon, Janet Victoria

    Significant differences in achievement among ethnic groups persist on the eighth-grade science Washington Assessment of Student Learning (WASL). The WASL measures academic performance in science using both scenario and stand-alone question types. Previous research suggests that presenting target items connected to an authentic context, like scenario question types, can increase science achievement scores especially in underrepresented groups and thus help to close the achievement gap. The purpose of this study was to identify significant differences in performance between gender and ethnic subgroups by question type on the 2005 eighth-grade science WASL. MANOVA and ANOVA were used to examine relationships between gender and ethnic subgroups as independent variables with achievement scores on scenario and stand-alone question types as dependent variables. MANOVA revealed no significant effects for gender, suggesting that the 2005 eighth-grade science WASL was gender neutral. However, there were significant effects for ethnicity. ANOVA revealed significant effects for ethnicity and ethnicity by gender interaction in both question types. Effect sizes were negligible for the ethnicity by gender interaction. Large effect sizes between ethnicities on scenario question types became moderate to small effect sizes on stand-alone question types. This indicates the score advantage the higher performing subgroups had over the lower performing subgroups was not as large on stand-alone question types compared to scenario question types. A further comparison examined performance on multiple-choice items only within both question types. Similar achievement patterns between ethnicities emerged; however, achievement patterns between genders changed in boys' favor. Scenario question types appeared to register differences between ethnic groups to a greater degree than stand-alone question types. These differences may be attributable to individual differences in cognition

  6. A Brief Look at: Test Scores and the Standard Error of Measurement. E&R Report No. 10.13

    ERIC Educational Resources Information Center

    Holdzkom, David; Sumner, Brian; McMillen, Brad

    2010-01-01

    In the context of standardized testing, the standard error of measurement (SEM) is a measure of the factors other than the student's actual knowledge of the tested material that may affect the student's test score. Such factors may include distractions in the testing environment, fatigue, hunger, or even luck. This means that a student's observed…

  7. Multiple tests for wind turbine fault detection and score fusion using two- level multidimensional scaling (MDS)

    NASA Astrophysics Data System (ADS)

    Ye, Xiang; Gao, Weihua; Yan, Yanjun; Osadciw, Lisa A.

    2010-04-01

    Wind is an important renewable energy source. The energy and economic return from building wind farms justify the expensive investments in doing so. However, without an effective monitoring system, underperforming or faulty turbines will cause a huge loss in revenue. Early detection of such failures help prevent these undesired working conditions. We develop three tests on power curve, rotor speed curve, pitch angle curve of individual turbine. In each test, multiple states are defined to distinguish different working conditions, including complete shut-downs, under-performing states, abnormally frequent default states, as well as normal working states. These three tests are combined to reach a final conclusion, which is more effective than any single test. Through extensive data mining of historical data and verification from farm operators, some state combinations are discovered to be strong indicators of spindle failures, lightning strikes, anemometer faults, etc, for fault detection. In each individual test, and in the score fusion of these tests, we apply multidimensional scaling (MDS) to reduce the high dimensional feature space into a 3-dimensional visualization, from which it is easier to discover turbine working information. This approach gains a qualitative understanding of turbine performance status to detect faults, and also provides explanations on what has happened for detailed diagnostics. The state-of-the-art SCADA (Supervisory Control And Data Acquisition) system in industry can only answer the question whether there are abnormal working states, and our evaluation of multiple states in multiple tests is also promising for diagnostics. In the future, these tests can be readily incorporated in a Bayesian network for intelligent analysis and decision support.

  8. At Wake Forest U., Admissions Has Become "More Art than Science"

    ERIC Educational Resources Information Center

    Hoover, Eric

    2009-01-01

    The admissions process is awash in numbers. Students accumulate grade-point averages and test scores. Colleges use statistical models to predict enrollment outcomes, and they tout their place in commercial rankings. In many ways, numbers simplify this complex enterprise. However, they have come to carry undue weight, says Martha Blevins Allman,…

  9. The Impact of the Revamped SAT on Admissions Policies Among Western Land Grant Institutions

    ERIC Educational Resources Information Center

    Choroszy, Melisa N.; Muehlberg, Jessica

    2006-01-01

    The newly revamped Scholastic Aptitude Test (SAT) was administered for the first time in March 2005. Administrators, faculty, and staff in colleges and universities throughout the country were asking the same questions as students: "What was this new score of 2400 and what would it mean for them in terms of admissions, scholarships, and course…

  10. A Survey of Master's-Level Psychology Programs: Admissions Criteria and Program Policies

    ERIC Educational Resources Information Center

    Briihl, Deborah S.; Wasieleski, David T.

    2004-01-01

    This study summarizes the admission and program characteristics of American master's-level graduate programs housed in psychology departments. Individual programs (N = 253) from 163 colleges and universities provided data, including the use of grade point average, Graduate Record Exam scores, and other tests (Psychology Graduate Record Exam,…

  11. Handicapped Applicants to College: An Analysis of Admissions Decisions: College Board Report No. 87-1.

    ERIC Educational Resources Information Center

    Willingham, Warren W.

    The purpose of the study was to compare college admission decisions concerning handicapped and nonhandicapped applicants with comparable Scholastic Aptitude Test (SAT) scores and high school grades (HSG). The study focused on 1,539 handicapped students who applied to 121 institutions that participated in the College Entrance Examination Board's…

  12. Use of Admissions Data to Predict Student Success in Postsecondary Freshman Science

    ERIC Educational Resources Information Center

    Anderson, Amie K.

    2014-01-01

    The purpose of this study was to determine if significant relationships exist for any of the variables, age, gender, previous GPA, test scores (ACT, Compass), number of accumulated credits, and student success in Biology. This study strived to determine what academic/admissions data can be used to determine the likelihood of student success in…

  13. Guided-Inquiry Lessons Raise Scores on the Sixth Grade Georgia Science Test

    NASA Astrophysics Data System (ADS)

    Page, Purlie M.

    At the local level, G Middle School has the highest district-wide percentage of 6th grade science students who are not meeting standards. It is imperative that G middle school take corrective action to reduce the number of students failing to meet state science standards. Dewey's theory of conceptual framework, which involves knowledge constructed on a person's personal experience and mind activity through active forms of learning, guided this study. The goal of the study was to determine whether inquiry-based science modules produce greater 6th grade science achievement, as measured by an equivalent instrument of the science section of the Georgia Criterion-Referenced Competency Test, when compared to traditional instruction among eastern Georgia 6th graders. The sample consisted of 230 students in the nonintervention group and 119 students in the intervention group. All students were from intact classes. At the end of the intervention, an independent t test was conducted to analyze the scores. According to the study t test, (t = 12.33, df = 304.56, p < 0.05), the difference between the means was statistically significant. This project's potential impact on social change includes increasing student motivation towards, comprehension of, and interest in science concepts. At the local level, these inquiry lessons can be shared with science teachers across grade levels and within the district to improve county-wide science scores. An increase in student interest and comprehension of science concepts could ultimately lead to the United States producing more students in the fields of science, technology, engineering, and mathematics (STEM) education.

  14. Survival analysis of colorectal cancer patients with tumor recurrence using global score test methodology

    NASA Astrophysics Data System (ADS)

    Zain, Zakiyah; Aziz, Nazrina; Ahmad, Yuhaniz; Azwan, Zairul; Raduan, Farhana; Sagap, Ismail

    2014-12-01

    Colorectal cancer is the third and the second most common cancer worldwide in men and women respectively, and the second in Malaysia for both genders. Surgery, chemotherapy and radiotherapy are among the options available for treatment of patients with colorectal cancer. In clinical trials, the main purpose is often to compare efficacy between experimental and control treatments. Treatment comparisons often involve several responses or endpoints, and this situation complicates the analysis. In the case of colorectal cancer, sets of responses concerned with survival times include: times from tumor removal until the first, the second and the third tumor recurrences, and time to death. For a patient, the time to recurrence is correlated to the overall survival. In this study, global score test methodology is used in combining the univariate score statistics for comparing treatments with respect to each survival endpoint into a single statistic. The data of tumor recurrence and overall survival of colorectal cancer patients are taken from a Malaysian hospital. The results are found to be similar to those computed using the established Wei, Lin and Weissfeld method. Key factors such as ethnic, gender, age and stage at diagnose are also reported.

  15. Survival analysis of colorectal cancer patients with tumor recurrence using global score test methodology

    SciTech Connect

    Zain, Zakiyah Ahmad, Yuhaniz; Azwan, Zairul E-mail: farhanaraduan@gmail.com Raduan, Farhana E-mail: farhanaraduan@gmail.com Sagap, Ismail E-mail: farhanaraduan@gmail.com; Aziz, Nazrina

    2014-12-04

    Colorectal cancer is the third and the second most common cancer worldwide in men and women respectively, and the second in Malaysia for both genders. Surgery, chemotherapy and radiotherapy are among the options available for treatment of patients with colorectal cancer. In clinical trials, the main purpose is often to compare efficacy between experimental and control treatments. Treatment comparisons often involve several responses or endpoints, and this situation complicates the analysis. In the case of colorectal cancer, sets of responses concerned with survival times include: times from tumor removal until the first, the second and the third tumor recurrences, and time to death. For a patient, the time to recurrence is correlated to the overall survival. In this study, global score test methodology is used in combining the univariate score statistics for comparing treatments with respect to each survival endpoint into a single statistic. The data of tumor recurrence and overall survival of colorectal cancer patients are taken from a Malaysian hospital. The results are found to be similar to those computed using the established Wei, Lin and Weissfeld method. Key factors such as ethnic, gender, age and stage at diagnose are also reported.

  16. Assessing Growth in Young Children: A Comparison of Raw, Age-Equivalent, and Standard Scores Using the Peabody Picture Vocabulary Test

    ERIC Educational Resources Information Center

    Sullivan, Jeremy R.; Winter, Suzanne M.; Sass, Daniel A.; Svenkerud, Nicole

    2014-01-01

    Many tests provide users with several different types of scores to facilitate interpretation and description of students' performance. Common examples include raw scores, age- and grade-equivalent scores, and standard scores. However, when used within the context of assessing growth among young children, these scores should not be…

  17. Beyond the Test Score: A Mixed Methods Analysis of a College Access Intervention in Chile

    ERIC Educational Resources Information Center

    Treviño, Ernesto; Scheele, Judith; Flores, Stella M.

    2014-01-01

    Using both quantitative and qualitative analyses, we examine the role of a college access intervention in the enrollment and persistence outcomes of low-income students in Chile modeled partially after a Texas admissions program. We find that, although students from the program have a mean cumulative GPA significantly lower than that of their…

  18. An Evaluation of the IntelliMetric[SM] Essay Scoring System

    ERIC Educational Resources Information Center

    Rudner, Lawrence M.; Garcia, Veronica; Welch, Catherine

    2006-01-01

    This report provides a two-part evaluation of the IntelliMetric[SM] automated essay scoring system based on its performance scoring essays from the Analytic Writing Assessment of the Graduate Management Admission Test[TM] (GMAT[TM]). The IntelliMetric system performance is first compared to that of individual human raters, a Bayesian system…

  19. Dimensionality and predictive validity of the HAM-Nat, a test of natural sciences for medical school admission

    PubMed Central

    2011-01-01

    Background Knowledge in natural sciences generally predicts study performance in the first two years of the medical curriculum. In order to reduce delay and dropout in the preclinical years, Hamburg Medical School decided to develop a natural science test (HAM-Nat) for student selection. In the present study, two different approaches to scale construction are presented: a unidimensional scale and a scale composed of three subject specific dimensions. Their psychometric properties and relations to academic success are compared. Methods 334 first year medical students of the 2006 cohort responded to 52 multiple choice items from biology, physics, and chemistry. For the construction of scales we generated two random subsamples, one for development and one for validation. In the development sample, unidimensional item sets were extracted from the item pool by means of weighted least squares (WLS) factor analysis, and subsequently fitted to the Rasch model. In the validation sample, the scales were subjected to confirmatory factor analysis and, again, Rasch modelling. The outcome measure was academic success after two years. Results Although the correlational structure within the item set is weak, a unidimensional scale could be fitted to the Rasch model. However, psychometric properties of this scale deteriorated in the validation sample. A model with three highly correlated subject specific factors performed better. All summary scales predicted academic success with an odds ratio of about 2.0. Prediction was independent of high school grades and there was a slight tendency for prediction to be better in females than in males. Conclusions A model separating biology, physics, and chemistry into different Rasch scales seems to be more suitable for item bank development than a unidimensional model, even when these scales are highly correlated and enter into a global score. When such a combination scale is used to select the upper quartile of applicants, the proportion of

  20. Single- versus Double-Scoring of Trend Responses in Trend Score Equating with Constructed-Response Tests. Research Report. ETS RR-10-12

    ERIC Educational Resources Information Center

    Tan, Xuan; Ricker, Kathryn L.; Puhan, Gautam

    2010-01-01

    This study examines the differences in equating outcomes between two trend score equating designs resulting from two different scoring strategies for trend scoring when operational constructed-response (CR) items are double-scored--the single group (SG) design, where each trend CR item is double-scored, and the nonequivalent groups with anchor…

  1. Should We Stop Looking for a Better Scoring Algorithm for Handling Implicit Association Test Data? Test of the Role of Errors, Extreme Latencies Treatment, Scoring Formula, and Practice Trials on Reliability and Validity

    PubMed Central

    Perugini, Marco; Schönbrodt, Felix

    2015-01-01

    Since the development of D scores for the Implicit Association Test, few studies have examined whether there is a better scoring method. In this contribution, we tested the effect of four relevant parameters for IAT data that are the treatment of extreme latencies, the error treatment, the method for computing the IAT difference, and the distinction between practice and test critical trials. For some options of these different parameters, we included robust statistic methods that can provide viable alternative metrics to existing scoring algorithms, especially given the specificity of reaction time data. We thus elaborated 420 algorithms that result from the combination of all the different options and test the main effect of the four parameters with robust statistical analyses as well as their interaction with the type of IAT (i.e., with or without built-in penalty included in the IAT procedure). From the results, we can elaborate some recommendations. A treatment of extreme latencies is preferable but only if it consists in replacing rather than eliminating them. Errors contain important information and should not be discarded. The D score seems to be still a good way to compute the difference although the G score could be a good alternative, and finally it seems better to not compute the IAT difference separately for practice and test critical trials. From this recommendation, we propose to improve the traditional D scores with small yet effective modifications. PMID:26107176

  2. The TSCA interagency testing committee`s approaches to screening and scoring chemicals and chemical groups: 1977-1983

    SciTech Connect

    Walker, J.D.

    1990-12-31

    This paper describes the TSCA interagency testing committee`s (ITC) approaches to screening and scoring chemicals and chemical groups between 1977 and 1983. During this time the ITC conducted five scoring exercises to select chemicals and chemical groups for detailed review and to determine which of these chemicals and chemical groups should be added to the TSCA Section 4(e) Priority Testing List. 29 refs., 1 fig., 2 tabs.

  3. Model-based testing for space-time interaction using point processes: An application to psychiatric hospital admissions in an urban area.

    PubMed

    Meyer, Sebastian; Warnke, Ingeborg; Rössler, Wulf; Held, Leonhard

    2016-05-01

    Spatio-temporal interaction is inherent to cases of infectious diseases and occurrences of earthquakes, whereas the spread of other events, such as cancer or crime, is less evident. Statistical significance tests of space-time clustering usually assess the correlation between the spatial and temporal (transformed) distances of the events. Although appealing through simplicity, these classical tests do not adjust for the underlying population nor can they account for a distance decay of interaction. We propose to use the framework of an endemic-epidemic point process model to jointly estimate a background event rate explained by seasonal and areal characteristics, as well as a superposed epidemic component representing the hypothesis of interest. We illustrate this new model-based test for space-time interaction by analysing psychiatric inpatient admissions in Zurich, Switzerland (2007-2012). Several socio-economic factors were found to be associated with the admission rate, but there was no evidence of general clustering of the cases. PMID:27246269

  4. An investigation of a method of simplified scoring for the Kaufman Hand Movements test as a measure of limb apraxia.

    PubMed

    Neiman, M R; Duffy, R J; Belanger, S A; Coelho, C A

    1996-02-01

    A simplified, 3-category method for scoring the Kaufman Hand Movements test was devised to replace a previously used, more complex 21-category scoring method. The concurrent validity and diagnostic sensitivity of the rescored test as a measure of limb apraxia were investigated in a reanalysis of the test protocols of 23 aphasic adults. Using the Limb Apraxia Test as the criterion measure, a Pearson r of .71 and predictive validity of 100% were obtained. These results encourage further investigation of the Kaufman Hand Movements test as an efficient measure of limb apraxia.

  5. REPRODUCIBILITY OF THE MODIFIED STAR EXCURSION BALANCE TEST COMPOSITE AND SPECIFIC REACH DIRECTION SCORES

    PubMed Central

    van Lieshout, Remko; Reijneveld, Elja A.E.; van den Berg, Sandra M.; Haerkens, Gijs M.; Koenders, Niek H.; de Leeuw, Arina J.; van Oorsouw, Roel G.; Paap, Davy; Scheffer, Else; Weterings, Stijn

    2016-01-01

    ABSTRACT Background The mSEBT is a screening tool used to evaluate dynamic balance. Most research investigating measurement properties focused on intrarater reliability and was done in small samples. To know whether the mSEBT is useful to discriminate dynamic balance between persons and to evaluate changes in dynamic balance, more research into intra- and interrater reliability and smallest detectable change (synonymous with minimal detectable change) is needed. Purpose To estimate intra- and interrater reliability and smallest detectable change of the mSEBT in adults at risk for ankle sprain. Study Design Cross-sectional, test-retest design Methods Fifty-five healthy young adults participating in sports at risk for ankle sprain participated (mean ± SD age, 24.0 ± 2.9 years). Each participant performed three test sessions within one hour and was rated by two physical therapists (session 1, rater 1; session 2, rater 2; session 3, rater 1). Participants and raters were blinded for previous measurements. Normalized composite and reach direction scores for the right and left leg were collected. Analysis of variance was used to calculate intraclass correlation coefficient values for intra- and interrater reliability. Smallest detectable change values were calculated based on the standard error of measurement. Results Intra- and interrater reliability for both legs was good to excellent (intraclass correlation coefficient ranging from 0.87 to 0.94). The intrarater smallest detectable change for the composite score of the right leg was 7.2% and for the left 6.2%. The interrater smallest detectable change for the composite score of the right leg was 6.9% and for the left 5.0%. Conclusion The mSEBT is a reliable measurement instrument to discriminate dynamic balance between persons. Most smallest detectable change values of the mSEBT appear to be large. More research is needed to investigate if the mSEBT is usable for evaluative purposes. Level of Evidence Level 2

  6. Development of new risk score for pre-test probability of obstructive coronary artery disease based on coronary CT angiography.

    PubMed

    Fujimoto, Shinichiro; Kondo, Takeshi; Yamamoto, Hideya; Yokoyama, Naoyuki; Tarutani, Yasuhiro; Takamura, Kazuhisa; Urabe, Yoji; Konno, Kumiko; Nishizaki, Yuji; Shinozaki, Tomohiro; Kihara, Yasuki; Daida, Hiroyuki; Isshiki, Takaaki; Takase, Shinichi

    2015-09-01

    Existing methods to calculate pre-test probability of obstructive coronary artery disease (CAD) have been established using selected high-risk patients who were referred to conventional coronary angiography. The purpose of this study is to develop and validate our new method for pre-test probability of obstructive CAD using patients who underwent coronary CT angiography (CTA), which could be applicable to a wider range of patient population. Using consecutive 4137 patients with suspected CAD who underwent coronary CTA at our institution, a multivariate logistic regression model including clinical factors as covariates calculated the pre-test probability (K-score) of obstructive CAD determined by coronary CTA. The K-score was compared with the Duke clinical score using the area under the curve (AUC) for the receiver-operating characteristic curve. External validation was performed by an independent sample of 319 patients. The final model included eight significant predictors: age, gender, coronary risk factor (hypertension, diabetes mellitus, dyslipidemia, smoking), history of cerebral infarction, and chest symptom. The AUC of the K-score was significantly greater than that of the Duke clinical score for both derivation (0.736 vs. 0.699) and validation (0.714 vs. 0.688) data sets. Among patients who underwent coronary CTA, newly developed K-score had better pre-test prediction ability of obstructive CAD compared to Duke clinical score in Japanese population.

  7. Poisson Approximation-Based Score Test for Detecting Association of Rare Variants.

    PubMed

    Fang, Hongyan; Zhang, Hong; Yang, Yaning

    2016-07-01

    Genome-wide association study (GWAS) has achieved great success in identifying genetic variants, but the nature of GWAS has determined its inherent limitations. Under the common disease rare variants (CDRV) hypothesis, the traditional association analysis methods commonly used in GWAS for common variants do not have enough power for detecting rare variants with a limited sample size. As a solution to this problem, pooling rare variants by their functions provides an efficient way for identifying susceptible genes. Rare variant typically have low frequencies of minor alleles, and the distribution of the total number of minor alleles of the rare variants can be approximated by a Poisson distribution. Based on this fact, we propose a new test method, the Poisson Approximation-based Score Test (PAST), for association analysis of rare variants. Two testing methods, namely, ePAST and mPAST, are proposed based on different strategies of pooling rare variants. Simulation results and application to the CRESCENDO cohort data show that our methods are more powerful than the existing methods.

  8. Examination of Substance Use, Risk Factors, and Protective Factors on Student Academic Test Score Performance

    PubMed Central

    Arthur, Michael W.; Brown, Eric C.; Briney, John S.; Hawkins, J. David; Abbott, Robert D.; Catalano, Richard F.; Becker, Linda; Langer, Michael; Mueller, Martin T.

    2016-01-01

    BACKGROUND School administrators and teachers face difficult decisions about how best to use school resources in order to meet academic achievement goals. Many are hesitant to adopt prevention curricula that are not focused directly on academic achievement. Yet, some have hypothesized that prevention curricula can remove barriers to learning and, thus, promote achievement. This study examined relationships between school levels of student substance use and risk and protective factors that predict adolescent problem behaviors and achievement test performance in Washington State. METHODS Hierarchical Generalized Linear Models were used to examine predictive associations between school-averaged levels of substance use and risk and protective factors and Washington State students’ likelihood of meeting achievement test standards on the Washington Assessment of Student Learning, statistically controlling for demographic and economic factors known to be associated with achievement. RESULTS Results indicate that levels of substance use and risk/protective factors predicted the academic test score performance of students. Many of these effects remained significant even after controlling for model covariates. CONCLUSIONS The findings suggest that implementing prevention programs that target empirically identified risk and protective factors have the potential to positively affect students’ academic achievement. PMID:26149305

  9. The IWQOL-Kids(©): establishing minimal clinically important difference scores and test-retest reliability.

    PubMed

    Modi, Avani C; Zeller, Meg H

    2011-06-01

    This study presents additional psychometric testing of the Impact of Weight on Quality of Life-Kids (IWQOL-Kids) with aims to establish distribution-based minimal clinically important difference scores (MCIDs) and evaluate test-retest reliability. Participants (N=263) represent a pooled sample of treatment-seeking obese adolescents (aged 11-19 years) from four large studies examining health-related quality of life (HRQOL) and psychosocial outcomes (Mz(BMI)=2.6 ± 0.4; M(age)=15.1 ± 1.9; 64% female; 51% Black, 46% White). Adolescents completed the IWQOL-Kids©. Standard errors of measurement, which represent the MCID for each scale, were: Physical Comfort=8.8; Body Esteem=7.7; Social Life=8.1; Family Relations=6.2; Total QOL =4.8. Test-retest reliabilities ranged from 0.75-0.88. These data provide further support for the excellent psychometric properties of the IWQOL-Kids. In addition, preliminary MCIDs for IWQOL-Kids scales have now been established, which can be used in clinical trials.

  10. The achievement impact of the inclusion model on the standardized test scores of general education students

    NASA Astrophysics Data System (ADS)

    Garrett-Rainey, Syrena

    The purpose of this study was to compare the achievement of general education students within regular education classes to the achievement of general education students in inclusion/co-teach classes to determine whether there was a significant difference in the achievement between the two groups. The school district's inclusion/co-teach model included ongoing professional development support for teachers and administrators. General education teachers, special education teachers, and teacher assistants collaborated to develop instructional strategies to provide additional remediation to help students to acquire the skills needed to master course content. This quantitative study reviewed the end-of course test (EoCT) scores of Grade 10 physical science and math students within an urban school district. It is not known whether general education students in an inclusive/co-teach science or math course will demonstrate a higher achievement on the EoCT in math or science than students not in an inclusive/co-teach classroom setting. In addition, this study sought to determine if students classified as low socioeconomic status benefited from participating in co-teaching classrooms as evidenced by standardized tests. Inferential statistics were used to determine whether there was a significant difference between the achievements of the treatment group (inclusion/co-teach) and the control group (non-inclusion/co-teach). The findings can be used to provide school districts with optional instructional strategies to implement in the diverse classroom setting in the modern classroom to increase academic performance on state standardized tests.

  11. Poisson Approximation-Based Score Test for Detecting Association of Rare Variants.

    PubMed

    Fang, Hongyan; Zhang, Hong; Yang, Yaning

    2016-07-01

    Genome-wide association study (GWAS) has achieved great success in identifying genetic variants, but the nature of GWAS has determined its inherent limitations. Under the common disease rare variants (CDRV) hypothesis, the traditional association analysis methods commonly used in GWAS for common variants do not have enough power for detecting rare variants with a limited sample size. As a solution to this problem, pooling rare variants by their functions provides an efficient way for identifying susceptible genes. Rare variant typically have low frequencies of minor alleles, and the distribution of the total number of minor alleles of the rare variants can be approximated by a Poisson distribution. Based on this fact, we propose a new test method, the Poisson Approximation-based Score Test (PAST), for association analysis of rare variants. Two testing methods, namely, ePAST and mPAST, are proposed based on different strategies of pooling rare variants. Simulation results and application to the CRESCENDO cohort data show that our methods are more powerful than the existing methods. PMID:27346734

  12. Rugby versus Soccer in South Africa: Content Familiarity Contributes to Cross-Cultural Differences in Cognitive Test Scores

    ERIC Educational Resources Information Center

    Malda, Maike; van de Vijver, Fons J. R.; Temane, Q. Michael

    2010-01-01

    In this study, cross-cultural differences in cognitive test scores are hypothesized to depend on a test's cultural complexity (Cultural Complexity Hypothesis: CCH), here conceptualized as its content familiarity, rather than on its cognitive complexity (Spearman's Hypothesis: SH). The content familiarity of tests assessing short-term memory,…

  13. Getting the Message Out: An Evaluation of NAEP Score Reporting Practices with Implications for Disseminating Test Results

    ERIC Educational Resources Information Center

    Zenisky, April L.; Hambleton, Ronald K.; Sireci, Stephen G.

    2009-01-01

    How a testing agency approaches score reporting can have a significant impact on the perception of that assessment and the usefulness of the information among intended users and stakeholders. Too often, important decisions about reporting test data are left to the end of the test development cycle, but by considering the audience(s) and the kinds…

  14. Meta-Analyses of the Relationship of Creative Achievement to both IQ and Divergent Thinking Test Scores

    ERIC Educational Resources Information Center

    Kim, Kyung Hee

    2008-01-01

    There is disagreement among researchers about whether IQ tests or divergent thinking (DT) tests are better predictors of creative achievement. Resolving this dispute is complicated by the fact that some research has shown a relationship between IQ and DT test scores (e.g., Runco & Albert, 1986; Wallach, 1970). The present study conducted…

  15. [Comprehensive Assessment of Psychiatric Residents: An Addition to the Program Admission Process].

    PubMed

    Luis, E Jaramillo G; Elena, Martín C

    2012-01-01

    The training of medical specialists is a long and complex process. Its purpose is to guarantee the society that they are the right professionals to meet the health needs of the population. The first step to ensure this objective is the admission process. In psychiatry this process, monitoring resident students and the criteria for each one are different in each country. Admission in Colombia is a heterogeneous process, not standardized, which varies greatly from one university to another, even between private and public universities. At the National University of Colombia, the admissions process is handled by the Admissions Office and includes: a written test for which you must obtain a minimum score, a resume rating and an interview. The Teaching Committee and the Department of Psychiatry considered the admission procedure in general to be good, but in need of refinement. Due to the experience of some teachers and given the current rules, a "comprehensive assessment" for master and doctoral students was required and in 1996 it was decided that this method of assessment for admission to a specialization in Psychiatry would serve to complement the admission process. The article describes the experience of the process and its outcomes, strengths and weaknesses.

  16. Admission to Law School: New Measures

    ERIC Educational Resources Information Center

    Shultz, Marjorie M.; Zedeck, Sheldon

    2012-01-01

    Standardized tests have been increasingly controversial over recent years in high-stakes admission decisions. Their role in operationalizing definitions of merit and qualification is especially contested, but in law schools this challenge has become particularly intense. Law schools have relied on the Law School Admission Test (LSAT) and an INDEX…

  17. Improving test interpretation for detecting executive dysfunction in adults and older adults: prevalence of low scores on the test of verbal conceptualization and fluency.

    PubMed

    Brooks, Brian L; Iverson, Grant L; Lanting, Shawnda C; Horton, Arthur M; Reynolds, Cecil R

    2012-01-01

    Knowing the prevalence of low scores on a battery of executive-functioning tests supplements clinical interpretation and can reduce the likelihood of misdiagnosing deficits in executive functioning. The purpose of this study is to examine the base rates of low scores on the Test of Verbal Conceptualization and Fluency (TVCF; Reynolds & Horton, 2006 ) in healthy adults (n = 332; M (age) = 33.0 years, SD = 10.5, range = 20-59) and older adults (n = 138; M (age) = 74.9 years, SD = 7.8, range = 60-89) from the TVCF standardization sample. The TVCF consists of four tests of executive functioning (i.e., Category Fluency, Letter Naming, Classification, and Trails C) that provide five age-adjusted T-scores. The prevalence of low scores was examined in the total sample and was stratified by educational level. When the five T-scores were considered simultaneously, having one or more scores that were 1 standard deviation (SD) below the mean was found in 28% of healthy adults and 38% of older adults. Education-based differences were also present with more lenient cutoff scores (e.g., 1 SD) but not with more conservative cutoffs. Consistent with the existing literature on other test batteries, at least one low subtest score on the TVCF is common in healthy adults and older adults.

  18. Predictors of MCAT Scores for Black Americans

    PubMed Central

    Carmichael, J. W.; Bauer, Joanne; Hunter, Jacqueline R.; Labat, Deidre D.; Sevenair, John P.

    1987-01-01

    If minority students likely to score low on the Medical College Admissions Test (MCAT) can be identified in advance, they can be advised to take existing preparatory programs, or programs can be developed to meet their needs. Correlation coefficients for a number of available independent variables with MCAT scores were determined for a population of premedical students at Xavier University of Louisiana. American College Testing (ACT) and Scholastic Aptitude Test (SAT) scores were found to have similar ability to predict MCAT scores, with a correlation coefficient of 0.64 between ACT composite and MCAT total scores. Correlations of sophomore year grade point average (GPA) with MCAT scores were only slightly weaker. Use of subtest scores for the ACT and SAT, grades in science courses, and Nelson-Denny Reading Test scores did not improve prediction to any real extent, either when used alone or in multiple linear regression analysis. In contrast to some previous studies, predictions for black men were as good as those for black women. Use of only ACT composite and sophomore year GPA together gave correlations only slightly weaker than predictions using a full range of variables; data from ACT composite and sophomore year GPA can be used for calculating predictive equations on many available micro-computers. These procedures may not be applicable to minority students at majority institutions. PMID:3612830

  19. Age-Correction of Test Scores Reduces the Validity of Mild Cognitive Impairment in Predicting Progression to Dementia

    PubMed Central

    Hessler, Johannes; Tucha, Oliver; Förstl, Hans; Mösch, Edelgard; Bickel, Horst

    2014-01-01

    Objectives A phase of mild cognitive impairment (MCI) precedes most forms of neurodegenerative dementia. Many definitions of MCI recommend the use of test norms to diagnose cognitive impairment. It is, however, unclear whether the use of norms actually improves the detection of individuals at risk of dementia. Therefore, the effects of age- and education-norms on the validity of test scores in predicting progression to dementia were investigated. Methods Baseline cognitive test scores (Syndrome Short Test) of dementia-free participants aged ≥65 were used to predict progression to dementia within three years. Participants were comprehensively examined one, two, and three years after baseline. Test scores were calculated with correction for (1) age and education, (2) education only, (3) age only and (4) without correction. Predictive validity was estimated with Cox proportional hazard regressions. Areas under the curve (AUCs) were calculated for the one-, two-, and three-year intervals. Results 82 (15.3%) of initially 537 participants, developed dementia. Model coefficients, hazard ratios, and AUCs of all scores were significant (p<0.001). Predictive validity was the lowest with age-corrected scores (−2 log likelihood  = 840.90, model fit χ2 (1)  = 144.27, HR  = 1.33, AUCs between 0.73 and 0.87) and the highest with education-corrected scores (−2 log likelihood  = 815.80, model fit χ2 (1)  = 171.16, HR  = 1.34, AUCs between 0.85 and 0.88). Conclusion The predictive validity of test scores is markedly reduced by age-correction. Therefore, definitions of MCI should not recommend the use of age-norms in order to improve the detection of individuals at risk of dementia. PMID:25171483

  20. The increasing impact of socioeconomics and race on standardized academic test scores across elementary, middle, and high school.

    PubMed

    White, Gwyne W; Stepney, Cesalie T; Hatchimonji, Danielle Ryan; Moceri, Dominic C; Linsky, Arielle V; Reyes-Portillo, Jazmin A; Elias, Maurice J

    2016-01-01

    For students and schools, the current policy is to measure success via standardized testing. Yet the immutable factors of socioeconomic status (SES) and race have, consistently, been implicated in fostering an achievement gap. The current study explores, at the school-level, the impact of these factors on test scores. Percentage of students proficient for Language and Math was analyzed from 452 schools across the state of New Jersey. By high school, 52% of the variance in Language and 59% in Math test scores can be accounted for by SES and racial factors. At this level, a 1% increase in school minority population corresponds to a 0.19 decrease in percent Language proficient and 0.33 decrease for Math. These results have significant implications as they suggest that school-level interventions to improve academic achievement scores will be stymied by socioeconomic and racial factors and efforts to improve the achievement gap via testing have largely measured it.

  1. Efforts to Produce Relevant Score Reports to School, District, and State Officials on National Tests

    ERIC Educational Resources Information Center

    Patelis, Thanos; Matos-Elefonte, Haifa

    2009-01-01

    Presented at the Annual National Council on Measurement in Education (NCME) in San Diego in April 2009. This presentation explores how the College Board strives to ensure the relevance and utility of score reporting practices and methods for the PSAT/NMSQT and SAT scores. The new reporting methods allow for greater interaction and intervention at…

  2. A general equation to obtain multiple cut-off scores on a test from multinomial logistic regression.

    PubMed

    Bersabé, Rosa; Rivas, Teresa

    2010-05-01

    The authors derive a general equation to compute multiple cut-offs on a total test score in order to classify individuals into more than two ordinal categories. The equation is derived from the multinomial logistic regression (MLR) model, which is an extension of the binary logistic regression (BLR) model to accommodate polytomous outcome variables. From this analytical procedure, cut-off scores are established at the test score (the predictor variable) at which an individual is as likely to be in category j as in category j+1 of an ordinal outcome variable. The application of the complete procedure is illustrated by an example with data from an actual study on eating disorders. In this example, two cut-off scores on the Eating Attitudes Test (EAT-26) scores are obtained in order to classify individuals into three ordinal categories: asymptomatic, symptomatic and eating disorder. Diagnoses were made from the responses to a self-report (Q-EDD) that operationalises DSM-IV criteria for eating disorders. Alternatives to the MLR model to set multiple cut-off scores are discussed.

  3. The Relationship among Student Achievement Scores on the Math and Science End-of-Course-Tests and Scores on the High School Graduation Test

    ERIC Educational Resources Information Center

    Turner, Sherry L.

    2011-01-01

    Thirteen percent of the 2008-2009 senior class in one southeastern state did not pass the science portion of the state's high school graduation test. Another 5% failed to pass the math portion of the graduation test, leaving these students unable to obtain a high school diploma. The purpose of this nonexperimental quantitative research study was…

  4. Number Right and Elimination Score as a Function of Item Arrangement, Knowledge of Arrangement, and Test Anxiety.

    ERIC Educational Resources Information Center

    Plake, Barbara S.; And Others

    Number right and elimination scores were analyzed on a 48-item college level mathematics test that was assembled from pretest data in three forms by varying the item orderings: easy-hard, uniform, or random. Half of the forms contained information explaining the item arrangement and suggesting strategies for taking the test. Several anxiety…

  5. Language Learner Strategies and Linguistic Competence as Factors Affecting Achievement Test Scores in English for Specific Purposes

    ERIC Educational Resources Information Center

    Jurkovic, Violeta

    2010-01-01

    The article examines the effect of two factors on achievement test scores in English as a foreign language for specific purposes in higher education: preexisting linguistic competence and frequency of use of language learner strategies. The rationale for the analysis of language learner strategies as a factor affecting achievement test outcomes is…

  6. Childhood Fitness and Academic Performance: An Investigation into the Effect of Aerobic Capacity on Academic Test Scores

    ERIC Educational Resources Information Center

    Hobbs, Mark

    2014-01-01

    The purpose of this quantitate ve study was to determine whether or not students in fifth grade who meet the healthy fitness zone (HFZ) for aerobic capacity on the fall 2013 FITNESSGRAM® Test scored higher on the math portion of the 2013 fall Measures of Academic Progress (MAP) test, than students that failed to reach the HFZ for aerobic capacity…

  7. The Impact of Cooperative Learning on Critical Thinking Test Scores of Associate's Degree Graduates in Southwest Virginia

    ERIC Educational Resources Information Center

    Hodges, James Gregory

    2013-01-01

    This study examined the impact that the teaching technique known as cooperative learning had on the changes between pre- and post-test scores on all sub-categories ("induction, deduction, analysis, evaluation, inference", and "total composite") associated with the "California Critical Thinking Skills Test" (CCTST) for…

  8. On the Question of Secular Trends in the Heritability of Intelligence Test Scores: A Study of Norwegian Twins.

    ERIC Educational Resources Information Center

    Sundet, Jon Martin; And Others

    1988-01-01

    Intelligence test data collected in 1931 through 1960 on 757 identical and 1,093 fraternal male twins, from the files of the Norwegian Armed Forces, were examined for secular trends in the heritability of intelligence test scores. Only ambiguous evidence of such trends was found. (SLD)

  9. Effects of Mayfield's Four Questions (M4Q) on Nursing Students' Self-Efficacy and Multiple-Choice Test Scores

    ERIC Educational Resources Information Center

    Mayfield, Linda Riggs

    2010-01-01

    This study examined the effects of being taught the Mayfield's Four Questions multiple-choice test-taking strategy on the perceived self-efficacy and multiple-choice test scores of nursing students in a two-year associate degree program. Experimental and control groups were chosen by stratified random sampling. Subjects completed the 10-statement…

  10. Error Rates in Measuring Teacher and School Performance Based on Student Test Score Gains. NCEE 2010-4004

    ERIC Educational Resources Information Center

    Schochet, Peter Z.; Chiang, Hanley S.

    2010-01-01

    This paper addresses likely error rates for measuring teacher and school performance in the upper elementary grades using value-added models applied to student test score gain data. Using realistic performance measurement system schemes based on hypothesis testing, we develop error rate formulas based on OLS and Empirical Bayes estimators.…

  11. A Cross-sectional Study of Current Doctors' Performance in a Modified Version of a Medical School Admission Aptitude Test: The UKCAT.

    PubMed

    Blackmur, James P; Lone, Nazir I; Stone, Oliver D; Webb, David J; Dhaun, Neeraj

    2016-05-01

    The 2-hour long United Kingdom Clinical Aptitude Test (UKCAT) is used by many universities in the United Kingdom as part of their selection process for undergraduate medical and dentistry degrees. We aimed to compare the performance of senior doctors in primary and secondary care and across a range of specialties, in a modified version of the medical school entrance examination-the mUKCAT. Lay people were also included in the study. Despite its widespread use, this is the first study that examines the performance of senior clinicians in the UKCAT.The study used a prospective cross-sectional design. It used mock questions from the UKCAT website to generate an mUKCAT that was anticipated to take 15 minutes to complete. In all, 167 doctors at consultant, general practitioner (GP), or specialty trainee grade and 26 lay people took part.The overall mean mUKCAT score of all participants was 2486 (69.1%). Of the total cohort, 126 (65.3%) scored above our designated threshold of 2368 and were deemed to have passed the mUKCAT. Excluding lay people, 113 (67.7%) of the 167 doctors scored above that threshold. Medical specialty was associated with overall score (P = 0.003), with anesthetists/intensive care physicians scoring highest (n = 20, mean score 2660) and GPs scoring lowest (n = 38, mean score 2302). Academics outperformed nonacademics (mean score of academics, n = 44 vs nonacademics, n = 123: 2750 vs 2406; P < 0.001). Those clinicians in senior management positions scored lower than those in "standard" roles (mean score of senior management, n = 31 vs standard roles, n = 136: 2332 vs 2534, mean difference 202, 95% confidence interval 67-337, P = 0.004).In the situational judgement section, there was no evidence that specialty was associated with score (P = 0.15). Academics exhibited greater situational judgement than their nonacademic colleagues (academics vs nonacademics: 69.8 vs 63.6%; P = 0.01).The majority of senior

  12. ETS Psychometric Contributions: Focus on Test Scores. Research Report. ETS RR-13-15. ETS R&D Scientific and Policy Contributions Series. ETS SPC-13-03

    ERIC Educational Resources Information Center

    Moses, Tim

    2013-01-01

    The purpose of this report is to review ETS psychometric contributions that focus on test scores. Two major sections review contributions based on assessing test scores' measurement characteristics and other contributions about using test scores as predictors in correlational and regression relationships. An additional section reviews additional…

  13. Effect of locomotion score on sows' performances in a feed reward collection test.

    PubMed

    Bos, E-J; Nalon, E; Maes, D; Ampe, B; Buijs, S; van Riet, M M J; Millet, S; Janssens, G P J; Tuyttens, F A M

    2015-10-01

    Sows housed in groups have to move through their pen to fulfil their behavioural and physiological needs such as feeding and resting. In addition to causing pain and discomfort, lameness may restrict the ability of sows to fulfil such needs. The aim of our study was to investigate the extent to which the mobility of sows is affected by different degrees of lameness. Mobility was measured as the sow's willingness or capability to cover distances. Feed-restricted hybrid sows with different gait scores were subjected to a feed reward collection test in which they had to walk distances to obtain subsequent rewards. In all, 29 group-housed sows at similar gestation stage (day 96.6 ± 7 s.d.) were visually recorded for gait and classified as non-lame, mildly lame, moderately lame or severely lame. All sows received 2.6 kg of standard commercial gestation feed per day. The test arena consisted of two feeding locations separated from each other by a Y-shaped middle barrier. Feed rewards were presented at the two feeders in turn, using both light and sound cues to signal the availability of a new feed reward. Sows were individually trained during 5 non-consecutive days for 10 min/day with increasing barrier length (range: 0 to 3.5 m) each day. After training, sows were individually tested once per day on 3 non-consecutive days with the maximum barrier length such that they had to cover 9.3 m to walk from one feeder to the other. The outcome variable was the number of rewards collected in a 15-min time span. Non-lame and mildly lame sows obtained more rewards than moderately lame and severely lame sows (P<0.01). However, no significant difference was found between non-lame and mildly lame sows (P=0.69), nor between moderately lame and severely lame sows (P=1.00). This feed reward collection test indicates that both moderately lame and severely lame sows are limited in their combined ability and willingness to walk, but did not reveal an effect of mild lameness on mobility

  14. Child Abuse: Its Relationship to Birthweight, Apgar Score, and Developmental Testing.

    ERIC Educational Resources Information Center

    Goldson, Edward; And Others

    1978-01-01

    The relationship of child abuse to birthweight, five-minute Apgar score, and performance on the Bayley Scales of Infant Development was studied in 75 low socioeconomic infants (ages 2-30 months). Journal availability: see EC 111 042. (Author)

  15. An Investigation of the Relationship Between Readiness Test Scores for Kindergarten Children and Achievement Scores Obtained at the End of Grades One and Two. S.S.T.A. Research Centre Report No. 62.

    ERIC Educational Resources Information Center

    Warkentin, Lena

    The primary purpose of this study was to investigate the relationship between Metropolitan Readiness Test (MRT) scores in kindergarten (MRTK) and grade one (MRT1) with the reading scores of the Canadian Tests of Basic Skills (CTBS) at the end of grades one (CTBSR1) and two (CTBSR2). A secondary purpose of the study was to determine whether the…

  16. Intelligibility, subjective ratings and completion time scores using the FAAF test with hearing-impaired subjects and noisy reverberant environments.

    PubMed

    Shields, P W; Campbell, D R

    2001-08-01

    A series of experiments have been performed with the primary aim of assessing the performance of a signal-processing algorithm for a possible future hearing aid application. As part of this work the four alternative auditory feature (FAAF) test was used to obtain a quantitative assessment of speech intelligibility and a subjective assessment of speech quality. This paper reports results of experiments using normal hearing (NH) subjects that provide partial verification of the FAAF test originators' prior work. Also reported are intelligibility score, mean opinion score (MOS) and completion time data obtained by use of the PMID:11694098

  17. Washington Pre-College Test Norms and Admission's Index Data for University of Washington Freshmen Entering Autumn 1974.

    ERIC Educational Resources Information Center

    de Wolf, Virginia A.

    The present report is one in a series which profiles the University of Washington (UW) entering freshman class in terms of performance on the several tests making up the Washington Pre-College Testing Program's battery. As in earlier years, percentile charts for male and female entrants as well as for the total tested group, comprising 79 percent…

  18. Science course sequences: The alignment of written, enacted, and tested curricula and their impact on grade 11 HSPA science scores

    NASA Astrophysics Data System (ADS)

    Lentz, Christine A.

    The purpose of this mixed method study was to examine the alignment of the written, enacted, and tested curricula of the Ocean City High School science course sequencing and its impact on student achievement. This study also examined the school's ability to predict student scores on the science portion of the High School Proficiency Assessment (HSPA). Data collected for science achievement included the science portion of the Grade Eight Proficiency Assessment (GEPA) as a pretest and the scores for the science portion of the HSPA as a posttest. Data collected for curriculum alignment included an examination of teacher generated course curriculum maps to determine the alignment with the New Jersey Core Curriculum Content Standards and the HSPA Test Specifications Directory. The quantitative data were treated through a series of paired samples t-tests, Pearson product moment correlation was used to examine relationships between variables, an ANCOVA analysis and a stepwise regression analysis were also completed. Based on the findings of the data analysis of this research effort, the following conclusions were drawn: (1) the alignment of the enacted curriculum with the tested and written curricula affected science achievement. (2) GEPA scores are significantly tied to HSPA scores and (3) GEPA scores and enrollment in the science sequence whose curriculum was aligned with the written and tested curricula, met the requirements of a predictor of scores on the HSPA exam. It is expected that educational leadership will use the results of this research to inform practice and drive decision-making in respect to student placement in to course sequences. It is hoped that the results will not only increase support for the district's curricula development plan but also add to the overall body of knowledge surrounding science program effectiveness in relation to the No Child Left Behind standards.

  19. Lead exposure and the 2010 achievement test scores of children in New York counties

    PubMed Central

    2012-01-01

    Background Lead is toxic to cognitive and behavioral functioning in children even at levels well below those producing physical symptoms. Continuing efforts in the U.S. since about the 1970s to reduce lead exposure in children have dramatically reduced the incidence of elevated blood lead levels (with elevated levels defined by the current U.S. Centers for Disease Control threshold of 10 μg/dl). The current study examines how much lead toxicity continues to impair the academic achievement of children of New York State, using 2010 test data. Methods This study relies on three sets of data published for the 57 New York counties outside New York City: school achievement data from the New York State Department of Education, data on incidence of elevated blood lead levels from the New York State Department of Health, and data on income from the U.S. Census Bureau. We studied third grade and eighth grade test scores in English Language Arts and mathematics. Using the county as the unit of analysis, we computed bivariate correlations and regression coefficients, with percent of children achieving at the lowest reported level as the dependent variable and the percent of preschoolers in the county with elevated blood lead levels as the independent variable. Then we repeated those analyses using partial correlations to control for possible confounding effects of family income, and using multiple regressions with income included. Results The bivariate correlations between incidence of elevated lead and number of children in the lowest achievement group ranged between 0.38 and 0.47. The partial correlations ranged from 0.29 to 0.40. The regression coefficients, both bivariate and partial (both estimating the increase in percent of children in the lowest achievement group for every percent increase in the children with elevated blood lead levels), ranged from 0.52 to 1.31. All regression coefficients, when rounded to the nearest integer, were approximately 1. Thus, when the

  20. Effects of a Coaching Class on the ACT Scores of Students at a Large Midwest High School

    ERIC Educational Resources Information Center

    Moss, Gary L.; Chippendale, Ene Kaja; Mershon, Clark W.; Carney, Trisha

    2012-01-01

    The role of standardized tests in general and college admission tests in particular is a hot topic for educators, administrators and others evaluating the outcomes of educational institutions. The importance of the test scores for both students and institutions is acknowledged, and the test preparation industry has spawned into a multi-billion…

  1. The Token and Reporter's Tests--Using Two Scoring Conventions: A Normative Study with 286 Grade and Junior High Students and Use with 123 Language-Disordered Students.

    ERIC Educational Resources Information Center

    Jordan, Linda S.; Hall, Penelope K.

    1985-01-01

    Performance of 286 normal children (grades K-9) on the De Renzi and Faglioni form of the Token Test and the De Renzi and Ferrari Reporter's Test were analyzed. Two different scoring conventions were compared: number correct versus weighted scores. Normative data are presented by grade level and age. Specific administration and scoring procedures…

  2. Likelihood ratio and score tests to test the non-inferiority (or equivalence) of the odds ratio in a crossover study with binary outcomes.

    PubMed

    Li, Xiaochun; Li, Huilin; Jin, Man; D Goldberg, Judith

    2016-09-10

    We consider the non-inferiority (or equivalence) test of the odds ratio (OR) in a crossover study with binary outcomes to evaluate the treatment effects of two drugs. To solve this problem, Lui and Chang (2011) proposed both an asymptotic method and a conditional method based on a random effects logit model. Kenward and Jones (1987) proposed a likelihood ratio test (LRTM ) based on a log linear model. These existing methods are all subject to model misspecification. In this paper, we propose a likelihood ratio test (LRT) and a score test that are independent of model specification. Monte Carlo simulation studies show that, in scenarios considered in this paper, both the LRT and the score test have higher power than the asymptotic and conditional methods for the non-inferiority test; the LRT, score, and asymptotic methods have similar power, and they all have higher power than the conditional method for the equivalence test. When data can be well described by a log linear model, the LRTM has the highest power among all the five methods (LRTM , LRT, score, asymptotic, and conditional) for both non-inferiority and equivalence tests. However, in scenarios for which a log linear model does not describe the data well, the LRTM has the lowest power for the non-inferiority test and has inflated type I error rates for the equivalence test. We provide an example from a clinical trial that illustrates our methods. Copyright © 2016 John Wiley & Sons, Ltd. PMID:27095359

  3. Likelihood ratio and score tests to test the non-inferiority (or equivalence) of the odds ratio in a crossover study with binary outcomes.

    PubMed

    Li, Xiaochun; Li, Huilin; Jin, Man; D Goldberg, Judith

    2016-09-10

    We consider the non-inferiority (or equivalence) test of the odds ratio (OR) in a crossover study with binary outcomes to evaluate the treatment effects of two drugs. To solve this problem, Lui and Chang (2011) proposed both an asymptotic method and a conditional method based on a random effects logit model. Kenward and Jones (1987) proposed a likelihood ratio test (LRTM ) based on a log linear model. These existing methods are all subject to model misspecification. In this paper, we propose a likelihood ratio test (LRT) and a score test that are independent of model specification. Monte Carlo simulation studies show that, in scenarios considered in this paper, both the LRT and the score test have higher power than the asymptotic and conditional methods for the non-inferiority test; the LRT, score, and asymptotic methods have similar power, and they all have higher power than the conditional method for the equivalence test. When data can be well described by a log linear model, the LRTM has the highest power among all the five methods (LRTM , LRT, score, asymptotic, and conditional) for both non-inferiority and equivalence tests. However, in scenarios for which a log linear model does not describe the data well, the LRTM has the lowest power for the non-inferiority test and has inflated type I error rates for the equivalence test. We provide an example from a clinical trial that illustrates our methods. Copyright © 2016 John Wiley & Sons, Ltd.

  4. Predictive validity of the classroom strategies scale-observer form on statewide testing scores: an initial investigation.

    PubMed

    Reddy, Linda A; Fabiano, Gregory A; Dudek, Christopher M; Hsu, Louis

    2013-12-01

    The present study examined the validity of a teacher observation measure, the Classroom Strategies Scale--Observer Form (CSS), as a predictor of student performance on statewide tests of mathematics and English language arts. The CSS is a teacher practice observational measure that assesses evidence-based instructional and behavioral management practices in elementary school. A series of two-level hierarchical generalized linear models were fitted to data of a sample of 662 third- through fifth-grade students to assess whether CSS Part 2 Instructional Strategy and Behavioral Management Strategy scale discrepancy scores (i.e., ∑ |recommended frequency--frequency ratings|) predicted statewide mathematics and English language arts proficiency scores when percentage of minority students in schools was controlled. Results indicated that the Instructional Strategy scale discrepancy scores significantly predicted mathematics and English language arts proficiency scores: Relatively larger discrepancies on observer ratings of what teachers did versus what should have been done were associated with lower proficiency scores. Results offer initial evidence of the predictive validity of the CSS Part 2 Instructional Strategy discrepancy scores on student academic outcomes. PMID:24341925

  5. The relationship of minority students' MCAT scores and grade point averages to their acceptance into medical school.

    PubMed

    Lynch, K B; Woode, M K

    1990-07-01

    The authors identified relationships between quantitative academic variables--specifically, grade-point averages (GPAs) and Medical College Admission Test (MCAT) scores--and the admission decisions of 58 students from minority groups underrepresented in medicine. These students had participated in a summer enrichment program at the University of Virginia School of Medicine and had applied to medical school. A total of 49 of the students were offered admission, and ultimately enrolled in 17 different medical schools. Results of a stepwise multiple regression analysis indicated that scores on the Skills Analysis: Quantitative Subtest of the MCAT explained the greatest percentage of the variance related to medical school admission (26%); scores on the Skills Analysis: Reading Subtest contributed an additional 7% to the explained variance, and scores on the Physics Knowledge Subtest, another 5%. The overall GPAs did not contribute significantly to the explanation of the variance in admission decisions. These results differ from published findings based on data from minorities' admissions to individual institutions. Caution must therefore be exercised when using GPAs and MCATs as medical school admission predictors for students from minority groups.

  6. Representing Response-Time Information in Item Banks. Law School Admission Council Computerized Testing Report. LSAC Research Report Series.

    ERIC Educational Resources Information Center

    Schnipke, Deborah L.; Scrams, David J.

    The availability of item response times made possible by computerized testing represents an entirely new type of information about test items. This study explores the issue of how to represent response-time information in item banks. Empirical response-time distribution functions can be fit with statistical distribution functions with known…

  7. Entrance Test Accommodations, Admission and Enrollment of Students with Learning Disabilities in Teacher Training Colleges in Israel

    ERIC Educational Resources Information Center

    Sharoni, Varda; Vogel, Gila

    2007-01-01

    The "Saf" (threshold) exam is the entry exam taken by approximately 40% of the students enrolled in teacher training colleges in Israel. Students with learning disabilities may apply and be granted testing accommodations on this exam. This study examines the percentage of students with testing accommodations among the testees and those who began…

  8. Testing Measurement Invariance of the Schizotypal Personality Questionnaire-Brief Scores across Spanish and Swiss Adolescents

    PubMed Central

    Ortuño-Sierra, Javier; Badoud, Deborah; Knecht, Francesca; Paino, Mercedes; Eliez, Stephan; Fonseca-Pedrero, Eduardo; Debbané, Martin

    2013-01-01

    Background Schizotypy is a complex construct intimately related to psychosis. Empirical evidence indicates that participants with high scores on schizotypal self-report are at a heightened risk for the later development of psychotic disorders. Schizotypal experiences represent the behavioural expression of liability for psychotic disorders. Previous factorial studies have shown that schizotypy is a multidimensional construct similar to that found in patients with schizophrenia. Specifically, using the Schizotypal Personality Questionnaire-Brief (SPQ-B), the three-dimensional model has been widely replicated. However, there has been no in-depth investigation of whether the dimensional structure underlying the SPQ-B scores is invariant across countries. Methods The main goal of this study was to examine the measurement invariance of the SPQ-B scores across Spanish and Swiss adolescents. The final sample was made up of 261 Spanish participants (51.7% men; M = 16.04 years) and 241 Swiss participants (52.3% men; M = 15.94 years). Results The results indicated that Raine et al.'s three-factor model presented adequate goodness-of-fit indices. Moreover, the results supported the measurement invariance (configural and partial strong invariance) of the SPQ-B scores across the two samples. Spanish participants scored higher on Interpersonal dimension than Swiss when latent means were compared. Discussion The study of measurement equivalence across countries provides preliminary evidence for the Raine et al.'s three-factor model and of the cross-cultural validity of the SPQ-B scores in adolescent population. Future studies should continue to examine the measurement invariance of the schizotypy and psychosis-risk syndromes across cultures. PMID:24349180

  9. Incomplete Psychometric Equivalence of Scores Obtained on the Manual and the Computer Version of the Wisconsin Card Sorting Test?

    ERIC Educational Resources Information Center

    Steinmetz, Jean-Paul; Brunner, Martin; Loarer, Even; Houssemand, Claude

    2010-01-01

    The Wisconsin Card Sorting Test (WCST) assesses executive and frontal lobe function and can be administered manually or by computer. Despite the widespread application of the 2 versions, the psychometric equivalence of their scores has rarely been evaluated and only a limited set of criteria has been considered. The present experimental study (N =…

  10. Improving Secondary Practical Computer Skills: Logo Test Scores through Graphically Designed Computer Programs and Utilization of Multimedia and Technology.

    ERIC Educational Resources Information Center

    Miller, Douglas S.

    The intent of this project was to improve test and programming scores of 9th through 12th grade students enrolled in the Practical Computer Skills: Logo course in a north central Florida high school. An implementation program that demonstrated teacher-designed graphical computer language Logo programs, utilized multimedia techniques, and used…

  11. Automated Scoring for the "TOEFL Junior"® Comprehensive Writing and Speaking Test. Research Report. ETS RR-15-09

    ERIC Educational Resources Information Center

    Evanini, Keelan; Heilman, Michael; Wang, Xinhao; Blanchard, Daniel

    2015-01-01

    This report describes the initial automated scoring results that were obtained using the constructed responses from the Writing and Speaking sections of the pilot forms of the "TOEFL Junior"® Comprehensive test administered in late 2011. For all of the items except one (the edit item in the Writing section), existing automated scoring…

  12. The Effects of Different Types of Anchor Tests on Observed Score Equating. Research Report. ETS RR-09-41

    ERIC Educational Resources Information Center

    Liu, Jinghua; Sinharay, Sandip; Holland, Paul W.; Feigenbaum, Miriam; Curley, Edward

    2009-01-01

    This study explores the use of a different type of anchor, a "midi anchor", that has a smaller spread of item difficulties than the tests to be equated, and then contrasts its use with the use of a "mini anchor". The impact of different anchors on observed score equating were evaluated and compared with respect to systematic error (bias), random…

  13. Predicting Pre-Service Classroom Teachers' Civil Servant Recruitment Examination's Educational Sciences Test Scores Using Artificial Neural Networks

    ERIC Educational Resources Information Center

    Demir, Metin

    2015-01-01

    This study predicts the number of correct answers given by pre-service classroom teachers in Civil Servant Recruitment Examination's (CSRE) educational sciences test based on their high school grade point averages, university entrance scores, and grades (mid-term and final exams) from their undergraduate educational courses. This study was…

  14. The Score Reliability of Draw-a-Person Intellectual Ability Test (DAP: IQ) for Rural Malawi Students

    ERIC Educational Resources Information Center

    Khasu, Denis S.; Williams, Thomas O., Jr.

    2016-01-01

    In this brief article, the reliability of scores for the Draw-A-Person Intellectual Ability Test for Children, Adolescents, and Adults (DAP: IQ; Reynolds & Hickman, 2004) was examined through several analyses with a sample of 147 children from rural Malawi, Africa using a Chichewa translation of instructions. Cronbach alpha coefficients for…

  15. Changes in the Black-White Test score Gap in the Elementary School Grades. CSE Report 715

    ERIC Educational Resources Information Center

    Koretz, Daniel; Kim, Young-Suk

    2007-01-01

    In a pair of recent studies, Fryer and Levitt (2004a, 2004b) analyzed the Early Childhood Longitudinal Study--Kindergarten Cohort (ECLS-K) to explore the characteristics of the Black-White test score gap in young children. They found that the gap grew markedly between kindergarten and the third grade and that they could predict the gap from…

  16. School Policies and the Black-White Test Score Gap. Working Paper Series. SAN08-03

    ERIC Educational Resources Information Center

    Ladd, Helen F.

    2008-01-01

    This paper examines school-related policies and strategies that have been proposed or justified, at least in part, on the basis of their potential for reducing black-white test score gaps. These include strategies, one of which is greater integration, to reduce differences in the quality of teachers faced by black and white students; school and…

  17. "Does Charter School Attendance Improve Test Scores?" Comments and Reactions on the Arizona Achievement Study. Upjohn Institute Staff Working Paper.

    ERIC Educational Resources Information Center

    Nelson, Christopher; Hollenbeck, Kevin

    In a recent report, Lewis Solmon, Kern Paark, and David Garcia (2001) seek to identify the impact of attending charter schools on student achievement using data from Arizona. Based on a sophisticated statistical analysis, these authors report that charter school attendance increases test score gains of students. This note raises some questions…

  18. Assessing the Effect of School Days and Absences on Test Score Performance. CEP Discussion Paper No. 1302

    ERIC Educational Resources Information Center

    Aucejo, Esteban M.; Romano, Teresa Foy

    2014-01-01

    While instructional time is viewed as crucial to learning, little is known about the effectiveness of reducing absences relative to increasing the number of school days. In this regard, this paper jointly estimates the effect of absences and length of the school calendar on test score performance. Using administrative data from North Carolina…

  19. Predictive Validity of Pre-University Examinations Test Scores for University Science Undergraduates' Academic Achievement in South West, Nigeria

    ERIC Educational Resources Information Center

    Gbore, L. O.

    2012-01-01

    This study examined the predictive validity of pre-university examinations test scores (university matriculation examination (UME), Post-UME and pre-degree) for undergraduate academic achievement. The study is planned along the lines of correlational and ex-post-facto research design. A sample of four hundred university science based…

  20. Options in Education, Transcript for March 8, 1976: Parent Tutors, Feminization of the Teaching Profession, Test Score Controversy, and Busing.

    ERIC Educational Resources Information Center

    George Washington Univ., Washington, DC. Inst. for Educational Leadership.

    "Options in Education" is a radio news program which focuses on issues and developments in education. This transcript contains discussions of volunteer parent tutors in a junior high school, the feminization of the teaching profession, the test score controversy, busing as an issue in the political primaries, and busing and the role of the social…