Schrader, William B.
This report provides information on test development, test administration, and score interpretation for the Graduate Management Admission Test (GMAT). The GMAT, first administered in 1954, provides objective measures of an applicant's abilities for use in admissions decisions by graduate management schools. It is currently composed of five…
Journal of Blacks in Higher Education, 2003
Discusses the racial scoring gap on tests for admission to medical, business, law, and other graduate programs, noting that in the highest-scoring brackets on the Medical College Admission Test (MCAT), the racial gap is even larger. Whites are five times, twelve times, and seven times more likely, respectively, to score higher on the MCAT, Law…
Correlational evidence suggests that high school GPA is better than admission test scores in predicting first-year college GPA, although test scores have incremental predictive validity. The usefulness of a selection variable in making admission decisions depends in part on its predictive validity, but also on institutions' selectivity and…
This study examined the relationship between teacher education students' scores on basic skills admission tests and graduating seniors' scores on the National Teacher Examinations (NTE) at Eastern Kentucky University. The 1981-82 basic skills test scores for 262 teacher education students were compared with their NTE scores taken in 1984-85 during…
Carroll, Alexander M; Schuster, Gregory M
The aim of this study was to investigate whether there was a statistically significant positive correlation between dental students' Dental Admission Test (DAT) scores, particularly on the Perceptual Ability Test (PAT), and their performance on a dental school's competency exam. Scores from the written and clinical competency exam administered in the fall quarter of the fourth year of the curriculum at Midwestern University College of Dental Medicine-Arizona were compared to DAT scores of all 216 members of the graduating classes of 2012 and 2013. It was hypothesized that students who performed highly on one or more sections of the DAT would perform highly on the competency exam. Backward stepwise regression analyses were used to analyze the data. The results showed that the PAT scores were most strongly correlated with the competency exam scores and were a positive predictor for all three clinical sections of the exam (operative dentistry, periodontics, and endodontics). Positive predictors for the written portion of the exam were total DAT score for patient assessment and treatment planning and the DAT reading comprehension score for prosthodontics; there were no predictors for periodontics. The total variance explained by the results ranged from 4% to 15%. While statistically significant relationships were found between the students' PAT scores and clinical performance, DAT scores explained relatively little variance in the competency exam scores. According to these findings, neither the PAT nor any of the DAT components contributed to predicting these students' clinical performance. PMID:26522638
Cullen, Thomas J.; And Others
The predictive validity of the new Medical College Admission Test as it relates to end-of-quarter examinations in anatomy, histology, physiology, biochemistry, and "ages of man" is presented. Results indicate that the Science Knowledge assessment areas of chemistry and physics and the Science Problems subtest were most useful in predicting student…
Allareddy, Veerasathpurush; Howell, T Howard; Karimbux, Nadeem Y
This study examined the association between Dental Admission Test (DAT) scores and the comprehensive exams conducted at Harvard School of Dental Medicine. The authors hypothesized that students who scored high on the DAT would also perform well on the comprehensive examinations. Sixty-six students from the graduating classes of 2005 and 2006 were included. The outcome variable of interest was the final composite grade obtained by the students in the three comprehensive examinations. The main independent variable of interest was the individual component scores on the DAT. Multivariable logistic regression analysis using the maximum likelihood methods was used to examine the association between comprehensive exam grades and DAT scores. Effects of age, gender, and race/ethnicity were adjusted in the regression models. On the first comprehensive examination, seventeen students obtained an Honors grade, while thirteen did so on the second comprehensive examination and fifteen on the third comprehensive examination. None of the DAT component scores were significantly associated with Honors grades on the first comprehensive examination. On the second comprehensive examination, quantitative reasoning scores (OR=2.48, 95 percent CI=1.09-5.68, p=0.03) and total science scores (OR=14.17, 95 percent CI=1.89-106.80, p=0.01) were significantly associated with Honors grades. Reading comprehension score was associated with increased odds of obtaining Honors grade on the third comprehensive examination (OR=1.81, 95 percent CI=1.13-2.92, p=0.01). Students who scored well on the quantitative reasoning, total science, and reading comprehension sections of the DAT had higher odds of receiving an Honors grade on the second and third comprehensive examinations. These factors may be associated with the problem-solving/critical thinking components in the school's PBL curriculum. PMID:22319081
Bridgeman, Brent; Cooper, Peter
Essays for the Graduate Management Admissions Test must be written with a word processor (except in some foreign countries). The test sponsors, the Graduate Management Admissions Council, believed that this is fair because some word processing skill is a prerequisite for advanced management education. Furthermore, it might also be unfair to…
In "Beyond Test Scores: Leading Indicators for Education," Foley and colleagues (2008) define leading indicators as those that "provide early signals of progress toward academic achievement" (p. 1) and stress that educators "need leading indicators to help them see the direction their efforts are going in and to take corrective action as soon as…
Cross, Theodore, Ed.
Argues that differences in black performance and white performance on standardized tests likely comes from deeply rooted environmental forces such as expectations of one's life being restricted to a small and poorly rewarded set of social roles. Issues of test bias, the influence of caste-like minorities, the conflict between African American…
Hedlund, Jennifer; Wilt, Jeanne M.; Nebel, Kristina L.; Ashford, Susan J.; Sternberg, Robert J.
The Graduate Management Admission Test (GMAT) is the most widely used measure of managerial potential in MBA admissions. GMAT scores, although predictive of grades in business school, leave much of the variance in graduate school performance unexplained. The GMAT also produces disparities in test scores between groups, generating the potential for…
Predictive value of grade point average (GPA), Medical College Admission Test (MCAT), internal examinations (Block) and National Board of Medical Examiners (NBME) scores on Medical Council of Canada qualifying examination part I (MCCQE-1) scores
Roy, Banibrata; Ripstein, Ira; Perry, Kyle; Cohen, Barry
Background To determine whether the pre-medical Grade Point Average (GPA), Medical College Admission Test (MCAT), Internal examinations (Block) and National Board of Medical Examiners (NBME) scores are correlated with and predict the Medical Council of Canada Qualifying Examination Part I (MCCQE-1) scores. Methods Data from 392 admitted students in the graduating classes of 2010–2013 at University of Manitoba (UofM), College of Medicine was considered. Pearson’s correlation to assess the strength of the relationship, multiple linear regression to estimate MCCQE-1 score and stepwise linear regression to investigate the amount of variance were employed. Results Complete data from 367 (94%) students were studied. The MCCQE-1 had a moderate-to-large positive correlation with NBME scores and Block scores but a low correlation with GPA and MCAT scores. The multiple linear regression model gives a good estimate of the MCCQE-1 (R2 =0.604). Stepwise regression analysis demonstrated that 59.2% of the variation in the MCCQE-1 was accounted for by the NBME, but only 1.9% by the Block exams, and negligible variation came from the GPA and the MCAT. Conclusions Amongst all the examinations used at UofM, the NBME is most closely correlated with MCCQE-1. PMID:27103953
I will present data showing that significant performance disparities on the GRE general test exist based on the test taker's race and gender . Because of the belief that high GRE scores qualify one for graduate studies, the diversity issues faced by STEM fields may originate, at least in part, in misuse of the GRE scores by graduate admissions committees. I will quantitatively demonstrate this by showing that the combination of a hard cut-off and the different score distributions leads to the systematic underrepresentation of certain groups. I will present data from USF’s PhD program that shows a lack of correlation between GRE scores and research ability; similar null results are emerging from numerous other programs. I will then discuss how assessing non-cognitive competencies in the selection process may lead to a more enlightened search for the next generation of scientists.  C. W. Miller, "Admissions Criteria and Diversity in Graduate School", APS News Vol 22, Issue 2, The Back Page (2013) http://www.aps.org/publications/apsnews/201302/backpage.cfm
Noble, Julie P.; Camara, Wayne J.
College admissions tests provide a standardized and objective measure of student achievement and generalized skills. Unlike high school grades or rank, admission tests are a common measure for comparing students who have attended different high schools, completed different courses, received different grades in courses taught by different teachers,…
Clarke, Marguerite; Shore, Arnold
In order to understand the roles of test scores and diversity characteristics (including race and ethnicity) in the admission process, National Board researchers interviewed admissions directors who worked at selective public and private institutions are well as admissions consultants in the summer and fall of 1999. This report presents an…
Sternberg, Robert J.
Standardized admissions tests such as the SAT (originally stood for "Scholastic Aptitude Test") and the ACT measure only a narrow segment of the skills needed to become an active citizen and possibly a leader who makes a positive, meaningful, and enduring difference to the world. The problem with these tests is that they promised, under what have…
McLaughlin, Jacqueline E.; Khanova, Julia; Scolaro, Kelly; Rodgers, Philip T.
Objective. To examine the relationship between admissions, objective structured clinical examination (OSCE), and advanced pharmacy practice experience (APPE) scores. Methods. Admissions, OSCE, and APPE scores were collected for students who graduated from the doctor of pharmacy (PharmD) program in spring of 2012 and spring of 2013 (n=289). Pearson correlation was used to examine relationships between variables, and independent t test was used to compare mean scores between groups. Results. All relationships among admissions data (undergraduate grade point average, composite PCAT scores, and interview scores) and OSCE and APPE scores were weak, with the strongest association found between the final OSCE and ambulatory care APPEs. Students with low scores on the final OSCE performed lower than others on the acute care, ambulatory care, and community APPEs. Conclusion. This study highlights the complexities of assessing student development of noncognitive professional skills over the course of a curriculum. PMID:26430271
Kelley, Katherine A.; Secnik, Kristina; Boye, Mark E.
Investigated the capacity of the Pharmacy College Admissions Test (PCAT) to predict success in pharmacy school. Found demographic differences in PCAT scores, and that the PCAT used in combination with pre-pharmacy grade point average is meaningful in assessing applicants to pharmacy school; applicants with PCAT composite percentile scores below 40…
Delgado-Hurtado, Juan J.; Berger, Andrea; Bansal, Amit B.
Background Geisinger Health System implemented the Modified Early Warning Score (MEWS) in 2011 and is fully integrated to the Electronic Medical Record (EMR). Our objective was to assess whether the emergency department (ED) MEWS (auto-calculated by EMR) is associated with admission to the hospital, admission disposition, inpatient mortality, and length of stay (LOS) 4 years after its implementation. Methods A random sample of 3,000 patients’ first encounter in the ED was extracted in the study period (between January 1, 2014 and May 31, 2015). Logistic regression was done to analyze whether mean, maximum, and median ED MEWS is associated with admission disposition, mortality, and LOS. Results Mean, maximum, and median ED MEWS is associated with admission to the hospital, admission disposition, and mortality. It correlates weakly with LOS. Conclusion MEWS can be integrated to the EMR, and the score automatically generated still helps predict catastrophic events. MEWS can be used as a triage tool when deciding whether and where patients should be admitted. PMID:27124174
Romanowski, Kathleen S; Barsun, Alura; Pamlieri, Tina L; Greenhalgh, David G; Sen, Soman
With longer life expectancy, the number of burn injuries in the elderly continues to increase. Prediction of outcomes for the elderly is complicated by preinjury physical fitness and comorbid illness. The authors hypothesize that admission frailty assessment would be predictive of outcomes in the elderly burn population. Our primary aim was to determine if higher frailty scores were associated with higher risk of mortality for elderly burn patients. The secondary aims were to assess if higher frailty scores were associated with increased length of stay, increased needs for mechanical ventilation and poor discharge disposition. A 2-year retrospective chart review was performed of all admitted acute burn patients 65 years or older. Data collected included: age, gender, %TBSA of burn injury, presence of inhalation injury, in hospital mortality, hospital length of stay, ventilator days, ICU length of stay, surgical procedures, insurance status, and discharge disposition. Frailty scores were assessed from admission data and calculated using the Canadian Study of Health and Aging clinical frailty scale. A total of 89 patients met entry criteria. Mean age was 75.3 ± 8.1 years and consisted of 62 men and 27 women. Mean %TBSA was 9.6 ± 9.1% and mean frailty score (FS) was 4.5 ± 1.2. Eighty patients survived to discharge and nine died. Nonsurvivors had significantly higher FS compared to survivors (5.2 ± 1.2 vs 4.4 ± 1.2). FS were also significantly higher in patients discharged to skilled nursing facilities (SNF) (5.34 ± 0.9) compared to those who were discharged home (4.1 ± 1.2) or to physical rehabilitation facilities (4 ± 1.5). Multivariate linear regression analysis revealed that age (B = 0.04) and discharge to SNF (B = 1.2) are independently associated with higher FS. However, survivors were independently associated with a significantly lower FS (B = -1.3). Multivariate logistic regression analysis revealed high admission FS independently increased the risk of
Cherngchawano, Wirun; Jaturapitakkul, Natjiree
University Admission Tests in Thailand are important documents which reflect Thailand's education system. To study at a higher education level, all students generally need to take the University Admission Tests designed by the National Institute of Educational Testing Service (NIETS). For the English test, vocabulary and reading comprehension is…
This paper discusses one of the arguments that has been advanced against the use of standardized college admissions tests: the notion that their use leads admissions officers to reject non-Asian minority students on the basis of small and insignificant differences in scores. In the aggregate, the disadvantage minority students face as a result of…
Miller, Casey W.
About 180 graduate programs in physics are listed in the AIP Graduate Programs book. ~ 96% require the general GRE test; a quarter of these have an explicitly stated minimum score for admission, with the median stated cut-off being 700 (64th percentile) on GRE Quantitative; ~ 48% require the physics GRE; about half of these have an explicitly stated minimum score for admission, with the median being 600 (32nd percentile). It does not seem unreasonable to expect students to be among the top test scorers, until you dissect the test results by race and gender. In this talk, I will present data showing that the use of minimum acceptable scores on the GRE exam will have (have had?) a negative impact on diversity in Physics. I will remind the community that this practice is in opposition to ETS's Guide to the Use of Scores. I will make some suggestions for admissions committees, based in part on analyses I have performed. I will then pose challenges related to reducing the influence of GRE scores to the community, ranging from the department and university administration, to ranking bodies and professional societies. Supported by NSF.
Gender Differences in Prediction of Graduate Course Performance from Admissions Test Scores: An Empirical Example of Statistical Methods for Investigating Prediction Bias. AIR 1998 Annual Forum Paper.
House, J. Daniel
This study focused on gender differences in examining the extent to which Graduate Record Examination (GRE) scores predicted subsequent achievement. Data on 275 graduate students in professional psychology programs at a large midwestern university were collected and analyzed. Two methods for the identification of prediction bias were used and…
Kawahara, Nancy E.; Ethington, Corinna
Median polishing, an exploratory data statistical analysis technique, was used to study achievement patterns for men and women on the Pharmacy College Admission Test over a six-year period. In general, a declining trend in scores was found, and males performed better than females, with the largest differences found in chemistry and biology.…
This article describes the National Center Test for University Admissions, a unified national test in Japan, which is taken by 500,000 students every year. It states that implementation of the Center Test began in 1990, with the English component consisting only of the written section until 2005, when the listening section was first implemented…
Yang, Wen-Ling; Bontya, Andrea M.; Moses, Tim P.
Using self-reported but empirically verified repeater groups, we analyzed vast amounts of real test data across a wide range of administrations from a graduate admissions examination that was administered in a non-English language to investigate repeater effects on score equating using the nonequivalent groups with anchor test (NEAT) design. Both…
Morgan, Deanna L.
The introduction of the SAT Reasoning Test™ with a writing section in March 2005 and the concomitant elimination of the SAT® Subject Test in Writing after January 2005 have led many colleges and institutions to ask for guidance in using the new SAT Reasoning Test writing section scores for college placement and admissions. Standard-setting…
Chalhoub-Deville, Micheline; Turner, Carolyn E.
Familiarizes test users with issues to consider when employing assessments for screening and admission purposes. Examines the purpose, content, and scoring methods of three English-as-a-Second-Language admissions tests--the Cambridge certificate exams, International English Language Teaching System, and Test of English as a Foreign…
Kramer, Gene A.; Johnston, JoElle
A study examined the relationship between Optometry Admission Test scores and pre-optometry or undergraduate grade point average (GPA) with first and second year performance in optometry schools. The test's predictive validity was limited but significant, and comparable to those reported for other admission tests. In addition, the scores…
Providing Transparency and Credibility: The Selection of International Students for Australian Universities. An Examination of the Relationship between Scores in the International Student Admissions Test (ISAT), Final Year Academic Programs and an Australian University's Foundation Program
Lai, Kelvin; Nankervis, Susan; Story, Margot; Hodgson, Wayne; Lewenberg, Michael; Ball, Marita MacMahon
Throughout 2003-04 five cohorts of students in their final year of school studies in various Malaysian colleges and a group of students completing an Australian university foundation year in Malaysia sat the International Student Admissions Test (ISAT). The ISAT is a multiple-choice test of general academic abilities developed for students whose…
Examined the degree to which test scores reflect English proficiency using data on students whose native language was Spanish. They had taken the GRE, English proficiency tests, and the PAEG, a test in Spanish used for admission to graduate schools in Puerto Rico. Found that quantitative, analytical, and subject tests are more highly related to…
Warnapala, Yajni; Silva, Karishma
In the year 2001, the University Grants Commission of Sri Lanka successfully appealed to change the method of determining the cut-off scores for university admissions from raw scores to standardized z-scores. This standardization allegedly eliminated the discrepancy caused due to the assumption of equal difficulty levels across all subjects. This…
Tittle, Carol; Kay, Patricia
The establishment of an open admissions policy necessitated an evaluative procedure to identify groups requiring remedial instruction and to assist in estimating budgeting and staffing needs. This study was undertaken, therefore, to select tests in reading and mathematics which would: (1) discriminate adequately between non-college and college…
Stemler, Steven E.
University admissions tests should predict an applicant's ability to succeed in college, but how should this success be defined and measured? The status quo has been to use 1st-year grade point average (FYGPA) as the key indicator of college success, but a review of documents such as university mission statements reveals that universities expect…
Kramer, Gene A.
The relationship of Dental Admission Test (DAT) scales and predental grade point averages with freshman and sophomore dental school performance measures and National Board Dental Examination (NBDE) Part I averages were examined. The results indicated that the DAT scales had limited predictive validity. (Author/MLW)
Pawlow, Laura A.
This exercise aims to provide a hands-on, role-playing activity that requires students to evaluate the strengths and limitations of standardized tests in making admission decisions. Small groups pretend to be an admissions committee and review fictitious student applications containing both standardized test scores and other information admissions…
What strategies can improve test scores? According to research done by Amrein and Berliner, who studied 18 states with high stakes testing, their conclusion was that students did not necessarily score higher and often remained at the same level prior to the introduction of the high stakes testing. In other research done by Carnoy and Loeb, their…
Marini, Jessica P.; Shaw, Emily J.; Young, Linda; Walker, Michael
College admission offices receive many different pieces of information on each applicant to their institution. The job of considering this information becomes further complicated when the information is not consistent across applicants. An example of one such complication is receiving SAT scores for some applicants and receiving ACT scores for…
As more colleges move to "test optional" admissions policies, the debate over the utility and interpretation of standardized-test scores continues. In this article, the author interviews Daniel Koretz, a professor of education at Harvard University and author of "Measuring Up: What Educational Testing Really Tells Us". Koretz shares his thoughts…
van der Linden, Wim J.; Luecht, Richard M.
Derives a set of linear conditions of item-response functions that guarantees identical observed-score distributions on two test forms. The conditions can be added as constraints to a linear programming model for test assembly. An example illustrates the use of the model for an item pool from the Law School Admissions Test (LSAT). (SLD)
Edwards, Janine C.; Maldonado, Filomeno G. Jr.; Calvin, James A.
Analysis of admission records for 439 applicants to Texas A&M University's medical school compared applicants actually admitted under a formula that equally weighted academic and interview scores, with applicants who would have been admitted under two different weighting formulas. Results indicate that readjusting criteria weights may help meet…
Ezeala, Christian C.; Swami, Niraj S.; Lal, Nilesh; Hussain, Shagufta
Secondary education in Fiji ends with the Form 7 examination. Predictive validity for academic success of Form 7 scores which form the basis for admission into the Bachelor of Medicine Bachelor of Surgery programme of the Fiji School of Medicine was examined via a cohort of 129 students. Success rates for year 1 in 2008, 2009, and 2010 were 90.7…
The goal of this study was to determine how ACT Aptitude Ratings (AAR, a pre-admission criterion) of first-year students are associated with various aspects of their first-year experiences. AAR scores were only weakly correlated with students' first-semester grade point averages (GPA), as well as their second-semester grades and attendance rates…
Rizzo, Monica Ellen
Most American colleges and universities require standardized entrance exams when making admissions decisions. Scores on these exams help determine if, when and where students will be allowed to pursue higher education. These scores are also used to determine eligibility for merit based financial aid. This testing persists even though half of the…
Shanley, Brian J.
In this essay, Father Brian J. Shanley discusses Providence College's pilot program to eliminate standardized test scores from the required components of an admission application. Building on the college's ninety-year history of opening the doors of higher education to underrepresented populations, Providence College's test-optional policy is…
Pendlebury, Sarah T.; Lovett, Nicola; Smith, Sarah C.; Cornish, Emily; Mehta, Ziyah; Rothwell, Peter M.
Background: reliable delirium risk stratification will aid recognition, anticipation and prevention and will facilitate targeting of resources in clinical practice as well as identification of at-risk patients for research. Delirium risk scores have been derived for acute medicine, but none has been prospectively validated in external cohorts. We therefore aimed to determine the reliability of externally derived risk scores in a consecutive cohort of older acute medicine patients. Methods: consecutive patients aged ≥65 over two 8-week periods (2010, 2012) were screened prospectively for delirium using the Confusion Assessment Method (CAM), and delirium was diagnosed using the DSM IV criteria. The reliability of existing delirium risk scores derived in acute medicine cohorts and simplified for use in routine clinical practice (USA, n = 2; Spain, n = 1; Indonesia, n = 1) was determined by the area under the receiver operating characteristic curve (AUC). Delirium was defined as prevalent (on admission), incident (occurring during admission) and any (prevalent + incident) delirium. Results: among 308 consecutive patients aged ≥65 (mean age/SD = 81/8 years, 164 (54%) female), existing delirium risk scores had AUCs for delirium similar to those reported in their original internal validations ranging from 0.69 to 0.76 for any delirium and 0.73 to 0.83 for incident delirium. All scores performed better than chance but no one score was clearly superior. Conclusions: externally derived delirium risk scores performed well in our independent acute medicine population with reliability unaffected by simplification and might therefore facilitate targeting of multicomponent interventions in routine clinical practice. PMID:26764396
Edwards, Daniel; Coates, Hamish; Friedman, Tim
This paper explores how admissions tests are used in different higher education systems around the world. This is a relatively new area of research, despite the fact that admissions processes are a key component of university practices and given the ever increasing globalisation of higher education. This paper shows that aptitude and achievement…
We present the approach to graduate admissions developed by the Fisk-Vanderbilt Masters-to-PhD Bridge Program. The approach emphasizes a careful examination of applicants' basic academic preparedness together with noncognitive tracers of future success -- so-called "grit" or "performance character" -- and does not rely upon standardized exam scores such as GREs. This approach has enabled the Fisk-Vanderbilt program to identify and select large numbers of underrepresented minority students who are succeeding at the PhD level, making the program the nation's top producer of underrepresented minority PhDs in astronomy. We highlight outcomes of the program utilizing this "enlightened approach" to admissions, and share tools developed by the program for use by others.
Powell, Brian; Steelman, Lala Carr
Compares men's and women's performance on the Law School Admission Test (LSAT), and suggests that the math section may have penalized women, since they scored equally to men on other sections. Questions the validity of mathematics performance as a predictor of success in law school and as a lawyer. (GC)
Linn, Robert L.
Atkinson and Geiser (2009) make a strong argument for moving to a new form of college admissions testing using curriculum-based achievement tests. In making their case, however, they exaggerate the weaknesses of current tests such as the ACT and SAT by minimizing these tests' predictive utility and claiming a stronger relationship to socioeconomic…
Soares, Joseph A., Ed.
What can a college admissions officer safely predict about the future of a 17-year-old? Are the best and the brightest students the ones who can check off the most correct boxes on a multiple-choice exam? Or are there better ways of measuring ability and promise? In this penetrating and revealing look at high-stakes standardized admissions tests,…
Levin, Henry M.
Around the world we hear considerable talk about creating world-class schools. Usually the term refers to schools whose students get very high scores on the international comparisons of student achievement such as PISA or TIMSS. The practice of restricting the meaning of exemplary schools to the narrow criterion of achievement scores is usually…
Atkinson, Richard C.; Geiser, Saul
The College Boards started as achievement tests designed to measure students' mastery of college preparatory subjects. Admissions testing has significantly changed since then with the introduction of the Scholastic Aptitude Test, Lindquist's creation of the ACT, renewed interest in subject-specific assessments, and current efforts to adapt K-12…
Kolen, Michael J.
Estimation/smoothing methods that are flexible enough to fit a wide variety of test score distributions are reviewed: kernel method, strong true-score model-based method, and method that uses polynomial log-linear models. Applications of these methods include describing/comparing test score distributions, estimating norms, and estimating…
De Ball, Suzanne; Sullivan, Kathleen; Horine, Julie; Duncan, William K.; Replogle, William
Comapred University of Mississippi dental student scores on the Dental Admission Test (DAT) and Part I of the National Board Dental Examinations (NBDE) and found that DAT reading comprehension was a statistically significant predictor of all four subtests of the NBDE. Also found that DAT biology and organic chemistry scores were predictors of NBDE…
Cascallar, Alicia S.; Dorans, Neil J.
This study compares two methods commonly used (concordance and prediction) to establish linkages between scores from tests of similar content given in different languages. Score linkages between the Verbal and Math sections of the SAT I and the corresponding sections of the Spanish-language admissions test, the Prueba de Aptitud Academica (PAA),…
Cascallar, Alicia S.; Dorans, Neil J.
This study compares two methods commonly used (concordance and prediction) to establish linkages between scores from tests of similar content given in different languages. Score linkages between the Verbal and Math sections of the SAT I and the corresponding sections of the Spanish-language admissions test, the Prueba de Aptitud Academica (PAA),…
Sternberg, Robert J.
This article describes two projects based on Robert J. Sternberg's theory of successful intelligence and designed to provide theory-based testing for university admissions. The first, Rainbow Project, provided a supplementary test of analytical, practical, and creative skills to augment the SAT in predicting college performance. The Rainbow…
The purpose of this study was to validate the use of a test to assess dental school applicants' critical thinking abilities. The intent was to include this test on the Dental Admission Test (DAT) if it was shown to enhance the DAT's validity. Correlation and regression analyses of undergraduate and dental school performance with scores on each of the tests on the DAT battery and the California Critical Thinking Skills Test (CCTST) were performed. Data were collected from 439 third- and fourth-year dental students who consented to participate and were enrolled at one of the ten accredited dental schools included in the study. These ten dental schools were from most regions of the United States. This study concluded that including the CCTST on the DAT did not significantly enhance the DAT's validity. PMID:24706684
Trimble, Susan; Gay, Anne; Matthews, Jan
Advances in technology available to access test data coupled with the challenges of No Child Left Behind (NCLB) are pushing schools to grapple with the complexities of test score data. With the current frenzy to raise test scores, there is little attention being paid to teacher development in learning to use data to improve learning. For the past…
Pandiani, John A.; Simon, Monica M.; Banks, Steven M.
This paper reports on an ongoing effort of the Vermont Mental Health Performance Indicator Project (PIP) to examine the relevance and utility of standardized test scores for evaluating community mental health programs. This analysis is of test scores from Vermont's first four years of statewide testing. The study is examining anonymous…
Drymalski, Walter M; Nunley, Michael R
The high prevalence of co-occurring mental health and substance use disorders indicates a need for integrated behavioral health treatment that addresses both types of disorder simultaneously. One component of this integrated treatment is the use of an assessment process that can concurrently identify the presence of each class of disorder. The Addiction Severity Index (ASI) has been extensively used and researched in the field of substance use disorders for over 30 years. The ASI has seven sections, including a section on substance use disorders and a section on psychiatric symptoms, making it a potential candidate for a co-occurring screen during intake. The following study utilized a receiver operating characteristic curve analysis to determine an optimal cutoff score on the ASI psychiatric composite score to identify which individuals seeking substance use disorder treatment were admitted to the Milwaukee County Behavioral Health Division's psychiatric hospital in the year subsequent to their ASI administration. Of the 19,320 individuals who completed an initial ASI in our system, 343 had an inpatient admission. The receiver operating characteristic curve was statistically significant, with an area under the curve of 0.75. A cutoff of 0.27 had a sensitivity of 0.77 and a specificity of 0.61, such that over 60% (11,963/19,320) of the sample was excluded. These results suggest that the ASI psychiatric composite score may be a useful initial screen to identify those with potential mental health problems/needs in a behavioral health system attempting to integrate addiction and mental health services. PMID:27580192
Hunter, William J.
An essential function of the school guidance worker is the translation of test results into plain language and/or concrete recommendations. To do so requires a thorough understanding of the various test scores publishers provide. (Author)
Simmenroth-Nayda, Anne; Meskauskas, Erik; Burckhardt, Gerhard; Görlich, Yvonne
Medical schools in Germany may select 60% of the student applicants through their own admission tests. The influence of the school-leaving examination grades (EGs) in each of the procedural steps is controversial. At Goettingen Medical School, we combine a structured interview and a communicative skills assessment. We analysed how many applicants succeeded in our admission test, compared to a model which only takes EGs into account. Admission scores were transferred into SPSS-21. Sociodemographic data were submitted by the Stiftung Hochschulstart. Besides descriptive statistics, we used Pearson-correlation and means comparisons (t-test, analysis of variance). 221 applicants (EGs 1.0-1.9) were invited in the winter semester 2013/14 and 222 applicants (EGs 1.1-1.8) in the summer semester 2014. The proportion of women was 68% (winter) and 74% (summer). Sixteen and 37 applicants had a medical vocational training and performed slightly better. The analysis showed that our test was gender neutral. EGs did not correlate with interviews or skills assessment. Despite a two-fold impact of EGs, 26 (winter) and 44 (summer) of the overall 181 applicants had EGs of 1.4 -1.9, which would have been too low for admission otherwise. If EGs were only considered once, 40 (winter) and 59 (summer) applicants would have succeeded. PMID:25499117
Graduate Management Admission Council, 2012
The 2011-2012 testing year held a number of significant milestones for the Graduate Management Admission Test exam. A total of 286,529 GMAT exams were administered, with 831,337 score reports sent to more than 5,200 graduate-level management programs around the world--all record numbers. This record volume reflects the increase in graduate-level…
The paper investigates if the provision of financial incentives has an impact on the performance of students in educational tests. The analysis is based on data from an experiment with high school students who answered multiple-choice items from the Third International Mathematics and Science Study (TIMSS). As in TIMSS, the setup did not…
Kuntz, Jennifer L; Smith, David H; Petrik, Amanda F; Yang, Xiuhai; Thorp, Micah L; Barton, Tracy; Barton, Karen; Labreche, Matthew; Spindel, Steven J; Johnson, Eric S
Introduction: Increasing morbidity and health care costs related to Clostridium difficile infection (CDI) have heightened interest in methods to identify patients who would most benefit from interventions to mitigate the likelihood of CDI. Objective: To develop a risk score that can be calculated upon hospital admission and used by antimicrobial stewards, including pharmacists and clinicians, to identify patients at risk for CDI who would benefit from enhanced antibiotic review and patient education. Methods: We assembled a cohort of Kaiser Permanente Northwest patients with a hospital admission from July 1, 2005, through December 30, 2012, and identified CDI in the six months following hospital admission. Using Cox regression, we constructed a score to identify patients at high risk for CDI on the basis of preadmission characteristics. We calculated and plotted the observed six-month CDI risk for each decile of predicted risk. Results: We identified 721 CDIs following 54,186 hospital admissions—a 6-month incidence of 13.3 CDIs/1000 patient admissions. Patients with the highest predicted risk of CDI had an observed incidence of 53 CDIs/1000 patient admissions. The score differentiated between patients who do and do not develop CDI, with values for the extended C-statistic of 0.75. Predicted risk for CDI agreed closely with observed risk. Conclusion: Our risk score accurately predicted six-month risk for CDI using preadmission characteristics. Accurate predictions among the highest-risk patient subgroups allow for the identification of patients who could be targeted for and who would likely benefit from review of inpatient antibiotic use or enhanced educational efforts at the time of discharge planning. PMID:26845084
van der Linden, Wim J.
Two local methods for observed-score equating are applied to the problem of equating an adaptive test to a linear test. In an empirical study, the methods were evaluated against a method based on the test characteristic function (TCF) of the linear test and traditional equipercentile equating applied to the ability estimates on the adaptive test…
Whittaker, Tiffany A.; Williams, Natasha J.; Dodd, Barbara G.
This study assessed the interpretability of scaled scores based on either number correct (NC) scoring for a paper-and-pencil test or one of two methods of scoring computer-based tests: an item pattern (IP) scoring method and a method based on equated NC scoring. The equated NC scoring method for computer-based tests was proposed as an alternative…
Al-Hattami, Abdulghani Ali Dawod
High school grade point average and college entrance test scores are two admission criteria that are currently used by most colleges in Yemen to select their prospective students. Given their widespread use, it is important to investigate their predictive validity to ensure the accuracy of the admission decisions in these institutions. This study…
Sisson, Harriet E.; Dizney, Henry F.
A regression equation incorporating grade point average (GPA), calculus grade, and grades in two sections of the Pharmacy College Admission Test was significantly more useful than using GPA alone. A minimum sum of scores in three PCAT sections was most effective. Cross-validation with new samples was recommended. (Author/CP)
Weinstein, Lawrence; Laverghetta, Antonio; Alexander, Ralph; Stewart, Megan
The current study is an extension of a previous investigation dealing with teacher greetings to students. The present investigation used teacher greetings with college students and academic performance (test scores). We report data using university students and in-class test performance. Students in introductory psychology who received teachers'…
McIntosh, James; Munk, Martin D.
Latent class Poisson count models are used to analyse a sample of Danish test score results from a cohort of individuals born in 1954-1955, tested in 1968, and followed until 2011. The procedure takes account of unobservable effects as well as excessive zeros in the data. We show that the test scores measure manifest or measured ability as it has…
Hilborn, Robert C.
Physics has played an important role in the preparation of future physicians and other health professionals for more than 100 years. Almost all pre-health students take a year of college-level physics as part of their preparation for medical, dental, and pharmacy school. In particular, the widely-used Medical College Admission Test (MCAT) contains a significant number of questions that require physics knowledge and skills. This paper describes the changes in the MCAT to be implemented in 2015, the role of physics in the revised MCAT, and implications for introductory physics courses for the life sciences.
Keiser, Heidi N; Sackett, Paul R; Kuncel, Nathan R; Brothen, Thomas
Women typically obtain higher subsequent college GPAs than men with the same admissions test score. A common reaction is to attribute this to a flaw in the admissions test. We explore the possibility that this underprediction of women's performance reflects gender differences in conscientiousness and college course-taking patterns. In Study 1, we focus on using the ACT to predict performance in a single, large course where performance is decomposed into cognitive (exam and quiz scores) and less cognitive, discretionary components (discussion and extra credit points). The ACT does not underpredict female's cognitive performance, but it does underpredict female performance on the less cognitive, discretionary components of academic performance, because it fails to measure and account for the personality trait of conscientiousness. In Study 2, we create 2 course-difficulty indices (Course Challenge and Mean Aptitude in Course) and add them to an HLM regression model to see if they reduce the degree to which SAT scores underpredict female performance. Including Course Challenge does result in a modest reduction of the gender coefficient; however, including Mean Aptitude in Course does not. Thus, differences in course-taking patterns is a partial (albeit small) explanation for the common finding of differential prediction by gender. (PsycINFO Database Record PMID:26653526
Grace, Catherine O'Neill
Psychologist Robert J. Sternberg's conviction that American standardized testing does not accurately reflect a child's intelligence or potential is far from theoretical. As an elementary school student in the 1950s, he scored poorly on the ubiquitous IQ test of the time, freezing up when the school psychologist entered the room. Thankfully for…
Miller, Steven C.
The Wyoming Department of Education (WDE) has invested time and money developing standardized achievement test score reports designed to give teachers data about each of their students' levels of mastery of particular concepts in order to differentiate their instruction. The purpose of this study was to determine the extent to which…
Singapore students have scored exceedingly well on international tests in mathematics. In response, there has been a desire in the United States--both at the policy level and at the school level--to emulate Singapore. Because what can be identified most easily about Singapore's school mathematics can be gleaned from curriculum documents from the…
Bracey, Gerald W.
Examines correlation between national test scores in mathematics from the Third International Mathematics and Science Study (TIMSS) and the Current Competitiveness Index (CCI). Finds, for example, that while the United States ranks 29th in TIMSS mathematics, it ranks second in competitiveness on the CCI. Korea ranks 3rd in mathematics, but 27th in…
Smith, Vernon G.; Szymanski, Antonia
This article is for practicing or aspiring school administrators. The demand for excellence in public education has lead to an emphasis on standardized test scores. This article explores the development of a professional enhancement program designed to prepare teachers to teach higher order thinking skills. Higher order thinking is the primary…
Jencks, Christopher, Ed.; Phillips, Meredith, Ed.
The 15 chapters of this book address issues related to the continuing test score gap between black and white students. The editors argue against traditional explanations which emphasize differences in economic resources and demographic factors, and they urge that more emphasis be put on psychological and cultural factors. The book suggests studies…
According to the 2004 National Assessment of Educational Progress, males who have made it through 12 years of school have significantly poorer reading skills than their female peers. In every age group, boys have been scoring lower than girls annually for more than three decades on U.S. Department of Education reading tests. The longer boys are in…
Farrell, Elizabeth F.; Hoover, Eric
At the annual conference of the National Association for College Admission Counseling (Nacac), admissions deans and high-school counselors gathered in September 2007 to grapple with questions such as: (1) Rethinking the role of standardized tests in admissions (many attendees predict that psychometric giants ACT and SAT, will not always dominate…
Mitchell, Karen; Lewis, Richard S; Satterfield, Jason; Hong, Barry A
This year's applicants to medical school took a newly revised version of the Medical College Admission Test. Unlike applicants in the past, they were asked to demonstrate their knowledge and use of concepts commonly taught in introductory psychology courses. The new Psychological, Social, and Biological Foundations of Behavior Test asked applicants to demonstrate the ways in which psychological, social, and biological factors influence perceptions and reactions to the world; behavior and behavior change; what people think about themselves and others; the cultural and social differences that influence well-being; and the relationships among social stratification, access to resources, and well-being. Building from the classic biopsychosocial model, this article provides the rationale for testing psychology concepts in application to medical school. It describes the concepts and skills that the new exam tests and shows how they lay the foundation for learning in medical school about the behavioral and sociocultural determinants of health. This article discusses the implications of these changes for undergraduate psychology faculty and psychology curricula as well as their importance to the profession of psychology at large. PMID:26866988
Cantrell, Catherine E.
The admissions decisions of a university are one of its four "essential freedoms," and the courts, as a general rule, defer to universities' judgments regarding academic decisions. In many cases, the courts have said that admissions standards cannot be high-handed, arbitrary, or formulated in bad faith, and they must fall within constitutionally…
The Quality Control (QC) Guidelines are intended to increase the efficiency, precision, and accuracy of the scoring, analysis, and reporting process of testing. The QC Guidelines focus on large-scale testing operations where multiple forms of tests are created for use on set dates. However, they may also be used for a wide variety of other testing…
Spencer, Bruce D.
Because test scores are ordinal not cordinal attributes, the average test score often is a misleading way to summarize the scores of a group of individuals. Similarly, correlation coefficients may be misleading summary measures of association between test scores. Proper, readily interpretable, summary statistics are developed from a theory of…
Kane, Michael T.
To validate an interpretation or use of test scores is to evaluate the plausibility of the claims based on the scores. An argument-based approach to validation suggests that the claims based on the test scores be outlined as an argument that specifies the inferences and supporting assumptions needed to get from test responses to score-based…
Ferreira, Juan Pablo; Hamui, Magali; Paganini, Agustina; Torres, Fernando A; Ossorio, María Fabiana; Eiguchi, Kumiko; Ferrero, Fernando
In the city of Buenos Aires (CABA), pediatric residents enter the residency program after taking a unified admission test. After completion of the program and passing a final test, the Universidad de Buenos Aires (UBA) provides a professional certification. The objective of this study is to determine if the results obtained in the residency admission test (RAT) and those of the professional certification test (PCT) correlated. This is a cross-sectional study, that included all subjects who passed the pediatrics RAT in CABA in 2004-2009, and that attended the pediatric PCT of the UBA. The score for each subject in both tests was obtained and the corresponding correlation was calculated. Results were divided in quintiles, and the proportion of subjects who improved their position in the PCT with respect to the RAT was calculated. Data from 303 subjects was obtained. The RAT showed a median of 45.0 (over 60 maximum) (IC-range: 43.0-48.7), and the PCT showed a median of 6 points (over 10 max.)(IC-range: 6-8). A significative correlation between results in RAT and PCT was observed (r = 0.37, p < 0.001). Based on their position in the RAT, 43.8% of subjects improved their position in the PCT, without differences between residents attending pediatric and general hospitals (45.6 vs. 31.5%; p = 0.1). In the case of pediatric residents, results of the residency admission test correlate with those obtained in the professional certification test. PMID:27295703
Background Entry from secondary school to Australian and New Zealand undergraduate medical schools has since the late 1990’s increasingly relied on the Undergraduate Medicine and Health Sciences Admission Test (UMAT) as one of the selection factors. The UMAT consists of 3 sections – logical reasoning and problem solving (UMAT-1), understanding people (UMAT-2) and non-verbal reasoning (UMAT-3). One of the goals of using this test has been to enhance equity in the selection of students with the anticipation of an increase in the socioeconomic diversity in student cohorts. However there has been limited assessment as to whether UMAT performance itself might be influenced by socioeconomic background. Methods Between 2000 and 2012, 158,909 UMAT assessments were completed. From these, 118,085 cases have been identified where an Australian candidate was sitting for the first time during that period. Predictors of the total UMAT score, UMAT-1, UMAT-2 and UMAT-3 scores were entered into regression models and included gender, age, school type, language used at home, deciles for the Index of Relative Socioeconomic Advantage and Disadvantage score, the Accessibility/Remoteness Index of Australia (ARIA), self-identification as being of Aboriginal or Torres Strait Islander origin (ATSI) and current Australian state or territory of abode. Results A lower UMAT score was predicted by living in an area of relatively higher social disadvantage and lower social advantage. Other socioeconomic indicators were consistent with this observation with lower scores in those who self-identified as being of ATSI origin and higher scores evident in those from fee-paying independent school backgrounds compared to government schools. Lower scores were seen with increasing age, female gender and speaking any language other than English at home. Divergent effects of rurality were observed, with increased scores for UMAT-1 and UMAT-2, but decreasing UMAT-3 scores with increasing ARIA score
Niu, Sunny X.; Tienda, Marta
Using administrative data for five Texas universities that differ in selectivity, this study evaluates the relative influence of two key indicators for college success—high school class rank and standardized tests. Empirical results show that class rank is the superior predictor of college performance and that test score advantages do not insulate lower ranked students from academic underperformance. Using the UT-Austin campus as a test case, we conduct a simulation to evaluate the consequences of capping students admitted automatically using both achievement metrics. We find that using class rank to cap the number of students eligible for automatic admission would have roughly uniform impacts across high schools, but imposing a minimum test score threshold on all students would have highly unequal consequences by greatly reduce the admission eligibility of the highest performing students who attend poor high schools while not jeopardizing admissibility of students who attend affluent high schools. We discuss the implications of the Texas admissions experiment for higher education in Europe. PMID:23788828
Achieve, Inc., 2007
Achieve launched this study to help inform the decisions states are making about high school assessments by providing greater insights into the world of college admissions and placement testing. Achieve analyzed more than 2,000 questions from college admissions and placement exams to determine how these tests compare to one another and how well…
National Association for College Admission Counseling, 2008
Standardized admission tests have become an increasingly important factor in undergraduate admission. At the same time, reforms in elementary and secondary education at both the state and federal level have elevated the importance of standardized tests as a tool to measure educational outcomes. Nevertheless, long-standing concerns with…
Background Internationally, tests of general mental ability are used in the selection of medical students. Examples include the Medical College Admission Test, Undergraduate Medicine and Health Sciences Admission Test and the UK Clinical Aptitude Test. The most widely used measure of their efficacy is predictive validity. A new tool, the Health Professions Admission Test- Ireland (HPAT-Ireland), was introduced in 2009. Traditionally, selection to Irish undergraduate medical schools relied on academic achievement. Since 2009, Irish and EU applicants are selected on a combination of their secondary school academic record (measured predominately by the Leaving Certificate Examination) and HPAT-Ireland score. This is the first study to report on the predictive validity of the HPAT-Ireland for early undergraduate assessments of communication and clinical skills. Method Students enrolled at two Irish medical schools in 2009 were followed up for two years. Data collected were gender, HPAT-Ireland total and subsection scores; Leaving Certificate Examination plus HPAT-Ireland combined score, Year 1 Objective Structured Clinical Examination (OSCE) scores (Total score, communication and clinical subtest scores), Year 1 Multiple Choice Questions and Year 2 OSCE and subset scores. We report descriptive statistics, Pearson correlation coefficients and Multiple linear regression models. Results Data were available for 312 students. In Year 1 none of the selection criteria were significantly related to student OSCE performance. The Leaving Certificate Examination and Leaving Certificate plus HPAT-Ireland combined scores correlated with MCQ marks. In Year 2 a series of significant correlations emerged between the HPAT-Ireland and subsections thereof with OSCE Communication Z-scores; OSCE Clinical Z-scores; and Total OSCE Z-scores. However on multiple regression only the relationship between Total OSCE Score and the Total HPAT-Ireland score remained significant; albeit the
Thomas, Michael C.; Draugalis, JoLaine R.
Examined significance of PCAT scores, prepharmacy GPA, math and science GPA, and attainment of a previous college degree in predicting first-year pharmacy GPA. Asserts that the best predictive model incorporated all variables, while the most parsimonious model included math and science GPA and the PCAT chemistry subscore. The PCAT was a…
Bills, James L.; VanHouten, Jacob; Grundy, Michelle M.; Chalkley, Roger; Dermody, Terence S.
The Medical College Admission Test (MCAT) is a quantitative metric used by MD and MD-PhD programs to evaluate applicants for admission. This study assessed the validity of the MCAT in predicting training performance measures and career outcomes for MD-PhD students at a single institution. The study population consisted of 153 graduates of the…
Sachar, Jane; Suppes, Patrick
The present study compared six methods, two of which utilize the content structure of items, to estimate total-test scores using 450 students and 60 items of the 110-item Stanford Mental Arithmetic Test. Three methods yielded fairly good estimates of the total-test score. (Author/RL)
This paper studies a natural experiment due to an unusual change in the college admission policy at a Chinese university, which brought a large number of low-score students into several academic departments in the university. Exploiting large variations in peer characteristics and strong interactions among peer groups, the analysis finds that…
Kuentzel, Jeffrey G.; Hetterscheidt, Lesley A.; Barnett, Douglas
The rigors of standardized testing make for numerous opportunities for examiner error, including simple computational mistakes in scoring. Although experts recommend that test scoring be double-checked, the extent to which independent double-checking would reduce scoring errors is not known. A double-checking procedure was established at a…
Camilli, Gregory; Wang, Ming-mei; Fesq, Jacqueline
Assessed multidimensionality for six forms of the Law School Admission Test (LSAT) through factor analysis. Determined that the LSAT appears to measure two different reasoning abilities: inductive and deductive. Describes the techniques used to assess dimensionality. (SLD)
Mowsesian, Richard; Hays, William L.
The purpose of this study was to test the predictive efficiency of the Graduate Record Examination Analytical (GRE-A) score when used in combination with other predictors for doctoral candidacy decisions in an educational psychology department. It was hypothesized that the GRE-A would be a more efficient predictor than faculty ratings on a…
Koljatic, Mladen; Silva, Monica
The article describes a test-blind admission initiative in a Chilean research university aimed at expanding the inclusion of talented, albeit educationally and socially disadvantaged, students. The outcomes of the test-blind admission cohort were compared with those of students admitted via the regular admission procedure to the same academic…
This article reports an empirical study that examined the pattern of test preparation for College English Test Band 4 (CET4) and the differential effects of test preparation practices on its scores, thereby drawing implications for CET4 score validity. Data collection involved 1,003 test takers of CET4. A pretest was administered at the beginning…
Chevalier, Shirley A.
In conventional practice, most educators and educational researchers score cognitive tests using a dichotomous right-wrong scoring system. Although simple and straightforward, this method does not take into consideration other factors, such as partial knowledge or guessing tendencies and abilities. This paper discusses alternative scoring models:…
Biswas, Ajoy Kumar
This article studies the ordinal reliability of (total) test scores. This study is based on a classical-type linear model of observed score (X), true score (T), and random error (E). Based on the idea of Kendall's tau-a coefficient, a measure of ordinal reliability for small-examinee populations is developed. This measure is extended to large…
Wise, Vicki L.; Wise, Steven L.; Bhola, Dennison S.
Accountability for educational quality is a priority at all levels of education. Low-stakes testing is one way to measure the quality of education that students receive and make inferences about what students know and can do. Aggregate test scores from low-stakes testing programs are suspect, however, to the degree that these scores are influenced…
Sachar, Jane; Suppes, Patrick
It is sometimes desirable to obtain an estimated total-test score for an individual who was administered only a subset of the items in a total test. The present study compared six methods, two of which utilize the content structure of items, to estimate total-test scores using 450 students in grades 3-5 and 60 items of the ll0-item Stanford Mental…
Wang, Xiang-Bo; Harris, Vincent; Roussos, Louis
Multidimensionality is known to affect the accuracy of item parameter and ability estimations, which subsequently influences the computation of item characteristic curves (ICCs) and true scores. By judiciously combining sections of a Law School Admission Test (LSAT), 11 sections of varying degrees of uni- and multidimensional structures are used…
Koys, Daniel J.
In this study, the author examined the validity of the Graduate Management Admissions Test (GMAT) for non-U.S. students (N = 75) through a predictive validation procedure in which applicants were given the predictor test but the test results were not used to admit students. The author's business school admitted students to three overseas MBA…
Carlson, Alfred B.
A factor analysis of the Law School Admission Test (LSAT) battery was undertaken to gain a better understanding of the specific abilities which contribute to performance on the tests. To determine whether greater amounts of testing time should be allocated to groups of items defined by the factor analysis, a validity study was conducted. Existing…
Grulick, Lawrence E.
A study was conducted at Florence-Darlington Technical College (FDTC) in South Carolina to assess the college's admissions testing program through a review of the current literature on placement testing and a statistical study comparing currently employed instruments, i.e., the Career Planning Program (CPP) test and the Scholastic Aptitude Test…
The context in which higher education admissions tests are used, both intended, validated uses and misuses, are examined. Possible legal problems of misuse are discussed in light of a recent court decision, Sharif v. New York State Education Department. Focus is on three tests: the Scholastic Assessment Test, Graduate Record Examinations, and the…
Center on Education Policy, 2010
This paper profiles Washington's test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the basic level on NAEP (National Assessment of Educational Progress) decreased in grade 4 reading. In grade 4 math, the percentage scoring proficient on the state test decreased…
Quereshi, M. Y.; Veeser, William R.
Investigates the influence of various scoring cutoffs on mental test performance as measured by the Michell General Ability Test (MGAT) and develops a rationale for selecting the optimum cutoff based on raw scores, internal consistency, stability, parallel-form reliability and concurrent validity estimates. (MB)
Brown, Steven M.; Walberg, Herbert J.
To examine the effect of motivational manipulated conditions on students' mathematics scores, elementary students received either ordinary standardized test instructions or special instructions (do as well as possible for themselves, parents, and teachers). Those given special instructions scored significantly higher in the test, implying that…
Integrating mathematics with family and consumer sciences (FCS) has enabled youth to pass the Minnesota 8th Grade Math Basic Skills test. The test focuses on the eight content areas: (1) problem solving with whole numbers and fractions; (2) problem solving with percentage/ratio; (3) number sense; (4) estimation; 5) measurement; (6) tables and…
Blai, Boris, Jr.
Reading test results and their interpretation are stressed because of their importance in student achievement. The Nelson-Denny Reading Test used at Harcum Junior College is a useful measuring instrument for predicting academic achievement, screening students, and diagnosing reading and learning problems. General hints for interpretation of the…
Purpose: The purpose of this paper is to evaluate the evidence on whether grade inflation has led to an increasing emphasis on standardized test scores as a criterion for law school admissions. Design/methodology/approach: Fit probabilistic models to admissions data for American law schools during the mid to late 1990s, a period during which…
Atkinson, Richard C.; Geiser, Saul
Standardized testing for college admissions has grown exponentially since the first administration of the old "College Boards" in 1901. This paper surveys major developments since then: the introduction of the "Scholastic Aptitude Test" in 1926, designed to tap students' general analytic ability; E.F. Lindquist's creation of the ACT in 1959 as a…
Wilson, Kenneth M.; Powers, Donald E.
This study was undertaken to clarify the internal structure of the Law School Admission Test (LSAT) and shed light on the ability or abilities measured by the three item types that make up the test (logical reasoning, analytical reasoning, and reading comprehension). The study used data for two forms of the LSAT for general samples of LSAT…
As a public school English teacher, the author observes standardized testing season each year with a sort of grim fascination. "So this is it," she thinks as she paces around her silent classroom, peering over kids' shoulders at articles about parasailing. Line graphs tracking the rainfall in Tulsa. Parts of speech. Functions of "x." "These are…
Turnipseed, Stephan; Darling-Hammond, Linda
The number one quality business leaders look for in employees is creativity and yet the U.S. education system undermines the development of the higher-order skills that promote creativity by its dogged focus on multiple-choice tests. Stephan Turnipseed and Linda DarlingHammond discuss the kind of rich accountability system that will help students…
College Entrance Examination Board, 2002
As an admissions test, the SAT II: Subject Tests in foreign languages allow students to demonstrate academic competence in the selected language area. As a placement tool, the SAT II: Subject Tests in foreign languages serve the same function as the SAT II: Subject Tests do in other academic areas such as world history, chemistry, or math. SAT II:…
Harrington, Phillip J.
An historical overview is presented of criticisms arising during an early period of large-scale standardized testing in the United States: the post World War II period when the Educational Testing Service was established. Not only is a brief description of the controversy provided, but the fact that many problems of assessment are not transient…
Tanner, John R.
State test scores administered for accountability purposes are regularly used to adjust instruction in nuanced ways. This is no accident--No Child Left Behind demanded that students' scores be returned quickly to teachers in order that this might be the case, and the idea of data-driven decision making continues as one way the promise of education…
Spencer, Harry E.
Explores the comparative performance of various segments of the student sample in general chemistry courses relative to their scores on the mathematical SAT test. Results indicate that mathematical skill measured by the SAT scores is an important factor in determining grades while factors that are not important in determining grades are gender…
Lyman, Howard B.
The first edition of this book was written to give information about testing to people whose work gave them access to test results, but whose training included little or nothing about the use and interpretation of tests. Later editions have been intended for a broader audience as the need for understanding what test scores really mean has…
Walpole, MaryBeth; McDonough, Patricia M.; Bauer, Constance J.; Gibson, Carolyn; Kanyi, Kamau T.; Toliver, Rita
This qualitative study focused on African American and Latino high school students perceptions of standardized admission tests, including the Scholastic Assessment Tests (I and II) and the ACT Assessment. Students enrolled in college preparatory classes were interviewed about these tests individually and in focus groups in fall 1998 in their…
Standardized Tests Used for Higher Education Admission and Placement in California during 1988. The Fourth in a Series of Annual Reports Published in Accordance with Senate Bill 1758 (Chapter 1505, Statutes of 1984). Report 89-16.
California State Postsecondary Education Commission, Sacramento.
California law requires sponsors of standardized tests used for admission or placement purposes in higher education to report certain data annually. Data on each of the 10 batteries of standardized tests affected by the law in 1987-88 are reported, and mean test scores are displayed for students nationally and in California if those averages are…
Haberman, Shelby J; Yao, Lili; Sinharay, Sandip
In many educational tests which involve constructed responses, a traditional test score is obtained by adding together item scores obtained through holistic scoring by trained human raters. For example, this practice was used until 2008 in the case of GRE(®) General Analytical Writing and until 2009 in the case of TOEFL(®) iBT Writing. With use of natural language processing, it is possible to obtain additional information concerning item responses from computer programs such as e-rater(®). In addition, available information relevant to examinee performance may include scores on related tests. We suggest application of standard results from classical test theory to the available data to obtain best linear predictors of true traditional test scores. In performing such analysis, we require estimation of variances and covariances of measurement errors, a task which can be quite difficult in the case of tests with limited numbers of items and with multiple measurements per item. As a consequence, a new estimation method is suggested based on samples of examinees who have taken an assessment more than once. Such samples are typically not random samples of the general population of examinees, so that we apply statistical adjustment methods to obtain the needed estimated variances and covariances of measurement errors. To examine practical implications of the suggested methods of analysis, applications are made to GRE General Analytical Writing and TOEFL iBT Writing. Results obtained indicate that substantial improvements are possible both in terms of reliability of scoring and in terms of assessment reliability. PMID:25773314
Santelices, Maria Veronica; Wilson, Mark
This paper investigates the predictive validity of the Revised SAT (R-SAT) score, proposed by Freedle (2003) as an alternative to compensate minority students for the potential harm caused by the relationship between item difficulty and ethnic DIF observed in the SAT. The R-SAT score is the score minority students would have received if only the…
Meara, Kevin; Sireci, Stephen G.
To provide a better understanding of the structure of the Medical College Admission Test (MCAT) and to determine if there are structural differences across selected groups of MCAT examinees, several dimensionality analyses were conducted on data from recent administrations of the MCAT. The first set of analyses focused on the global structure of…
Li, Weichang; Mitchell, Karen J.
A substantially revised Medical College Admission Test (MCAT) was introduced in spring 1991. The new examination is designed to assess critical thinking skills, basic concepts and problem solving facility in science, and writing skills. This paper reports preliminary findings on the factor structure of the revised MCAT, which consists of four…
Rendon, Laura I.; Novack, Vincent; Dowell, David
This policy analysis article examines how California State University-Long Beach, an institution where upward of 22,000 student applied for roughly 3,400 freshman slots and where the transfer class had to be reduced because of mandatory enrollment reductions, tested race-neutral admissions models in accordance with Proposition 209, which prohibits…
Wamala, Robert; Buyinza, Mukadasi
Although the Graduate Management Admission Test (GMAT) is considered by leading business institutions worldwide as a predictor of success in graduate programs, an issue of contention is whether the introduction of the examination enhances the quality of education outcomes. This study sought to obtain an understanding of this issue, focusing on…
Perez Mejias, Paulina
In Chile, reports and research papers have shown that there is an achievement gap in college admissions tests mostly associated to students' gender, socioeconomic status and type of school attended. This gap represents a barrier for low-income and female students to access higher education, as well as for graduates of public schools. Prior…
Caroline, Jan D.; And Others
The results of a predictive validity study of the new Medical College Admission Test (MCAT) using criteria from the clinical years of undergraduate medical education are discussed. The criteria included course grades and faculty ratings of clerks in internal medicine, surgery, obstetrics and gynecology, pediatrics, and psychiatry. (Author/MLW)
Liu, Mei; Holland, Paul W.
The simplified version of the Dorans and Holland (2000) measure of population invariance, the root mean square difference (RMSD), is used to explore the degree of dependence of linking functions on the Law School Admission Test (LSAT) subpopulations defined by examinees' gender, ethnic background, geographic region, law school application status,…
Fischer, Franziska T.; Schult, Johannes; Hell, Benedikt
This is the first meta-analysis that investigates the differential prediction of undergraduate and graduate college admission tests for women and men. Findings on 130 independent samples representing 493,048 students are summarized. The underprediction of women's academic performance (d = 0.14) and the overprediction of men's academic performance…
Gorney, Barbara; Maury, Marcia
The purpose of this study was to determine if a relationship exists between scores and the times that medical students choose to take a computer-administered test. The results indicate that students who choose to take a test later within a given time period tend to perform less well than students who take the test earlier. Although the magnitude…
Kopriva, Rebecca J.; Thurlow, Martha L.; Perie, Marianne; Lazarus, Sheryl S.; Clark, Amy
This article argues that test takers are as integral to determining validity of test scores as defining target content and conditioning inferences on test use. A principled sustained attention to how students interact with assessment opportunities is essential, as is a principled sustained evaluation of evidence confirming the validity or calling…
Zolandz, A. M.
The differences in average scores on standardized assessments between gender and minority subgroups of examinees is a continuing concern, both for Educational Testing Service (ETS) and the organizations that sponsor the assessments developed at ETS. The test development and analysis procedures used by ETS will be described, with an emphasis on those related to the fairness of the assessments for all subgroups and the understanding of the diffferences in performance among the groups. Relevant information for the SAT II and GRE Physics tests will be presented and discussed.
Towards a Diversified Legal Profession: An Inquiry into the Law School Admission Test, Grade Inflation, and Current Admissions Policies. [with] A Statement from the National Institute of Education "An Investigation into the Validity and Cultural Bias of the Law School Admission Test."
White, David M., Ed.
This is the final report and critique which investigated the law school admissions process, and especially the role of the Law School Admission Test (LSAT) within that process, for possible bias against minority applicants. The study involved the reanalysis of existing data. Results show that current admission policies unfairly limit the…
In this essay, Rebecca Zwick confronts the controversy surrounding the use of standardized tests in college admissions. She examines the degree to which the SAT and its lesser known cousin, the ACT, limit access to college, particularly for racial and ethnic minorities, and considers two alternative admissions policies that do not involve tests:…
Center on Education Policy, 2010
This paper profiles Maryland's test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the basic level on NAEP (National Assessment of Educational Progress) increased at grades 4 and 8 in both reading and math. Average annual gains were larger on the state test than…
Center on Education Policy, 2010
This paper profiles Pennsylvania's test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the basic level on NAEP (National Assessment of Educational Progress) increased in grade 8 reading and math. Average annual gains were larger on the state test than on NAEP in…
The uses and misuses of standardized test results used for program evaluation as seen by a staff member of an Elementary Secondary Education Act (ESEA) Title I Technical Assistance Center are described. In ESEA Title I, test scores are used to select students for the program. Although federal requirements do not require using standardized test…
A widely held view is that good schools are essential to a nation's international economic success and that high test scores on international tests of academic skills and knowledge indicate how good a nation's schools are. The widespread belief that good schools are an important contributor to a nation's economic success in the world is supported…
Wise, Steven L.
Whenever the purpose of measurement is to inform an inference about a student's achievement level, it is important that we be able to trust that the student's test score accurately reflects what that student knows and can do. Such trust requires the assumption that a student's test event is not unduly influenced by construct-irrelevant factors…
Brown, Sarah Lee
The researcher interviewed two groups of eleventh grade students, in a rural Appalachian setting, who tended to score low on the state mandated high stakes/low stakes test to discover their efforts on the test, specifically in reading, and to obtain their opinions concerning the effects of a specific incentive or consequence. Before the eleventh…
Xie, Yan; Xian, Hong; Chandiramani, Pooja; Bainter, Emily; Wan, Leping; Martin, Wade H
Objective Arm exercise stress testing may be an equivalent or better predictor of mortality outcome than pharmacological stress imaging for the ≥50% for patients unable to perform leg exercise. Thus, our objective was to develop an arm exercise ECG stress test scoring system, analogous to the Duke Treadmill Score, for predicting outcome in these individuals. Methods In this retrospective observational cohort study, arm exercise ECG stress tests were performed in 443 consecutive veterans aged 64.1 (11.1) years. (mean (SD)) between 1997 and 2002. From multivariate Cox models, arm exercise scores were developed for prediction of 5-year and 12-year all-cause and cardiovascular mortality and 5-year cardiovascular mortality or myocardial infarction (MI). Results Arm exercise capacity in resting metabolic equivalents (METs), 1 min heart rate recovery (HRR) and ST segment depression ≥1 mm were the stress test variables independently associated with all-cause and cardiovascular mortality by step-wise Cox analysis (all p<0.01). A score based on the relation HRR (bpm)+7.3×METs−10.5×ST depression (0=no; 1=yes) prognosticated 5-year cardiovascular mortality with a C-statistic of 0.81 before and 0.88 after adjustment for significant demographic and clinical covariates. Arm exercise scores for the other outcome end points yielded C-statistic values of 0.77–0.79 before and 0.82–0.86 after adjustment for significant covariates versus 0.64–0.72 for best fit pharmacological myocardial perfusion imaging models in a cohort of 1730 veterans who were evaluated over the same time period. Conclusions Arm exercise scores, analogous to the Duke Treadmill Score, have good power for prediction of mortality or MI in patients who cannot perform leg exercise. PMID:26835142
Kan, G; Visser, C A; Koolen, J J; Dunning, A J
A score of left ventricular segmental wall motion was used as a convenient rapid way to assess overall left ventricular function in acute myocardial infarction. Its success in risk stratification at admission was assessed by a blind review of cross sectional echocardiographic tape recordings from multiple acoustic windows. Sixty nine (20%) of the 345 patients died during hospital stay or within a one year follow up. The mean (SD) wall motion score in those who died was significantly higher than in those who survived (16.2 (5.9) vs 5.7 (3.9)). There were no differences between the group that died in hospital within three months of discharge and the group that died between three months and one year after discharge. Among the 31 patients who died in hospital, however, wall motion score was highest in 15 patients dying of cardiogenic shock (19.2 (4.2)). In 16 patients with lethal ruptures it was 13.5 (6.1). The nine patients with free wall ruptures had higher wall motion scores than those with ventricular septal rupture or papillary muscle rupture (15.7 (6.9) vs 8.5 (5.3)). Eight (3.3%) of 245 patients with a score less than 10 died, compared with 61 (61%) of 100 scoring greater than or equal to 10. The sensitivity of a score of greater than or equal to 10 in predicting death within one year was 88%, the specificity was 86%, the positive predictive value was 61%, and the negative predictive value was 97%. PMID:3790378
The prevalence of childhood overweight and obesity increased dramatically in the United States during the past three decades. This increase has adverse public health implications, but its implication for children's academic outcomes is less clear. This paper uses data from five waves of the Early Childhood Longitudinal Study-Kindergarten to examine how children's weight is related to their scores on standardized tests and to their teachers' assessments of their academic ability. The results indicate that children's weight is more negatively related to teacher assessments of their academic performance than to test scores. PMID:24014932
Walpole, Marybeth; McDonough, Patricia M.; Bauer, Constance J.; Gibson, Carolyn; Kanyi, Kamau; Toliver, Rita
This qualitative study explored the perceptions of, knowledge regarding, and preparation for standardized college admissions exams of 227 urban African American and Latino high school students. Findings include the students' lack of information about the test and their reliance on their relatively uninformed and unavailable school officials for…
Center on Education Policy, 2010
This paper profiles Massachusetts' test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the basic level on NAEP (National Assessment of Educational Progress) increased in grade 4 reading and math and grade 8 math. Average annual gains were larger on the state test…
Center on Education Policy, 2010
This paper profiles Texas' test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the basic level on NAEP (National Assessment of Educational Progress) increased in reading at grades 4 and 8 and in math at grade 8. In grade 4 math, however, the percentage scoring…
Petrilli, Michael J.; Wright, Brandon L.
At a time when the national conversation is focused on lagging upward mobility, it is no surprise that many educators point to poverty as the explanation for mediocre test scores among U.S. students compared to those of students in other countries. If American teachers in struggling U.S. schools taught in Finland, says Finnish educator Pasi…
The justification for national standards is that test scores predict a nation's future economic success. There is no evidence that supports this assumption. There is evidence that it is wrong. For more than half a century, reformers have been trying to fix our schools with little success. The obvious conclusion is that something that can't be…
Klein, Stephen P.; Hamilton, Laura S.; McCaffrey, Daniel F.; Stecher, Brian M.
Compared results on the Texas Assessment of Academic Skills to Texas (TAAS) score changes on the National Assessment of Educational Progress (NAEP). Texas fourth graders did improve significantly more on the NAEP mathematics test than their counterparts nationally, but this gain was smaller than their TAAS gains, and a similar gain was not seen…
Fahle, Erin; Reardon, Sean
Describing the variation in test scores between and within school districts is critical for: (1) for policy-related and descriptive work that investigates the sorting of students among districts and the differential effectiveness of those districts; and (2) for methodological work planning future experiments or interventions. Intraclass…
To achieve perpetually better test results each year as mandated by the No Child Left Behind Act (NCLB), teachers in successful schools such as Leroy Anderson Elementary in San Jose, California, will "try anything" to raise scores, as the school's principal stated in an interview with "The San Jose Mercury News." In schools across California for…
Kposowa, Augustine J.; Valdez, Amanda D.
Objectives: The primary objective of the study was to investigate the relationship between ubiquitous laptop use and academic achievement. It was hypothesized that students with ubiquitous laptops would score on average higher on standardized tests than those without such computers. Methods: Data were obtained from two sources. First, demographic…
Homeschooling, one of the fastest growing educational alternatives, is enjoying increasing respect from educators and parents alike. This is partly because homeschooling children score as well and often better on standardized tests than their publicly schooled counterparts. However, the vast majority of homeschooled students come from the…
Grissom, Jason A.; Kalogrides, Demetra; Loeb, Susanna
Expansion of the use of student test score data to measure teacher performance has fueled recent policy interest in using those data to measure the effects of school administrators as well. However, little research has considered the capacity of student performance data to uncover principal effects. Filling this gap, this article identifies…
Bender, Robert C.
Because most counselors have experienced a significant amount of success, they often have difficulty understanding the impact of test scores on persons who do not perform well. Counselor educators must develop experiential awareness in an area normally outside the realm of their students. To provide such an experience, 25 counselor trainees took…
Report on Education Research, 1983
THE FOLLOWING IS THE FULL TEXT OF THIS DOCUMENT: A new study by a pair of Harvard University researchers discounts earlier findings that coaching can substantially improve student performance on the Scholastic Aptitude Test (SAT). "There is simply insufficient evidence that large score increases are a result of a coaching program," write Rebecca…
Brennan, Robert L.
Kane's paper "Validating the Interpretations and Uses of Test Scores" is the most complete and clearest discussion yet available of the argument-based approach to validation. At its most basic level, validation as formulated by Kane is fundamentally a simply-stated two-step enterprise: (1) specify the claims inherent in a particular interpretation…
van der Ark, L. Andries; van der Palm, Daniel W.; Sijtsma, Klaas
This study presents a general framework for single-administration reliability methods, such as Cronbach's alpha, Guttman's lambda-2, and method MS. This general framework was used to derive a new approach to estimating test-score reliability by means of the unrestricted latent class model. This new approach is the latent class reliability…
Holland, Paul W.; Thayer, Dorothy T.
Applied the theory of exponential families of distributions to the problem of fitting the univariate histograms and discrete bivariate frequency distributions that often arise in the analysis of test scores. Considers efficient computation of the maximum likelihood estimates of the parameters using Newton's Method and computationally efficient…
Dougherty, Jack; Harelson, Jeffrey; Maloney, Laura; Murphy, Drew; Smith, Russell; Snow, Michael; Zannoni, Diane
Home buyers exercise school choice when shopping for a private residence due to its location in a public school district or attendance area. In this quantitative study of one Connecticut suburban district, we measure the effect of elementary school test scores and racial composition on home buyers' willingness to purchase single-family homes over…
Rangvid, Beatrice Schindler
We combine data from three studies for Denmark in the PISA 2000 framework to investigate differences in the native-immigrant test score gap by country of origin. In addition to the controls available from PISA data sources, we use student-level data on home background and individual migration histories linked from administrative registers. We find…
Bishop, N. Scott
This study examined the effects of different test administration conditions on reading comprehension test scores. Evidence of performance differences across district testing conditions might imply that the meanings and interpretations associated with the corresponding test scores have limited generalizability (i.e., knowing how well one reads…
Marder, M.; Bansal, D.
We apply visualization and modeling methods for convective and diffusive flows to public school mathematics test scores from Texas. We obtain plots that show the most likely future and past scores of students, the effects of random processes such as guessing, and the rate at which students appear in and disappear from schools. We show that student outcomes depend strongly upon economic class, and identify the grade levels where flows of different groups diverge most strongly. Changing the effectiveness of instruction in one grade naturally leads to strongly nonlinear effects on student outcomes in subsequent grades. PMID:19805049
Zain, Zakiyah; Aziz, Nazrina; Ahmad, Yuhaniz
In clinical trials, the main purpose is often to compare efficacy between experimental and control treatments. Treatment comparisons often involve multiple endpoints, and this situation further complicates the analysis of survival data. In the case of tumor patients, endpoints concerning survival times include: times from tumor removal until the first, the second and the third tumor recurrences, and time to death. For each patient, these endpoints are correlated, and the estimation of the correlation between two score statistics is fundamental in derivation of overall treatment advantage. In this paper, the bivariate survival analysis method using the global score test methodology is extended to multivariate setting.
Liu, Liqun; Neilson, William S.
In this paper college admissions are based on test scores and students can exert two types of effort: real learning and exam preparation. The former improves skills but the latter is more effective in raising test scores. In this setting the students with the lowest skills are no longer the ones with the lowest aptitude, but instead are the ones…
A new option that allows students to choose which of their test scores to send to colleges has generated renewed criticism of the College Board. College Board officials tout the option, called Score Choice, as a way to ease test taker anxiety. Some prominent admissions officials have publicly described Score Choice as a sales tactic that will…
Validating Standardized Testing: The Role of the SAT and ACT in Undergraduate Admissions. [Hearing before the California Legislature, Senate Select Committee on Higher Education Admissions and Outreach, Sacramento, California, February 5, 1998].
California State Legislature, Sacramento. Senate Select Committee on Higher Education Admissions and Outreach.
This report contains testimony and materials presented at the California Senate Select Committee on Higher Education Admissions and Outreach hearing, February 1998, held to examine the role of the Scholastic Assessment Test (SAT) and the American College Test (ACT) in undergraduate admissions to the public universities of California. In her…
Spafford, Marlee M.
A five-year evaluation of the admissions decision process at the University of Waterloo (Ontario) School of Optometry found that when primary tools (i.e., university grades, Optometry Admission Test scores) did not differentiate candidates, there was an increased emphasis on secondary tools (i.e., interview, autobiographic sketch, prerequisite…
Martin, John D.; And Others
The degree of relationship between scores on the Barron Ego Strength Scale and the scores on the Bender-Gestalt Test was investigated on a sample of college students. Correlations were moderate to low. Racial differences were observed on the Bender-Gestalt Test. (Author/JKS)
Warne, Russell T.; Larsen, Ross; Anderson, Braydon; Odasso, Alyce J.
The Advanced Placement (AP) program is an educational program that permits high school students to take introductory college-level courses and receive college credit by passing a standardized end-of-course exam. Data were obtained from a statewide database of 2 high school graduating cohorts (N = 90,044). We used a series of propensity score…
Nelson, Paul A.; Monson, Terry D.
The average score of economics majors on the Graduate Management Admission Test (GMAT) exceeds those of nearly all humanities and arts, social sciences, and business undergraduate majors but not those of most science, engineering, and mathematics majors. (Contains 1 table.)
This study examined test-optional admission policies and their effect on the composition of the student body at King's College. According to the College Board (2009), students from underrepresented populations do not perform as well as White students on standardized tests. Therefore, higher education institutions have established test-optional…
Hsu, Wen-Chuin; Chu, Yi-Chuan; Fung, Hon-Chung; Wai, Yau-Yau; Wang, Jiun-Jie; Lee, Jiann-Der; Chen, Yi-Chun
Mounting evidence shows that hyperhomocysteinemia is a risk factor for cognitive decline. This study enrolled subjects with normal serum levels of B12 and folate and performed thorough neuropsychological assessments to illuminate the independent role of homocysteine on cognitive functions.Participants between ages 50 and 85 were enrolled with Modified Hachinski ischemic score of <4, adequate visual and auditory acuity to allow neuropsychological testing, and good general health. Subjects with cognitive impairment resulting from secondary causes were excluded. Each of the participants completed evaluations of general intellectual function, including the Mini-Mental State Examination, Cognitive Abilities Screening Instrument, Clinical Dementia Rating, and a battery of neuropsychological assessments.This study enrolled 225 subjects (90 subjects younger than 65 years and 135 subjects aged 65 years or older). The sex proportion was similar between the 2 age groups. Years of education were significantly fewer in the elderly (7.49 ± 5.40 years) than in the young (9.76 ± 4.39 years, P = 0.001). There was no significant difference in body mass index or levels of vitamin B12 and folate between the 2 age groups. Homocysteine levels were significantly higher in the elderly group compared to the younger group (10.8 ± 2.7 vs. 9.5 ± 2.5 μmol/L, respectively, P = 0.0006). After adjusting for age, sex, and education, only the Digit Symbol Substitution (DSS) score was significantly lower in subjects with hyperhomocysteinemia (homocysteine >12 μmol/L) than those with homocysteine ≤12 μmol/L in the elderly group (DSS score: 7.1 ± 2.7 and 9.0 ± 3.0, respectively, beta = -1.6, 95% confidence interval [CI] = -2.8∼-0.5, P = 0.001) and borderline significance was noted in the combined age group (beta = -1.1, 95% CI = -2.1∼-0.1, P = 0.04). We did not find an association between hyperhomocysteinemia and other
Hsu, Wen-Chuin; Chu, Yi-Chuan; Fung, Hon-Chung; Wai, Yau-Yau; Wang, Jiun-Jie; Lee, Jiann-Der; Chen, Yi-Chun
Abstract Mounting evidence shows that hyperhomocysteinemia is a risk factor for cognitive decline. This study enrolled subjects with normal serum levels of B12 and folate and performed thorough neuropsychological assessments to illuminate the independent role of homocysteine on cognitive functions. Participants between ages 50 and 85 were enrolled with Modified Hachinski ischemic score of <4, adequate visual and auditory acuity to allow neuropsychological testing, and good general health. Subjects with cognitive impairment resulting from secondary causes were excluded. Each of the participants completed evaluations of general intellectual function, including the Mini-Mental State Examination, Cognitive Abilities Screening Instrument, Clinical Dementia Rating, and a battery of neuropsychological assessments. This study enrolled 225 subjects (90 subjects younger than 65 years and 135 subjects aged 65 years or older). The sex proportion was similar between the 2 age groups. Years of education were significantly fewer in the elderly (7.49 ± 5.40 years) than in the young (9.76 ± 4.39 years, P = 0.001). There was no significant difference in body mass index or levels of vitamin B12 and folate between the 2 age groups. Homocysteine levels were significantly higher in the elderly group compared to the younger group (10.8 ± 2.7 vs. 9.5 ± 2.5 μmol/L, respectively, P = 0.0006). After adjusting for age, sex, and education, only the Digit Symbol Substitution (DSS) score was significantly lower in subjects with hyperhomocysteinemia (homocysteine >12 μmol/L) than those with homocysteine ≤12 μmol/L in the elderly group (DSS score: 7.1 ± 2.7 and 9.0 ± 3.0, respectively, beta = −1.6, 95% confidence interval [CI] = −2.8∼−0.5, P = 0.001) and borderline significance was noted in the combined age group (beta = −1.1, 95% CI = −2.1∼−0.1, P = 0.04). We did not find an association between
Gaddis, S Michael; Lauen, Douglas Lee
Since at least the 1960s, researchers have closely examined the respective roles of families, neighborhoods, and schools in producing the black-white achievement gap. Although many researchers minimize the ability of schools to eliminate achievement gaps, the No Child Left Behind Act (NCLB) increased pressure on schools to do so by 2014. In this study, we examine the effects of NCLB's subgroup-specific accountability pressure on changes in black-white math and reading test score gaps using a school-level panel dataset on all North Carolina public elementary and middle schools between 2001 and 2009. Using difference-in-difference models with school fixed effects, we find that accountability pressure reduces black-white achievement gaps by raising mean black achievement without harming mean white achievement. We find no differential effects of accountability pressure based on the racial composition of schools, but schools with more affluent populations are the most successful at reducing the black-white math achievement gap. Thus, our findings suggest that school-based interventions have the potential to close test score gaps, but differences in school composition and resources play a significant role in the ability of schools to reduce racial inequality. PMID:24468431
Walter, Richard Barry
This study investigated the relationship between instructional level scores as determined by a cloze test and instructional level scores as determined by an informal reading inventory (IRI). Fifty male and 50 female subjects were randomly selected from the total fifth grade population of five schools chosen from a total of 22 midwestern elementary…
This study sought to determine whether the use of standardized test scores contributes any useful information regarding First Time in College (FTIC) students' probable success at USF, using more detailed analysis of underrepresented minorities and women, who Micceri (2009) shows, experience substantial negative bias relative to males and whites on…
Martin, John D.; And Others
The relationship between Elizur's Hostility Scoring on the Rorschach Test and the Acting-Out Score on the Hand Test was examined. Correlations between the two measures (using several scoring procedures) ranged from .40 to .64. (JKS)
Sinharay, Sandip; Puhan, Gautam; Haberman, Shelby J.
Diagnostic scores are of increasing interest in educational testing due to their potential remedial and instructional benefit. Naturally, the number of educational tests that report diagnostic scores is on the rise, as are the number of research publications on such scores. This article provides a critical evaluation of diagnostic score reporting…
Bracey, Gerald W.
Three former secretaries of education--William Bennett, Lauro Cavazos, and Terrel Bell--have touted state-level SAT scores as proof that educational financing does not matter. Recently, Brian Powell and Lala Carr Steelman adjusted scores for participation rate and detected a very strong relationship between expenditures and SAT scores. Bigger…
Cascallar, Alicia S.; Dorans, Neil J.
Score linkages between the Verbal and Math sections of the SAT® I: Reasoning Test and the corresponding sections of the new version of a Spanish-language admissions test, the Prueba de Aptitud Academica® (PAA™) were investigated. A bilingual group design was employed. A language proficiency measure (ESLAT) was used to define the bilingual group…
Donnelly, Michael B.; And Others
A statistical comparison of Medical College Admission Test scores with National Board of Medical Examiners (NBME) Part I scores of one school's students supported simple averaging of subtest scores rather than weighting as a predictor of NBME Part I performance. Similar results were obtained using basic science course examination scores. (MSE)
Zimmerman, Donald W.
Results of this study indicate that the correlation between half-test scores over repeated splits, over persons, and over repeated testings resulting in different sets of observed scores, is given by Kuder-Richardson Formula 21. (RF)
Sokhi, Dilraj Singh; Oxenham, Chantal; Coates, Rebecca; Forbes, Mhairi; Gupta, Nadi K.; Blackburn, Daniel J.
Background UK National Guidelines (UKNG) advise HIV testing in clinically indicated neurological presentations. We audited the impact of our practical strategies to increase uptake of HIV testing at a regional acute neurology admissions unit. Methods We audited HIV testing in 4 periods over 2 years: before we designed a UKNG-based “HIV testing in Neurology” protocol (“pre-protocol”); after dissemination of the protocol alone (“post-protocol”); post-protocol dissemination combined with both a tailored departmental admissions clerking proforma to prompt for HIV testing & consenting, and regular focussed tutorials to doctors on HIV testing in neurological patients (“post-proforma”); and finally one year after the post-proforma period (“+1 year”). We also looked at the total number of HIV tests sent from the unit during the two-year period. We assessed significance using Fisher’s exact test. Results 47.8% of all acute neurology non-stroke admissions were eligible for HIV testing during all the audit periods. Testing rates were as follows: pre-protocol 21.9%; post-protocol 36.6%; post-proforma 83.3%; and at +1 year 65.4% (p<0.05 for both post-protocol and +1 year when compared to pre-protocol). Documentation of consent for HIV testing improved from 25% to 67.6% with the HIV-tailored clerking proforma. The total number of HIV tests requested from the unit doubled in the post-proforma period compared to pre-protocol (p<0.05). Conclusion In conclusion: the combination of an HIV testing protocol, a tailored departmental clerking proforma and regular focussed teaching to doctors on indications for HIV testing led to a sustained increase in HIV testing uptake in our regional acute neurology admissions unit. PMID:26335351
Young, John W.
This research report is a review and analysis of all of the published studies during the past 25+ years (since 1974) in the area of differential validity/prediction and college admission testing. More specifically, this report includes 49 separate studies of differences in validity and/or prediction for different racial/ethnic groups and/or for…
Zenisky, April L.; Hambleton, Ronald K.
Test scores matter these days. Test-takers want to understand how they performed, and test score reports, particularly those for individual examinees, are the vehicles by which most people get the bulk of this information. Historically, score reports have not always met the examinees' information or usability needs, but this is clearly changing…
Hansen, Karsten; Heckman, James J.; Mullen, Kathleen J.
This study developed two methods for estimating the effect of schooling on achievement test scores that control for the endogeneity of schooling by postulating that both schooling and test scores are generated by a common unobserved latent ability. The methods were applied to data on schooling and test scores. Estimates from the two methods are in…
Khan, Junaid Sarfraz; Tabasum, Saima; Mukhtar, Osama; Iqbal, Maryam
Assessment is an indispensable part of an educational program. Multiple Choice Questions (MCQs) is an objective tool of assessment provided cheating is controlled. A method employed to reduce the chance of cheating is to scramble the sequence of the MCQs and responses in multiple papers having the same content. It is assumed that the performance of students is mainly dependent on the difficulty of the items and not the order in which they are placed within the instrument. The marks obtained by 1,02,211 candidates sitting in Medical Colleges Admission Test (MCAT) from 2008 to 2011 and given similar-content but scrambled-sequence question paper codes were analyzed using parametric tests. A significant difference amongst the mean marks of candidates in the different codes of MCAT 2008 (F = 22.15, p < 0.001) and MCAT 2011 (F = 3.85, p = 0.009) was identified. No significant difference was found in the mean marks of the candidates' each year for different codes in each centre. PMID:24305000
This study aimed to compare the accuracy of the test scores as results of Test of English Proficiency (TOEP) based on paper and pencil test (PPT) versus computer-based test (CBT). Using the participants' responses to the PPT documented from 2008-2010 and data of CBT TOEP documented in 2013-2014 on the sets of 1A, 2A, and 3A for the Listening and…
Rich, John D., Jr.; Fullard, William; Overton, Willis
One Hundred and Twelve Latino students from Philadelphia participated in this study, which examined the development of deductive reasoning across adolescence, and the relation of reasoning to test anxiety and standardized test scores. As predicted, 11th and ninth graders demonstrated significantly more advanced reasoning than seventh graders.…
This handbook provides guidelines for teaching test-taking skills to students of all grade levels to help the students raise their standardized test scores. Topics covered include: understanding instructions and following directions, efficient use of time, intelligent guessing, and application of special strategies for multiple-choice and…
Molenaar, Dylan; Borsboom, Denny
Measurement invariance is an important prerequisite for the adequate comparison of group differences in test scores. In psychology, measurement invariance is typically investigated by means of linear factor analyses of subtest scores. These subtest scores typically result from summing the item scores. In this paper, we discuss 4 possible problems…
Severo, Milton; Gaio, A. Rita; Povo, Ana; Silva-Pereira, Fernanda; Ferreira, Maria Amélia
In theory the formula scoring methods increase the reliability of multiple-choice tests in comparison with number-right scoring. This study aimed to evaluate the impact of the formula scoring method in clinical anatomy multiple-choice examinations, and to compare it with that from the number-right scoring method, hoping to achieve an…
Handeland, Katina; Kjellevold, Marian; Wik Markhus, Maria; Eide Graff, Ingvild; Frøyland, Livar; Lie, Øyvind; Skotheim, Siv; Stormark, Kjell Morten; Dahl, Lisbeth; Øyen, Jannike
Assessment of adolescents' dietary habits is challenging. Reliable instruments to monitor dietary trends are required to promote healthier behaviours in this group. The purpose of this cross-sectional study was to assess adolescents' adherence to Norwegian dietary recommendations with a diet score and to report results from, and test-retest reliability of, the score. The diet score involved seven food groups and one physical activity indicator, and was applied to answers from a semi-quantitative food frequency questionnaire (FFQ) administered twice. Reproducibility of the score was assessed with Cohen's Kappa (κ statistics) at an interval of three months. The setting was eight lower-secondary schools in Hordaland County, Norway, and subjects were adolescents (n = 472) aged 14-15 years and their caregivers. Results showed that the proportion of adolescents consistently classified by the diet score was 87.6% (κ = 0.465). For food groups, proportions ranged from 74.0% to 91.6% (κ = 0.249 to κ = 0.573). Less than 40% of the participants were found to adhere to recommendations for frequencies of eating fruits, vegetables, added sugar, and fish. Highest compliance to recommendations was seen for choosing water as beverage and limit the intake of red meat. The score was associated with parental socioeconomic status. The diet score was found to be reproducible at an acceptable level. Health promoting work targeting adolescents should emphasize to increase the intake of recommended foods to approach nutritional guidelines. PMID:27483312
Handeland, Katina; Kjellevold, Marian; Wik Markhus, Maria; Eide Graff, Ingvild; Frøyland, Livar; Lie, Øyvind; Skotheim, Siv; Stormark, Kjell Morten; Dahl, Lisbeth; Øyen, Jannike
Assessment of adolescents’ dietary habits is challenging. Reliable instruments to monitor dietary trends are required to promote healthier behaviours in this group. The purpose of this cross-sectional study was to assess adolescents’ adherence to Norwegian dietary recommendations with a diet score and to report results from, and test-retest reliability of, the score. The diet score involved seven food groups and one physical activity indicator, and was applied to answers from a semi-quantitative food frequency questionnaire (FFQ) administered twice. Reproducibility of the score was assessed with Cohen’s Kappa (κ statistics) at an interval of three months. The setting was eight lower-secondary schools in Hordaland County, Norway, and subjects were adolescents (n = 472) aged 14–15 years and their caregivers. Results showed that the proportion of adolescents consistently classified by the diet score was 87.6% (κ = 0.465). For food groups, proportions ranged from 74.0% to 91.6% (κ = 0.249 to κ = 0.573). Less than 40% of the participants were found to adhere to recommendations for frequencies of eating fruits, vegetables, added sugar, and fish. Highest compliance to recommendations was seen for choosing water as beverage and limit the intake of red meat. The score was associated with parental socioeconomic status. The diet score was found to be reproducible at an acceptable level. Health promoting work targeting adolescents should emphasize to increase the intake of recommended foods to approach nutritional guidelines. PMID:27483312
Hageman, Barbara H.; Sigman, Clayton B.; Koslosky, John T.
A Test/Score/Report capability is currently being developed for the Transportable Payload Operations Control Center (TPOCC) Advanced Spacecraft Simulator (TASS) system which will automate testing of the Goddard Space Flight Center (GSFC) Payload Operations Control Center (POCC) and Mission Operations Center (MOC) software in three areas: telemetry decommutation, spacecraft command processing, and spacecraft memory load and dump processing. Automated computer control of the acceptance test process is one of the primary goals of a test team. With the proper simulation tools and user interface, the task of acceptance testing, regression testing, and repeatability of specific test procedures of a ground data system can be a simpler task. Ideally, the goal for complete automation would be to plug the operational deliverable into the simulator, press the start button, execute the test procedure, accumulate and analyze the data, score the results, and report the results to the test team along with a go/no recommendation to the test team. In practice, this may not be possible because of inadequate test tools, pressures of schedules, limited resources, etc. Most tests are accomplished using a certain degree of automation and test procedures that are labor intensive. This paper discusses some simulation techniques that can improve the automation of the test process. The TASS system tests the POCC/MOC software and provides a score based on the test results. The TASS system displays statistics on the success of the POCC/MOC system processing in each of the three areas as well as event messages pertaining to the Test/Score/Report processing. The TASS system also provides formatted reports documenting each step performed during the tests and the results of each step. A prototype of the Test/Score/Report capability is available and currently being used to test some POCC/MOC software deliveries. When this capability is fully operational it should greatly reduce the time necessary
Rudner, Lawrence M.
GMAT (Graduate Management Admission Test) scaled scores convey the same level of ability over time, and GMAT percentiles convey the competitiveness of scores relative to today's GMAT test takers. In an earlier column, the author discussed the role of the GMAT scaled scores and percentiles. Here, he gets more technical and discusses how GMAT scaled…
Matton, Nadine; Vautier, Stephane; Raufaste, Eric
Mean gain scores for cognitive ability tests between two sessions in a selection setting are now a robust finding, yet not fully understood. Many authors do not attribute such gain scores to an increase in the target abilities. Our approach consists of testing a longitudinal SEM model suitable to this view. We propose to model the scores' changes…
Pope, Gregory A.; Wentzel, Carolyn; Braden, Brigitta; Anderson, Jordan
The purpose of this study was to investigate statistical relationships between gender and Alberta Achievement Testing Program scores. Achievement test scores from grades 3, 6, and 9 in all subject areas were investigated during a four-year period. Results showed statistically significant positive correlations between gender and scores in most…
Thompson, Simon M; Salmon, Lucy J; Webb, Justin M; Pinczewski, Leo A; Roe, Justin P
Consecutive patients undergoing knee arthroplasty completed questionnaires: FJS, Knee Injury and Osteoarthritis Outcome Score (KOOS) and WOMAC Score (mean 39 months after surgery), and were mailed a repeat questionnaire after 4 to 6 weeks. The test-retest reliability was almost perfect for the FJS (ICC = 0.97), and the FJS subdomains (ICC > 0.8). Convergent construct validity of the FJS was correlated with the KOOS Subscores of Quality of Life (0.63, P = 0.001), Symptom (0.33, P = 0.001), Pain (0.68, P = 0.001) and ADL (0.66, P = 0.001) and the Total WOMAC (0.70, P = 0.001). The FJS demonstrates high test-retest reliability and construct validity compared to the Normalised WOMAC and KOOS Subscales. The FJS does not demonstrate the ceiling effect of the WOMAC or KOOS pain scores so may have greater discriminatory ability following TKR. PMID:26027525
Bingold, Tobias M.; Lefering, Rolf; Zacharowski, Kai; Meybohm, Patrick; Waydhas, Christian; Rosenberger, Peter; Scheller, Bertram
Introduction Organ dysfunction or failure after the first days of ICU treatment and subsequent mortality with respect to the type of intensive care unit (ICU) admission is poorly elucidated. Therefore we analyzed the association of ICU mortality and admission for medical (M), scheduled surgery (ScS) or unscheduled surgery (US) patients mirrored by the occurrence of organ dysfunction/failure (OD/OF) after the first 72h of ICU stay. Methods For this retrospective cohort study (23,795 patients; DIVI registry; German Interdisciplinary Association for Intensive Care Medicine (DIVI)) organ dysfunction or failure were derived from the Sequential Organ Failure Assessment (SOFA) score (excluding the Glasgow Coma Scale). SOFA scores were collected on admission to ICU and 72h later. For patients with a length of stay of at least five days, a multivariate analysis was performed for individual OD/OF on day three. Results M patients had the lowest prevalence of cardiovascular failure (M 31%; ScS 35%; US 38%), and the highest prevalence of respiratory (M 24%; ScS 13%; US 17%) and renal failure (M 10%; ScS 6%; US 7%). Risk of death was highest for M- and ScS-patients in those with respiratory failure (OR; M 2.4; ScS 2.4; US 1.4) and for surgical patients with renal failure (OR; M 1.7; ScS 2.7; US 2.4). Conclusion The dynamic evolution of OD/OF within 72h after ICU admission and mortality differed between patients depending on their types of admission. This has to be considered to exclude a systematic bias during multi-center trials. PMID:26241475
Spencer, Harry E.
The relationships between mathematical SAT scores (SAT-M) and grades earned by students in eight consecutive years of first- and second-semester general chemistry courses at Oberlin College are reported. The academic years surveyed are 1987-1988 through 1994-1995. SAT-M scores are grouped within seven ranges from 450 and less to 710-800. Within any range of scores, students in both courses earned a wide variety of grades, but those within the higher ranges tended to earn higher grades and fewer failures relative to students in the lower ranges. For all students within each range of SAT-M scores, the fraction earning each grade are calculated. These fractions along with the numbers of students and their SAT-M scores in a subset are used to calculate grades expected for that subset. In the first-semester course, the expected and actual grades for subsets of males, females, first-year students, non-first-year students, Asians, Blacks, and Latinos are not significantly different. Those who eventually majored in chemistry or biochemistry attained grades very significantly higher than expected. Most students tended to achieve grades in the second-semester course that were similar to those earned in the first-semester course.
In order to meet the goals of No Child Left Behind, standardized testing is preeminent as the sole indicator determining whether states all across America demonstrate adequate yearly progress regarding the improvement of student achievement in literacy education. This book will help teachers and parents raise children's scores on standardized…
Viliūnas, V; Lukauskiene, R; Svegzda, A; Zukauskas, A
The scoring artefact in the Farnsworth-Munsell 100-Hue test, arising from the grouping of the caps into four boxes, was investigated. The traditional method of scoring performed with the numbers of the anchor caps disregarded and the alternative scoring performed with the numbers of the anchor caps employed, were compared. For the traditional method of scoring, we revealed an increase of the error score of the outside (end-box) caps when the total error score was above 240. On the contrary for scoring performed with the numbers of the anchor caps employed, the difference between the error score of the outside caps and the average error per cap is not significant. To mitigate the end-box artefact and to improve the reliability of the Farnsworth-Munsell 100-Hue test, corrections to the traditional method of scoring are proposed. PMID:17040422
Blackburn, McKinley L.
Previous research has suggested that skills reflected in test-score performance on tests such as the Armed Forces Qualification Test (AFQT) can account for some of the racial differences in average wages. I use a more complete set of test scores available with the National Longitudinal Survey of Youth 1979 Cohort to reconsider this evidence, and…
... 21 Food and Drugs 8 2011-04-01 2011-04-01 false Ovarian adnexal mass assessment score test system... immunological Test Systems § 866.6050 Ovarian adnexal mass assessment score test system. (a) Identification. An ovarian/adnexal mass assessment test system is a device that measures one or more proteins in serum...
... 21 Food and Drugs 8 2012-04-01 2012-04-01 false Ovarian adnexal mass assessment score test system... immunological Test Systems § 866.6050 Ovarian adnexal mass assessment score test system. (a) Identification. An ovarian/adnexal mass assessment test system is a device that measures one or more proteins in serum...
... 21 Food and Drugs 8 2013-04-01 2013-04-01 false Ovarian adnexal mass assessment score test system... immunological Test Systems § 866.6050 Ovarian adnexal mass assessment score test system. (a) Identification. An ovarian/adnexal mass assessment test system is a device that measures one or more proteins in serum...
... 21 Food and Drugs 8 2014-04-01 2014-04-01 false Ovarian adnexal mass assessment score test system... immunological Test Systems § 866.6050 Ovarian adnexal mass assessment score test system. (a) Identification. An ovarian/adnexal mass assessment test system is a device that measures one or more proteins in serum...
Dorans, Neil J.; Moses, Tim P.; Eignor, Daniel R.
Score equating is essential for any testing program that continually produces new editions of a test and for which the expectation is that scores from these editions have the same meaning over time. Particularly in testing programs that help make high-stakes decisions, it is extremely important that test equating be done carefully and accurately.…
van der Linden, Wim J.
A constrained computerized adaptive testing (CAT) algorithm is presented that automatically equates the number-correct scores on adaptive tests. The algorithm can be used to equate number-correct scores across different administrations of the same adaptive test as well as to an external reference test. The constraints are derived from a set of…
Research documents that transient students who change schools frequently oftensuffer from low academic achievement. This article investigates standardized group measures by disentangling elementary achievement scores. Located in a highly transient area outside of Philadelphia, Pennsylvania, Main Street School had their fifth grade Pennsylvania…
Gentry, Ruben; Stokes, Dorothy
Many African Americans were imbued with the cliché that they must work twice as hard as others to be a success in life. Entering college, students with this belief put extensive effort into earning top grades to ensure quality preparation for their chosen career; yet, some fail to earn top scores. Why? This is the million dollar question, but the…
Cope, Ronald T.; Kolen, Michael J.
This study compared five density estimation techniques applied to samples from a population of 272,244 examinees' ACT English Usage and Mathematics Usage raw scores. Unsmoothed frequencies, kernel method, negative hypergeometric, four-parameter beta compound binomial, and Cureton-Tukey methods were applied to 500 replications of random samples of…
Carraway, Cassandra T.
A study was conducted to determine whether participation in a test-taking strategy seminar significantly decreased test anxiety in first-year nursing students. The study also sought to compare nursing test scores of first-year nursing students who participated in the seminar with those who did not. The sample consisted of 30 first-year nursing…
Rosselli, M; Ardila, A; Bateman, J R; Guzmán, M
Limited information is currently available about performance of Spanish-speaking children on different neuropsychological tests. This study was designed to (a) analyze the effects of age and sex on different neuropsychological test scores of a randomly selected sample of Spanish-speaking children, (b) analyze the value of neuropsychological test scores for predicting school performance, and (c) describe the neuropsychological profile of Spanish-speaking children with learning disabilities (LD). Two hundred ninety (141 boys, 149 girls) 6- to 11-year-old children were selected from a school in Bogotá, Colombia. Three age groups were distinguished: 6- to 7-, 8- to 9-, and 10- to 11-year-olds. Performance was measured utilizing the following neuropsychological tests: Seashore Rhythm Test, Finger Tapping Test (FTT), Grooved Pegboard Test, Children's Category Test (CCT), California Verbal Learning Test-Children's Version (CVLT-C), Benton Visual Retention Test (BVRT), and Bateria Woodcock Psicoeducativa en Español (Woodcock, 1982). Normative scores were calculated. Age effect was significant for most of the test scores. A significant sex effect was observed for 3 test scores. Intercorrelations were performed between neuropsychological test scores and academic areas (science, mathematics, Spanish, social studies, and music). In a post hoc analysis, children presenting very low scores on the reading, writing, and arithmetic achievement scales of the Woodcock battery were identified in the sample, and their neuropsychological test scores were compared with a matched normal group. Finally, a comparison was made between Colombian and American norms. PMID:11827093
This paper presents the selective student recruitment and the admissions process implemented at the Department of Library and Information Science of Lund University, Sweden (BIVIL). The background is the profound changes in the library profession that have taken place during the past 10-15 years, with the development of information technology and…
Munson, James W.; Bourne, David W. A.
Correlations were made between grade point averages calculated after one year and PCAT scores and/or prepharmacy grade point averages. Findings are presented and a model is given for using PCAT scores as a predictor of academic success. (LBH)
Argues that economics education provides many benefits to students, including preparation for law school. Examines the ranking of economics majors on the Law School Admission Test (LSAT). Finds that among the 14 majors having more than 2,000 students take the LSAT, economics students received the highest average score. (DSK)
Troll, Lillian E.; And Others
After seven years, a group (N=32) of originally nonemployed poverty-level older people (over 60) now employed as foster grandparents were retested with the WAIS. Three subtest scores showed stability and Digit Span showed a statistically significant drop. Neither age nor initial level of health or WAIS scores was related to test-score changes over…
Cross, Lawrence H.; And Others
A new scoring procedure for multiple choice tests attempts to assess partial knowledge and to restrict guessing. It is a variant of Coombs' elimination scoring method, adapted for use with the carbon-shield answer sheets commonly used with answer-until-correct scoring. Examinees are directed to erase the carbon shields of choices they are certain…
This article considers the claim that machine scoring of writing test responses agrees with human readers as much as humans agree with other humans. These claims about the reliability of machine scoring of writing are usually based on specific and constrained writing tasks, and there is reason for asking whether machine scoring of writing requires…
Over the past five years, both DC Public Schools (DCPS) and public charter schools (PCS) have seen significant growth in secondary reading and math scores on the state test known as the District of Columbia Comprehensive Assessment System (DC CAS). However, scores have not improved as much at the elementary level. Reading and math scores for DCPS…
Zwick, Rebecca; Zapata-Rivera, Diego; Hegarty, Mary
Research has shown that many educators do not understand the terminology or displays used in test score reports and that measurement error is a particularly challenging concept. We investigated graphical and verbal methods of representing measurement error associated with individual student scores. We created four alternative score reports, each…
Olson, Alan C.
Institutions of higher education attempt to select and prepare the finest K-12 teachers possible. National, state, and local influences also search for the best and brightest teacher candidates to become K-12 teachers. The result has been increased accountability measures to ensure quality. The selection process for admission to teacher education…
Tsao, Ting Man
In 1999, the City University of New York (CUNY) abolished remedial programs in its four-year colleges and began to rely on standardized test scores as criteria both for exiting remediation and for admission to bachelor's programs. By doing that, the university has in effect eradicated its three-decade-old "open admissions" policy, argue CUNY…
Gavett, Brandon E
The base rates of abnormal test scores in cognitively normal samples have been a focus of recent research. The goal of the current study is to illustrate how Bayes' theorem uses these base rates--along with the same base rates in cognitively impaired samples and prevalence rates of cognitive impairment--to yield probability values that are more useful for making judgments about the absence or presence of cognitive impairment. Correlation matrices, means, and standard deviations were obtained from the Wechsler Memory Scale--4th Edition (WMS-IV) Technical and Interpretive Manual and used in Monte Carlo simulations to estimate the base rates of abnormal test scores in the standardization and special groups (mixed clinical) samples. Bayes' theorem was applied to these estimates to identify probabilities of normal cognition based on the number of abnormal test scores observed. Abnormal scores were common in the standardization sample (65.4% scoring below a scaled score of 7 on at least one subtest) and more common in the mixed clinical sample (85.6% scoring below a scaled score of 7 on at least one subtest). Probabilities varied according to the number of abnormal test scores, base rates of normal cognition, and cutoff scores. The results suggest that interpretation of base rates obtained from cognitively healthy samples must also account for data from cognitively impaired samples. Bayes' theorem can help neuropsychologists answer questions about the probability that an individual examinee is cognitively healthy based on the number of abnormal test scores observed. PMID:25784058
Carroll, John B.
The problem of determining relative weights for quantity and quality in scoring foreign language speaking and writing fluency tests is studied. French speaking and writing fluency tests were administered to students of French in several schools in England. Data from these tests was analyzed to support the suggestion that scoring formulas should…
Cornwell, Christopher; Mustard, David B.; Van Parys, Jessica
Using data from the 1998-99 ECLS-K cohort, we show that the grades awarded by teachers are not aligned with test scores. Girls in every racial category outperform boys on reading tests, while boys score at least as well on math and science tests as girls. However, boys in all racial categories across all subject areas are not represented in…
Ho, Andrew D.
State test score trends are widely interpreted as indicators of educational improvement. To validate these interpretations, state test score trends are often compared to trends on other tests such as the National Assessment of Educational Progress (NAEP). These comparisons raise serious technical and substantive concerns. Technically, the most…
The relationship between the National League for Nursing (NLN) achievement test scores and performance on the State Board Test Pool Examination (SBTPE) was studied with 166 graduates of a diploma degree school of nursing between 1976 and 1978. It was found that NLN achievement test scores had a highly significant correlation with SBTPE results.…
Klesch, Heather S.
The reporting of scores on educational tests is at times misunderstood, misinterpreted, and potentially confusing to examinees and other stakeholders who may need to interpret test scores. In reporting test results to examinees, there is a need for clarity in the message communicated. As pressure rises for students to demonstrate performance at a…
Das, Jishnu; Dercon, Stefan; Habyarimana, James; Krishnan, Pramila; Muralidharan, Karthik; Sundararaman, Venkatesh
Empirical studies of the relationship between school inputs and test scores typically do not account for the fact that households will respond to changes in school inputs. We present a dynamic household optimization model relating test scores to school and household inputs, and test its predictions in two very different low-income country…
Kamalakannan, Banupriya; Thulasingam, Mahalakshmy; Sampath, Srinivasan
M/IgG MAC ELISA was predominantly positive in convalescent phase sera. There were six deaths (2.3%) and the common causes of poor outcome were multiorgan failure, encephalopathy and refractory shock. Conclusion NS1 Ag assay is a useful early diagnostic marker for dengue fever but cannot be used as an early predictor of severe dengue infection. The criteria for admission in hospitals of cases of dengue fever should be based on clinical warning signs rather than positive NS1 Antigen test. PMID:27190911
Basco, William T., Jr.; Lancaster, Carol J.; Gilbert, Gregory E.; Carey, Maura E.; Blue, Amy V.
Background and purpose: Data supporting the predictive validity of the medical school admission interview are mixed. This study tested the hypothesis that the admission interview is predictive of interpersonal interactions between medical students and standardized patients. Method: We determined correlations between admission interview scores and…
Delaware State Dept. of Education, Dover. Assessment and Accountability Branch.
This guide is intended to help parents understand the Delaware Student Testing Program (DSTP) and the reports it generates. The DSTP tests are administered to provide an accurate measure of how well students are doing relative to Delaware's rigorous content standards. DSTP tests are administered in reading, writing, mathematics, science, and…
Marron, Joseph E.
Studies conducted by the United States Military Academy and Air Force Academy showed that many cadets who had attended preparatory schools which prepared high school graduates for college admission performed below what had been predicted by their scores on college entrance examinations. Based on these findings, about 1,200 students from 10…
Zou, Xiao-Ling; Chen, Yan-Min
The effects of computer and paper test media on EFL test-takers with different computer familiarity in writing scores and in the cognitive writing process have been comprehensively explored from the learners' aspect as well as on the basis of related theories and practice. The results indicate significant differences in test scores among the…
Ramos, Erica; Alfonso, Vincent C.; Schermerhorn, Susan M.
The interpretation of cognitive test scores often leads to decisions concerning the diagnosis, educational placement, and types of interventions used for children. Therefore, it is important that practitioners administer and score cognitive tests without error. This study assesses the frequency and types of examiner errors that occur during the…