identification test scores: Topics by Science.gov

Sample records for identification test scores

Algorithm improvement program nuclide identification algorithm scoring criteria and scoring application.

DOE Office of Scientific and Technical Information (OSTI.GOV)

Enghauser, Michael

2016-02-01

The goal of the Domestic Nuclear Detection Office (DNDO) Algorithm Improvement Program (AIP) is to facilitate gamma-radiation detector nuclide identification algorithm development, improvement, and validation. Accordingly, scoring criteria have been developed to objectively assess the performance of nuclide identification algorithms. In addition, a Microsoft Excel spreadsheet application for automated nuclide identification scoring has been developed. This report provides an overview of the equations, nuclide weighting factors, nuclide equivalencies, and configuration weighting factors used by the application for scoring nuclide identification algorithm performance. Furthermore, this report presents a general overview of the nuclide identification algorithm scoring application including illustrative examples.
Algorithm Improvement Program Nuclide Identification Algorithm Scoring Criteria And Scoring Application - DNDO.

DOE Office of Scientific and Technical Information (OSTI.GOV)

Enghauser, Michael

2015-02-01

The goal of the Domestic Nuclear Detection Office (DNDO) Algorithm Improvement Program (AIP) is to facilitate gamma-radiation detector nuclide identification algorithm development, improvement, and validation. Accordingly, scoring criteria have been developed to objectively assess the performance of nuclide identification algorithms. In addition, a Microsoft Excel spreadsheet application for automated nuclide identification scoring has been developed. This report provides an overview of the equations, nuclide weighting factors, nuclide equivalencies, and configuration weighting factors used by the application for scoring nuclide identification algorithm performance. Furthermore, this report presents a general overview of the nuclide identification algorithm scoring application including illustrative examples.
21 CFR 866.6050 - Ovarian adnexal mass assessment score test system.

Code of Federal Regulations, 2011 CFR

2011-04-01

... 21 Food and Drugs 8 2011-04-01 2011-04-01 false Ovarian adnexal mass assessment score test system... immunological Test Systems § 866.6050 Ovarian adnexal mass assessment score test system. (a) Identification. An ovarian/adnexal mass assessment test system is a device that measures one or more proteins in serum or...
Olfactory Impairment in Chronic Rhinosinusitis Using Threshold, Discrimination, and Identification Scores

PubMed Central

Kohli, Preeti; Storck, Kristina A.; Schlosser, Rodney J.

2016-01-01

Differences in testing modalities and cut-points used to define olfactory dysfunction contribute to the wide variability in estimating the prevalence of olfactory dysfunction in chronic rhinosinusitis (CRS). The aim of this study is to report the prevalence of olfactory impairment using each component of the Sniffin’ Sticks test (threshold, discrimination, identification, and total score) with age-adjusted and ideal cut-points from normative populations. Patients meeting diagnostic criteria for CRS were enrolled from rhinology clinics at a tertiary academic center. Olfaction was assessed using the Sniffin’ Sticks test. The study population consisted of 110 patients. The prevalence of normosmia, hyposmia, and anosmia using total Sniffin’ Sticks score was 41.8%, 20.0%, and 38.2% using age-appropriate cut-points and 20.9%, 40.9%, and 38.2% using ideal cut-points. Olfactory impairment estimates for each dimension mirrored these findings, with threshold yielding the highest values. Threshold, discrimination, and identification were also found to be significantly correlated to each other (P < 0.001). In addition, computed tomography scores, asthma, allergy, and diabetes were found to be associated with olfactory dysfunction. In conclusion, the prevalence of olfactory dysfunction is dependent upon olfactory dimension and if age-adjusted cut-points are used. The method of olfactory testing should be chosen based upon specific clinical and research goals. PMID:27469973
The relationship between selected standardized test scores and performance in advanced placement math and science exams: Analyzing the differential effectiveness of scores for course identification and placement

NASA Astrophysics Data System (ADS)

Urbina, Josue N.

There is a national need to increase the STEM-related workforce. Among factors leading towards STEM careers include the number of advanced high school mathematics and science courses students complete. Florida's enrollment patterns in STEM-related Advanced Placement (AP) courses, however, reveal that only a small percentage of students enroll into these classes. Therefore, screening tools are needed to find more students for these courses, who are academically ready, yet have not been identified. The purpose of this study was to investigate the extent to which scores from a national standardized test, Preliminary Scholastic Assessment Test/ National Merit Qualifying Test (PSAT/NMSQT), in conjunction with and compared to a state-mandated standardized test, Florida Comprehensive Assessment Test (FCAT), are related to selected AP exam performance in Seminole County Public Schools. An ex post facto correlational study was conducted using 6,189 student records from the 2010 - 2012 academic years. Multiple regression analyses using simultaneous Full Model testing showed differential moderate to strong relationships between scores in eight of the nine AP courses (i.e., Biology, Environmental Science, Chemistry, Physics B, Physics C Electrical, Physics C Mechanical, Statistics, Calculus AB and BC) examined. For example, the significant unique contribution to overall variance in AP scores was a linear combination of PSAT Math (M), Critical Reading (CR) and FCAT Reading (R) for Biology and Environmental Science. Moderate relationships for Chemistry included a linear combination of PSAT M, W (Writing) and FCAT M; a combination of FCAT M and PSAT M was most significantly associated with Calculus AB performance. These findings have implications for both research and practice. FCAT scores, in conjunction with PSAT scores, can potentially be used for specific STEM-related AP courses, as part of a systematic approach towards AP course identification and placement. For courses with
Reliability Generalization of the Alcohol Use Disorder Identification Test.

ERIC Educational Resources Information Center

Shields, Alan L.; Caruso, John C.

2002-01-01

Evaluated the reliability of scores from the Alcohol Use Disorders Identification Test (AUDIT; J. Sounders and others, 1993) in a reliability generalization study based on 17 empirical journal articles. Results show AUDIT scores to be generally reliable for basic assessment. (SLD)
Incorporating sequence information into the scoring function: a hidden Markov model for improved peptide identification.

PubMed

Khatun, Jainab; Hamlett, Eric; Giddings, Morgan C

2008-03-01

The identification of peptides by tandem mass spectrometry (MS/MS) is a central method of proteomics research, but due to the complexity of MS/MS data and the large databases searched, the accuracy of peptide identification algorithms remains limited. To improve the accuracy of identification we applied a machine-learning approach using a hidden Markov model (HMM) to capture the complex and often subtle links between a peptide sequence and its MS/MS spectrum. Our model, HMM_Score, represents ion types as HMM states and calculates the maximum joint probability for a peptide/spectrum pair using emission probabilities from three factors: the amino acids adjacent to each fragmentation site, the mass dependence of ion types and the intensity dependence of ion types. The Viterbi algorithm is used to calculate the most probable assignment between ion types in a spectrum and a peptide sequence, then a correction factor is added to account for the propensity of the model to favor longer peptides. An expectation value is calculated based on the model score to assess the significance of each peptide/spectrum match. We trained and tested HMM_Score on three data sets generated by two different mass spectrometer types. For a reference data set recently reported in the literature and validated using seven identification algorithms, HMM_Score produced 43% more positive identification results at a 1% false positive rate than the best of two other commonly used algorithms, Mascot and X!Tandem. HMM_Score is a highly accurate platform for peptide identification that works well for a variety of mass spectrometer and biological sample types. The program is freely available on ProteomeCommons via an OpenSource license. See http://bioinfo.unc.edu/downloads/ for the download link.
[Full Sibling Identification by IBS Scoring Method and Establishment of the Query Table of Its Critical Value].

PubMed

Li, R; Li, C T; Zhao, S M; Li, H X; Li, L; Wu, R G; Zhang, C C; Sun, H Y

2017-04-01

To establish a query table of IBS critical value and identification power for the detection systems with different numbers of STR loci under different false judgment standards. Samples of 267 pairs of full siblings and 360 pairs of unrelated individuals were collected and 19 autosomal STR loci were genotyped by Golden e ye™ 20A system. The full siblings were determined using IBS scoring method according to the 'Regulation for biological full sibling testing'. The critical values and identification power for the detection systems with different numbers of STR loci under different false judgment standards were calculated by theoretical methods. According to the formal IBS scoring criteria, the identification power of full siblings and unrelated individuals was 0.764 0 and the rate of false judgment was 0. The results of theoretical calculation were consistent with that of sample observation. The query table of IBS critical value for identification of full sibling detection systems with different numbers of STR loci was successfully established. The IBS scoring method defined by the regulation has high detection efficiency and low false judgment rate, which provides a relatively conservative result. The query table of IBS critical value for identification of full sibling detection systems with different numbers of STR loci provides an important reference data for the result judgment of full sibling testing and owns a considerable practical value. Copyright© by the Editorial Department of Journal of Forensic Medicine
A between-subjects test of the lower-identification/ higher-priming paradox.

PubMed

Rubino, I Alex; Rociola, Giuseppe; Di Lorenzo, Giorgio; Magni, Valentina; Ribolsi, Michele; Mancini, Valentina; Saya, Anna; Pezzarossa, Bianca; Siracusano, Alberto; Suslow, Thomas

2013-01-01

An under-recognised U-shaped model states that unconscious and conscious perceptual effects are functionally exclusive and that unconscious perceptual effects manifest themselves only at the objective detection threshold, when conscious perception is completely absent. We tested the U-shaped line model with a between-subjects paradigm. Angry, happy, neutral faces, or blank slides were flashed for 5.5 ms and 19.5 ms before Chinese ideographs in a darkened room. A group of volunteers (n = 84) were asked to rate how much they liked each ideograph and performed an identification task. According to the median identification score two subgroups were composed; one with 50% or < 50% identification scores (n = 31), and one with above 50% identification scores (n = 53). The hypothesised U-shaped line was confirmed by the findings. Affective priming was found only at the two extreme points: the 5.5 ms condition of the low-identification group (subliminal perception) and the 19.5 ms condition of the > 50% high-identification group (supraliminal perception). The two intermediate points (19.5 ms of the low-identification group and 5.5 ms of the high-identification group) did not correspond to significant priming effects. These results confirm that a complete absence of conscious perception is the condition for the deployment of unconscious perceptual effects.
Do Test Scores Buy Happiness?

ERIC Educational Resources Information Center

McCluskey, Neal

2017-01-01

Since at least the enactment of No Child Left Behind in 2002, standardized test scores have served as the primary measures of public school effectiveness. Yet, such scores fail to measure the ultimate goal of education: maximizing happiness. This exploratory analysis assesses nation level associations between test scores and happiness, controlling…
Development of an International Odor Identification Test for Children: The Universal Sniff Test.

PubMed

Schriever, Valentin A; Agosin, Eduardo; Altundag, Aytug; Avni, Hadas; Cao Van, Helene; Cornejo, Carlos; de Los Santos, Gonzalo; Fishman, Gad; Fragola, Claudio; Guarneros, Marco; Gupta, Neelima; Hudson, Robyn; Kamel, Reda; Knaapila, Antti; Konstantinidis, Iordanis; Landis, Basile N; Larsson, Maria; Lundström, Johan N; Macchi, Alberto; Mariño-Sánchez, Franklin; Martinec Nováková, Lenka; Mori, Eri; Mullol, Joaquim; Nord, Marie; Parma, Valentina; Philpott, Carl; Propst, Evan J; Rawan, Ahmed; Sandell, Mari; Sorokowska, Agnieszka; Sorokowski, Piotr; Sparing-Paschke, Lisa-Marie; Stetzler, Carolin; Valder, Claudia; Vodicka, Jan; Hummel, Thomas

2018-07-01

To assess olfactory function in children and to create and validate an odor identification test to diagnose olfactory dysfunction in children, which we called the Universal Sniff (U-Sniff) test. This is a multicenter study involving 19 countries. The U-Sniff test was developed in 3 phases including 1760 children age 5-7 years. Phase 1: identification of potentially recognizable odors; phase 2: selection of odorants for the odor identification test; and phase 3: evaluation of the test and acquisition of normative data. Test-retest reliability was evaluated in a subgroup of children (n = 27), and the test was validated using children with congenital anosmia (n = 14). Twelve odors were familiar to children and, therefore, included in the U-Sniff test. Children scored a mean ± SD of 9.88 ± 1.80 points out of 12. Normative data was obtained and reported for each country. The U-Sniff test demonstrated a high test-retest reliability (r 27 = 0.83, P < .001) and enabled discrimination between normosmia and children with congenital anosmia with a sensitivity of 100% and specificity of 86%. The U-Sniff is a valid and reliable method of testing olfaction in children and can be used internationally. Copyright © 2018 Elsevier Inc. All rights reserved.
Predicting occupational personality test scores.

PubMed

Furnham, A; Drakeley, R

2000-01-01

The relationship between students' actual test scores and their self-estimated scores on the Hogan Personality Inventory (HPI; R. Hogan & J. Hogan, 1992), an omnibus personality questionnaire, was examined. Despite being given descriptive statistics and explanations of each of the dimensions measured, the students tended to overestimate their scores; yet all correlations between actual and estimated scores were positive and significant. Correlations between self-estimates and actual test scores were highest for sociability, ambition, and adjustment (r = .62 to r = .67). The results are discussed in terms of employers' use and abuse of personality assessment for job recruitment.
Exploring a Source of Uneven Score Equity across the Test Score Range

ERIC Educational Resources Information Center

Huggins-Manley, Anne Corinne; Qiu, Yuxi; Penfield, Randall D.

2018-01-01

Score equity assessment (SEA) refers to an examination of population invariance of equating across two or more subpopulations of test examinees. Previous SEA studies have shown that score equity may be present for examinees scoring at particular test score ranges but absent for examinees scoring at other score ranges. No studies to date have…
Do Examinees Understand Score Reports for Alternate Methods of Scoring Computer Based Tests?

ERIC Educational Resources Information Center

Whittaker, Tiffany A.; Williams, Natasha J.; Dodd, Barbara G.

2011-01-01

This study assessed the interpretability of scaled scores based on either number correct (NC) scoring for a paper-and-pencil test or one of two methods of scoring computer-based tests: an item pattern (IP) scoring method and a method based on equated NC scoring. The equated NC scoring method for computer-based tests was proposed as an alternative…
A Cross-Cultural Adaptation of the Sniffin' Sticks Olfactory Identification Test for US children.

PubMed

Cavazzana, Annachiara; Wesarg, Christiane; Schriever, Valentin A; Hummel, Thomas; Lundström, Johan N; Parma, Valentina

2017-02-01

Disorders associated with smell loss are common in adolescents. However, current odor identification tests focus on children from age 6 and older and no cross-cultural test has to date been validated and fully implemented. Here, we aimed to investigate how 3-to-11-year-old US children performed to an adapted and shortened (11 odors instead of 14) version of a European odor identification test-the Sniffin' Kids (Schriever VA, Mori E, Petters W, Boerner C, Smitka M, Hummel T. 2014. The "Sniffin'Kids" test: a 14-item odor identification test for children. Plos One. 9:e101086.). Results confirmed that cued odor identification performance increases with age and revealed little to no differences between girls and boys. Scores below 3 and below 6 may raise hyposmia concerns in US children aged 3-7 years and 8-10 years, respectively. Even though the completion rate of the task reached the 88%, suggesting that children below age 5 were able to finish the test, their performance was relatively poor. In comparing the overall identification performance of US children with that of German children, for whom the test was specifically developed, significant differences emerged, with higher scores obtained by the German sample. Analysis of errors indicated that a lack of semantic knowledge for the olfactory-presented objects may be at the root of poor identification skills in US children and therefore constitutes a problem in the development of an odor identification test for younger children valid across cultures. © The Author 2016. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
How Accurate Is a Test Score?

ERIC Educational Resources Information Center

Doppelt, Jerome E.

1956-01-01

The standard error of measurement as a means for estimating the margin of error that should be allowed for in test scores is discussed. The true score measures the performance that is characteristic of the person tested; the variations, plus and minus, around the true score describe a characteristic of the test. When the standard deviation is used…
Context-Sensitive Markov Models for Peptide Scoring and Identification from Tandem Mass Spectrometry

PubMed Central

Grover, Himanshu; Wallstrom, Garrick; Wu, Christine C.

2013-01-01

Abstract Peptide and protein identification via tandem mass spectrometry (MS/MS) lies at the heart of proteomic characterization of biological samples. Several algorithms are able to search, score, and assign peptides to large MS/MS datasets. Most popular methods, however, underutilize the intensity information available in the tandem mass spectrum due to the complex nature of the peptide fragmentation process, thus contributing to loss of potential identifications. We present a novel probabilistic scoring algorithm called Context-Sensitive Peptide Identification (CSPI) based on highly flexible Input-Output Hidden Markov Models (IO-HMM) that capture the influence of peptide physicochemical properties on their observed MS/MS spectra. We use several local and global properties of peptides and their fragment ions from literature. Comparison with two popular algorithms, Crux (re-implementation of SEQUEST) and X!Tandem, on multiple datasets of varying complexity, shows that peptide identification scores from our models are able to achieve greater discrimination between true and false peptides, identifying up to ∼25% more peptides at a False Discovery Rate (FDR) of 1%. We evaluated two alternative normalization schemes for fragment ion-intensities, a global rank-based and a local window-based. Our results indicate the importance of appropriate normalization methods for learning superior models. Further, combining our scores with Crux using a state-of-the-art procedure, Percolator, we demonstrate the utility of using scoring features from intensity-based models, identifying ∼4-8 % additional identifications over Percolator at 1% FDR. IO-HMMs offer a scalable and flexible framework with several modeling choices to learn complex patterns embedded in MS/MS data. PMID:23289783
What Do Test Score Really Mean? A Latent Class Analysis of Danish Test Score Performance

ERIC Educational Resources Information Center

McIntosh, James; Munk, Martin D.

2014-01-01

Latent class Poisson count models are used to analyse a sample of Danish test score results from a cohort of individuals born in 1954-1955, tested in 1968, and followed until 2011. The procedure takes account of unobservable effects as well as excessive zeros in the data. We show that the test scores measure manifest or measured ability as it has…
Evaluation of the alcohol use disorders identification test and the drug use disorders identification test among patients at a Norwegian psychiatric emergency ward.

PubMed

Gundersen, Oystein Hoel; Mordal, Jon; Berman, Anne H; Bramness, Jørgen G

2013-01-01

High rates of substance use disorders (SUD) among psychiatric patients are well documented. This study explores the usefulness of the Alcohol Use Disorders Identification Test (AUDIT) and the Drug Use Disorders Identification Test (DUDIT) in identifying SUD in emergency psychiatric patients. Of 287 patients admitted consecutively, 256 participants (89%) were included, and 61-64% completed the questionnaires and the Mini-International Neuropsychiatric Interview (MINI), used as the reference standard. Both AUDIT and DUDIT were valid (area under the curve above 0.92) and reliable (Cronbach's alpha above 0.89) in psychotic and nonpsychotic men and women. The suitable cutoff scores for AUDIT were higher among the psychotic than nonpsychotic patients, with 12 versus 10 in men and 8 versus 5 in women. The suitable cutoff scores for DUDIT were 1 in both psychotic and nonpsychotic women, and 5 versus 1 in psychotic and nonpsychotic men, respectively. This study shows that AUDIT and DUDIT may provide precise information about emergency psychiatric patients' problematic alcohol and drug use. Copyright © 2013 S. Karger AG, Basel.
Validating Test Score Meaning and Defending Test Score Use: Different Aims, Different Methods

ERIC Educational Resources Information Center

Cizek, Gregory J.

2016-01-01

Advances in validity theory and alacrity in validation practice have suffered because the term "validity" has been used to refer to two incompatible concerns: (1) the degree of support for specified interpretations of test scores (i.e. intended score meaning) and (2) the degree of support for specified applications (i.e. intended test…

State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP

ERIC Educational Resources Information Center

Chudowsky, Naomi; Chudowsky, Victor

2010-01-01

In recent years, scores on the annual state reading and mathematics tests used for accountability have gone up in most states. These trends in state test scores do not always coincide, however, with trends on the National Assessment of Educational Progress (NAEP), the federally sponsored assessment that is administered periodically to…
Estimating Total-Test Scores from Partial Scores in a Matrix Sampling Design.

ERIC Educational Resources Information Center

Sachar, Jane; Suppes, Patrick

1980-01-01

The present study compared six methods, two of which utilize the content structure of items, to estimate total-test scores using 450 students and 60 items of the 110-item Stanford Mental Arithmetic Test. Three methods yielded fairly good estimates of the total-test score. (Author/RL)
State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. Washington

ERIC Educational Resources Information Center

Center on Education Policy, 2010

2010-01-01

This paper profiles Washington's test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the basic level on NAEP (National Assessment of Educational Progress) decreased in grade 4 reading. In grade 4 math, the percentage scoring proficient on the state test decreased…
State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. Utah

ERIC Educational Resources Information Center

Center on Education Policy, 2010

2010-01-01

This paper profiles Utah's test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the basic level on NAEP (National Assessment of Educational Progress) increased in grade 8 reading. In grade 4 reading, the percentage scoring proficient on the state test showed a…
State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. Arkansas

ERIC Educational Resources Information Center

Center on Education Policy, 2010

2010-01-01

This paper profiles Arkansas's test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the basic level on NAEP (National Assessment of Educational Progress) went up in math at grades 4 and 8. In reading, the percentages scoring proficient on the state test went up at…
EDUCATION AND PSYCHOLOGICAL TEST SCORES

PubMed Central

Pershad, Dwarka; Verma, S. K.

1980-01-01

Education, a long neglected variable affecting psychological test score, is in search of reemphasis. Some evidence for this has accumulated on the psychological tests constructed and standardized here at the department of Psychiatry, P.G.I., Chandigarh. Tentative norms prepared education wise on WAIS-Verbal section, PGI-Memory Scale, Proverb and Similarity Tests, Psychoticism Questionnaire, and PGI MQN 2, for adults, in the age range of 16-50, are reported. The results showed marked difference in the mean scores of different educational categories and thus stressed the need for reporting norms separately for different educational levels. PMID:22064617
Prediction of true test scores from observed item scores and ancillary data.

PubMed

Haberman, Shelby J; Yao, Lili; Sinharay, Sandip

2015-05-01

In many educational tests which involve constructed responses, a traditional test score is obtained by adding together item scores obtained through holistic scoring by trained human raters. For example, this practice was used until 2008 in the case of GRE(®) General Analytical Writing and until 2009 in the case of TOEFL(®) iBT Writing. With use of natural language processing, it is possible to obtain additional information concerning item responses from computer programs such as e-rater(®). In addition, available information relevant to examinee performance may include scores on related tests. We suggest application of standard results from classical test theory to the available data to obtain best linear predictors of true traditional test scores. In performing such analysis, we require estimation of variances and covariances of measurement errors, a task which can be quite difficult in the case of tests with limited numbers of items and with multiple measurements per item. As a consequence, a new estimation method is suggested based on samples of examinees who have taken an assessment more than once. Such samples are typically not random samples of the general population of examinees, so that we apply statistical adjustment methods to obtain the needed estimated variances and covariances of measurement errors. To examine practical implications of the suggested methods of analysis, applications are made to GRE General Analytical Writing and TOEFL iBT Writing. Results obtained indicate that substantial improvements are possible both in terms of reliability of scoring and in terms of assessment reliability. © 2015 The British Psychological Society.
Genetic variation of the growth hormone secretagogue receptor gene is associated with alcohol use disorders identification test scores and smoking.

PubMed

Suchankova, Petra; Nilsson, Staffan; von der Pahlen, Bettina; Santtila, Pekka; Sandnabba, Kenneth; Johansson, Ada; Jern, Patrick; Engel, Jörgen A; Jerlhag, Elisabet

2016-03-01

The multifaceted gut-brain peptide ghrelin and its receptor (GHSR-1a) are implicated in mechanisms regulating not only the energy balance but also the reward circuitry. In our pre-clinical models, we have shown that ghrelin increases whereas GHSR-1a antagonists decrease alcohol consumption and the motivation to consume alcohol in rodents. Moreover, ghrelin signaling is required for the rewarding properties of addictive drugs including alcohol and nicotine in rodents. Given the hereditary component underlying addictive behaviors and disorders, we sought to investigate whether single nucleotide polymorphisms (SNPs) located in the pre-proghrelin gene (GHRL) and GHSR-1a gene (GHSR) are associated with alcohol use, measured by the alcohol use disorders identification test (AUDIT) and smoking. Two SNPs located in GHRL, rs4684677 (Gln90Leu) and rs696217 (Leu72Met), and one in GHSR, rs2948694, were genotyped in a subset (n = 4161) of a Finnish population-based cohort, the Genetics of Sexuality and Aggression project. The effect of these SNPs on AUDIT scores and smoking was investigated using linear and logistic regressions, respectively. We found that the minor allele of the rs2948694 SNP was nominally associated with higher AUDIT scores (P = 0.0204, recessive model) and smoking (P = 0.0002, dominant model). Furthermore, post hoc analyses showed that this risk allele was also associated with increased likelihood of having high level of alcohol problems as determined by AUDIT scores ≥ 16 (P = 0.0043, recessive model). These convergent findings lend further support for the hypothesized involvement of ghrelin signaling in addictive disorders. © 2015 Society for the Study of Addiction.
Genetic variation of the growth hormone secretagogue receptor gene is associated with alcohol use disorders identification test scores and smoking

PubMed Central

Nilsson, Staffan; von der Pahlen, Bettina; Santtila, Pekka; Sandnabba, Kenneth; Johansson, Ada; Jern, Patrick; Engel, Jörgen A.; Jerlhag, Elisabet

2015-01-01

Abstract The multifaceted gut‐brain peptide ghrelin and its receptor (GHSR‐1a) are implicated in mechanisms regulating not only the energy balance but also the reward circuitry. In our pre‐clinical models, we have shown that ghrelin increases whereas GHSR‐1a antagonists decrease alcohol consumption and the motivation to consume alcohol in rodents. Moreover, ghrelin signaling is required for the rewarding properties of addictive drugs including alcohol and nicotine in rodents. Given the hereditary component underlying addictive behaviors and disorders, we sought to investigate whether single nucleotide polymorphisms (SNPs) located in the pre‐proghrelin gene (GHRL) and GHSR‐1a gene (GHSR) are associated with alcohol use, measured by the alcohol use disorders identification test (AUDIT) and smoking. Two SNPs located in GHRL, rs4684677 (Gln90Leu) and rs696217 (Leu72Met), and one in GHSR, rs2948694, were genotyped in a subset (n = 4161) of a Finnish population‐based cohort, the Genetics of Sexuality and Aggression project. The effect of these SNPs on AUDIT scores and smoking was investigated using linear and logistic regressions, respectively. We found that the minor allele of the rs2948694 SNP was nominally associated with higher AUDIT scores (P = 0.0204, recessive model) and smoking (P = 0.0002, dominant model). Furthermore, post hoc analyses showed that this risk allele was also associated with increased likelihood of having high level of alcohol problems as determined by AUDIT scores ≥ 16 (P = 0.0043, recessive model). These convergent findings lend further support for the hypothesized involvement of ghrelin signaling in addictive disorders. PMID:26059200
Test/score/report: Simulation techniques for automating the test process

NASA Technical Reports Server (NTRS)

Hageman, Barbara H.; Sigman, Clayton B.; Koslosky, John T.

1994-01-01

A Test/Score/Report capability is currently being developed for the Transportable Payload Operations Control Center (TPOCC) Advanced Spacecraft Simulator (TASS) system which will automate testing of the Goddard Space Flight Center (GSFC) Payload Operations Control Center (POCC) and Mission Operations Center (MOC) software in three areas: telemetry decommutation, spacecraft command processing, and spacecraft memory load and dump processing. Automated computer control of the acceptance test process is one of the primary goals of a test team. With the proper simulation tools and user interface, the task of acceptance testing, regression testing, and repeatability of specific test procedures of a ground data system can be a simpler task. Ideally, the goal for complete automation would be to plug the operational deliverable into the simulator, press the start button, execute the test procedure, accumulate and analyze the data, score the results, and report the results to the test team along with a go/no recommendation to the test team. In practice, this may not be possible because of inadequate test tools, pressures of schedules, limited resources, etc. Most tests are accomplished using a certain degree of automation and test procedures that are labor intensive. This paper discusses some simulation techniques that can improve the automation of the test process. The TASS system tests the POCC/MOC software and provides a score based on the test results. The TASS system displays statistics on the success of the POCC/MOC system processing in each of the three areas as well as event messages pertaining to the Test/Score/Report processing. The TASS system also provides formatted reports documenting each step performed during the tests and the results of each step. A prototype of the Test/Score/Report capability is available and currently being used to test some POCC/MOC software deliveries. When this capability is fully operational it should greatly reduce the time necessary
A scoring scheme for evaluating magnetofossil identifications

NASA Astrophysics Data System (ADS)

Kopp, R. E.; Kirschvink, J. L.

2007-12-01

In many Quaternary lacustrine and marine settings, fossil magnetotactic bacteria are a major contributor to sedimentary magnetization [1]. Magnetite particles produced by magnetotactic bacteria have traits, shaped by natural selection, that increase the efficiency with which the bacteria utilize iron and also facilitate the recognition of the particles' biological origin. In particular, magnetotactic bacteria generally produce particles with characteristic shapes and narrow size and shape distributions that lie within the single domain stability field. The particles have effective positive magnetic anisotropy, produced by alignment in chains and frequently by particle elongation. In addition, the crystals are often nearly stochiometric and have few crystallographic defects. Yet, despite these distinctive traits, there are few identified magnetofossils that predate the Quaternary, and many putative identifications are highly controversial. We propose a six-criteria scoring scheme for evaluating identifications based on the quality of the geological, magnetic, and electron microscopic evidence. Our criteria are: (1) whether the geological context is well-constrained stratigraphically, and whether paleomagnetic evidence suggests a primary magnetization; (2) whether magnetic or microscopic evidence support the presence of significant single-domain magnetite; (3) whether magnetic or ferromagnetic resonance evidence indicates narrow size and shape distributions, and whether microscopic evidence reveals single-domain particles with truncated edges, elongate single-domain particles, and/or narrow size and shape distributions; (4) whether ferromagnetic resonance, low-temperature magnetic, or electron microscopic evidence reveals the presence of chains; (5) whether low-temperature magnetometry, energy dispersive X-ray spectroscopy, or other techniques demonstrate the near-stochiometry of the particles; and (6) whether high-resolution TEM indicates the near- absence of
State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP

ERIC Educational Resources Information Center

Chudowsky, Naomi; Chudowsky, Victor

2010-01-01

This report compares state math and reading proficiency scores in grades 4 and 8 to National Assessment of Educational Progress (NAEP) basic scores for the period of 2005 to 2009. The study found that scores on state tests and NAEP have increased in most states with sufficient data. Also included with the report are profiles for the 23 states that…
Estimating Total-test Scores from Partial Scores in a Matrix Sampling Design.

ERIC Educational Resources Information Center

Sachar, Jane; Suppes, Patrick

It is sometimes desirable to obtain an estimated total-test score for an individual who was administered only a subset of the items in a total test. The present study compared six methods, two of which utilize the content structure of items, to estimate total-test scores using 450 students in grades 3-5 and 60 items of the ll0-item Stanford Mental…
State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. Ohio

ERIC Educational Resources Information Center

Center on Education Policy, 2010

2010-01-01

This paper profiles Ohio's test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the basic level on NAEP (National Assessment of Educational Progress) increased in grade 4 reading and grade 8 math. In grade 8 reading, the percentage of students scoring proficient…
Estimating the Reliability of a Test Battery Composite or a Test Score Based on Weighted Item Scoring

ERIC Educational Resources Information Center

Feldt, Leonard S.

2004-01-01

In some settings, the validity of a battery composite or a test score is enhanced by weighting some parts or items more heavily than others in the total score. This article describes methods of estimating the total score reliability coefficient when differential weights are used with items or parts.
ITC Guidelines on Quality Control in Scoring, Test Analysis, and Reporting of Test Scores

ERIC Educational Resources Information Center

Allalouf, Avi

2014-01-01

The Quality Control (QC) Guidelines are intended to increase the efficiency, precision, and accuracy of the scoring, analysis, and reporting process of testing. The QC Guidelines focus on large-scale testing operations where multiple forms of tests are created for use on set dates. However, they may also be used for a wide variety of other testing…
Summary of Score Changes (in other Tests).

ERIC Educational Resources Information Center

Cleary, T. Anne; McCandless, Sam A.

Scholastic Aptitude Test (SAT) scores have declined during the last 14 years. Similar score declines have been observed in many different testing programs, many groups, and tested areas. The declines, while not large in any given year, have been consistent over time, area, and group. The period around 1965 is critical for the interpretation of…
Testing Intelligently Includes Double-Checking Wechsler IQ Scores

ERIC Educational Resources Information Center

Kuentzel, Jeffrey G.; Hetterscheidt, Lesley A.; Barnett, Douglas

2011-01-01

The rigors of standardized testing make for numerous opportunities for examiner error, including simple computational mistakes in scoring. Although experts recommend that test scoring be double-checked, the extent to which independent double-checking would reduce scoring errors is not known. A double-checking procedure was established at a…
Test Scores and Stereotypes.

ERIC Educational Resources Information Center

Gose, Ben

1995-01-01

A psychologist's research suggests that black and female students may have lower standardized test scores and academic achievement because they have accepted stereotypes concerning their ability. Critics feel the researcher, Claude M. Steele, may be overlooking other factors. Steele has developed a program a Stanford University (California) to…
Using Patterns of Summed Scores in Paper-and-Pencil Tests and Computer-Adaptive Tests to Detect Misfitting Item Score Patterns

ERIC Educational Resources Information Center

Meijer, Rob R.

2004-01-01

Two new methods have been proposed to determine unexpected sum scores on sub-tests (testlets) both for paper-and-pencil tests and computer adaptive tests. A method based on a conservative bound using the hypergeometric distribution, denoted p, was compared with a method where the probability for each score combination was calculated using a…

Does Test Preparation Work? Implications for Score Validity

ERIC Educational Resources Information Center

Xie, Qin

2013-01-01

This article reports an empirical study that examined the pattern of test preparation for College English Test Band 4 (CET4) and the differential effects of test preparation practices on its scores, thereby drawing implications for CET4 score validity. Data collection involved 1,003 test takers of CET4. A pretest was administered at the beginning…
Facilitating the Interpretation of English Language Proficiency Scores: Combining Scale Anchoring and Test Score Mapping Methodologies

ERIC Educational Resources Information Center

Powers, Donald; Schedl, Mary; Papageorgiou, Spiros

2017-01-01

The aim of this study was to develop, for the benefit of both test takers and test score users, enhanced "TOEFL ITP"® test score reports that go beyond the simple numerical scores that are currently reported. To do so, we applied traditional scale anchoring (proficiency scaling) to item difficulty data in order to develop performance…
The Truth about Scores Children Achieve on Tests.

ERIC Educational Resources Information Center

Brown, Jonathan R.

1989-01-01

The importance of using the standard error of measurement (SEm) in determining reliability in test scores is emphasized. The SEm is compared to the hypothetical true score for standardized tests, and procedures for calculation of the SEm are explained. (JDD)
The diagnostic performance of the Mass Restricted (MR) score in the identification of microbial invasion of the amniotic cavity or intra-amniotic inflammation is not superior to amniotic fluid interleukin-6

PubMed Central

Romero, Roberto; Kadar, Nicholas; Miranda, Jezid; Korzeniewski, Steven J.; Schwartz, Alyse G.; Chaemsaithong, Piya; Rogers, Wade; Soto, Eleazar; Gotsch, Francesca; Yeo, Lami; Hassan, Sonia S.; Chaiworapongsa, Tinnakorn

2018-01-01

Objective Intra-amniotic infection/inflammation are major causes of spontaneous preterm labor and delivery. However, diagnosis of intra-amniotic infection is challenging because most are subclinical and amniotic fluid (AF) cultures take several days before results are available. Several tests have been proposed for the rapid diagnosis of microbial invasion of the amniotic cavity (MIAC) or intra-amniotic inflammation. The aim of this study was to examine the diagnostic performance of the AF Mass Restricted (MR) score in comparison with interleukin-6 (IL-6) and matrix metalloproteinase-8 (MMP-8) for the identification of MIAC or inflammation. Methods AF samples were collected from patients with singleton gestations and symptoms of preterm labor (n = 100). Intra-amniotic inflammation was defined as >100 white blood cells/mm3 (WBCs) in AF; MIAC was defined as a positive AF culture. AF IL-6 and MMP-8 were determined using ELISA. The MR score was obtained using the Surface-Enhanced Laser Desorption Ionization Time of Flight (SELDI-TOF) mass spectrometry. Sensitivity and specificity were calculated and logistic regression models were fit to construct receiver-operating characteristic (ROC) curves for the identification of each outcome. The McNemar’s test and paired sample non-parametric statistical techniques were used to test for differences in diagnostic performance metrics. Results (1) The prevalence of MIAC and intra-amniotic inflammation was 34% (34/100) and 40% (40/100), respectively; (2) there were no significant differences in sensitivity of the three tests under study (MR score, IL-6 or MMP-8) in the identification of either MIAC or intra-amniotic inflammation (using the following cutoffs: MR score >2, IL-6 >11.4 ng/mL, and MMP-8 >23 ng/mL); (3) there was no significant difference in the sensitivity among the three tests for the same outcomes when the false positive rate was fixed at 15%; (4) the specificity for IL-6 was not significantly different from that of
The diagnostic performance of the Mass Restricted (MR) score in the identification of microbial invasion of the amniotic cavity or intra-amniotic inflammation is not superior to amniotic fluid interleukin-6.

PubMed

Romero, Roberto; Kadar, Nicholas; Miranda, Jezid; Korzeniewski, Steven J; Schwartz, Alyse G; Chaemsaithong, Piya; Rogers, Wade; Soto, Eleazar; Gotsch, Francesca; Yeo, Lami; Hassan, Sonia S; Chaiworapongsa, Tinnakorn

2014-05-01

Intra-amniotic infection/inflammation are major causes of spontaneous preterm labor and delivery. However, diagnosis of intra-amniotic infection is challenging because most are subclinical and amniotic fluid (AF) cultures take several days before results are available. Several tests have been proposed for the rapid diagnosis of microbial invasion of the amniotic cavity (MIAC) or intra-amniotic inflammation. The aim of this study was to examine the diagnostic performance of the AF Mass Restricted (MR) score in comparison with interleukin-6 (IL-6) and matrix metalloproteinase-8 (MMP-8) for the identification of MIAC or inflammation. AF samples were collected from patients with singleton gestations and symptoms of preterm labor (n = 100). Intra-amniotic inflammation was defined as >100 white blood cells/mm(3) (WBCs) in AF; MIAC was defined as a positive AF culture. AF IL-6 and MMP-8 were determined using ELISA. The MR score was obtained using the Surface-Enhanced Laser Desorption Ionization Time of Flight (SELDI-TOF) mass spectrometry. Sensitivity and specificity were calculated and logistic regression models were fit to construct receiver-operating characteristic (ROC) curves for the identification of each outcome. The McNemar's test and paired sample non-parametric statistical techniques were used to test for differences in diagnostic performance metrics. (1) The prevalence of MIAC and intra-amniotic inflammation was 34% (34/100) and 40% (40/100), respectively; (2) there were no significant differences in sensitivity of the three tests under study (MR score, IL-6 or MMP-8) in the identification of either MIAC or intra-amniotic inflammation (using the following cutoffs: MR score >2, IL-6 >11.4 ng/mL, and MMP-8 >23 ng/mL); (3) there was no significant difference in the sensitivity among the three tests for the same outcomes when the false positive rate was fixed at 15%; (4) the specificity for IL-6 was not significantly different from that of the MR score in
The Probability of Obtaining Two Statistically Different Test Scores as a Test Index

ERIC Educational Resources Information Center

Muller, Jorg M.

2006-01-01

A new test index is defined as the probability of obtaining two randomly selected test scores (PDTS) as statistically different. After giving a concept definition of the test index, two simulation studies are presented. The first analyzes the influence of the distribution of test scores, test reliability, and sample size on PDTS within classical…
State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. Nevada

ERIC Educational Resources Information Center

Center on Education Policy, 2010

2010-01-01

This paper profiles Nevada's test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the basic level on NAEP increased in grade 8 reading and math. Average annual gains were larger on the state test than on NAEP in both subjects. Trends in average (mean) test scores…
Do Gains in Test Scores Explain Labor Market Outcomes?

ERIC Educational Resources Information Center

Rose, Heather

2006-01-01

Using data from the National Education Longitudinal Study of 1988, this article investigates whether students who made relatively large test score gains during high school had larger earnings 7 years after high school compared to students whose scores improved little. In models that control for pre-high school test scores, family background, and…
State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. Louisiana

ERIC Educational Resources Information Center

Center on Education Policy, 2010

2010-01-01

This paper profiles Louisiana's test score trends through 2008-09. Between 2005 and 2009, trends on state tests and NAEP (National Assessment of Educational Progress) sometimes differed. On the state test, the percentages of students reaching the proficient level increased at grades 4 and 8 in both reading and math. On NAEP, the percentage of…
State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. Tennessee

ERIC Educational Resources Information Center

Center on Education Policy, 2010

2010-01-01

This paper profiles Tennessee's test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the basic level on NAEP (National Assessment of Educational Progress) increased in grade 8 reading and math. At grade 4, trends on the state test and NAEP differed somewhat. In…
State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. Maryland

ERIC Educational Resources Information Center

Center on Education Policy, 2010

2010-01-01

This paper profiles Maryland's test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the basic level on NAEP (National Assessment of Educational Progress) increased at grades 4 and 8 in both reading and math. Average annual gains were larger on the state test than…
State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. Pennsylvania

ERIC Educational Resources Information Center

Center on Education Policy, 2010

2010-01-01

This paper profiles Pennsylvania's test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the basic level on NAEP (National Assessment of Educational Progress) increased in grade 8 reading and math. Average annual gains were larger on the state test than on NAEP in…
State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. Nebraska

ERIC Educational Resources Information Center

Center on Education Policy, 2010

2010-01-01

This paper profiles Nebraska's test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the percentages reaching the basic level on NAEP (National Assessment of Educational Progress) increased at grade 4 in both reading and math. At grade 8, however, the percentages…
Equating Scores from Adaptive to Linear Tests

ERIC Educational Resources Information Center

van der Linden, Wim J.

2006-01-01

Two local methods for observed-score equating are applied to the problem of equating an adaptive test to a linear test. In an empirical study, the methods were evaluated against a method based on the test characteristic function (TCF) of the linear test and traditional equipercentile equating applied to the ability estimates on the adaptive test…
Stability of scores for the Slosson Full-Range Intelligence Test.

PubMed

Williams, Thomas O; Eaves, Ronald C; Woods-Groves, Suzanne; Mariano, Gina

2007-08-01

The test-retest stability of the Slosson Full-Range Intelligence Test by Algozzine, Eaves, Mann, and Vance was investigated with test scores from a sample of 103 students. With a mean interval of 13.7 mo. and different examiners for each of the two test administrations, the test-retest reliability coefficients for the Full-Range IQ, Verbal Reasoning, Abstract Reasoning, Quantitative Reasoning, and Memory were .93, .85, .80, .80, and .83, respectively. Mean differences from the test-retest scores were not statistically significantly different for any of the scales. Results suggest that Slosson scores are stable over time even when different examiners administer the test.
State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. Alaska

ERIC Educational Resources Information Center

Center on Education Policy, 2010

2010-01-01

This paper profiles Alaska's test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the basic level on NAEP (National Assessment of Educational Progress) increased in grades 4 and 8 in math and grade 8 in reading. In grade 4 reading, the percentage reaching the…
State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. Massachusetts

ERIC Educational Resources Information Center

Center on Education Policy, 2010

2010-01-01

This paper profiles Massachusetts' test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the basic level on NAEP (National Assessment of Educational Progress) increased in grade 4 reading and math and grade 8 math. Average annual gains were larger on the state test…
State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. California

ERIC Educational Resources Information Center

Center on Education Policy, 2010

2010-01-01

This paper profiles California's test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the basic level on NAEP (National Assessment of Educational Progress) increased in grades 4 and 8 in both reading and math. Average annual gains were larger on the state test…
State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. Montana

ERIC Educational Resources Information Center

Center on Education Policy, 2010

2010-01-01

This paper profiles Montana's test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the basic level on NAEP (National Assessment of Educational Progress) increased in grade 4 reading and math and grade 8 reading. In grade 8 math, however, the percentage proficient…
State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. Colorado

ERIC Educational Resources Information Center

Center on Education Policy, 2010

2010-01-01

This paper profiles Colorado's test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the basic level on NAEP (National Assessment of Educational Progress) increased in grades 4 and 8 in both reading and math. Average annual gains were generally larger on NAEP than…

State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. Wisconsin

ERIC Educational Resources Information Center

Center on Education Policy, 2010

2010-01-01

This paper profiles Wisconsin's test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the basic level on NAEP (National Assessment of Educational Progress) increased in math at grades 4 and 8 and in reading at grade 8. In grade 4 reading, the percentage scoring…
State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. Alabama

ERIC Educational Resources Information Center

Center on Education Policy, 2010

2010-01-01

This paper profiles Alabama's test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the basic level on NAEP (National Assessment of Educational Progress) increased in grades 4 and 8 in both reading and math. Average annual gains were generally larger on the state…
State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. Texas

ERIC Educational Resources Information Center

Center on Education Policy, 2010

2010-01-01

This paper profiles Texas' test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the basic level on NAEP (National Assessment of Educational Progress) increased in reading at grades 4 and 8 and in math at grade 8. In grade 4 math, however, the percentage scoring…
State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. Florida

ERIC Educational Resources Information Center

Center on Education Policy, 2010

2010-01-01

This paper profiles Florida's test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the basic level on NAEP (National Assessment of Educational Progress) increased in grades 4 and 8 in both reading and math. Average annual gains were generally larger on the state…
State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. Arizona

ERIC Educational Resources Information Center

Center on Education Policy, 2010

2010-01-01

This paper profiles Arizona's test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the basic level on NAEP (National Assessment of Educational Progress) increased in grades 4 and 8 in both reading and math. Average annual gains were generally larger on the state…
State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. Iowa

ERIC Educational Resources Information Center

Center on Education Policy, 2010

2010-01-01

This paper profiles Iowa's test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the basic level on NAEP (National Assessment of Educational Progress) increased in grade 4 reading and math and in grade 8 math. In grade 8 reading, the percentage of students reaching…
Modified Balance Error Scoring System (M-BESS) test scores in athletes wearing protective equipment and cleats.

PubMed

Azad, Aftab Mohammad; Al Juma, Saad; Bhatti, Junaid Ahmad; Delaney, J Scott

2016-01-01

Balance testing is an important part of the initial concussion assessment. There is no research on the differences in Modified Balance Error Scoring System (M-BESS) scores when tested in real world as compared to control conditions. To assess the difference in M-BESS scores in athletes wearing their protective equipment and cleats on different surfaces as compared to control conditions. This cross-sectional study examined university North American football and soccer athletes. Three observers independently rated athletes performing the M-BESS test in three different conditions: (1) wearing shorts and T-shirt in bare feet on firm surface (control); (2) wearing athletic equipment with cleats on FieldTurf; and (3) wearing athletic equipment with cleats on firm surface. Mean M-BESS scores were compared between conditions. 60 participants were recruited: 39 from football (all males) and 21 from soccer (11 males and 10 females). Average age was 21.1 years (SD=1.8). Mean M-BESS scores were significantly lower (p<0.001) for cleats on FieldTurf (mean=26.3; SD=2.0) and for cleats on firm surface (mean=26.6; SD=2.1) as compared to the control condition (mean=28.4; SD=1.5). Females had lower scores than males for cleats on FieldTurf condition (24.9 (SD=1.9) vs 27.3 (SD=1.6), p=0.005). Players who had taping or bracing on their ankles/feet had lower scores when tested with cleats on firm surface condition (24.6 (SD=1.7) vs 26.9 (SD=2.0), p=0.002). Total M-BESS scores for athletes wearing protective equipment and cleats standing on FieldTurf or a firm surface are around two points lower than M-BESS scores performed on the same athletes under control conditions.
Modified Balance Error Scoring System (M-BESS) test scores in athletes wearing protective equipment and cleats

PubMed Central

Azad, Aftab Mohammad; Al Juma, Saad; Bhatti, Junaid Ahmad; Delaney, J Scott

2016-01-01

Background Balance testing is an important part of the initial concussion assessment. There is no research on the differences in Modified Balance Error Scoring System (M-BESS) scores when tested in real world as compared to control conditions. Objective To assess the difference in M-BESS scores in athletes wearing their protective equipment and cleats on different surfaces as compared to control conditions. Methods This cross-sectional study examined university North American football and soccer athletes. Three observers independently rated athletes performing the M-BESS test in three different conditions: (1) wearing shorts and T-shirt in bare feet on firm surface (control); (2) wearing athletic equipment with cleats on FieldTurf; and (3) wearing athletic equipment with cleats on firm surface. Mean M-BESS scores were compared between conditions. Results 60 participants were recruited: 39 from football (all males) and 21 from soccer (11 males and 10 females). Average age was 21.1 years (SD=1.8). Mean M-BESS scores were significantly lower (p<0.001) for cleats on FieldTurf (mean=26.3; SD=2.0) and for cleats on firm surface (mean=26.6; SD=2.1) as compared to the control condition (mean=28.4; SD=1.5). Females had lower scores than males for cleats on FieldTurf condition (24.9 (SD=1.9) vs 27.3 (SD=1.6), p=0.005). Players who had taping or bracing on their ankles/feet had lower scores when tested with cleats on firm surface condition (24.6 (SD=1.7) vs 26.9 (SD=2.0), p=0.002). Conclusions Total M-BESS scores for athletes wearing protective equipment and cleats standing on FieldTurf or a firm surface are around two points lower than M-BESS scores performed on the same athletes under control conditions. PMID:27900181
THE EFFECTS ON ACHIEVEMENT TEST RESULTS OF VARYING CONDITIONS OF EXPERIMENTAL ATMOSPHERE, NOTICE OF TEST, TEST ADMINISTRATION, AND TEST SCORING.

ERIC Educational Resources Information Center

GOODWIN, WILLIAM L.; AND OTHERS

NULL HYPOTHESES WERE TESTED TO DETERMINE THE DIFFERENTIAL EFFECTS OF (1) EXPERIMENTAL ATMOSPHERE AND ABSENCE OF SAME, (2) NOTICE OF TEST (10 SCHOOL DAYS) AND NO NOTICE (1 SCHOOL DAY), (3) TEACHER ADMINISTRATION AND OUTSIDE ADMINISTRATION OF TESTS, AND (4) TEACHER SCORING AND OUTSIDE SCORING OF TESTS. SIXTH-GRADE CLASSES (N=64), EACH FROM A…
Identification of Balance Deficits in People with Parkinson Disease; is the Sensory Organization Test Enough?

PubMed

Gera, G; Freeman, D L; Blackinton, M T; Horak, F B; King, L

2016-02-01

Balance deficits in people with Parkinson's disease can affect any of the multiple systems encompassing balance control. Thus, identification of the specific deficit is crucial in customizing balance rehabilitation. The sensory organization test, a test of sensory integration for balance control, is sometimes used in isolation to identify balance deficits in people with Parkinson's disease. More recently, the Mini-Balance Evaluations Systems Test, a clinical scale that tests multiple domains of balance control, has begun to be used to assess balance in patients with Parkinson's disease. The purpose of our study was to compare the use of Sensory Organization Test and Mini-Balance Evaluations Systems Test in identifying balance deficits in people with Parkinson's disease. 45 participants (27M, 18F; 65.2 ± 8.2 years) with idiopathic Parkinson's disease participated in the cross-sectional study. Balance assessment was performed using the Sensory Organization Test and the Mini-Balance Evaluations Systems Test. People were classified into normal and abnormal balance based on the established cutoff scores (normal balance: Sensory Organization Test >69; Mini-Balance Evaluations Systems Test >73). More subjects were classified as having abnormal balance with the Mini-Balance Evaluations Systems Test (71% abnormal) than with the Sensory Organization Test (24% abnormal) in our cohort of people with Parkinson's disease. There were no subjects with a normal Mini-Balance Evaluations Systems Test score but abnormal Sensory Organization Test score. In contrast, there were 21 subjects who had an abnormal Mini-Balance Evaluations Systems Test score but normal Sensory Organization Test scores. Findings from this study suggest that investigation of sensory integration deficits, alone, may not be able to identify all types of balance deficits found in patients with Parkinson's disease. Thus, a comprehensive approach should be used to test of multiple balance systems to provide
Score Equating and Nominally Parallel Language Tests.

ERIC Educational Resources Information Center

Moy, Raymond

Score equating requires that the forms to be equated are functionally parallel. That is, the two test forms should rank order examinees in a similar fashion. In language proficiency testing situations, this assumption is often put into doubt because of the numerous tests that have been proposed as measures of language proficiency and the…
Reporting Diagnostic Scores in Educational Testing: Temptations, Pitfalls, and Some Solutions

ERIC Educational Resources Information Center

Sinharay, Sandip; Puhan, Gautam; Haberman, Shelby J.

2010-01-01

Diagnostic scores are of increasing interest in educational testing due to their potential remedial and instructional benefit. Naturally, the number of educational tests that report diagnostic scores is on the rise, as are the number of research publications on such scores. This article provides a critical evaluation of diagnostic score reporting…
Neuropsychological test scores, academic performance, and developmental disorders in Spanish-speaking children.

PubMed

Rosselli, M; Ardila, A; Bateman, J R; Guzmán, M

2001-01-01

Limited information is currently available about performance of Spanish-speaking children on different neuropsychological tests. This study was designed to (a) analyze the effects of age and sex on different neuropsychological test scores of a randomly selected sample of Spanish-speaking children, (b) analyze the value of neuropsychological test scores for predicting school performance, and (c) describe the neuropsychological profile of Spanish-speaking children with learning disabilities (LD). Two hundred ninety (141 boys, 149 girls) 6- to 11-year-old children were selected from a school in Bogotá, Colombia. Three age groups were distinguished: 6- to 7-, 8- to 9-, and 10- to 11-year-olds. Performance was measured utilizing the following neuropsychological tests: Seashore Rhythm Test, Finger Tapping Test (FTT), Grooved Pegboard Test, Children's Category Test (CCT), California Verbal Learning Test-Children's Version (CVLT-C), Benton Visual Retention Test (BVRT), and Bateria Woodcock Psicoeducativa en Español (Woodcock, 1982). Normative scores were calculated. Age effect was significant for most of the test scores. A significant sex effect was observed for 3 test scores. Intercorrelations were performed between neuropsychological test scores and academic areas (science, mathematics, Spanish, social studies, and music). In a post hoc analysis, children presenting very low scores on the reading, writing, and arithmetic achievement scales of the Woodcock battery were identified in the sample, and their neuropsychological test scores were compared with a matched normal group. Finally, a comparison was made between Colombian and American norms.
Score Reporting for the 1991 Medical College Admission Test.

ERIC Educational Resources Information Center

Mitchell, Karen J.; Haynes, Robert

1990-01-01

Data used in a major review of the system for reporting scores on the Medical College Admission Test (MCAT) are presented and discussed. The data demonstrated the value of the current score-reporting system and led to retention of the 15-point MCAT score scale in 1991. (Author/MSE)
Extended version of the "Sniffin' Sticks" identification test: test-retest reliability and validity.

PubMed

Sorokowska, A; Albrecht, E; Haehner, A; Hummel, T

2015-03-30

The extended, 32-item version of the Sniffin' Sticks identification test was developed in order to create a precise tool enabling repeated, longitudinal testing of individual olfactory subfunctions. Odors of the previous test version had to be changed for technical reasons, and the odor identification test needed re-investigation in terms of reliability, validity, and normative values. In our study we investigated olfactory abilities of a group of 100 patients with olfactory dysfunction and 100 controls. We reconfirmed the high test-retest reliability of the extended version of the Sniffin' Sticks identification test and high correlations between the new and the original part of this tool. In addition, we confirmed the validity of the test as it discriminated clearly between controls and patients with olfactory loss. The additional set of 16 odor identification sticks can be either included in the current olfactory test, thus creating a more detailed diagnosis tool, or it can be used separately, enabling to follow olfactory function over time. Additionally, the normative values presented in our paper might provide useful guidelines for interpretation of the extended identification test results. The revised version of the Sniffin' Sticks 32-item odor identification test is a reliable and valid tool for the assessment of olfactory function. Copyright © 2015 Elsevier B.V. All rights reserved.
Teacher Greetings Increase College Students' Test Scores

ERIC Educational Resources Information Center

Weinstein, Lawrence; Laverghetta, Antonio; Alexander, Ralph; Stewart, Megan

2009-01-01

The current study is an extension of a previous investigation dealing with teacher greetings to students. The present investigation used teacher greetings with college students and academic performance (test scores). We report data using university students and in-class test performance. Students in introductory psychology who received teachers'…
State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. New Mexico

ERIC Educational Resources Information Center

Center on Education Policy, 2010

2010-01-01

This paper profiles New Mexico's test score trends through 2008-09. Between 2005 and 2009, the percentages of students reaching the proficient level on the state test and the basic level on NAEP (National Assessment of Educational Progress) increased in grade 4 math and grade 8 reading and math. In grade 4 reading, the percentage basic on NAEP …
State Test Score Trends through 2008-09, Part 1: Rising Scores on State Tests and NAEP. North Dakota

ERIC Educational Resources Information Center

Center on Education Policy, 2010

2010-01-01

This paper profiles North Dakota's test score trends through 2008-09. Between 2005 and 2009, the percentage of students reaching the proficient level on the state test and the basic level on NAEP (National Assessment of Educational Progress) increased in grades 4 and 8 in both reading and math. Average annual gains were larger on the state test…
A process dissociation approach to objective-projective test score interrelationships.

PubMed

Bornstein, Robert F

2002-02-01

Even when self-report and projective measures of a given trait or motive both predict theoretically related features of behavior, scores on the 2 tests correlate modestly with each other. This article describes a process dissociation framework for personality assessment, derived from research on implicit memory and learning, which can resolve these ostensibly conflicting results. Research on interpersonal dependency is used to illustrate 3 key steps in the process dissociation approach: (a) converging behavioral predictions, (b) modest test score intercorrelations, and (c) delineation of variables that differentially affect self-report and projective test scores. Implications of the process dissociation framework for personality assessment and test development are discussed.
A prognostic scoring system for arm exercise stress testing.

PubMed

Xie, Yan; Xian, Hong; Chandiramani, Pooja; Bainter, Emily; Wan, Leping; Martin, Wade H

2016-01-01

Arm exercise stress testing may be an equivalent or better predictor of mortality outcome than pharmacological stress imaging for the ≥50% for patients unable to perform leg exercise. Thus, our objective was to develop an arm exercise ECG stress test scoring system, analogous to the Duke Treadmill Score, for predicting outcome in these individuals. In this retrospective observational cohort study, arm exercise ECG stress tests were performed in 443 consecutive veterans aged 64.1 (11.1) years. (mean (SD)) between 1997 and 2002. From multivariate Cox models, arm exercise scores were developed for prediction of 5-year and 12-year all-cause and cardiovascular mortality and 5-year cardiovascular mortality or myocardial infarction (MI). Arm exercise capacity in resting metabolic equivalents (METs), 1 min heart rate recovery (HRR) and ST segment depression ≥1 mm were the stress test variables independently associated with all-cause and cardiovascular mortality by step-wise Cox analysis (all p<0.01). A score based on the relation HRR (bpm)+7.3×METs-10.5×ST depression (0=no; 1=yes) prognosticated 5-year cardiovascular mortality with a C-statistic of 0.81 before and 0.88 after adjustment for significant demographic and clinical covariates. Arm exercise scores for the other outcome end points yielded C-statistic values of 0.77-0.79 before and 0.82-0.86 after adjustment for significant covariates versus 0.64-0.72 for best fit pharmacological myocardial perfusion imaging models in a cohort of 1730 veterans who were evaluated over the same time period. Arm exercise scores, analogous to the Duke Treadmill Score, have good power for prediction of mortality or MI in patients who cannot perform leg exercise.

Test Operations Procedure (TOP) 03-2-827 Test Procedures for Video Target Scoring Using Calibration Lights

DTIC Science & Technology

2016-04-04

Final 3. DATES COVERED (From - To) 4. TITLE AND SUBTITLE Test Operations Procedure (TOP) 03-2-827 Test Procedures for Video Target Scoring Using...ABSTRACT This Test Operations Procedure (TOP) describes typical equipment and procedures to setup and operate a Video Target Scoring System (VTSS) to...lights. 15. SUBJECT TERMS Video Target Scoring System, VTSS, witness screens, camera, target screen, light pole 16. SECURITY
Semi-Quantitative Scoring of an Immunochromatographic Test for Circulating Filarial Antigen

PubMed Central

Chesnais, Cédric B.; Missamou, François; Pion, Sébastien D. S.; Bopda, Jean; Louya, Frédéric; Majewski, Andrew C.; Weil, Gary J.; Boussinesq, Michel

2013-01-01

The value of a semi-quantitative scoring of the filarial antigen test (Binax Now Filariasis card test, ICT) results was evaluated during a field survey in the Republic of Congo. One hundred and thirty-four (134) of 774 tests (17.3%) were clearly positive and were scored 1, 2, or 3; and 11 (1.4%) had questionable results. Wuchereria bancrofti microfilariae (mf) were detected in 41 of those 133 individuals with an ICT test score ≥ 1 who also had a night blood smear; none of the 11 individuals with questionable ICT results harbored night mf. Cuzick's test showed a significant trend for higher microfilarial densities in groups with higher ICT scores (P < 0.001). The ICT scores were also significantly correlated with blood mf counts. Because filarial antigen levels provide an indication of adult worm infection intensity, our results suggest that semi-quantitative reading of the ICT may be useful for grading the intensity of filarial infections in individuals and populations. PMID:24019435
Stimulus Picture Identification in Articulation Testing

ERIC Educational Resources Information Center

Mullen, Patricia A.; Whitehead, Robert L.

1977-01-01

Compared with 20 normal speaking and 20 articulation defective Ss (7 and 8 years old) was the percent of correct initial identification of stimulus pictures on the Goldman-Fristoe Test of Articulation with the percent correct identification on the Arizona Articulation Proficiency Scale. (Author/IM)
Critical Thinking: More than Test Scores

ERIC Educational Resources Information Center

Smith, Vernon G.; Szymanski, Antonia

2013-01-01

This article is for practicing or aspiring school administrators. The demand for excellence in public education has lead to an emphasis on standardized test scores. This article explores the development of a professional enhancement program designed to prepare teachers to teach higher order thinking skills. Higher order thinking is the primary…
The Black-White Test Score Gap.

ERIC Educational Resources Information Center

Jencks, Christopher, Ed.; Phillips, Meredith, Ed.

The 15 chapters of this book address issues related to the continuing test score gap between black and white students. The editors argue against traditional explanations which emphasize differences in economic resources and demographic factors, and they urge that more emphasis be put on psychological and cultural factors. The book suggests studies…
Test Takers and the Validity of Score Interpretations

ERIC Educational Resources Information Center

Kopriva, Rebecca J.; Thurlow, Martha L.; Perie, Marianne; Lazarus, Sheryl S.; Clark, Amy

2016-01-01

This article argues that test takers are as integral to determining validity of test scores as defining target content and conditioning inferences on test use. A principled sustained attention to how students interact with assessment opportunities is essential, as is a principled sustained evaluation of evidence confirming the validity or calling…
ANOVA Analysis of Student Daily Test Scores in Multi-Day Test Periods

ERIC Educational Resources Information Center

Mouritsen, Matthew L.; Davis, Jefferson T.; Jones, Steven C.

2016-01-01

Instructors are often concerned when giving multiple-day tests because students taking the test later in the exam period may have an advantage over students taking the test early in the exam period due to information leakage. However, exam scores seemed to decline as students took the same test later in a multi-day exam period (Mouritsen and…
Scoring Yes-No Vocabulary Tests: Reaction Time vs. Nonword Approaches

ERIC Educational Resources Information Center

Pellicer-Sanchez, Ana; Schmitt, Norbert

2012-01-01

Despite a number of research studies investigating the Yes-No vocabulary test format, one main question remains unanswered: What is the best scoring procedure to adjust for testee overestimation of vocabulary knowledge? Different scoring methodologies have been proposed based on the inclusion and selection of nonwords in the test. However, there…
Increased correlation coefficient between the written test score and tutors' performance test scores after training of tutors for assessment of medical students during problem-based learning course in Malaysia.

PubMed

Jaiprakash, Heethal; Min, Aung Ko Ko; Ghosh, Sarmishtha

2016-03-01

This paper is aimed at finding if there was a change of correlation between the written test score and tutors' performance test scores in the assessment of medical students during a problem-based learning (PBL) course in Malaysia. This is a cross-sectional observational study, conducted among 264 medical students in two groups from November 2010 to November 2012. The first group's tutors did not receive tutor training; while the second group's tutors were trained in the PBL process. Each group was divided into high, middle and low achievers based on their end-of-semester exam scores. PBL scores were taken which included written test scores and tutors' performance test scores. Pearson correlation coefficient was calculated between the two kinds of scores in each group. The correlation coefficient between the written scores and tutors' scores in group 1 was 0.099 (p<0.001) and for group 2 was 0.305 (p<0.001). The higher correlation coefficient in the group where tutors received the PBL training reinforces the importance of tutor training before their participation in the PBL course.
The Effect of Pretest Exercise on Baseline Computerized Neurocognitive Test Scores.

PubMed

Pawlukiewicz, Alec; Yengo-Kahn, Aaron M; Solomon, Gary

2017-10-01

Baseline neurocognitive assessment plays a critical role in return-to-play decision making following sport-related concussions. Prior studies have assessed the effect of a variety of modifying factors on neurocognitive baseline test scores. However, relatively little investigation has been conducted regarding the effect of pretest exercise on baseline testing. The aim of our investigation was to determine the effect of pretest exercise on baseline Immediate Post-Concussion Assessment and Cognitive Testing (ImPACT) scores in adolescent and young adult athletes. We hypothesized that athletes undergoing self-reported strenuous exercise within 3 hours of baseline testing would perform more poorly on neurocognitive metrics and would report a greater number of symptoms than those who had not completed such exercise. Cross-sectional study; Level of evidence, 3. The ImPACT records of 18,245 adolescent and young adult athletes were retrospectively analyzed. After application of inclusion and exclusion criteria, participants were dichotomized into groups based on a positive (n = 664) or negative (n = 6609) self-reported history of strenuous exercise within 3 hours of the baseline test. Participants with a positive history of exercise were then randomly matched, based on age, sex, education level, concussion history, and hours of sleep prior to testing, on a 1:2 basis with individuals who had reported no pretest exercise. The baseline ImPACT composite scores of the 2 groups were then compared. Significant differences were observed for the ImPACT composite scores of verbal memory, visual memory, reaction time, and impulse control as well as for the total symptom score. No significant between-group difference was detected for the visual motor composite score. Furthermore, pretest exercise was associated with a significant increase in the overall frequency of invalid test results. Our results suggest a statistically significant difference in ImPACT composite scores between
Proposed Confidence Scale and ID Score in the Identification of Known-Unknown Compounds Using High Resolution MS Data

NASA Astrophysics Data System (ADS)

Rochat, Bertrand

2017-04-01

High-resolution (HR) MS instruments recording HR-full scan allow analysts to go further beyond pre-acquisition choices. Untargeted acquisition can reveal unexpected compounds or concentrations and can be performed for preliminary diagnosis attempt. Then, revealed compounds will have to be identified for interpretations. Whereas the need of reference standards is mandatory to confirm identification, the diverse information collected from HRMS allows identifying unknown compounds with relatively high degree of confidence without reference standards injected in the same analytical sequence. However, there is a necessity to evaluate the degree of confidence in putative identifications, possibly before further targeted analyses. This is why a confidence scale and a score in the identification of (non-peptidic) known-unknown, defined as compounds with entries in database, is proposed for (LC-) HRMS data. The scale is based on two representative documents edited by the European Commission (2007/657/EC) and the Metabolomics Standard Initiative (MSI), in an attempt to build a bridge between the communities of metabolomics and screening labs. With this confidence scale, an identification (ID) score is determined as [a number, a letter, and a number] (e.g., 2D3), from the following three criteria: I, a General Identification Category (1, confirmed, 2, putatively identified, 3, annotated compounds/classes, and 4, unknown); II, a Chromatography Class based on the relative retention time (from the narrowest tolerance, A, to no chromatographic references, D); and III, an Identification Point Level (1, very high, 2, high, and 3, normal level) based on the number of identification points collected. Three putative identification examples of known-unknown will be presented.
Observed-Score Equating as a Test Assembly Problem.

ERIC Educational Resources Information Center

van der Linden, Wim J.; Luecht, Richard M.

1998-01-01

Derives a set of linear conditions of item-response functions that guarantees identical observed-score distributions on two test forms. The conditions can be added as constraints to a linear programming model for test assembly. An example illustrates the use of the model for an item pool from the Law School Admissions Test (LSAT). (SLD)
A Review of Scoring Algorithms for Ability and Aptitude Tests.

ERIC Educational Resources Information Center

Chevalier, Shirley A.

In conventional practice, most educators and educational researchers score cognitive tests using a dichotomous right-wrong scoring system. Although simple and straightforward, this method does not take into consideration other factors, such as partial knowledge or guessing tendencies and abilities. This paper discusses alternative scoring models:…
Score tests for independence in semiparametric competing risks models.

PubMed

Saïd, Mériem; Ghazzali, Nadia; Rivest, Louis-Paul

2009-12-01

A popular model for competing risks postulates the existence of a latent unobserved failure time for each risk. Assuming that these underlying failure times are independent is attractive since it allows standard statistical tools for right-censored lifetime data to be used in the analysis. This paper proposes simple independence score tests for the validity of this assumption when the individual risks are modeled using semiparametric proportional hazards regressions. It assumes that covariates are available, making the model identifiable. The score tests are derived for alternatives that specify that copulas are responsible for a possible dependency between the competing risks. The test statistics are constructed by adding to the partial likelihoods for the individual risks an explanatory variable for the dependency between the risks. A variance estimator is derived by writing the score function and the Fisher information matrix for the marginal models as stochastic integrals. Pitman efficiencies are used to compare test statistics. A simulation study and a numerical example illustrate the methodology proposed in this paper.
Odour identification test and its relation to cardiac 123I‐metaiodobenzylguanidine in patients with drug induced parkinsonism

PubMed Central

Lee, Phil Hyu; Yeo, Seung Hyeon; Yong, Seok Woo; Kim, Yun Joong

2007-01-01

We investigated olfactory function and its relation to cardiac 123I‐metaiodobenzylguanidine (MIBG) uptake in 15 patients with drug induced parkinsonism (DIP). The mean Cross Cultural Smell Identification (CCSI) score was significantly greater in patients with DIP than in those with Parkinson's disease (PD: 6.9 (1.6) vs 4.4 (2.2); p<0.001); however, the mean CCSI score in patients with DIP was not significantly different from controls. One patient with DIP, whose CCSI score was significantly reduced, also exhibited decreased cardiac MIBG uptake. DIP patients with CCSI scores within the normal range had normal cardiac MIBG uptake. Our study suggests that an olfactory function test may be a useful tool for detecting DIP unrelated to PD and for identifying patients with DIP who have subclinical PD. PMID:17557797
10 CFR 707.7 - Random drug testing requirements and identification of testing designated positions.

Code of Federal Regulations, 2012 CFR

2012-01-01

... 10 Energy 4 2012-01-01 2012-01-01 false Random drug testing requirements and identification of... PROGRAMS AT DOE SITES Procedures § 707.7 Random drug testing requirements and identification of testing... evidence of the use of illegal drugs of employees in testing designated positions identified in this...
10 CFR 707.7 - Random drug testing requirements and identification of testing designated positions.

Code of Federal Regulations, 2014 CFR

2014-01-01

... 10 Energy 4 2014-01-01 2014-01-01 false Random drug testing requirements and identification of... PROGRAMS AT DOE SITES Procedures § 707.7 Random drug testing requirements and identification of testing... evidence of the use of illegal drugs of employees in testing designated positions identified in this...
10 CFR 707.7 - Random drug testing requirements and identification of testing designated positions.

Code of Federal Regulations, 2011 CFR

2011-01-01

... 10 Energy 4 2011-01-01 2011-01-01 false Random drug testing requirements and identification of... PROGRAMS AT DOE SITES Procedures § 707.7 Random drug testing requirements and identification of testing... evidence of the use of illegal drugs of employees in testing designated positions identified in this...
10 CFR 707.7 - Random drug testing requirements and identification of testing designated positions.

Code of Federal Regulations, 2013 CFR

2013-01-01

... 10 Energy 4 2013-01-01 2013-01-01 false Random drug testing requirements and identification of... PROGRAMS AT DOE SITES Procedures § 707.7 Random drug testing requirements and identification of testing... evidence of the use of illegal drugs of employees in testing designated positions identified in this...
10 CFR 707.7 - Random drug testing requirements and identification of testing designated positions.

Code of Federal Regulations, 2010 CFR

2010-01-01

... 10 Energy 4 2010-01-01 2010-01-01 false Random drug testing requirements and identification of... PROGRAMS AT DOE SITES Procedures § 707.7 Random drug testing requirements and identification of testing... evidence of the use of illegal drugs of employees in testing designated positions identified in this...

Olfactory identification and Stroop interference converge in schizophrenia.

PubMed Central

Purdon, S E

1998-01-01

OBJECTIVE: To test the discriminant validity of a model predicting a dissociation between measures of right and left frontal lobe function in people with schizophrenia. PARTICIPANTS: Twenty-one clinically stable outpatients with schizophrenia. INTERVENTIONS: Patients were administered the University of Pennsylvania Smell Identification Test (UPSIT), the Stroop Color-Word Test (Stroop), and the Positive and Negative Syndrome Scale (PANSS). OUTCOME MEASURES: Scores on these tests and relation among scores. RESULTS: There was a convergence of UPSII and Stroop interference scores consistent with a common cerebral basis for limitations in olfactory identification and inhibition of distraction. There was also a divergence of UPSIT and Stroop reading scores suggesting that the olfactory identification limitation is distinct from a general limitation of attention or a dysfunction of the left dorsolateral prefrontal cortex. Most notable was the 81% classification convergence between the UPSIT and Stroop incongruous colour naming scores compared with the near-random 57% classification convergence of the UPSIT and Stroop reading scores. CONCLUSIONS: These data are consistent with a right orbitofrontal dysfunction in a subgroup of patients with schizophrenia, although the involvement of mesial temporal structures in both tasks must be ruled out with further study. A multifactorial model depicting contributions from diverse cerebral structures is required to describe the pathophysiology of schizophrenia. Valid behavioural methods for classifying suspected subgroups of patients with particular cerebral dysfunction would be of value in the construction of this model. PMID:9595890
Relationships of Declining Test Scores and Grade Inflation.

ERIC Educational Resources Information Center

Bellott, Fred K.

The relationship between declining scores on national standardized tests and grade inflation is explored. Grade inflation refers to the indicated measure of evaluation of student performance having higher placement than is usual based on the performances. Data for this study were taken from the American College Testing (ACT) Program Class Profile…
D.C. Student Test Scores Show Uneven Progress. Data Snapshot

ERIC Educational Resources Information Center

DuPre, Mary

2011-01-01

Over the past five years, both DC Public Schools (DCPS) and public charter schools (PCS) have seen significant growth in secondary reading and math scores on the state test known as the District of Columbia Comprehensive Assessment System (DC CAS). However, scores have not improved as much at the elementary level. Reading and math scores for DCPS…
Reliability of Total Test Scores When Considered as Ordinal Measurements

ERIC Educational Resources Information Center

Biswas, Ajoy Kumar

2006-01-01

This article studies the ordinal reliability of (total) test scores. This study is based on a classical-type linear model of observed score (X), true score (T), and random error (E). Based on the idea of Kendall's tau-a coefficient, a measure of ordinal reliability for small-examinee populations is developed. This measure is extended to large…
Correlation of Simulation Examination to Written Test Scores for Advanced Cardiac Life Support Testing: Prospective Cohort Study.

PubMed

Strom, Suzanne L; Anderson, Craig L; Yang, Luanna; Canales, Cecilia; Amin, Alpesh; Lotfipour, Shahram; McCoy, C Eric; Osborn, Megan Boysen; Langdorf, Mark I

2015-11-01

Traditional Advanced Cardiac Life Support (ACLS) courses are evaluated using written multiple-choice tests. High-fidelity simulation is a widely used adjunct to didactic content, and has been used in many specialties as a training resource as well as an evaluative tool. There are no data to our knowledge that compare simulation examination scores with written test scores for ACLS courses. To compare and correlate a novel high-fidelity simulation-based evaluation with traditional written testing for senior medical students in an ACLS course. We performed a prospective cohort study to determine the correlation between simulation-based evaluation and traditional written testing in a medical school simulation center. Students were tested on a standard acute coronary syndrome/ventricular fibrillation cardiac arrest scenario. Our primary outcome measure was correlation of exam results for 19 volunteer fourth-year medical students after a 32-hour ACLS-based Resuscitation Boot Camp course. Our secondary outcome was comparison of simulation-based vs. written outcome scores. The composite average score on the written evaluation was substantially higher (93.6%) than the simulation performance score (81.3%, absolute difference 12.3%, 95% CI [10.6-14.0%], p<0.00005). We found a statistically significant moderate correlation between simulation scenario test performance and traditional written testing (Pearson r=0.48, p=0.04), validating the new evaluation method. Simulation-based ACLS evaluation methods correlate with traditional written testing and demonstrate resuscitation knowledge and skills. Simulation may be a more discriminating and challenging testing method, as students scored higher on written evaluation methods compared to simulation.
Between-District Test Score Variation, 2009-2012

ERIC Educational Resources Information Center

Fahle, Erin; Reardon, Sean

2016-01-01

Describing the variation in test scores between and within school districts is critical for: (1) for policy-related and descriptive work that investigates the sorting of students among districts and the differential effectiveness of those districts; and (2) for methodological work planning future experiments or interventions. Intraclass…
Deficits in recognition, identification, and discrimination of facial emotions in patients with bipolar disorder.

PubMed

Benito, Adolfo; Lahera, Guillermo; Herrera, Sara; Muncharaz, Ramón; Benito, Guillermo; Fernández-Liria, Alberto; Montes, José Manuel

2013-01-01

To analyze the recognition, identification, and discrimination of facial emotions in a sample of outpatients with bipolar disorder (BD). Forty-four outpatients with diagnosis of BD and 48 matched control subjects were selected. Both groups were assessed with tests for recognition (Emotion Recognition-40 - ER40), identification (Facial Emotion Identification Test - FEIT), and discrimination (Facial Emotion Discrimination Test - FEDT) of facial emotions, as well as a theory of mind (ToM) verbal test (Hinting Task). Differences between groups were analyzed, controlling the influence of mild depressive and manic symptoms. Patients with BD scored significantly lower than controls on recognition (ER40), identification (FEIT), and discrimination (FEDT) of emotions. Regarding the verbal measure of ToM, a lower score was also observed in patients compared to controls. Patients with mild syndromal depressive symptoms obtained outcomes similar to patients in euthymia. A significant correlation between FEDT scores and global functioning (measured by the Functioning Assessment Short Test, FAST) was found. These results suggest that, even in euthymia, patients with BD experience deficits in recognition, identification, and discrimination of facial emotions, with potential functional implications.
Efficacy of the alcohol use disorders identification test as a screening tool for hazardous alcohol intake and related disorders in primary care: a validity study.

PubMed Central

Piccinelli, M.; Tessari, E.; Bortolomasi, M.; Piasere, O.; Semenzin, M.; Garzotto, N.; Tansella, M.

1997-01-01

OBJECTIVE: To determine the properties of the alcohol use disorders identification test in screening primary care attenders for alcohol problems. DESIGN: A validity study among consecutive primary care attenders aged 18-65 years. Every third subject completed the alcohol use disorders identification test (a 10 item self report questionnaire on alcohol intake and related problems) and was interviewed by an investigator with the composite international diagnostic interview alcohol use module (a standardised interview for the independent assessment of alcohol intake and related disorders). SETTING: 10 primary care clinics in Verona, north eastern Italy. PATIENTS: 500 subjects were approached and 482 (96.4%) completed evaluation. RESULTS: When the alcohol use disorders identification test was used to detect subjects with alcohol problems the area under the receiver operating characteristic curve was 0.95. The cut off score of 5 was associated with a sensitivity of 0.84, a specificity of 0.90, and a positive predictive value of 0.60. The screening ability of the total score derived from summing the responses to the five items minimising the probability of misclassification between subjects with and without alcohol problems provided an area under the receiver operating characteristic curve of 0.93. A score of 5 or more on the five items was associated with a sensitivity of 0.79, a specificity of 0.95, and a positive predictive value of 0.73. CONCLUSIONS: The alcohol use disorders identification test performs well in detecting subjects with formal alcohol disorders and those with hazardous alcohol intake. Using five of the 10 items on the questionnaire gives reasonable accuracy, and these are recommended as questions of choice to screen patients for alcohol problems. PMID:9040389
The Persisting Racial Scoring Gap on Graduate and Professional School Admission Tests.

ERIC Educational Resources Information Center

Journal of Blacks in Higher Education, 2003

2003-01-01

Discusses the racial scoring gap on tests for admission to medical, business, law, and other graduate programs, noting that in the highest-scoring brackets on the Medical College Admission Test (MCAT), the racial gap is even larger. Whites are five times, twelve times, and seven times more likely, respectively, to score higher on the MCAT, Law…
Identification of dynapenia in older adults through the use of grip strength t-scores.

PubMed

Bohannon, Richard W; Magasi, Susan

2015-01-01

The aim of this study was to generate reference values and t-scores (1.0-2.5 standard deviations below average) for grip strength for healthy young adults and to examine the utility of t-scores from this group for the identification of dynapenia in older adults. Our investigation was a population-based, general community secondary analysis of cross-sectional grip strength data utilizing the NIH Toolbox Assessment norming sample. Participants consisted of community-dwelling adults, with age ranges of 20-40 years (n = 558) and 60-85 years (n = 390). The main outcome measure was grip strength using a Jamar plus dynamometer. Maximum grip strengths were consistent over the 20-40-year age group [men 108.0 (SD 22.6) pounds, women 65.8 (SD 14.6) pounds]. Comparison of older group grip strengths to those of the younger reference group revealed (depending on age strata) that 46.2-87.1% of older men and 50.0-82.4% of older women could be designated as dynapenic on the basis of t-scores. The use of reference value t-scores from younger adults is a promising method for determining dynapenia in older adults. © 2014 Wiley Periodicals, Inc.
Comparability of IQ Scores on Five Widely Used Intelligence Tests

ERIC Educational Resources Information Center

Hieronymus, A. N.; Stroud, James B.

1969-01-01

Attempts to fill research gap on testing by obtaining comparisons of deviation scores, at grade levels four, seven, and ten, from the California Test of Mental Maturity, Henmon-Nelson Tests, and Lorge-Thorndike Intelligence tests. Results tabulated. (CJ)
Effect of Septorhinoplasty on Olfactory Function: Assessment Using the Brief Smell Identification Test.

PubMed

Dengiz, Ramazan; Haytoğlu, Süheyl; Görgülü, Orhan; Doğru, Mehmet; Arıkan, Osman Kürşat

2015-03-01

Septorhinoplasty (SRP), one of the most commonly performed rhinologic surgery procedures, can affect olfactory function; however, the findings of studies investigating smell following SRP are controversial. We used a culturally adapted modified Brief Smell Identification Test (B-SIT) to investigate the long- and short-term effects of SRP on olfactory function. We enrolled 59 patients admitted to the Ear-Nose-Throat Clinic, who were complaining of external nasal deformity and nasal obstruction. Functional SRP was performed on all cases. The B-SIT was administered prior to surgery and at 4 and 12 weeks post-surgery. The smell identification score (SIS) reflected the number of correct answers. In addition, we investigated the effects of gender and smoking on olfactory function and whether the SRP procedure changed these associations. The mean preoperative, 4-week, and 12-week postoperative SISs were 10.15±1.30, 10.21±1.52, and 10.92±0.95, respectively. The difference between the preoperative and 4-week postoperative SISs was not statistically significant; however, the 12-week postoperative score was significantly different from the preoperative and 4-week postoperative scores. Furthermore, the repeated measures analysis according to gender and smoking habit revealed a significant difference between the 4-and 12-week postoperative SISs. One patient developed postoperative anosmia; however, the patient recovered in the 12-week postoperative period. SRP surgery is a safe procedure in terms of olfactory function. In addition, olfactory function may increase following surgery as a result of improved nasal airflow.
Exploration of the (Interrater) Reliability and Latent Factor Structure of the Alcohol Use Disorders Identification Test (AUDIT) and the Drug Use Disorders Identification Test (DUDIT) in a Sample of Dutch Probationers.

PubMed

Hildebrand, Martin; Noteborn, Mirthe G C

2015-01-01

The use of brief, reliable, valid, and practical measures of substance use is critical for conducting individual (risk and need) assessments in probation practice. In this exploratory study, the basic psychometric properties of the Alcohol Use Disorders Identification Test (AUDIT) and the Drug Use Disorders Identification Test (DUDIT) are evaluated. The instruments were administered as an oral interview instead of a self-report questionnaire. The sample comprised 383 offenders (339 men, 44 women). A subset of 56 offenders (49 men, 7 women) participated in the interrater reliability study. Data collection took place between September 2011 and November 2012. Overall, both instruments have acceptable levels of interrater reliability for total scores and acceptable to good interrater reliabilities for most of the individual items. Confirmatory factor analyses (CFA) indicated that the a priori one-, two- and three-factor solutions for the AUDIT did not fit the observed data very well. Principal axis factoring (PAF) supported a two-factor solution for the AUDIT that included a level of alcohol consumption/consequences factor (Factor 1) and a dependence factor (Factor 2), with both factors explaining substantial variance in AUDIT scores. For the DUDIT, CFA and PAF suggest that a one-factor solution is the preferred model (accounting for 62.61% of total variance). The Dutch language versions of the AUDIT and the DUDIT are reliable screening instruments for use with probationers and both instruments can be reliably administered by probation officers in probation practice. However, future research on concurrent and predictive validity is warranted.
Sex Differences in Cognitive Abilities Test Scores: A UK National Picture

ERIC Educational Resources Information Center

Strand, Steve; Deary, Ian J.; Smith, Pauline

2006-01-01

Background and aims: There is uncertainty about the extent or even existence of sex differences in the mean and variability of reasoning test scores ( Jensen, 1998; Lynn, 1994, ; Mackintosh, 1996). This paper analyses the Cognitive Abilities Test (CAT) scores of a large and representative sample of UK pupils to determine the extent of any sex…
Teacher Use of Achievement Test Score Data

ERIC Educational Resources Information Center

Miller, Steven C.

2012-01-01

The Wyoming Department of Education (WDE) has invested time and money developing standardized achievement test score reports designed to give teachers data about each of their students' levels of mastery of particular concepts in order to differentiate their instruction. The purpose of this study was to determine the extent to which eighth-grade…
Generalization of the Lord-Wingersky Algorithm to Computing the Distribution of Summed Test Scores Based on Real-Number Item Scores

ERIC Educational Resources Information Center

Kim, Seonghoon

2013-01-01

With known item response theory (IRT) item parameters, Lord and Wingersky provided a recursive algorithm for computing the conditional frequency distribution of number-correct test scores, given proficiency. This article presents a generalized algorithm for computing the conditional distribution of summed test scores involving real-number item…
The Alcohol Use Disorders Identification Test (AUDIT): reliability and validity of the Greek version.

PubMed

Moussas, George; Dadouti, Georgia; Douzenis, Athanassios; Poulis, Evangelos; Tzelembis, Athanassios; Bratis, Dimitris; Christodoulou, Christos; Lykouras, Lefteris

2009-05-14

Problems associated with alcohol abuse are recognised by the World Health Organization as a major health issue, which according to most recent estimations is responsible for 1.4% of the total world burden of morbidity and has been proven to increase mortality risk by 50%. Because of the size and severity of the problem, early detection is very important. This requires easy to use and specific tools. One of these is the Alcohol Use Disorders Identification Test (AUDIT). This study aims to standardise the questionnaire in a Greek population. AUDIT was translated and back-translated from its original language by two English-speaking psychiatrists. The tool contains 10 questions. A score >or= 11 is an indication of serious abuse/dependence. In the study, 218 subjects took part: 128 were males and 90 females. The average age was 40.71 years (+/- 11.34). From the 218 individuals, 109 (75 male, 34 female) fulfilled the criteria for alcohol dependence according to the Diagnostic and Statistical Manual of Mental Disorders, 4th edition (DSM-IV), and presented requesting admission; 109 subjects (53 male, 56 female) were healthy controls. Internal reliability (Cronbach alpha) was 0.80 for the controls and 0.80 for the alcohol-dependent individuals. Controls had significantly lower average scores (t test P < 0.001) when compared to the alcoholics. The questionnaire's sensitivity for scores >8 was 0.98 and its specificity was 0.94 for the same score. For the alcohol-dependent sample 3% scored as false negatives and from the control group 1.8% scored false positives. In the alcohol-dependent sample there was no difference between males and females in their average scores (t test P > 0.05). The Greek version of AUDIT has increased internal reliability and validity. It detects 97% of the alcohol-dependent individuals and has a high sensitivity and specificity. AUDIT is easy to use, quick and reliable and can be very useful in detection alcohol problems in sensitive populations.
Normative Performance on the Brief Smell Identification Test (BSIT) in a Multi-Ethnic Bilingual Cohort: A Project FRONTIER Study

PubMed Central

Menon, Chloe; Westervelt, Holly James; Jahn, Danielle R.; Dressel, Jeffrey A.; O’Bryant, Sid E.

2013-01-01

The Brief Smell Identification Test (BSIT) is a commonly used measure of olfactory functioning in elderly populations. Few studies have provided normative data for this measure, and minimal data are available regarding the impact of sociodemographic factors on test scores. This study presents normative data for the BSIT in a sample of English- and Spanish-speaking Hispanic and non-Hispanic Whites. A Rasch analysis was also conducted to identify the items that best discriminated between varying levels of olfactory functioning, as measured by the BSIT. The total sample included 302 older adults seen as part of an ongoing study of rural cognitive aging, Project FRONTIER. Hierarchical regression analyses revealed that BSIT scores require adjustment by age and gender, but years of education, ethnicity, and language did not significantly influence BSIT performance. Four items best discriminated between varying levels of smell identification, accounting for 59.44% of total information provided by the measure. However, items did not represent a continuum of difficulty on the BSIT. The results of this study indicate that the BSIT appears to be well-suited for assessing odor identification deficits in older adults of diverse backgrounds, but that fine-tuning of this instrument may be recommended in light of its items’ difficulty and discrimination parameters. Clinical and empirical implications are discussed. PMID:23634698
Yeast identification: reassessment of assimilation tests as sole universal identifiers.

PubMed

Spencer, J; Rawling, S; Stratford, M; Steels, H; Novodvorska, M; Archer, D B; Chandra, S

2011-11-01

To assess whether assimilation tests in isolation remain a valid method of identification of yeasts, when applied to a wide range of environmental and spoilage isolates. Seventy-one yeast strains were isolated from a soft drinks factory. These were identified using assimilation tests and by D1/D2 rDNA sequencing. When compared to sequencing, assimilation test identifications (MicroLog™) were 18·3% correct, a further 14·1% correct within the genus and 67·6% were incorrectly identified. The majority of the latter could be attributed to the rise in newly reported yeast species. Assimilation tests alone are unreliable as a universal means of yeast identification, because of numerous new species, variability of strains and increasing coincidence of assimilation profiles. Assimilation tests still have a useful role in the identification of common species, such as the majority of clinical isolates. It is probable, based on these results, that many yeast identifications reported in older literature are incorrect. This emphasizes the crucial need for accurate identification in present and future publications. © 2011 The Authors. Letters in Applied Microbiology © 2011 The Society for Applied Microbiology.
Misidentifying Factors Underlying Singapore's High Test Scores

ERIC Educational Resources Information Center

Usiskin, Zalman

2012-01-01

Singapore students have scored exceedingly well on international tests in mathematics. In response, there has been a desire in the United States--both at the policy level and at the school level--to emulate Singapore. Because what can be identified most easily about Singapore's school mathematics can be gleaned from curriculum documents from the…

Effects of age and cognition on a cross-cultural paediatric adaptation of the Sniffin' Sticks Identification Test.

PubMed

Bastos, Laís Orrico Donnabella; Guerreiro, Marilisa Mantovani; Lees, Andrew John; Warner, Thomas T; Silveira-Moriyama, Laura

2015-01-01

To study the effects of age and cognition on the performance of children aged 3 to 18 years on a culturally adapted version of the 16 item smell identification test from Sniffin' Sticks (SS16). A series of pilots were conducted on 29 children aged 3 to 18 years old and 23 adults to produce an adapted version of the SS16 suitable for Brazilian children (SS16-Child). A final version was applied to 51 children alongside a picture identification test (PIT-SS16-Child) to access cognitive abilities involved in the smell identification task. In addition 20 adults performed the same tasks as a comparison group. The final adapted SS16-Child was applied to 51 children with a mean age of 9.9 years (range 3-18 years, SD=4.25 years), of which 68.3% were girls. There was an independent effect of age (p<0.05) and PIT-SS16-Child (p<0.001) on the performance on the SS16-Child, and older children reached the ceiling for scoring in the cognitive and olfactory test. Pre-school children had difficulties identifying items of the test. A cross-culturally adapted version of the SS16 can be used to test olfaction in children but interpretation of the results must take age and cognitive abilities into consideration.
A weighted generalized score statistic for comparison of predictive values of diagnostic tests

PubMed Central

Kosinski, Andrzej S.

2013-01-01

Positive and negative predictive values are important measures of a medical diagnostic test performance. We consider testing equality of two positive or two negative predictive values within a paired design in which all patients receive two diagnostic tests. The existing statistical tests for testing equality of predictive values are either Wald tests based on the multinomial distribution or the empirical Wald and generalized score tests within the generalized estimating equations (GEE) framework. As presented in the literature, these test statistics have considerably complex formulas without clear intuitive insight. We propose their re-formulations which are mathematically equivalent but algebraically simple and intuitive. As is clearly seen with a new re-formulation we present, the generalized score statistic does not always reduce to the commonly used score statistic in the independent samples case. To alleviate this, we introduce a weighted generalized score (WGS) test statistic which incorporates empirical covariance matrix with newly proposed weights. This statistic is simple to compute, it always reduces to the score statistic in the independent samples situation, and it preserves type I error better than the other statistics as demonstrated by simulations. Thus, we believe the proposed WGS statistic is the preferred statistic for testing equality of two predictive values and for corresponding sample size computations. The new formulas of the Wald statistics may be useful for easy computation of confidence intervals for difference of predictive values. The introduced concepts have potential to lead to development of the weighted generalized score test statistic in a general GEE setting. PMID:22912343
Using Raters from India to Score a Large-Scale Speaking Test

ERIC Educational Resources Information Center

Xi, Xiaoming; Mollaun, Pam

2011-01-01

We investigated the scoring of the Speaking section of the Test of English as a Foreign Language[TM] Internet-based (TOEFL iBT[R]) test by speakers of English and one or more Indian languages. We explored the extent to which raters from India, after being trained and certified, were able to score the TOEFL examinees with mixed first languages…
The impact of testing accommodations on MCAT scores: descriptive results.

PubMed

Julian, Ellen R; Ingersoll, Deborah J; Etienne, Patricia M; Hilger, Anthony E

2004-04-01

Medical College Admission Test (MCAT) examinees with disabilities who receive accommodations receive flagged scores indicating nonstandard administration. This report compares MCAT examinees who received accommodations and their performances with standard examinees. Aggregate history records of all 1994-2000 MCAT examinees were identified as flagged (2,401) or standard (297,880), then further sorted by race/ethnicity (broadly identified as underrepresented minority and non-URM, at the time of testing) and gender. Those with flagged scores were also classified by disability (LD = learning disability, ADHD = attention deficit hyperactivity disorder, LD/ADHD = learning disability and attention deficit hyperactivity disorder, and Other = other disability) and type of accommodation. Mean MCAT scores were calculated for all groups. A group of 866 examinees took the MCAT first as a standard administration and subsequently with accommodations. In a separate analysis, their two sets of scores were compared. Less than 1% of examinees (2,401) had accommodations; of these, 55% were LD, 17% ADHD, 5% LD/ADHD, and 23% Other. Extended time was the most frequently provided accommodation. Mean flagged scores slightly exceeded mean standard scores on all MCAT sections. Examinees who retook the MCAT with accommodations after a standard administration increased their scores by six points, quadrupling the average gain Standard-Standard retest cohort from another study. The small but statistically significant different higher flagged scores may reflect either appropriate compensation or overly generous accommodations. Extended time had a positive impact on the scores of those who retested with this accommodation. The validity the flagged MCAT in predicting success in medical school is not known, and further investigation is underway.
Leveraging Gender Differences to Boost Test Scores

ERIC Educational Resources Information Center

Costello, Bill

2008-01-01

According to the 2004 National Assessment of Educational Progress, males who have made it through 12 years of school have significantly poorer reading skills than their female peers. In every age group, boys have been scoring lower than girls annually for more than three decades on U.S. Department of Education reading tests. The longer boys are in…
Identification of Dynapenia in Older Adults Through the Use of Grip Strength T-Scores

PubMed Central

Bohannon, Richard W; Magasi, Susan

2014-01-01

Objective To generate reference values and t-scores (1.0 to 2.5 standard deviations below average) for grip strength for healthy young adults and to examine the utility of t-scores from this group for the identification of dynapenia in older adults. Design Secondary analysis of cross-sectional grip strength data from the NIH Toolbox norming sample. Setting Population-based general community sample. Participants Community dwelling adults, between the ages 20 and 40 years (n=558); and 60 to 85 years (n=390) Main Outcomes Measures Grip strength measured with a Jamar plus dynamometer. Results Maximum grip strengths were consistent over the 20–40 year age span. For men they were 108.0 lbs (S.D. 22.6). For women, they were 65.8 lbs (S.D. 14.6) Comparison of older participant grip strengths to those of the younger reference group revealed (depending on age strata) that 46.2–87.1% of older men and 50.0–82.4% of older women could be designated as dynapenic on the basis of t-scores. Conclusion The use of reference value t-scores from younger adults is a promising method for determining dynapenia in older adults. PMID:24729356
Test Score Stability and the Relationship of Adult Manifest Anxiety Scale-College Version Scores to External Variables among Graduate Students

ERIC Educational Resources Information Center

Lowe, Patricia A.; Peyton, Vicki; Reynolds, Cecil R.

2007-01-01

A sample of 79 individuals participated in the present study to evaluate the test score stability (8-week test-retest interval) and construct validity of the scores of the Adult Manifest Anxiety Scale-College Version, a new measure used to assess anxiety in college students, for application to graduate-level students. Results of the study…
An Approach to Scoring and Equating Tests with Binary Items: Piloting With Large-Scale Assessments

ERIC Educational Resources Information Center

Dimitrov, Dimiter M.

2016-01-01

This article describes an approach to test scoring, referred to as "delta scoring" (D-scoring), for tests with dichotomously scored items. The D-scoring uses information from item response theory (IRT) calibration to facilitate computations and interpretations in the context of large-scale assessments. The D-score is computed from the…
Developing Test Score Reports that Work: The Process and Best Practices for Effective Communication

ERIC Educational Resources Information Center

Zenisky, April L.; Hambleton, Ronald K.

2012-01-01

Test scores matter these days. Test-takers want to understand how they performed, and test score reports, particularly those for individual examinees, are the vehicles by which most people get the bulk of this information. Historically, score reports have not always met the examinees' information or usability needs, but this is clearly changing…
Testing Students with Special Educational Needs in Large-Scale Assessments – Psychometric Properties of Test Scores and Associations with Test Taking Behavior

PubMed Central

Pohl, Steffi; Südkamp, Anna; Hardt, Katinka; Carstensen, Claus H.; Weinert, Sabine

2016-01-01

Assessing competencies of students with special educational needs in learning (SEN-L) poses a challenge for large-scale assessments (LSAs). For students with SEN-L, the available competence tests may fail to yield test scores of high psychometric quality, which are—at the same time—measurement invariant to test scores of general education students. We investigated whether we can identify a subgroup of students with SEN-L, for which measurement invariant competence measures of adequate psychometric quality may be obtained with tests available in LSAs. We furthermore investigated whether differences in test-taking behavior may explain dissatisfying psychometric properties and measurement non-invariance of test scores within LSAs. We relied on person fit indices and mixture distribution models to identify students with SEN-L for whom test scores with satisfactory psychometric properties and measurement invariance may be obtained. We also captured differences in test-taking behavior related to guessing and missing responses. As a result we identified a subgroup of students with SEN-L for whom competence scores of adequate psychometric quality that are measurement invariant to those of general education students were obtained. Concerning test taking behavior, there was a small number of students who unsystematically picked response options. Removing these students from the sample slightly improved item fit. Furthermore, two different patterns of missing responses were identified that explain to some extent problems in the assessments of students with SEN-L. PMID:26941665
Flow and diffusion of high-stakes test scores.

PubMed

Marder, M; Bansal, D

2009-10-13

We apply visualization and modeling methods for convective and diffusive flows to public school mathematics test scores from Texas. We obtain plots that show the most likely future and past scores of students, the effects of random processes such as guessing, and the rate at which students appear in and disappear from schools. We show that student outcomes depend strongly upon economic class, and identify the grade levels where flows of different groups diverge most strongly. Changing the effectiveness of instruction in one grade naturally leads to strongly nonlinear effects on student outcomes in subsequent grades.
Scoring systems for the Clock Drawing Test: A historical review

PubMed Central

Spenciere, Bárbara; Alves, Heloisa; Charchat-Fichman, Helenice

2017-01-01

The Clock Drawing Test (CDT) is a simple neuropsychological screening instrument that is well accepted by patients and has solid psychometric properties. Several different CDT scoring methods have been developed, but no consensus has been reached regarding which scoring method is the most accurate. This article reviews the literature on these scoring systems and the changes they have undergone over the years. Historically, different types of scoring systems emerged. Initially, the focus was on screening for dementia, and the methods were both quantitative and semi-quantitative. Later, the need for an early diagnosis called for a scoring system that can detect subtle errors, especially those related to executive function. Therefore, qualitative analyses began to be used for both differential and early diagnoses of dementia. A widely used qualitative method was proposed by Rouleau et al. (1992). Tracing the historical path of these scoring methods is important for developing additional scoring systems and furthering dementia prevention research. PMID:29213488
Effects of Test Media on Different EFL Test-Takers in Writing Scores and in the Cognitive Writing Process

ERIC Educational Resources Information Center

Zou, Xiao-Ling; Chen, Yan-Min

2016-01-01

The effects of computer and paper test media on EFL test-takers with different computer familiarity in writing scores and in the cognitive writing process have been comprehensively explored from the learners' aspect as well as on the basis of related theories and practice. The results indicate significant differences in test scores among the…
The Effect of Schooling and Ability on Achievement Test Scores. NBER Working Paper Series.

ERIC Educational Resources Information Center

Hansen, Karsten; Heckman, James J.; Mullen, Kathleen J.

This study developed two methods for estimating the effect of schooling on achievement test scores that control for the endogeneity of schooling by postulating that both schooling and test scores are generated by a common unobserved latent ability. The methods were applied to data on schooling and test scores. Estimates from the two methods are in…
Descriptive Statistics for Modern Test Score Distributions: Skewness, Kurtosis, Discreteness, and Ceiling Effects.

PubMed

Ho, Andrew D; Yu, Carol C

2015-06-01

Many statistical analyses benefit from the assumption that unconditional or conditional distributions are continuous and normal. More than 50 years ago in this journal, Lord and Cook chronicled departures from normality in educational tests, and Micerri similarly showed that the normality assumption is met rarely in educational and psychological practice. In this article, the authors extend these previous analyses to state-level educational test score distributions that are an increasingly common target of high-stakes analysis and interpretation. Among 504 scale-score and raw-score distributions from state testing programs from recent years, nonnormal distributions are common and are often associated with particular state programs. The authors explain how scaling procedures from item response theory lead to nonnormal distributions as well as unusual patterns of discreteness. The authors recommend that distributional descriptive statistics be calculated routinely to inform model selection for large-scale test score data, and they illustrate consequences of nonnormality using sensitivity studies that compare baseline results to those from normalized score scales.
Principles and Practices of Test Score Equating. Research Report. ETS RR-10-29

ERIC Educational Resources Information Center

Dorans, Neil J.; Moses, Tim P.; Eignor, Daniel R.

2010-01-01

Score equating is essential for any testing program that continually produces new editions of a test and for which the expectation is that scores from these editions have the same meaning over time. Particularly in testing programs that help make high-stakes decisions, it is extremely important that test equating be done carefully and accurately.…
The Role of Test Scores in Explaining Race and Gender Differences in Wages

ERIC Educational Resources Information Center

Blackburn, McKinley L.

2004-01-01

Previous research has suggested that skills reflected in test-score performance on tests such as the Armed Forces Qualification Test (AFQT) can account for some of the racial differences in average wages. I use a more complete set of test scores available with the National Longitudinal Survey of Youth 1979 Cohort to reconsider this evidence, and…
The Alcohol Use Disorders Identification Test for Consumption (AUDIT-C) is more useful than pre-existing laboratory tests for predicting hazardous drinking: a cross-sectional study.

PubMed

Fujii, Hideki; Nishimoto, Naoki; Yamaguchi, Seiko; Kurai, Osamu; Miyano, Masato; Ueda, Wataru; Oba, Hiroko; Aoki, Tetsuya; Kawada, Norifumi; Okawa, Kiyotaka

2016-05-10

It is important to screen for alcohol consumption and drinking customs in a standardized manner. The aim of this study was 1) to investigate whether the AUDIT score is useful for predicting hazardous drinking using optimal cutoff scores and 2) to use multivariate analysis to evaluate whether the AUDIT score was more useful than pre-existing laboratory tests for predicting hazardous drinking. A cross-sectional study using the Alcohol Use Disorders Identification Test (AUDIT) was conducted in 334 outpatients who consulted our internal medicine department. The patients completed self-reported questionnaires and underwent a diagnostic interview, physical examination, and laboratory testing. Forty (23 %) male patients reported daily alcohol consumption ≥ 40 g, and 16 (10 %) female patients reported consumption ≥ 20 g. The optimal cutoff values of hazardous drinking were calculated using a 10-fold cross validation, resulting in an optimal AUDIT score cutoff of 8.2, with a sensitivity of 95.5 %, specificity of 87.0 %, false positive rate of 13.0 %, false negative rate of 4.5 %, and area under the receiver operating characteristic curve of 0.97. Multivariate analysis revealed that the most popular short version of the AUDIT consisting solely of its three consumption items (AUDIT-C) and patient sex were significantly associated with hazardous drinking. The aspartate transaminase (AST)/alanine transaminase (ALT) ratio and mean corpuscular volume (MCV) were weakly significant. This study showed that the AUDIT score and particularly the AUDIT-C score were more useful than the AST/ALT ratio and MCV for predicting hazardous drinking.
Does breastfeeding contribute to the racial gap in reading and math test scores?

PubMed

Peters, Kristen E; Huang, Jin; Vaughn, Michael G; Witko, Christopher

2013-10-01

The aim of this study was to examine the impact of divergent breastfeeding practices between Caucasian and African American mothers on the lingering achievement test gap between Caucasian and African American children. The Child Development Supplement of the Panel Study of Income Dynamics, beginning in 1997, followed a cohort of 3563 children aged 0-12 years. Reading and math test scores from 2002 for 1928 children were linked with breastfeeding history. Regression analysis was used to examine associations between ever having been breastfed and duration of breastfeeding and test scores, controlling for characteristics of child, mother, and household. African American students scored significantly lower than Caucasian children by 10.6 and 10.9 points on reading and math tests, respectively. After accounting for the impact of having been breastfed during infancy, the racial test gap decreased by 17% for reading scores and 9% for math scores. Study findings indicate that breastfeeding explains 17% and 9% of the observed gaps in reading and math scores, respectively, between African Americans and Caucasians, an effect larger than most recent educational policy interventions. Renewed efforts around policies and clinical practices that promote and remove barriers for African American mothers to breastfeed should be implemented. Copyright © 2013 Elsevier Inc. All rights reserved.
Identifying Aboriginal-specific AUDIT-C and AUDIT-3 cutoff scores for at-risk, high-risk, and likely dependent drinkers using measures of agreement with the 10-item Alcohol Use Disorders Identification Test.

PubMed

Calabria, Bianca; Clifford, Anton; Shakeshaft, Anthony P; Conigrave, Katherine M; Simpson, Lynette; Bliss, Donna; Allan, Julaine

2014-09-01

The Alcohol Use Disorders Identification Test (AUDIT) is a 10-item alcohol screener that has been recommended for use in Aboriginal primary health care settings. The time it takes respondents to complete AUDIT, however, has proven to be a barrier to its routine delivery. Two shorter versions, AUDIT-C and AUDIT-3, have been used as screening instruments in primary health care. This paper aims to identify the AUDIT-C and AUDIT-3 cutoff scores that most closely identify individuals classified as being at-risk drinkers, high-risk drinkers, or likely alcohol dependent by the 10-item AUDIT. Two cross-sectional surveys were conducted from June 2009 to May 2010 and from July 2010 to June 2011. Aboriginal Australian participants (N = 156) were recruited through an Aboriginal Community Controlled Health Service, and a community-based drug and alcohol treatment agency in rural New South Wales (NSW), and through community-based Aboriginal groups in Sydney NSW. Sensitivity, specificity, and positive and negative predictive values of each score on the AUDIT-C and AUDIT-3 were calculated, relative to cutoff scores on the 10-item AUDIT for at-risk, high-risk, and likely dependent drinkers. Receiver operating characteristic (ROC) curve analyses were conducted to measure the detection characteristics of AUDIT-C and AUDIT-3 for the three categories of risk. The areas under the receiver operating characteristic (AUROC) curves were high for drinkers classified as being at-risk, high-risk, and likely dependent. Recommended cutoff scores for Aboriginal Australians are as follows: at-risk drinkers AUDIT-C ≥ 5, AUDIT-3 ≥ 1; high-risk drinkers AUDIT-C ≥ 6, AUDIT-3 ≥ 2; and likely dependent drinkers AUDIT-C ≥ 9, AUDIT-3 ≥ 3. Adequate sensitivity and specificity were achieved for recommended cutoff scores. AUROC curves were above 0.90.

Discrepancies between modified Medical Research Council dyspnea score and COPD assessment test score in patients with COPD

PubMed Central

Rhee, Chin Kook; Kim, Jin Woo; Hwang, Yong Il; Lee, Jin Hwa; Jung, Ki-Suck; Lee, Myung Goo; Yoo, Kwang Ha; Lee, Sang Haak; Shin, Kyeong-Cheol; Yoon, Hyoung Kyu

2015-01-01

Background and objective According to the Global Initiative for Chronic Obstructive Lung Disease (GOLD) guidelines, either a modified Medical Research Council (mMRC) dyspnea score of ≥2 or a chronic obstructive pulmonary disease (COPD) assessment test (CAT) score of ≥10 is considered to represent COPD patients who are more symptomatic. We aimed to identify the ideal CAT score that exhibits minimal discrepancy with the mMRC score. Methods A receiver operating characteristic curve of the CAT score was generated for an mMRC scores of 1 and 2. A concordance analysis was applied to quantify the association between the frequencies of patients categorized into GOLD groups A–D using symptom cutoff points. A κ-coefficient was calculated. Results For an mMRC score of 2, a CAT score of 15 showed the maximum value of Youden’s index with a sensitivity and specificity of 0.70 and 0.66, respectively (area under the receiver operating characteristic curve [AUC] 0.74; 95% confidence interval [CI], 0.70–0.77). For an mMRC score of 1, a CAT score of 10 showed the maximum value of Youden’s index with a sensitivity and specificity of 0.77 and 0.65, respectively (AUC 0.77; 95% CI, 0.72–0.83). The κ value for concordance was highest between an mMRC score of 1 and a CAT score of 10 (0.66), followed by an mMRC score of 2 and a CAT score of 15 (0.56), an mMRC score of 2 and a CAT score of 10 (0.47), and an mMRC score of 1 and a CAT score of 15 (0.43). Conclusion A CAT score of 10 was most concordant with an mMRC score of 1 when classifying patients with COPD into GOLD groups A–D. However, a discrepancy remains between the CAT and mMRC scoring systems. PMID:26316736
An Investigation into the Relationships Between Cloze Test Scores and Informal Reading Inventory Scores of Fifth Grade Pupils.

ERIC Educational Resources Information Center

Walter, Richard Barry

This study investigated the relationship between instructional level scores as determined by a cloze test and instructional level scores as determined by an informal reading inventory (IRI). Fifty male and 50 female subjects were randomly selected from the total fifth grade population of five schools chosen from a total of 22 midwestern elementary…
Accountancy, teaching methods, sex, and American College Test scores.

PubMed

Heritage, J; Harper, B S; Harper, J P

1990-10-01

This study examines the significance of sex, methodology, academic preparation, and age as related to development of judgmental and problem-solving skills. Sex, American College Test (ACT) Mathematics scores, Composite ACT scores, grades in course work, grade point average (GPA), and age were used in studying the effects of teaching method on 96 students' ability to analyze data in financial statements. Results reflect positively on accounting students compared to the general college population and the women students in particular.
Factor structure and invariance test of the alcohol use disorder identification test (AUDIT): Comparison and further validation in a U.S. and Philippines college student sample.

PubMed

Tuliao, Antover P; Landoy, Bernice Vania N; McChargue, Dennis E

2016-01-01

The Alcohol Use Disorder Identification Test's factor structure varies depending on population and culture. Because of this inconsistency, this article examined the factor structure of the test and conducted a factorial invariance test between a U.S. and a Philippines college sample. Confirmatory factor analyses indicated that a three-factor solution outperforms the one- and two-factor solution in both samples. Factorial invariance analyses further supports the confirmatory findings by showing that factor loadings were generally invariant across groups; however, item intercepts show non-invariance. Country differences between factors show that Filipino consumption factor mean scores were significantly lower than their U.S. counterparts.
Reduce, Reuse, Recycle: The Longitudinal Value of Local Cut Scores Using State Test Data

ERIC Educational Resources Information Center

Nelson, Peter M.; Van Norman, Ethan R.; VanDerHeyden, Amanda

2017-01-01

We used existing reading (n = 1,498) and math (n = 2,260) data to evaluate state test scores for screening middle school students. In Phase 1, state test data were used to create a research-derived cut score that was optimal for predicting state test performance the following year. In Phase 2, those cut scores were applied with future cohorts.…
Online pre-race education improves test scores for volunteers at a marathon.

PubMed

Maxwell, Shane; Renier, Colleen; Sikka, Robby; Widstrom, Luke; Paulson, William; Christensen, Trent; Olson, David; Nelson, Benjamin

2017-09-01

This study examined whether an online course would lead to increased knowledge about the medical issues volunteers encounter during a marathon. Health care professionals who volunteered to provide medical coverage for an annual marathon were eligible for the study. Demographic information about medical volunteers including profession, specialty, education level and number of marathons they had volunteered for was collected. A 15-question test about the most commonly encountered medical issues was created by the authors and administered before and after the volunteers took the online educational course and compared to a pilot study the previous year. Seventy-four subjects completed the pre-test. Those who participated in the pilot study last year (N = 15) had pre-test scores that were an average of 2.4 points higher than those who did not (mean ranks: pilot study = 51.6 vs. non-pilot = 33.9, p = 0.004). Of the 74 subjects who completed the pre-test, 54 also completed the post-test. The overall post-pre mean score difference was 3.8 ± 2.7 (t = 10.5 df = 53 p < 0.001). While subjects with all levels of volunteer experience demonstrated improvement, only change among first time marathon volunteers was significantly different from the others. Subjects reporting all degree/certification levels demonstrated improvement, but no difference in improvement was found between degree/certification levels. In this follow-up to the previous year's pilot study, online education demonstrated a long-term (one-year) increase in test scores. Testing also continued to show short-term improvement in post-course test scores, compared to pre-course test scores. In general, marathon medical volunteers who had no volunteer experience demonstrated greater improvement than those who had prior volunteer experience.
Examining the Validity of GED[R] Tests Scores with Scheduling and Setting Accommodations. GED Testing Service Research Studies, 2004-1

ERIC Educational Resources Information Center

George-Ezzelle, Carol E.; Skaggs, Gary

2004-01-01

Current testing standards call for test developers to provide evidence that testing procedures and test scores, and the inferences made based on the test scores, show evidence of validity and are comparable across subpopulations (American Educational Research Association [AERA], American Psychological Association [APA], & National Council on…
Early Identification of Children at Risk for Academic Difficulties Using Standardized Assessment: Stability and Predictive Validity of Preschool Math and Language Scores

ERIC Educational Resources Information Center

Frans, Niek; Post, Wendy J.; Huisman, Mark; Oenema-Mostert, Ineke C. E.; Keegstra, Anne L.; Minnaert, Alexander E. M. G.

2017-01-01

Despite the claim by several researchers that variability in performance may complicate the identification of "at-risk" children, variability in the academic performance of young children remains an undervalued area of research. The goal of this study is to examine the predictive validity for future scores and the score stability of two…
Predictors of Olfactory Dysfunction in Rhinosinusitis Using the Brief Smell Identification Test

PubMed Central

Alt, Jeremiah A.; Mace, Jess C.; Buniel, Maria C. F.; Soler, Zachary M.; Smith, Timothy L.

2014-01-01

Objective Associations between olfactory function to quality-of-life (QOL) and disease severity in patients with rhinosinusitis is poorly understood. We sought to evaluate and compare olfactory function between subgroups of patients with rhinosinusitis using the Brief Smell Identification Test (BSIT). Study Design Cross-sectional evaluation of a multi-center cohort. Methods Patients with recurrent acute sinusitis (RARS) and chronic rhinosinusitis (CRS) with and without nasal polyposis were prospectively enrolled from three academic tertiary care sites. Each subject completed the BSIT, in addition to measures of disease-specific QOL. Patient demographics, comorbidities, and clinical measures of disease severity were compared between patients with normal (BSIT; ≥9) and abnormal (BSIT; <9) olfaction scores. Regression modeling was used to identify potential risk factors associated with olfactory impairment. Results Patients with rhinosinusitis (n=445) were found to suffer olfactory dysfunction as measured by the BSIT (28.3%). Subgroups of rhinosinusitis differed in the degree of olfactory dysfunction reported. Worse disease severity, measured by computed tomography and nasal endoscopy, correlated to worse olfaction. Olfactory scores did not consistently correlate with Rhinosinusitis Disability Index or Sinonasal Outcome Test scores. Regression models demonstrated nasal polyposis was the strongest predictor of olfactory dysfunction. Recalcitrant disease and aspirin intolerance were strongly predictive of worse olfactory function. Conclusion Olfactory dysfunction is a complex, multi-factorial process found to be differentially expressed within subgroups of rhinosinusitis. Olfaction was associated with disease severity as measured by imaging and endoscopy, with only weak associations to disease-specific QOL measures. PMID:24402746
Do We Really Become Smarter When Our Fluid-Intelligence Test Scores Improve?

PubMed Central

Hayes, Taylor R.; Petrov, Alexander A.; Sederberg, Per B.

2014-01-01

Recent reports of training-induced gains on fluid intelligence tests have fueled an explosion of interest in cognitive training—now a billion-dollar industry. The interpretation of these results is questionable because score gains can be dominated by factors that play marginal roles in the scores themselves, and because intelligence gain is not the only possible explanation for the observed control-adjusted far transfer across tasks. Here we present novel evidence that the test score gains used to measure the efficacy of cognitive training may reflect strategy refinement instead of intelligence gains. A novel scanpath analysis of eye movement data from 35 participants solving Raven’s Advanced Progressive Matrices on two separate sessions indicated that one-third of the variance of score gains could be attributed to test-taking strategy alone, as revealed by characteristic changes in eye-fixation patterns. When the strategic contaminant was partialled out, the residual score gains were no longer significant. These results are compatible with established theories of skill acquisition suggesting that procedural knowledge tacitly acquired during training can later be utilized at posttest. Our novel method and result both underline a reason to be wary of purported intelligence gains, but also provide a way forward for testing for them in the future. PMID:25395695
Do We Really Become Smarter When Our Fluid-Intelligence Test Scores Improve?

PubMed

Hayes, Taylor R; Petrov, Alexander A; Sederberg, Per B

2015-01-01

Recent reports of training-induced gains on fluid intelligence tests have fueled an explosion of interest in cognitive training-now a billion-dollar industry. The interpretation of these results is questionable because score gains can be dominated by factors that play marginal roles in the scores themselves, and because intelligence gain is not the only possible explanation for the observed control-adjusted far transfer across tasks. Here we present novel evidence that the test score gains used to measure the efficacy of cognitive training may reflect strategy refinement instead of intelligence gains. A novel scanpath analysis of eye movement data from 35 participants solving Raven's Advanced Progressive Matrices on two separate sessions indicated that one-third of the variance of score gains could be attributed to test-taking strategy alone, as revealed by characteristic changes in eye-fixation patterns. When the strategic contaminant was partialled out, the residual score gains were no longer significant. These results are compatible with established theories of skill acquisition suggesting that procedural knowledge tacitly acquired during training can later be utilized at posttest. Our novel method and result both underline a reason to be wary of purported intelligence gains, but also provide a way forward for testing for them in the future.
A weighted generalized score statistic for comparison of predictive values of diagnostic tests.

PubMed

Kosinski, Andrzej S

2013-03-15

Positive and negative predictive values are important measures of a medical diagnostic test performance. We consider testing equality of two positive or two negative predictive values within a paired design in which all patients receive two diagnostic tests. The existing statistical tests for testing equality of predictive values are either Wald tests based on the multinomial distribution or the empirical Wald and generalized score tests within the generalized estimating equations (GEE) framework. As presented in the literature, these test statistics have considerably complex formulas without clear intuitive insight. We propose their re-formulations that are mathematically equivalent but algebraically simple and intuitive. As is clearly seen with a new re-formulation we presented, the generalized score statistic does not always reduce to the commonly used score statistic in the independent samples case. To alleviate this, we introduce a weighted generalized score (WGS) test statistic that incorporates empirical covariance matrix with newly proposed weights. This statistic is simple to compute, always reduces to the score statistic in the independent samples situation, and preserves type I error better than the other statistics as demonstrated by simulations. Thus, we believe that the proposed WGS statistic is the preferred statistic for testing equality of two predictive values and for corresponding sample size computations. The new formulas of the Wald statistics may be useful for easy computation of confidence intervals for difference of predictive values. The introduced concepts have potential to lead to development of the WGS test statistic in a general GEE setting. Copyright © 2012 John Wiley & Sons, Ltd.
Validity of GRE General Test scores and TOEFL scores for graduate admission to a technical university in Western Europe

NASA Astrophysics Data System (ADS)

Zimmermann, Judith; von Davier, Alina A.; Buhmann, Joachim M.; Heinimann, Hans R.

2018-01-01

Graduate admission has become a critical process in tertiary education, whereby selecting valid admissions instruments is key. This study assessed the validity of Graduate Record Examination (GRE) General Test scores for admission to Master's programmes at a technical university in Europe. We investigated the indicative value of GRE scores for the Master's programme grade point average (GGPA) with and without the addition of the undergraduate GPA (UGPA) and the TOEFL score, and of GRE scores for study completion and Master's thesis performance. GRE scores explained 20% of the variation in the GGPA, while additional 7% were explained by the TOEFL score and 3% by the UGPA. Contrary to common belief, the GRE quantitative reasoning score showed only little explanatory power. GRE scores were also weakly related to study progress but not to thesis performance. Nevertheless, GRE and TOEFL scores were found to be sensible admissions instruments. Rigorous methodology was used to obtain highly reliable results.
Proficiency Standards and Cut-Scores for Language Proficiency Tests.

ERIC Educational Resources Information Center

Moy, Raymond H.

1984-01-01

Discusses the problems associated with "grading on a curve," the approach often used for standard setting on language proficiency tests. Proposes four main steps presented in the setting of a non-arbitrary cut-score. These steps not only establish a proficiency standard checked by external criteria, but also check to see that the test covers the…
Effort Analysis: Individual Score Validation of Achievement Test Data

ERIC Educational Resources Information Center

Wise, Steven L.

2015-01-01

Whenever the purpose of measurement is to inform an inference about a student's achievement level, it is important that we be able to trust that the student's test score accurately reflects what that student knows and can do. Such trust requires the assumption that a student's test event is not unduly influenced by construct-irrelevant factors…
Subcritical flutter testing and system identification

NASA Technical Reports Server (NTRS)

Houbolt, J. C.

1974-01-01

Treatment is given of system response evaluation, especially in application to subcritical flight and wind tunnel flutter testing of aircraft. An evaluation is made of various existing techniques, in conjuction with a companion survey which reports theoretical and analog experiments made to study the identification of system response characteristics. Various input excitations are considered, and new techniques for analyzing response are explored, particularly in reference to the prevalent practical case where unwanted input noise is present, such as caused by gusts or wind tunnel turbulence. Further developments are also made of system parameter identification techniques.
Student Laptop Use and Scores on Standardized Tests

ERIC Educational Resources Information Center

Kposowa, Augustine J.; Valdez, Amanda D.

2013-01-01

Objectives: The primary objective of the study was to investigate the relationship between ubiquitous laptop use and academic achievement. It was hypothesized that students with ubiquitous laptops would score on average higher on standardized tests than those without such computers. Methods: Data were obtained from two sources. First, demographic…
Two rapid pigmentation tests for identification of Cryptococcus neoformans.

PubMed Central

Kaufmann, C S; Merz, W G

1982-01-01

Two tests were developed for the rapid identification of Cryptococcus neoformans based on pigment produced by the organism's phenoloxidase activity. Caffeic acid was incorporated into cornmeal agar, a medium used routinely for yeast identification. When tested on this medium, only C. neoformans isolates produced brown pigment. All other yeasts maintained their normal morphology and did not produce the reaction product. A non-medium-based test was developed for same-day identification of C. neoformans isolates. Paper strips saturated with a buffered L-beta-3,4-dihydroxyphenylalanine-ferric citrate solution were inoculated with isolates and incubated at 37 degrees C. Pigment production occurred only with C. neoformans isolates, many within 60 to 90 min. All other yeasts remained negative. PMID:7040452
The Mediating Effect of Listening Metacognitive Awareness between Test-Taking Motivation and Listening Test Score: An Expectancy-Value Theory Approach

PubMed Central

Xu, Jian

2017-01-01

The present study investigated test-taking motivation in L2 listening testing context by applying Expectancy-Value Theory as the framework. Specifically, this study was intended to examine the complex relationships among expectancy, importance, interest, listening anxiety, listening metacognitive awareness, and listening test score using data from a large-scale and high-stakes language test among Chinese first-year undergraduates. Structural equation modeling was used to examine the mediating effect of listening metacognitive awareness on the relationship between expectancy, importance, interest, listening anxiety, and listening test score. According to the results, test takers’ listening scores can be predicted by expectancy, interest, and listening anxiety significantly. The relationship between expectancy, interest, listening anxiety, and listening test score was mediated by listening metacognitive awareness. The findings have implications for test takers to improve their test taking motivation and listening metacognitive awareness, as well as for L2 teachers to intervene in L2 listening classrooms. PMID:29312063
The Mediating Effect of Listening Metacognitive Awareness between Test-Taking Motivation and Listening Test Score: An Expectancy-Value Theory Approach.

PubMed

Xu, Jian

2017-01-01

The present study investigated test-taking motivation in L2 listening testing context by applying Expectancy-Value Theory as the framework. Specifically, this study was intended to examine the complex relationships among expectancy, importance, interest, listening anxiety, listening metacognitive awareness, and listening test score using data from a large-scale and high-stakes language test among Chinese first-year undergraduates. Structural equation modeling was used to examine the mediating effect of listening metacognitive awareness on the relationship between expectancy, importance, interest, listening anxiety, and listening test score. According to the results, test takers' listening scores can be predicted by expectancy, interest, and listening anxiety significantly. The relationship between expectancy, interest, listening anxiety, and listening test score was mediated by listening metacognitive awareness. The findings have implications for test takers to improve their test taking motivation and listening metacognitive awareness, as well as for L2 teachers to intervene in L2 listening classrooms.

The Dynamics of the Evolution of the Black-White Test Score Gap

ERIC Educational Resources Information Center

Sohn, Kitae

2012-01-01

We apply a quantile version of the Oaxaca-Blinder decomposition to estimate the counterfactual distribution of the test scores of Black students. In the Early Childhood Longitudinal Study, Kindergarten Class of 1998-1999 (ECLS-K), we find that the gap initially appears only at the top of the distribution of test scores. As children age, however,…
The Dental Hygiene Aptitude Tests and the American College Testing Program Tests as Predictors of Scores on the National Board Dental Hygiene Examination.

ERIC Educational Resources Information Center

Longenbecker, Sueann; Wood, Peter H.

1984-01-01

Scores from the National Board Dental Hygiene Examination (NBDHE) served as the criterion variable in a comparison of the predictive validity of the Dental Hygiene Aptitude Tests (DHAT) and the ACT Assessment tests. The DHAT-Science and Verbal tests combined to produce the highest multiple correlation with NBDHE scores. (Author/DWH)
Comparing Graphical and Verbal Representations of Measurement Error in Test Score Reports

ERIC Educational Resources Information Center

Zwick, Rebecca; Zapata-Rivera, Diego; Hegarty, Mary

2014-01-01

Research has shown that many educators do not understand the terminology or displays used in test score reports and that measurement error is a particularly challenging concept. We investigated graphical and verbal methods of representing measurement error associated with individual student scores. We created four alternative score reports, each…
Rank score and permutation testing alternatives for regression quantile estimates

USGS Publications Warehouse

Cade, B.S.; Richards, J.D.; Mielke, P.W.

2006-01-01

Performance of quantile rank score tests used for hypothesis testing and constructing confidence intervals for linear quantile regression estimates (0 ≤ τ ≤ 1) were evaluated by simulation for models with p = 2 and 6 predictors, moderate collinearity among predictors, homogeneous and hetero-geneous errors, small to moderate samples (n = 20–300), and central to upper quantiles (0.50–0.99). Test statistics evaluated were the conventional quantile rank score T statistic distributed as χ2 random variable with q degrees of freedom (where q parameters are constrained by H 0:) and an F statistic with its sampling distribution approximated by permutation. The permutation F-test maintained better Type I errors than the T-test for homogeneous error models with smaller n and more extreme quantiles τ. An F distributional approximation of the F statistic provided some improvements in Type I errors over the T-test for models with > 2 parameters, smaller n, and more extreme quantiles but not as much improvement as the permutation approximation. Both rank score tests required weighting to maintain correct Type I errors when heterogeneity under the alternative model increased to 5 standard deviations across the domain of X. A double permutation procedure was developed to provide valid Type I errors for the permutation F-test when null models were forced through the origin. Power was similar for conditions where both T- and F-tests maintained correct Type I errors but the F-test provided some power at smaller n and extreme quantiles when the T-test had no power because of excessively conservative Type I errors. When the double permutation scheme was required for the permutation F-test to maintain valid Type I errors, power was less than for the T-test with decreasing sample size and increasing quantiles. Confidence intervals on parameters and tolerance intervals for future predictions were constructed based on test inversion for an example application
Test Plan for Cask Identification Detector

DOE Office of Scientific and Technical Information (OSTI.GOV)

Rauch, Eric Benton

2016-09-29

This document serves to outline the testing of a Used Fuel Cask Identification Detector (CID) currently being designed under the DOE-NE MPACT Campaign. A bench-scale prototype detector will be constructed and tested using surrogate neutron sources. The testing will serve to inform the design of the full detector that is to be used as a way of fingerprinting used fuel storage casks based on the neutron signature produced by the used fuel inside the cask.
Contributions of Hamstring Stiffness to Straight-Leg-Raise and Sit-and-Reach Test Scores.

PubMed

Miyamoto, Naokazu; Hirata, Kosuke; Kimura, Noriko; Miyamoto-Mikami, Eri

2018-02-01

The passive straight-leg-raise (PSLR) and the sit-and-reach (SR) tests have been widely used to assess hamstring extensibility. However, it remains unclear to what extent hamstring stiffness (a measure of material properties) contributes to PSLR and SR test scores. Therefore, we aimed to clarify the relationship between hamstring stiffness and PSLR and SR scores using ultrasound shear wave elastography. Ninety-eight healthy subjects completed the study. Each subject completed PSLR testing, and classic and modified SR testing of the right leg. Muscle shear modulus of the biceps femoris, semitendinosus, and semimembranosus was quantified as an index of muscle stiffness. The relationships between shear modulus of each muscle and PSLR or SR scores were calculated using Pearson's product-moment correlation coefficients. Shear modulus of the semitendinosus and semimembranosus showed negative correlations with the two PSLR and two SR scores (absolute r value≤0.484). Shear modulus of the biceps femoris was significantly correlated with the PSLR score determined by the examiner and the modified SR score (absolute r value≤0.308). The present findings suggest that PSLR and SR test scores are strongly influenced by factors other than hamstring stiffness and therefore might not accurately evaluate hamstring stiffness. © Georg Thieme Verlag KG Stuttgart · New York.
Manual for Scoring the Test of Directed Imagination.

ERIC Educational Resources Information Center

Veldman, Donald J.; And Others

A scoring manual for the Directed Imagination Test, a projective technique wherein the subject is instructed to write four fictional stories (four minutes are allowed for each) about teachers and their experiences, is presented. The manual provides detailed instructions for rating each story by fifteen dimensions relevant to teacher education…
AP Trends: Tests Soar, Scores Slip--Gaps between Groups Spur Equity Concerns

ERIC Educational Resources Information Center

Cech, Scott J.

2008-01-01

More students are taking Advanced Placement tests, but the proportion of tests receiving what is deemed a passing score has dipped, and the mean score is down for the fourth year in a row. Data released here this week by the New York City-based nonprofit organization that owns the AP brand shows that a greater-than-ever proportion of students…
Generalized likelihood ratios for quantitative diagnostic test scores.

PubMed

Tandberg, D; Deely, J J; O'Malley, A J

1997-11-01

The reduction of quantitative diagnostic test scores to the dichotomous case is a wasteful and unnecessary simplification in the era of high-speed computing. Physicians could make better use of the information embedded in quantitative test results if modern generalized curve estimation techniques were applied to the likelihood functions of Bayes' theorem. Hand calculations could be completely avoided and computed graphical summaries provided instead. Graphs showing posttest probability of disease as a function of pretest probability with confidence intervals (POD plots) would enhance acceptance of these techniques if they were immediately available at the computer terminal when test results were retrieved. Such constructs would also provide immediate feedback to physicians when a valueless test had been ordered.
Validity of GRE General Test Scores and TOEFL Scores for Graduate Admission to a Technical University in Western Europe

ERIC Educational Resources Information Center

Zimmermann, Judith; von Davier, Alina A.; Buhmann, Joachim M.; Heinimann, Hans R.

2018-01-01

Graduate admission has become a critical process in tertiary education, whereby selecting valid admissions instruments is key. This study assessed the validity of Graduate Record Examination (GRE) General Test scores for admission to Master's programmes at a technical university in Europe. We investigated the indicative value of GRE scores for the…
The Formalization of Fairness: Issues in Testing for Measurement Invariance Using Subtest Scores

ERIC Educational Resources Information Center

Molenaar, Dylan; Borsboom, Denny

2013-01-01

Measurement invariance is an important prerequisite for the adequate comparison of group differences in test scores. In psychology, measurement invariance is typically investigated by means of linear factor analyses of subtest scores. These subtest scores typically result from summing the item scores. In this paper, we discuss 4 possible problems…
Estimating Achievement Gaps from Test Scores Reported in Ordinal "Proficiency" Categories

ERIC Educational Resources Information Center

Ho, Andrew D.; Reardon, Sean F.

2012-01-01

Test scores are commonly reported in a small number of ordered categories. Examples of such reporting include state accountability testing, Advanced Placement tests, and English proficiency tests. This paper introduces and evaluates methods for estimating achievement gaps on a familiar standard-deviation-unit metric using data from these ordered…
A Seven-Year Follow-Up of Intelligence Test Scores of Foster Grandparents

ERIC Educational Resources Information Center

Troll, Lillian E.; And Others

1976-01-01

After seven years, a group (N=32) of originally nonemployed poverty-level older people (over 60) now employed as foster grandparents were retested with the WAIS. Three subtest scores showed stability and Digit Span showed a statistically significant drop. Neither age nor initial level of health or WAIS scores was related to test-score changes over…
Explaining the black-white gap in cognitive test scores: Toward a theory of adverse impact.

PubMed

Cottrell, Jonathan M; Newman, Daniel A; Roisman, Glenn I

2015-11-01

In understanding the causes of adverse impact, a key parameter is the Black-White difference in cognitive test scores. To advance theory on why Black-White cognitive ability/knowledge test score gaps exist, and on how these gaps develop over time, the current article proposes an inductive explanatory model derived from past empirical findings. According to this theoretical model, Black-White group mean differences in cognitive test scores arise from the following racially disparate conditions: family income, maternal education, maternal verbal ability/knowledge, learning materials in the home, parenting factors (maternal sensitivity, maternal warmth and acceptance, and safe physical environment), child birth order, and child birth weight. Results from a 5-wave longitudinal growth model estimated on children in the NICHD Study of Early Child Care and Youth Development from ages 4 through 15 years show significant Black-White cognitive test score gaps throughout early development that did not grow significantly over time (i.e., significant intercept differences, but not slope differences). Importantly, the racially disparate conditions listed above can account for the relation between race and cognitive test scores. We propose a parsimonious 3-Step Model that explains how cognitive test score gaps arise, in which race relates to maternal disadvantage, which in turn relates to parenting factors, which in turn relate to cognitive test scores. This model and results offer to fill a need for theory on the etiology of the Black-White ethnic group gap in cognitive test scores, and attempt to address a missing link in the theory of adverse impact. (c) 2015 APA, all rights reserved).
The rat whole embryo culture assay using the Dysmorphology Score system.

PubMed

Zhang, Cindy; Panzica-Kelly, Julie; Augustine-Rauch, Karen

2013-01-01

The rat whole embryo culture (WEC) system has been used extensively for characterizing teratogenic properties of test chemicals. In this chapter, we describe the methodology for culturing rat embryos as well as a new morphological score system, the Dysmorphology Score (DMS) system for assessing morphology of mid gestation (gestational day 11) rat embryos. In contrast to the developmental stage focused scoring associated with the Brown and Fabro score system, this new score system assesses the respective degree of severity of dysmorphology, which delineates normal from abnormal morphology of specific embryonic structures and organ systems. This score system generates an approach that allows rapid identification and quantification of adverse developmental findings, making it conducive for characterization of compounds for teratogenic properties and screening activities.
RAId_DbS: Peptide Identification using Database Searches with Realistic Statistics

PubMed Central

Alves, Gelio; Ogurtsov, Aleksey Y; Yu, Yi-Kuo

2007-01-01

Background The key to mass-spectrometry-based proteomics is peptide identification. A major challenge in peptide identification is to obtain realistic E-values when assigning statistical significance to candidate peptides. Results Using a simple scoring scheme, we propose a database search method with theoretically characterized statistics. Taking into account possible skewness in the random variable distribution and the effect of finite sampling, we provide a theoretical derivation for the tail of the score distribution. For every experimental spectrum examined, we collect the scores of peptides in the database, and find good agreement between the collected score statistics and our theoretical distribution. Using Student's t-tests, we quantify the degree of agreement between the theoretical distribution and the score statistics collected. The T-tests may be used to measure the reliability of reported statistics. When combined with reported P-value for a peptide hit using a score distribution model, this new measure prevents exaggerated statistics. Another feature of RAId_DbS is its capability of detecting multiple co-eluted peptides. The peptide identification performance and statistical accuracy of RAId_DbS are assessed and compared with several other search tools. The executables and data related to RAId_DbS are freely available upon request. PMID:17961253
Simple exercise test score versus cardiac stress test for the prediction of coronary artery disease in patients with type 2 diabetes.

PubMed

Pikto-Pietkiewicz, Witold; Przewłocka, Monika; Chybowska, Barbara; Cyciwa, Alona; Pasierski, Tomasz

2014-01-01

Type 2 diabetes markedly increases the risk of coronary heart disease (CHD), and screening for CHD is suggested by the guidelines. The aim of the study was to compare the diagnostic usefulness of the simple exercise test score, incorporating the clinical data and cardiac stress test results, with the standard stress test in patients with type 2 diabetes. A total of 62 consecutive patients (aged 65.4 ±8.5 years; 32 men) with type 2 diabetes and clinical symptoms suggesting CHD underwent a stress test followed by coronary angiography. The simple score was calculated for all patients. Significant coronary stenosis was observed in 41 patients (66.1%). Stress test results were positive in 36 patients (58.1%). The mean simple score was high (65.5 ±14.3 points). A positive linear relationship was observed between the score and the prevalence of CHD (R2 = 0.19; P <0.001) as well as its severity (R² = 0.23; P <0.001). The area under the receiver-operating characteristic curve for the simple score was 0.74 (95% confidence interval [CI], 0.62-0.86). At the original cut-off value of 60 points, the score had a similar prognostic value to that of the standard stress test. However, in a multivariate analysis, only the simple score (odds ratio [OR], 1.46; 95% CI, 1.11-1.94; P <0.01 for an increase in the score by 1 point) and male sex (OR, 1.57; 95% CI, 1.24-1.98; P <0.001) remained independent predictors of CHD. In patients with type 2 diabetes, the simple score correlated with the prevalence and severity of CHD. However, the cut-off value of 60 points was inadequate in the population of diabetic patients with high risk of CHD. The simple score used instead of or together with the stress test was a better predictor of CHD than the stress test alone.
A Maturing Global Testing Regime Meets the World Economy: Test Scores and Economic Growth, 1960-2012

ERIC Educational Resources Information Center

Kamens, David H.

2015-01-01

This article considers the growth of the international testing regime. It discusses sources of growth and empirically examines two related sets of issues: (1) the stability of countries' achievement scores, and (2) the influence of those national scores on subsequent economic development over different time lags. The article suggests that…
Music identification skills of children with specific language impairment.

PubMed

Mari, Giorgia; Scorpecci, Alessandro; Reali, Laura; D'Alatri, Lucia

2016-03-01

To date very few studies have investigated the musical skills of children with specific language impairment (SLI). There is growing evidence that SLI affects areas other than language, and it is therefore reasonable to hypothesize that children with this disorder may have difficulties in perceiving musical stimuli appropriately. To compare melody and song identification skills in a group of children with SLI and in a control group of children with typical language development (TD); and to study possible correlations between music identification skills and language abilities in the SLI group. This is a prospective case control study. Two groups of children were enrolled: one meeting DSM-IV-TR(®) diagnostic criteria for SLI and the other comprising an age-matched group of children with TD. All children received a melody and a song identification test, together with a test battery assessing receptive and productive language abilities. 30 children with SLI (mean age = 56 ± 9 months) and 23 with TD (mean age = 60 ± 10 months) were included. Melody and song identification scores among SLI children were significantly lower than those of TD children, and in both groups song identification scores were significantly higher than melody identification scores. Song identification skills bore a significant correlation to chronological age in both groups (TD: r = 0.529, p = 0.009; SLI: r = 0.506, p = 0.004). Whereas no other variables were found explaining the variability of melody or song identification scores in either group, the correlation between language comprehension and song identification in the SLI group approached significance (r = 0.166, p = 0.076). The poorer music perception skills of SLI children as compared with TD ones suggests that SLI may also affect music perception. Therefore, training programmes that simultaneously stimulate via language and music may prove useful in the rehabilitation of children affected by SLI. © 2015 Royal College of Speech and
Assessment Test Scores of Incoming Students, Fall 2001.

ERIC Educational Resources Information Center

Negron, Maggie; Breindel, Matthew

This assessment of placement test scores in reading, math, and sentence skills from incoming students at College of the Desert (California) shows that students are overwhelmingly underprepared for study at the college. Only 15% of students were prepared in sentence skills, 27% in reading skills, 7% in math skills; only 3% were prepared in all 3…

Test Score Stability and Construct Validity of the Adult Manifest Anxiety Scale-College Version Scores among College Students: A Brief Report

ERIC Educational Resources Information Center

Lowe, Patricia A.; Papanastasiou, Elena C.; DeRuyck, Kimberly A.; Reynolds, Cecil R.

2005-01-01

In this study, the authors investigated the temporal stability and construct validity of the Adult Manifest Anxiety Scale-College Version (AMAS-C; C. R. Reynolds, B. O. Richmond, & P. A. Lowe, 2003b) scores. Results indicated that the AMAS-C scores had adequate to excellent test score stability, and evidence supported the construct validity of the…
The Validity of IQ Scores Derived from Readiness Screening Tests

ERIC Educational Resources Information Center

Telegdy, Gabriel A.

1976-01-01

The Screening Test of Academic Readiness (STAR) and the Peabody Picture Vocabulary Test (PPVT) were administered to 52 kindergarten children to reveal the convergent validity of IQ scores derived from the STAR. The findings raise doubts about the validity of the deviation IQs derived from the STAR. (Author)
Psychometric Properties of Raw and Scale Scores on Mixed-Format Tests

ERIC Educational Resources Information Center

Kolen, Michael J.; Lee, Won-Chan

2011-01-01

This paper illustrates that the psychometric properties of scores and scales that are used with mixed-format educational tests can impact the use and interpretation of the scores that are reported to examinees. Psychometric properties that include reliability and conditional standard errors of measurement are considered in this paper. The focus is…
The Comparison of Accuracy Scores on the Paper and Pencil Testing vs. Computer-Based Testing

ERIC Educational Resources Information Center

Retnawati, Heri

2015-01-01

This study aimed to compare the accuracy of the test scores as results of Test of English Proficiency (TOEP) based on paper and pencil test (PPT) versus computer-based test (CBT). Using the participants' responses to the PPT documented from 2008-2010 and data of CBT TOEP documented in 2013-2014 on the sets of 1A, 2A, and 3A for the Listening and…
Pain scores for intravenous cannulation and arterial blood gas test among emergency department patients.

PubMed

Ballesteros-Peña, Sendoa; Vallejo-De la Hoz, Gorka; Fernández-Aedo, Irrintzi

2017-12-23

To analyse vein catheterisation and blood gas test-related pain among adult patients in the emergency department and to explore pain score-related factors. An observational and multicentre research study was performed. Patients undergoing vein catheterisation or arterial puncture for gas test were included consecutively. After each procedure, patients scored the pain experienced using the NRS-11. 780 vein catheterisations and 101 blood gas tests were analysed. Venipuncture was scored with an average score of 2.8 (95% CI: 2.6-3), and arterial puncture with 3.6 (95%CI 3.1-4). Iatrogenic pain scores were associated with moderate - high difficulty procedures (P<.001); with the choice of the humeral rather than the radial artery (P=.02) in the gas test and correlated to baseline pain in venipunctures (P<.001). Pain scores related to other variables such as sex, place of origin or needle gauge did not present statistically significant differences. Vein catheterisation and blood gas test-related pain can be considered mild to moderately and moderately painful procedures, respectively. The pain score is associated with certain variables such as the difficulty of the procedure, the anatomic area of the puncture or baseline pain. A better understanding of painful effects related to emergency nursing procedures and the factors associated with pain self-perception could help to determine when and how to act to mitigate this undesired effect. Copyright © 2017 Elsevier España, S.L.U. All rights reserved.
Normative data for the "Sniffin' Sticks" including tests of odor identification, odor discrimination, and olfactory thresholds: an upgrade based on a group of more than 3,000 subjects.

PubMed

Hummel, T; Kobal, G; Gudziol, H; Mackay-Sim, A

2007-03-01

"Sniffin' Sticks" is a test of nasal chemosensory function that is based on pen-like odor dispensing devices, introduced some 10 years ago by Kobal and co-workers. It consists of tests for odor threshold, discrimination, and identification. Previous work established its test-retest reliability and validity. Results of the test are presented as "TDI score", the sum of results obtained for threshold, discrimination, and identification measures. While normative data have been established they are based on a relatively small number of subjects, especially with regard to subjects older than 55 years where data from only 30 healthy subjects have been used. The present study aimed to remedy this situation. Now data are available from 3,282 subjects as compared to data from 738 subjects published previously. Disregarding sex-related differences, the TDI score at the tenth percentile was 24.9 in subjects younger than 15 years, 30.3 for ages from 16 to 35 years, 27.3 for ages from 36 to 55 years, and 19.6 for subjects older than 55 years. Because the tenth percentile has been defined to separate hyposmia from normosmia, these data can be used as a guide to estimate individual olfactory ability in relation to subject's age. Absolute hyposmia was defined as the tenth percentile score of 16-35 year old subjects. Other than previous reports the present norms are also sex-differentiated with women outperforming men in the three olfactory tests. Further, the present data suggest specific changes of individual olfactory functions in relation to age, with odor thresholds declining most dramatically compared to odor discrimination and odor identification.
Effects of Classroom Ventilation Rate and Temperature on Students' Test Scores.

PubMed

Haverinen-Shaughnessy, Ulla; Shaughnessy, Richard J

2015-01-01

Using a multilevel approach, we estimated the effects of classroom ventilation rate and temperature on academic achievement. The analysis is based on measurement data from a 70 elementary school district (140 fifth grade classrooms) from Southwestern United States, and student level data (N = 3109) on socioeconomic variables and standardized test scores. There was a statistically significant association between ventilation rates and mathematics scores, and it was stronger when the six classrooms with high ventilation rates that were indicated as outliers were filtered (> 7.1 l/s per person). The association remained significant when prior year test scores were included in the model, resulting in less unexplained variability. Students' mean mathematics scores (average 2286 points) were increased by up to eleven points (0.5%) per each liter per second per person increase in ventilation rate within the range of 0.9-7.1 l/s per person (estimated effect size 74 points). There was an additional increase of 12-13 points per each 1°C decrease in temperature within the observed range of 20-25°C (estimated effect size 67 points). Effects of similar magnitude but higher variability were observed for reading and science scores. In conclusion, maintaining adequate ventilation and thermal comfort in classrooms could significantly improve academic achievement of students.
Bi-Factor MIRT Observed-Score Equating for Mixed-Format Tests

ERIC Educational Resources Information Center

Lee, Guemin; Lee, Won-Chan

2016-01-01

The main purposes of this study were to develop bi-factor multidimensional item response theory (BF-MIRT) observed-score equating procedures for mixed-format tests and to investigate relative appropriateness of the proposed procedures. Using data from a large-scale testing program, three types of pseudo data sets were formulated: matched samples,…
Optimal Scoring Methods of Hand-Strength Tests in Patients with Stroke

ERIC Educational Resources Information Center

Huang, Sheau-Ling; Hsieh, Ching-Lin; Lin, Jau-Hong; Chen, Hui-Mei

2011-01-01

The purpose of this study was to determine the optimal scoring methods for measuring strength of the more-affected hand in patients with stroke by examining the effect of reducing measurement errors. Three hand-strength tests of grip, palmar pinch, and lateral pinch were administered at two sessions in 56 patients with stroke. Five scoring methods…
Score Reporting in Teacher Certification Testing: A Review, Design, and Interview/Focus Group Study

ERIC Educational Resources Information Center

Klesch, Heather S.

2010-01-01

The reporting of scores on educational tests is at times misunderstood, misinterpreted, and potentially confusing to examinees and other stakeholders who may need to interpret test scores. In reporting test results to examinees, there is a need for clarity in the message communicated. As pressure rises for students to demonstrate performance at a…
Identifying Aboriginal-specific AUDIT-C and AUDIT-3 cutoff scores for at-risk, high-risk, and likely dependent drinkers using measures of agreement with the 10-item Alcohol Use Disorders Identification Test

PubMed Central

2014-01-01

Background The Alcohol Use Disorders Identification Test (AUDIT) is a 10-item alcohol screener that has been recommended for use in Aboriginal primary health care settings. The time it takes respondents to complete AUDIT, however, has proven to be a barrier to its routine delivery. Two shorter versions, AUDIT-C and AUDIT-3, have been used as screening instruments in primary health care. This paper aims to identify the AUDIT-C and AUDIT-3 cutoff scores that most closely identify individuals classified as being at-risk drinkers, high-risk drinkers, or likely alcohol dependent by the 10-item AUDIT. Methods Two cross-sectional surveys were conducted from June 2009 to May 2010 and from July 2010 to June 2011. Aboriginal Australian participants (N = 156) were recruited through an Aboriginal Community Controlled Health Service, and a community-based drug and alcohol treatment agency in rural New South Wales (NSW), and through community-based Aboriginal groups in Sydney NSW. Sensitivity, specificity, and positive and negative predictive values of each score on the AUDIT-C and AUDIT-3 were calculated, relative to cutoff scores on the 10-item AUDIT for at-risk, high-risk, and likely dependent drinkers. Receiver operating characteristic (ROC) curve analyses were conducted to measure the detection characteristics of AUDIT-C and AUDIT-3 for the three categories of risk. Results The areas under the receiver operating characteristic (AUROC) curves were high for drinkers classified as being at-risk, high-risk, and likely dependent. Conclusions Recommended cutoff scores for Aboriginal Australians are as follows: at-risk drinkers AUDIT-C ≥ 5, AUDIT-3 ≥ 1; high-risk drinkers AUDIT-C ≥ 6, AUDIT-3 ≥ 2; and likely dependent drinkers AUDIT-C ≥ 9, AUDIT-3 ≥ 3. Adequate sensitivity and specificity were achieved for recommended cutoff scores. AUROC curves were above 0.90. PMID:25179547
Clinical experience of scoring criteria for Familial Hypercholesterolaemia (FH) genetic testing in Wales.

PubMed

Haralambos, K; Whatley, S D; Edwards, R; Gingell, R; Townsend, D; Ashfield-Watt, P; Lansberg, P; Datta, D B N; McDowell, I F W

2015-05-01

Familial Hypercholesterolaemia (FH) is caused by mutations in genes of the Low Density Lipoprotein (LDL) receptor pathway. A definitive diagnosis of FH can be made by the demonstration of a pathogenic mutation. The Wales FH service has developed scoring criteria to guide selection of patients for DNA testing, for those referred to clinics with hypercholesterolaemia. The criteria are based on a modification of the Dutch Lipid Clinic scoring criteria and utilise a combination of lipid values, physical signs, personal and family history of premature cardiovascular disease. They are intended to provide clinical guidance and enable resources to be targeted in a cost effective manner. 623 patients who presented to lipid clinics across Wales had DNA testing following application of these criteria. The proportion of patients with a pathogenic mutation ranged from 4% in those scoring 5 or less up to 85% in those scoring 15 or more. LDL-cholesterol was the strongest discriminatory factor. Scores gained from physical signs, family history, coronary heart disease, and triglycerides also showed a gradient in mutation pick-up rate according to the score. These criteria provide a useful tool to guide selection of patients for DNA testing when applied by health professionals who have clinical experience of FH. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
Increasing Racial Isolation and Test Score Gaps in Mathematics: A 30-Year Perspective

ERIC Educational Resources Information Center

Berends, Mark; Penaloza, Roberto V.

2010-01-01

Background/Context: Although there has been progress in closing the test score gaps among student groups over past decades, that progress has stalled. Many researchers have speculated why the test score gaps closed between the early 1970s and the early 1990s, but only a few have been able to empirically study how changes in school factors and…
Peer Effects and the Indigenous/Non-Indigenous Early Test-Score Gap in Peru

ERIC Educational Resources Information Center

Sakellariou, Chris

2008-01-01

This paper assesses the magnitude of the non-indigenous/indigenous test-score gap for third-year and fourth-year primary school pupils in Peru, in relation to the main family, school and peer inputs contributing to the test-score gap using the estimation method of feasible generalized least squares. The article then decomposes the gap into its…
The Uses and Misuses of Test Scores: Technical Assistance Perspective.

ERIC Educational Resources Information Center

Echternacht, Gary

The uses and misuses of standardized test results used for program evaluation as seen by a staff member of an Elementary Secondary Education Act (ESEA) Title I Technical Assistance Center are described. In ESEA Title I, test scores are used to select students for the program. Although federal requirements do not require using standardized test…
Motivating High School Students to Score Proficient on State Tests

ERIC Educational Resources Information Center

Brown, Sarah Lee

2015-01-01

The researcher interviewed two groups of eleventh grade students, in a rural Appalachian setting, who tended to score low on the state mandated high stakes/low stakes test to discover their efforts on the test, specifically in reading, and to obtain their opinions concerning the effects of a specific incentive or consequence. Before the eleventh…
Matrix-Assisted Laser Desorption Ionization-Time of Flight Mass Spectrometry for Combined Species Identification and Drug Sensitivity Testing in Mycobacteria.

PubMed

Ceyssens, Pieter-Jan; Soetaert, Karine; Timke, Markus; Van den Bossche, An; Sparbier, Katrin; De Cremer, Koen; Kostrzewa, Markus; Hendrickx, Marijke; Mathys, Vanessa

2017-02-01

Species identification and drug susceptibility testing (DST) of mycobacteria are important yet complex processes traditionally reserved for reference laboratories. Recent technical improvements in matrix-assisted laser desorption ionization-time of flight mass spectrometry (MALDI-TOF MS) has started to facilitate routine mycobacterial identifications in clinical laboratories. In this paper, we investigate the possibility of performing phenotypic MALDI-based DST in mycobacteriology using the recently described MALDI Biotyper antibiotic susceptibility test rapid assay (MBT-ASTRA). We randomly selected 72 clinical Mycobacterium tuberculosis and nontuberculous mycobacterial (NTM) strains, subjected them to MBT-ASTRA methodology, and compared its results to current gold-standard methods. Drug susceptibility was tested for rifampin, isoniazid, linezolid, and ethambutol (M. tuberculosis, n = 39), and clarithromycin and rifabutin (NTM, n = 33). Combined species identification was performed using the Biotyper Mycobacteria Library 4.0. Mycobacterium-specific MBT-ASTRA parameters were derived (calculation window, m/z 5,000 to 13,000, area under the curve [AUC] of >0.015, relative growth [RG] of <0.5; see the text for details). Using these settings, MBT-ASTRA analyses returned 175/177 M. tuberculosis and 65/66 NTM drug resistance profiles which corresponded to standard testing results. Turnaround times were not significantly different in M. tuberculosis testing, but the MBT-ASTRA method delivered on average a week faster than routine DST in NTM. Databases searches returned 90.4% correct species-level identifications, which increased to 98.6% when score thresholds were lowered to 1.65. In conclusion, the MBT-ASTRA technology holds promise to facilitate and fasten mycobacterial DST and to combine it directly with high-confidence species-level identifications. Given the ease of interpretation, its application in NTM typing might be the first in finding its way to current
A Comparison of Standardized Achievement Test Scores on Right and Left Brain Dominant Fourth-Grade Students.

ERIC Educational Resources Information Center

Bell, Michael L.; Roubinek, Darrell L.

1989-01-01

Compares fourth-graders' subtest scores on the Stanford Achievement Test (SAT), the Iowa Test of Basic Skills (ITBS), and the Metropolitan Achievement Test (MAT). Finds right-brain dominant students scored better on four SAT subtests, and left-brain dominant students scored better on four ITBS subtests and two MAT subtests. (NH)
Generation of GHS Scores from TEST and online sources ...

EPA Pesticide Factsheets

Alternatives assessment frameworks such as DfE (Design for the Environment) evaluate chemical alternatives in terms of human health effects, ecotoxicity, and fate. T.E.S.T. (Toxicity Estimation Software Tool) can be utilized to evaluate human health in terms of acute oral rat toxicity, developmental toxicity, endocrine activity, and mutagenicity. It can be used to evaluate ecotoxicity (in terms of acute fathead minnow toxicity) and fate (in terms of bioconcentration factor). It also be used to estimate a variety of key physicochemical properties such as melting point, boiling point, vapor pressure, water solubility, and bioconcentration factor. A web-based version of T.E.S.T. is currently being developed to allow predictions to be made from other web tools. Online data sources such as from NCCT’s Chemistry Dashboard, REACH dossiers, or from ChemHat.org can also be utilized to obtain GHS (Global Harmonization System) scores for comparing alternatives. The purpose of this talk is to show how GHS (Global Harmonization Score) data can be obtained from literature sources and from T.E.S.T. (Toxicity Estimation Software Tool). This data will be used to compare chemical alternatives in the alternatives assessment dashboard (a 2018 CSS product).
High Test Scores: The Wrong Road to National Economic Success

ERIC Educational Resources Information Center

Baker, Keith

2011-01-01

A widely held view is that good schools are essential to a nation's international economic success and that high test scores on international tests of academic skills and knowledge indicate how good a nation's schools are. The widespread belief that good schools are an important contributor to a nation's economic success in the world is supported…

Optical Automatic Car Identification (OACI) Field Test Program

DOT National Transportation Integrated Search

1976-05-01

The results of the Optical Automatic Car Identification (OACI) tests at Chicago conducted from August 16 to September 4, 1975 are presented. The main purpose of this test was to determine the suitability of optics as a principle of operation for an a...
Psychometric properties of the Turkish versions of the Drug Use Disorders Identification Test (DUDIT) and the Drug Abuse Screening Test (DAST-10) in the prison setting.

PubMed

Evren, Cuneyt; Ogel, Kultegin; Evren, Bilge; Bozkurt, Muge

2014-01-01

The aim of this study was to evaluate psychometric properties of the Drug Use Disorders Identification Test (DUDIT) and the Drug Abuse Screening Test (DAST-10) in prisoners with (n = 124) or without (n = 78) drug use disorder. Participants were evaluated with the DUDIT, the DAST-10, and the Addiction Profile Index-Short (API-S). The DUDIT and the DAST-10 were found to be psychometrically sound drug abuse screening measures with high convergent validity when compared with each other (r = 0.86), and API-S (r = 0.88 and r = 0.84, respectively), and to have a Cronbach's α of 0.93 and 0.87, respectively. In addition, a single component accounted for 58.28% of total variance for DUDIT, whereas this was 47.10% for DAST-10. The DUDIT had sensitivity and specificity scores of 0.95 and 0.79, respectively, when using the optimal cut-off score of 10, whereas these scores were 0.88 and 0.74 for the DAST-10 when using the optimal cut-off score of 4. Additionally, both the DUDIT and the DAST-10 showed good discriminant validity as they differentiated prisoners with drug use disorder from those without. Findings support the Turkish versions of both the DUDIT and the DAST-10 as reliable and valid drug abuse screening instruments that measure unidimensional constructs.
Commentary: Student Cognition, the Situated Learning Context, and Test Score Interpretation

ERIC Educational Resources Information Center

La Marca, Paul M.

2006-01-01

Although it is assumed that student cognition contributes to student performance on achievement tests, it may be that current testing models lack the degree of specification necessary to warrant such inferences. With test score interpretations as the referent, the authors in this special issue address the role of student cognition in learning and…
Relationships between spatial activities and scores on the mental rotation test as a function of sex.

PubMed

Ginn, Sheryl R; Pickens, Stefanie J

2005-06-01

Previous results suggested that female college students' scores on the Mental Rotations Test might be related to their prior experience with spatial tasks. For example, women who played video games scored better on the test than their non-game-playing peers, whereas playing video games was not related to men's scores. The present study examined whether participation in different types of spatial activities would be related to women's performance on the Mental Rotations Test. 31 men and 59 women enrolled at a small, private church-affiliated university and majoring in art or music as well as students who participated in intercollegiate athletics completed the Mental Rotations Test. Women's scores on the Mental Rotations Test benefitted from experience with spatial activities; the more types of experience the women had, the better their scores. Thus women who were athletes, musicians, or artists scored better than those women who had no experience with these activities. The opposite results were found for the men. Efforts are currently underway to assess how length of experience and which types of experience are related to scores.
The value of Bayes' theorem for interpreting abnormal test scores in cognitively healthy and clinical samples.

PubMed

Gavett, Brandon E

2015-03-01

The base rates of abnormal test scores in cognitively normal samples have been a focus of recent research. The goal of the current study is to illustrate how Bayes' theorem uses these base rates--along with the same base rates in cognitively impaired samples and prevalence rates of cognitive impairment--to yield probability values that are more useful for making judgments about the absence or presence of cognitive impairment. Correlation matrices, means, and standard deviations were obtained from the Wechsler Memory Scale--4th Edition (WMS-IV) Technical and Interpretive Manual and used in Monte Carlo simulations to estimate the base rates of abnormal test scores in the standardization and special groups (mixed clinical) samples. Bayes' theorem was applied to these estimates to identify probabilities of normal cognition based on the number of abnormal test scores observed. Abnormal scores were common in the standardization sample (65.4% scoring below a scaled score of 7 on at least one subtest) and more common in the mixed clinical sample (85.6% scoring below a scaled score of 7 on at least one subtest). Probabilities varied according to the number of abnormal test scores, base rates of normal cognition, and cutoff scores. The results suggest that interpretation of base rates obtained from cognitively healthy samples must also account for data from cognitively impaired samples. Bayes' theorem can help neuropsychologists answer questions about the probability that an individual examinee is cognitively healthy based on the number of abnormal test scores observed.
The Influence of Foreign Language Learning during Early Childhood on Standardized Test Scores

ERIC Educational Resources Information Center

Shaw, Tommetta

2010-01-01

Increasing standardized test scores in reading and math is of high importance to the California Department of Education to meet requirements mandated by the No Child Left Behind (NCLB) act of 2001. More research is needed to understand the best ways to improve tests scores to meet concerns of the NCLB act. The purpose of the study was to evaluate…
More than Just Test Scores

ERIC Educational Resources Information Center

Levin, Henry M.

2012-01-01

Around the world we hear considerable talk about creating world-class schools. Usually the term refers to schools whose students get very high scores on the international comparisons of student achievement such as PISA or TIMSS. The practice of restricting the meaning of exemplary schools to the narrow criterion of achievement scores is usually…
Predictive effects of teachers and schools on test scores, college attendance, and earnings

PubMed Central

Chamberlain, Gary E.

2013-01-01

I studied predictive effects of teachers and schools on test scores in fourth through eighth grade and outcomes later in life such as college attendance and earnings. For example, predict the fraction of a classroom attending college at age 20 given the test score for a different classroom in the same school with the same teacher and given the test score for a classroom in the same school with a different teacher. I would like to have predictive effects that condition on averages over many classrooms, with and without the same teacher. I set up a factor model that, under certain assumptions, makes this feasible. Administrative school district data in combination with tax data were used to calculate estimates and do inference. PMID:24101492
Predictive effects of teachers and schools on test scores, college attendance, and earnings.

PubMed

Chamberlain, Gary E

2013-10-22

I studied predictive effects of teachers and schools on test scores in fourth through eighth grade and outcomes later in life such as college attendance and earnings. For example, predict the fraction of a classroom attending college at age 20 given the test score for a different classroom in the same school with the same teacher and given the test score for a classroom in the same school with a different teacher. I would like to have predictive effects that condition on averages over many classrooms, with and without the same teacher. I set up a factor model that, under certain assumptions, makes this feasible. Administrative school district data in combination with tax data were used to calculate estimates and do inference.
Presumptive identification of streptococci with a new test system.

PubMed Central

Facklam, R R; Thacker, L G; Fox, B; Eriquez, L

1982-01-01

A test is described that could replace bacitracin susceptibility for presumptive identification of group A streptococci as well as 6.5% NaCl agar tolerance for presumptive identification of enterococcal streptococci. The L-pyrrolidonyl-beta-naphthylamide test, based on hydrolysis of pyrrolidonyl-beta-naphthylamide, was used in conjunction with the CAMP and bile-esculin tests to presumptively identify the streptococci. Among the beta-hemolytic streptococci; 98% of 50 group A, 98% of 46 group B, and 100% of 70 strains that were not group A, B, or D were correctly identified by the new presumptive test scheme. Among the non-beta-hemolytic streptococci; 96% of 74 group D enterococcal, 100% of 30 group D nonenterococcal, and 82% of 112 viridans strains were correctly identified by the new presumptive test scheme. PMID:7050157
Background Variables, Levels of Aggregation, and Standardized Test Scores

ERIC Educational Resources Information Center

Paulson, Sharon E.; Marchant, Gregory J.

2009-01-01

This article examines the role of student demographic characteristics in standardized achievement test scores at both the individual level and aggregated at the state, district, school levels. For several data sets, the majority of the variance among states, districts, and schools was related to demographic characteristics. Where these background…
What's in a Teacher Test? Assessing the Relationship between Teacher Test Scores and Student Secondary STEM Achievement. CEDR Working Paper. WP #2016-4

ERIC Educational Resources Information Center

Goldhaber, Dan; Gratz, Trevor; Theobald, Roddy

2016-01-01

We investigate the predictive validity of teacher credential test scores for student performance in secondary STEM classrooms in Washington state. After replicating earlier findings that teacher basic skills licensure test scores are a modest and statistically significant predictor of student math test score gains in elementary grades, we focus on…
Effects of Classroom Ventilation Rate and Temperature on Students’ Test Scores

PubMed Central

2015-01-01

Using a multilevel approach, we estimated the effects of classroom ventilation rate and temperature on academic achievement. The analysis is based on measurement data from a 70 elementary school district (140 fifth grade classrooms) from Southwestern United States, and student level data (N = 3109) on socioeconomic variables and standardized test scores. There was a statistically significant association between ventilation rates and mathematics scores, and it was stronger when the six classrooms with high ventilation rates that were indicated as outliers were filtered (> 7.1 l/s per person). The association remained significant when prior year test scores were included in the model, resulting in less unexplained variability. Students’ mean mathematics scores (average 2286 points) were increased by up to eleven points (0.5%) per each liter per second per person increase in ventilation rate within the range of 0.9–7.1 l/s per person (estimated effect size 74 points). There was an additional increase of 12–13 points per each 1°C decrease in temperature within the observed range of 20–25°C (estimated effect size 67 points). Effects of similar magnitude but higher variability were observed for reading and science scores. In conclusion, maintaining adequate ventilation and thermal comfort in classrooms could significantly improve academic achievement of students. PMID:26317643
The effects of calculator-based laboratories on standardized test scores

NASA Astrophysics Data System (ADS)

Stevens, Charlotte Bethany Rains

Nationwide, the goal of providing a productive science and math education to our youth in today's educational institutions is centering itself around the technology being utilized in these classrooms. In this age of digital technology, educational software and calculator-based laboratories (CBL) have become significant devices in the teaching of science and math for many states across the United States. Among the technology, the Texas Instruments graphing calculator and Vernier Labpro interface, are among some of the calculator-based laboratories becoming increasingly popular among middle and high school science and math teachers in many school districts across this country. In Tennessee, however, it is reported that this type of technology is not regularly utilized at the student level in most high school science classrooms, especially in the area of Physical Science (Vernier, 2006). This research explored the effect of calculator based laboratory instruction on standardized test scores. The purpose of this study was to determine the effect of traditional teaching methods versus graphing calculator teaching methods on the state mandated End-of-Course (EOC) Physical Science exam based on ability, gender, and ethnicity. The sample included 187 total tenth and eleventh grade physical science students, 101 of which belonged to a control group and 87 of which belonged to the experimental group. Physical Science End-of-Course scores obtained from the Tennessee Department of Education during the spring of 2005 and the spring of 2006 were used to examine the hypotheses. The findings of this research study suggested the type of teaching method, traditional or calculator based, did not have an effect on standardized test scores. However, the students' ability level, as demonstrated on the End-of-Course test, had a significant effect on End-of-Course test scores. This study focused on a limited population of high school physical science students in the middle Tennessee
Can Percentiles Replace Raw Scores in the Statistical Analysis of Test Data?

ERIC Educational Resources Information Center

Zimmerman, Donald W.; Zumbo, Bruno D.

2005-01-01

Educational and psychological testing textbooks typically warn of the inappropriateness of performing arithmetic operations and statistical analysis on percentiles instead of raw scores. This seems inconsistent with the well-established finding that transforming scores to ranks and using nonparametric methods often improves the validity and power…
Situational Effects May Account for Gain Scores in Cognitive Ability Testing: A Longitudinal SEM Approach

ERIC Educational Resources Information Center

Matton, Nadine; Vautier, Stephane; Raufaste, Eric

2009-01-01

Mean gain scores for cognitive ability tests between two sessions in a selection setting are now a robust finding, yet not fully understood. Many authors do not attribute such gain scores to an increase in the target abilities. Our approach consists of testing a longitudinal SEM model suitable to this view. We propose to model the scores' changes…
Effects of Targeted Test Preparation on Scores of Two Tests of Oral English as a Second Language

ERIC Educational Resources Information Center

Farnsworth, Tim

2013-01-01

This study investigated the effect of targeted test preparation, or coaching, on oral English as a second language test scores. The tests in question were the Basic English Skills Test Plus (BEST Plus), a scripted oral interview published by the Center for Applied Linguistics, and the Versant English Test (VET), a computer-administered and…
Φ-score: A cell-to-cell phenotypic scoring method for sensitive and selective hit discovery in cell-based assays.

PubMed

Guyon, Laurent; Lajaunie, Christian; Fer, Frédéric; Bhajun, Ricky; Sulpice, Eric; Pinna, Guillaume; Campalans, Anna; Radicella, J Pablo; Rouillier, Philippe; Mary, Mélissa; Combe, Stéphanie; Obeid, Patricia; Vert, Jean-Philippe; Gidrol, Xavier

2015-09-18

Phenotypic screening monitors phenotypic changes induced by perturbations, including those generated by drugs or RNA interference. Currently-used methods for scoring screen hits have proven to be problematic, particularly when applied to physiologically relevant conditions such as low cell numbers or inefficient transfection. Here, we describe the Φ-score, which is a novel scoring method for the identification of phenotypic modifiers or hits in cell-based screens. Φ-score performance was assessed with simulations, a validation experiment and its application to gene identification in a large-scale RNAi screen. Using robust statistics and a variance model, we demonstrated that the Φ-score showed better sensitivity, selectivity and reproducibility compared to classical approaches. The improved performance of the Φ-score paves the way for cell-based screening of primary cells, which are often difficult to obtain from patients in sufficient numbers. We also describe a dedicated merging procedure to pool scores from small interfering RNAs targeting the same gene so as to provide improved visualization and hit selection.
A knowledge-based theory of rising scores on "culture-free" tests.

PubMed

Fox, Mark C; Mitchum, Ainsley L

2013-08-01

Secular gains in intelligence test scores have perplexed researchers since they were documented by Flynn (1984, 1987). Gains are most pronounced on abstract, so-called culture-free tests, prompting Flynn (2007) to attribute them to problem-solving skills availed by scientifically advanced cultures. We propose that recent-born individuals have adopted an approach to analogy that enables them to infer higher level relations requiring roles that are not intrinsic to the objects that constitute initial representations of items. This proposal is translated into item-specific predictions about differences between cohorts in pass rates and item-response patterns on the Raven's Matrices (Flynn, 1987), a seemingly culture-free test that registers the largest Flynn effect. Consistent with predictions, archival data reveal that individuals born around 1940 are less able to map objects at higher levels of relational abstraction than individuals born around 1990. Polytomous Rasch models verify predicted violations of measurement invariance, as raw scores are found to underestimate the number of analogical rules inferred by members of the earlier cohort relative to members of the later cohort who achieve the same overall score. The work provides a plausible cognitive account of the Flynn effect, furthers understanding of the cognition of matrix reasoning, and underscores the need to consider how test-takers select item responses. PsycINFO Database Record (c) 2013 APA, all rights reserved.
A Latent Class Approach to Estimating Test-Score Reliability

ERIC Educational Resources Information Center

van der Ark, L. Andries; van der Palm, Daniel W.; Sijtsma, Klaas

2011-01-01

This study presents a general framework for single-administration reliability methods, such as Cronbach's alpha, Guttman's lambda-2, and method MS. This general framework was used to derive a new approach to estimating test-score reliability by means of the unrestricted latent class model. This new approach is the latent class reliability…

Experiential Awareness of the Effects of Test Score Reports.

ERIC Educational Resources Information Center

Bender, Robert C.

Because most counselors have experienced a significant amount of success, they often have difficulty understanding the impact of test scores on persons who do not perform well. Counselor educators must develop experiential awareness in an area normally outside the realm of their students. To provide such an experience, 25 counselor trainees took…
What We Lose in Winning the Test Score Race

ERIC Educational Resources Information Center

Jorgenson, Olaf

2012-01-01

To achieve perpetually better test results each year as mandated by the No Child Left Behind Act (NCLB), teachers in successful schools such as Leroy Anderson Elementary in San Jose, California, will "try anything" to raise scores, as the school's principal stated in an interview with "The San Jose Mercury News." In schools…
Benefits of Coaching on Test Scores Seen as Negligible.

ERIC Educational Resources Information Center

Report on Education Research, 1983

1983-01-01

THE FOLLOWING IS THE FULL TEXT OF THIS DOCUMENT: A new study by a pair of Harvard University researchers discounts earlier findings that coaching can substantially improve student performance on the Scholastic Aptitude Test (SAT). "There is simply insufficient evidence that large score increases are a result of a coaching program," write…
Structured didactic teaching sessions improve medical student neurology clerkship test scores: a pilot study.

PubMed

Menkes, Daniel L; Reed, Mary

2008-01-01

To determine the effectiveness of didactic case-based instruction methodology to improve medical student comprehension of common neurological illnesses and neurological emergencies. Neurology department, academic university. 415 third and fourth year medical students performing a required four week neurology clerkship. Raw test scores on a 1 hour, 50-item clinical vignette based examination and open-ended questions in a post-clerkship feedback session. There was a statistically significant improvement in overall test scores (p<0.001). Didactic teaching sessions have a significant positive impact on neurology student clerkship test score performance and perception of their educational experience. Confirmation of these results across multiple specialties in a multi-center trial is warranted.
Estimating Conditional Distributions of Scores on an Alternate Form of a Test. Research Report. ETS RR-15-18

ERIC Educational Resources Information Center

Livingston, Samuel A.; Chen, Haiwen H.

2015-01-01

Quantitative information about test score reliability can be presented in terms of the distribution of equated scores on an alternate form of the test for test takers with a given score on the form taken. In this paper, we describe a procedure for estimating that distribution, for any specified score on the test form taken, by estimating the joint…
Computerized scoring algorithms for the Autobiographical Memory Test.

PubMed

Takano, Keisuke; Gutenbrunner, Charlotte; Martens, Kris; Salmon, Karen; Raes, Filip

2018-02-01

Reduced specificity of autobiographical memories is a hallmark of depressive cognition. Autobiographical memory (AM) specificity is typically measured by the Autobiographical Memory Test (AMT), in which respondents are asked to describe personal memories in response to emotional cue words. Due to this free descriptive responding format, the AMT relies on experts' hand scoring for subsequent statistical analyses. This manual coding potentially impedes research activities in big data analytics such as large epidemiological studies. Here, we propose computerized algorithms to automatically score AM specificity for the Dutch (adult participants) and English (youth participants) versions of the AMT by using natural language processing and machine learning techniques. The algorithms showed reliable performances in discriminating specific and nonspecific (e.g., overgeneralized) autobiographical memories in independent testing data sets (area under the receiver operating characteristic curve > .90). Furthermore, outcome values of the algorithms (i.e., decision values of support vector machines) showed a gradient across similar (e.g., specific and extended memories) and different (e.g., specific memory and semantic associates) categories of AMT responses, suggesting that, for both adults and youth, the algorithms well capture the extent to which a memory has features of specific memories. (PsycINFO Database Record (c) 2018 APA, all rights reserved).
A seven-year follow-up of intelligence test scores of foster grandparents.

PubMed

Troll, L E; Saltz, R; Dunin-Markiewicz, A

1976-09-01

After 7 years, a group of originally nonemployed poverty-level older people (over 60) who had been employed as foster grandparents were retested with the WAIS. Four WAIS subtests - Vocabulary Similarities, Digit Span, and Block Design - were employed. Of the original group of 39, complete data were available for 28; 18 of these were still working on the project, and the other 10 had dropped out. Dropouts as a group tested lower originally and also showed more deterioration in functional health ratings over time. For the total group of 32 foster grandparents, three subtest scores showed stability over the 7 years. Only Digit Span showed a statistically significant drop. Neither age nor the initial level of health or WAIS scores was related to test-score changes over time.
Robust joint score tests in the application of DNA methylation data analysis.

PubMed

Li, Xuan; Fu, Yuejiao; Wang, Xiaogang; Qiu, Weiliang

2018-05-18

Recently differential variability has been showed to be valuable in evaluating the association of DNA methylation to the risks of complex human diseases. The statistical tests based on both differential methylation level and differential variability can be more powerful than those based only on differential methylation level. Anh and Wang (2013) proposed a joint score test (AW) to simultaneously detect for differential methylation and differential variability. However, AW's method seems to be quite conservative and has not been fully compared with existing joint tests. We proposed three improved joint score tests, namely iAW.Lev, iAW.BF, and iAW.TM, and have made extensive comparisons with the joint likelihood ratio test (jointLRT), the Kolmogorov-Smirnov (KS) test, and the AW test. Systematic simulation studies showed that: 1) the three improved tests performed better (i.e., having larger power, while keeping nominal Type I error rates) than the other three tests for data with outliers and having different variances between cases and controls; 2) for data from normal distributions, the three improved tests had slightly lower power than jointLRT and AW. The analyses of two Illumina HumanMethylation27 data sets GSE37020 and GSE20080 and one Illumina Infinium MethylationEPIC data set GSE107080 demonstrated that three improved tests had higher true validation rates than those from jointLRT, KS, and AW. The three proposed joint score tests are robust against the violation of normality assumption and presence of outlying observations in comparison with other three existing tests. Among the three proposed tests, iAW.BF seems to be the most robust and effective one for all simulated scenarios and also in real data analyses.
Peptide identification

DOEpatents

Jarman, Kristin H [Richland, WA; Cannon, William R [Richland, WA; Jarman, Kenneth D [Richland, WA; Heredia-Langner, Alejandro [Richland, WA

2011-07-12

Peptides are identified from a list of candidates using collision-induced dissociation tandem mass spectrometry data. A probabilistic model for the occurrence of spectral peaks corresponding to frequently observed partial peptide fragment ions is applied. As part of the identification procedure, a probability score is produced that indicates the likelihood of any given candidate being the correct match. The statistical significance of the score is known without necessarily having reference to the actual identity of the peptide. In one form of the invention, a genetic algorithm is applied to candidate peptides using an objective function that takes into account the number of shifted peaks appearing in the candidate spectrum relative to the test spectrum.
The Relationship between Deductive Reasoning Ability, Test Anxiety, and Standardized Test Scores in a Latino Sample

ERIC Educational Resources Information Center

Rich, John D., Jr.; Fullard, William; Overton, Willis

2011-01-01

One Hundred and Twelve Latino students from Philadelphia participated in this study, which examined the development of deductive reasoning across adolescence, and the relation of reasoning to test anxiety and standardized test scores. As predicted, 11th and ninth graders demonstrated significantly more advanced reasoning than seventh graders.…
Identifying Speech Acts in E-Mails: Toward Automated Scoring of the "TOEIC"® E-Mail Task. Research Report. ETS RR-12-16

ERIC Educational Resources Information Center

De Felice, Rachele; Deane, Paul

2012-01-01

This study proposes an approach to automatically score the "TOEIC"® Writing e-mail task. We focus on one component of the scoring rubric, which notes whether the test-takers have used particular speech acts such as requests, orders, or commitments. We developed a computational model for automated speech act identification and tested it…
A Diet Score Assessing Norwegian Adolescents’ Adherence to Dietary Recommendations—Development and Test-Retest Reproducibility of the Score

PubMed Central

Handeland, Katina; Kjellevold, Marian; Wik Markhus, Maria; Eide Graff, Ingvild; Frøyland, Livar; Lie, Øyvind; Skotheim, Siv; Stormark, Kjell Morten; Dahl, Lisbeth; Øyen, Jannike

2016-01-01

Assessment of adolescents’ dietary habits is challenging. Reliable instruments to monitor dietary trends are required to promote healthier behaviours in this group. The purpose of this cross-sectional study was to assess adolescents’ adherence to Norwegian dietary recommendations with a diet score and to report results from, and test-retest reliability of, the score. The diet score involved seven food groups and one physical activity indicator, and was applied to answers from a semi-quantitative food frequency questionnaire (FFQ) administered twice. Reproducibility of the score was assessed with Cohen’s Kappa (κ statistics) at an interval of three months. The setting was eight lower-secondary schools in Hordaland County, Norway, and subjects were adolescents (n = 472) aged 14–15 years and their caregivers. Results showed that the proportion of adolescents consistently classified by the diet score was 87.6% (κ = 0.465). For food groups, proportions ranged from 74.0% to 91.6% (κ = 0.249 to κ = 0.573). Less than 40% of the participants were found to adhere to recommendations for frequencies of eating fruits, vegetables, added sugar, and fish. Highest compliance to recommendations was seen for choosing water as beverage and limit the intake of red meat. The score was associated with parental socioeconomic status. The diet score was found to be reproducible at an acceptable level. Health promoting work targeting adolescents should emphasize to increase the intake of recommended foods to approach nutritional guidelines. PMID:27483312
A Diet Score Assessing Norwegian Adolescents' Adherence to Dietary Recommendations-Development and Test-Retest Reproducibility of the Score.

PubMed

Handeland, Katina; Kjellevold, Marian; Wik Markhus, Maria; Eide Graff, Ingvild; Frøyland, Livar; Lie, Øyvind; Skotheim, Siv; Stormark, Kjell Morten; Dahl, Lisbeth; Øyen, Jannike

2016-07-29

Assessment of adolescents' dietary habits is challenging. Reliable instruments to monitor dietary trends are required to promote healthier behaviours in this group. The purpose of this cross-sectional study was to assess adolescents' adherence to Norwegian dietary recommendations with a diet score and to report results from, and test-retest reliability of, the score. The diet score involved seven food groups and one physical activity indicator, and was applied to answers from a semi-quantitative food frequency questionnaire (FFQ) administered twice. Reproducibility of the score was assessed with Cohen's Kappa (κ statistics) at an interval of three months. The setting was eight lower-secondary schools in Hordaland County, Norway, and subjects were adolescents (n = 472) aged 14-15 years and their caregivers. Results showed that the proportion of adolescents consistently classified by the diet score was 87.6% (κ = 0.465). For food groups, proportions ranged from 74.0% to 91.6% (κ = 0.249 to κ = 0.573). Less than 40% of the participants were found to adhere to recommendations for frequencies of eating fruits, vegetables, added sugar, and fish. Highest compliance to recommendations was seen for choosing water as beverage and limit the intake of red meat. The score was associated with parental socioeconomic status. The diet score was found to be reproducible at an acceptable level. Health promoting work targeting adolescents should emphasize to increase the intake of recommended foods to approach nutritional guidelines.
A Bad Idea: National Standards Based on Test Scores

ERIC Educational Resources Information Center

Baker, Keith

2010-01-01

The justification for national standards is that test scores predict a nation's future economic success. There is no evidence that supports this assumption. There is evidence that it is wrong. For more than half a century, reformers have been trying to fix our schools with little success. The obvious conclusion is that something that can't be…
America's Mediocre Test Scores: Education Crisis or Poverty Crisis?

ERIC Educational Resources Information Center

Petrilli, Michael J.; Wright, Brandon L.

2016-01-01

At a time when the national conversation is focused on lagging upward mobility, it is no surprise that many educators point to poverty as the explanation for mediocre test scores among U.S. students compared to those of students in other countries. If American teachers in struggling U.S. schools taught in Finland, says Finnish educator Pasi…
A new scoring function for top-down spectral deconvolution

DOE Office of Scientific and Technical Information (OSTI.GOV)

Kou, Qiang; Wu, Si; Liu, Xiaowen

2014-12-18

Background: Top-down mass spectrometry plays an important role in intact protein identification and characterization. Top-down mass spectra are more complex than bottom-up mass spectra because they often contain many isotopomer envelopes from highly charged ions, which may overlap with one another. As a result, spectral deconvolution, which converts a complex top-down mass spectrum into a monoisotopic mass list, is a key step in top-down spectral interpretation. Results: In this paper, we propose a new scoring function, L-score, for evaluating isotopomer envelopes. By combining L-score with MS-Deconv, a new software tool, MS-Deconv+, was developed for top-down spectral deconvolution. Experimental results showedmore » that MS-Deconv+ outperformed existing software tools in top-down spectral deconvolution. Conclusions: L-score shows high discriminative ability in identification of isotopomer envelopes. Using L-score, MS-Deconv+ reports many correct monoisotopic masses missed by other software tools, which are valuable for proteoform identification and characterization.« less
Using Heteroskedastic Ordered Probit Models to Recover Moments of Continuous Test Score Distributions from Coarsened Data

ERIC Educational Resources Information Center

Reardon, Sean F.; Shear, Benjamin R.; Castellano, Katherine E.; Ho, Andrew D.

2017-01-01

Test score distributions of schools or demographic groups are often summarized by frequencies of students scoring in a small number of ordered proficiency categories. We show that heteroskedastic ordered probit (HETOP) models can be used to estimate means and standard deviations of multiple groups' test score distributions from such data. Because…
Power and sample size evaluation for the Cochran-Mantel-Haenszel mean score (Wilcoxon rank sum) test and the Cochran-Armitage test for trend.

PubMed

Lachin, John M

2011-11-10

The power of a chi-square test, and thus the required sample size, are a function of the noncentrality parameter that can be obtained as the limiting expectation of the test statistic under an alternative hypothesis specification. Herein, we apply this principle to derive simple expressions for two tests that are commonly applied to discrete ordinal data. The Wilcoxon rank sum test for the equality of distributions in two groups is algebraically equivalent to the Mann-Whitney test. The Kruskal-Wallis test applies to multiple groups. These tests are equivalent to a Cochran-Mantel-Haenszel mean score test using rank scores for a set of C-discrete categories. Although various authors have assessed the power function of the Wilcoxon and Mann-Whitney tests, herein it is shown that the power of these tests with discrete observations, that is, with tied ranks, is readily provided by the power function of the corresponding Cochran-Mantel-Haenszel mean scores test for two and R > 2 groups. These expressions yield results virtually identical to those derived previously for rank scores and also apply to other score functions. The Cochran-Armitage test for trend assesses whether there is an monotonically increasing or decreasing trend in the proportions with a positive outcome or response over the C-ordered categories of an ordinal independent variable, for example, dose. Herein, it is shown that the power of the test is a function of the slope of the response probabilities over the ordinal scores assigned to the groups that yields simple expressions for the power of the test. Copyright © 2011 John Wiley & Sons, Ltd.
The Implications of Family Size and Birth Order for Test Scores and Behavioral Development

ERIC Educational Resources Information Center

Silles, Mary A.

2010-01-01

This article, using longitudinal data from the National Child Development Study, presents new evidence on the effects of family size and birth order on test scores and behavioral development at age 7, 11 and 16. Sibling size is shown to have an adverse causal effect on test scores and behavioral development. For any given family size, first-borns…
The Weighted Airman Promotion System: Standardizing Test Scores

DTIC Science & Technology

2008-01-01

This document and trademark( s ) contained herein are protected by law as indicated in a notice appearing later in this work. This electronic...SUBTITLE The Weighted Airman Promotion System. Standardizing Test Scores 5a. CONTRACT NUMBER 5b. GRANT NUMBER 5c. PROGRAM ELEMENT NUMBER 6. AUTHOR( S ) 5d...PROJECT NUMBER 5e. TASK NUMBER 5f. WORK UNIT NUMBER 7. PERFORMING ORGANIZATION NAME( S ) AND ADDRESS(ES) Rand Corporation,PO Box 2138,Santa Monica

Affirm VPIII microbial identification test can be used to detect gardnerella vaginalis, Candida albicans and trichomonas vaginalis microbial infections in Korean women.

PubMed

Byun, Seung Won; Park, Yeon Joon; Hur, Soo Young

2016-04-01

The aim of this study was to compare Affirm VPIII Microbial Identification Test results for Korean women to those obtained for Gardnerella vaginalis through Nugent score, Candida albicans based on vaginal culture and Trichomonas vaginalis based on wet smear diagnostic standards. Study participants included 195 women with symptomatic or asymptomatic vulvovaginitis under hospital obstetric or gynecologic care. A definite diagnosis was made based on Nugent score for Gardnerella, vaginal culture for Candida and wet prep for Trichomonas vaginalis. Affirm VPIII Microbial Identification Test results were then compared to diagnostic standard results. Of the 195 participants, 152 were symptomatic, while 43 were asymptomatic. Final diagnosis revealed 68 (37.87%) cases of Gardnerella, 29 (14.87%) cases of Candida, one (0.51%) case of Trichomonas, and 10 (5.10%) cases of mixed infections. The detection rates achieved by each detection method (Affirm assay vs diagnostic standard) for Gardnerella and Candida were not significantly different (33.33% vs 34.8% for Gardnerella, 13.33% vs 14.87% for Candida, respectively). The sensitivity and specificity of the Affirm test for Gardnerella compared to the diagnostic standard were 75.0% and 88.98%, respectively. For Candida, the sensitivity and specificity of the Affirm test compared to the diagnostic standard were 82.76% and 98.80%, respectively. The number of Trichomonas cases was too small (1 case) to be statistically analyzed. The Affirm test is a quick tool that can help physicians diagnose and treat patients with infectious vaginitis at the point of care. © 2016 Japan Society of Obstetrics and Gynecology.
Qualitative Dimensions in Scoring the Rey Visual Memory Test of Malingering.

ERIC Educational Resources Information Center

Griffin, G. A. Elmer; And Others

1996-01-01

A new qualitative scoring system for the Rey Visual Memory Test was tested for its ability to distinguish between malingerers and nonmalingerers. The new system, based on the types of errors made, was able to distinguish between 53 psychiatrically disabled and 64 normal nonmalingerers, and between nonmalingerers and 91 possible malingerers. (SLD)
Correlations between the Hand Test Pathology score and Personality Assessment Inventory scales for pain clinic patients.

PubMed

George, J M; Wagner, E E

1995-06-01

Pearson correlations between the Hand Test Pathology (PATH) score and Personality Assessment Inventory scales produced a cluster of relationships characteristic of an antisocial orientation. Likewise, PATH significantly differentiated between a "P" (Pathology) group flagged by a high Negative Impression score on the inventory, and an "N" (Normal) group of 100 pain patients. It was suggested that the interpretive simplicity of Hand Test scores renders the scores amenable to further correlational studies involving the inventory.
Improving Test Score Reporting: Perspectives from the ETS Score Reporting Conference. Research Report. ETS RR-11-45

ERIC Educational Resources Information Center

Zapata-Rivera, Diego, Ed.; Zwick, Rebecca, Ed.

2011-01-01

This volume includes 3 papers based on presentations at a workshop on communicating assessment information to particular audiences, held at Educational Testing Service (ETS) on November 4th, 2010, to explore some issues that influence score reports and new advances that contribute to the effectiveness of these reports. Jessica Hullman, Rebecca…
Stability of the alcohol use disorders identification test in practical service settings.

PubMed

Sahker, Ethan; Lancianese, Donna A; Arndt, Stephan

2017-01-01

The purpose of the present study is to explore the stability of the Alcohol Use Disorders Identification Test (AUDIT) in a clinical setting by comparing prescreening heavy drinking questions and AUDIT scores over time. Because instrument stability is equal to test-retest reliability at worst, investigating the stability of the AUDIT would help better understand patient behavior change in context and the appropriateness of the AUDIT in a clinical setting. This was a retrospective exploratory analysis of Visit 1 to Visit 2 AUDIT stability (n=1,099; male [75.4%], female [24.6%]) from all patients with first-time and second-time records in the Iowa Screening, Brief Intervention, and Referral to Treatment project, October 2012 to July 7, 2015 (N=17,699; male [40.6%], female [59.4%]). The AUDIT demonstrated moderate stability (intraclass correlation=0.56, 95% confidence interval: 0.52-0.60). In a multiple regression predicting the (absolute) difference between the two AUDIT scores, the participants' age was highly significant, t (1,092)=6.23, p <0.001. Younger participants clearly showed less stability than their older counterparts. Results are limited/biased by the observational nature of the study design and the use of clinical service data. The present findings contribute to the literature by demonstrating that the AUDIT changes are moderately dependable from Visit 1 to Visit 2 while taking into account patient drinking behavior variability. It is important to know the stability of the AUDIT for continued use in Screening, Brief Intervention, and Referral to Treatment programming.
Standardized Testing of Special Education Students: A Comparison of Service Type and Test Scores

ERIC Educational Resources Information Center

Hogan-Young, Christine

2013-01-01

The purpose of this study was to determine if there was a difference in Tennessee Comprehensive Assessment Program Modified Academic Achievement Standards (TCAP MAAS) achievement test scores for special education students who receive their instruction in the resource classroom or in an inclusion classroom. The study involved third, fourth, and…
Φ-score: A cell-to-cell phenotypic scoring method for sensitive and selective hit discovery in cell-based assays

PubMed Central

Guyon, Laurent; Lajaunie, Christian; fer, Frédéric; bhajun, Ricky; sulpice, Eric; pinna, Guillaume; campalans, Anna; radicella, J. Pablo; rouillier, Philippe; mary, Mélissa; combe, Stéphanie; obeid, Patricia; vert, Jean-Philippe; gidrol, Xavier

2015-01-01

Phenotypic screening monitors phenotypic changes induced by perturbations, including those generated by drugs or RNA interference. Currently-used methods for scoring screen hits have proven to be problematic, particularly when applied to physiologically relevant conditions such as low cell numbers or inefficient transfection. Here, we describe the Φ-score, which is a novel scoring method for the identification of phenotypic modifiers or hits in cell-based screens. Φ-score performance was assessed with simulations, a validation experiment and its application to gene identification in a large-scale RNAi screen. Using robust statistics and a variance model, we demonstrated that the Φ-score showed better sensitivity, selectivity and reproducibility compared to classical approaches. The improved performance of the Φ-score paves the way for cell-based screening of primary cells, which are often difficult to obtain from patients in sufficient numbers. We also describe a dedicated merging procedure to pool scores from small interfering RNAs targeting the same gene so as to provide improved visualization and hit selection. PMID:26382112
Screening for Behavioral Risk: Identification of High Risk Cut Scores within the Social, Academic, and Emotional Behavior Risk Screener (SAEBRS)

ERIC Educational Resources Information Center

Kilgus, Stephen P.; Taylor, Crystal N.; von der Embse, Nathaniel P.

2018-01-01

The purpose of this study was to support the identification of Social, Academic, and Emotional Behavior Risk Screener (SAEBRS) cut scores that could be used to detect high-risk students. Teachers rated students across two time points (Time 1 n = 1,242 students; Time 2 n = 704) using the SAEBRS and the Behavioral and Emotional Screening System…
Validity of Alternative Cut-Off Scores for the Back-Saver Sit and Reach Test

ERIC Educational Resources Information Center

Looney, Marilyn A.; Gilbert, Jennie

2012-01-01

The purpose of the study was to determine if currently used FITNESSGRAM[R] cut-off scores for the Back Saver Sit and Reach Test had the best criterion-referenced validity evidence for 6-12 year old children. Secondary analyses of an existing data set focused on the passive straight leg raise and Back Saver Sit and Reach Test flexibility scores of…
Critical overview of applications of genetic testing in sport talent identification.

PubMed

Roth, Stephen M

2012-12-01

Talent identification for future sport performance is of paramount interest for many groups given the challenges of finding and costs of training potential elite athletes. Because genetic factors have been implicated in many performance- related traits (strength, endurance, etc.), a natural inclination is to consider the addition of genetic testing to talent identification programs. While the importance of genetic factors to sport performance is generally not disputed, whether genetic testing can positively inform talent identification is less certain. The present paper addresses the science behind the genetic tests that are now commercially available (some under patent protection) and aimed at predicting future sport performance potential. Also discussed are the challenging ethical issues that emerge from the availability of these tests. The potential negative consequences associated with genetic testing of young athletes will very likely outweigh any positive benefit for sport performance prediction at least for the next several years. The paper ends by exploring the future possibilities for genetic testing as the science of genomics in sport matures over the coming decade(s).
Interpretation and Utilization of Scores on the Air Force Officer Qualifying Test.

ERIC Educational Resources Information Center

Miller, Robert E.

The report summarizes a large body of data relevant to the proper interpretation and use of aptitude scores on the Air Force Officer Qualifying Test (AFOQT). Included are descriptions of the AFOQT testing program and the test itself. Technical data include an extensive sampling of validation studies covering predictors of success in pilot…
Stochastic Processes as True-Score Models for Highly Speeded Mental Tests.

ERIC Educational Resources Information Center

Moore, William E.

The previous theoretical development of the Poisson process as a strong model for the true-score theory of mental tests is discussed, and additional theoretical properties of the model from the standpoint of individual examinees are developed. The paper introduces the Erlang process as a family of test theory models and shows in the context of…
A physical function test for use in the intensive care unit: validity, responsiveness, and predictive utility of the physical function ICU test (scored).

PubMed

Denehy, Linda; de Morton, Natalie A; Skinner, Elizabeth H; Edbrooke, Lara; Haines, Kimberley; Warrillow, Stephen; Berney, Sue

2013-12-01

Several tests have recently been developed to measure changes in patient strength and functional outcomes in the intensive care unit (ICU). The original Physical Function ICU Test (PFIT) demonstrates reliability and sensitivity. The aims of this study were to further develop the original PFIT, to derive an interval score (the PFIT-s), and to test the clinimetric properties of the PFIT-s. A nested cohort study was conducted. One hundred forty-four and 116 participants performed the PFIT at ICU admission and discharge, respectively. Original test components were modified using principal component analysis. Rasch analysis examined the unidimensionality of the PFIT, and an interval score was derived. Correlations tested validity, and multiple regression analyses investigated predictive ability. Responsiveness was assessed using the effect size index (ESI), and the minimal clinically important difference (MCID) was calculated. The shoulder lift component was removed. Unidimensionality of combined admission and discharge PFIT-s scores was confirmed. The PFIT-s displayed moderate convergent validity with the Timed "Up & Go" Test (r=-.60), the Six-Minute Walk Test (r=.41), and the Medical Research Council (MRC) sum score (rho=.49). The ESI of the PFIT-s was 0.82, and the MCID was 1.5 points (interval scale range=0-10). A higher admission PFIT-s score was predictive of: an MRC score of ≥48, increased likelihood of discharge home, reduced likelihood of discharge to inpatient rehabilitation, and reduced acute care hospital length of stay. Scoring of sit-to-stand assistance required is subjective, and cadence cutpoints used may not be generalizable. The PFIT-s is a safe and inexpensive test of physical function with high clinical utility. It is valid, responsive to change, and predictive of key outcomes. It is recommended that the PFIT-s be adopted to test physical function in the ICU.
Univariate and Bivariate Loglinear Models for Discrete Test Score Distributions.

ERIC Educational Resources Information Center

Holland, Paul W.; Thayer, Dorothy T.

2000-01-01

Applied the theory of exponential families of distributions to the problem of fitting the univariate histograms and discrete bivariate frequency distributions that often arise in the analysis of test scores. Considers efficient computation of the maximum likelihood estimates of the parameters using Newton's Method and computationally efficient…
Allele-sharing models: LOD scores and accurate linkage tests.

PubMed

Kong, A; Cox, N J

1997-11-01

Starting with a test statistic for linkage analysis based on allele sharing, we propose an associated one-parameter model. Under general missing-data patterns, this model allows exact calculation of likelihood ratios and LOD scores and has been implemented by a simple modification of existing software. Most important, accurate linkage tests can be performed. Using an example, we show that some previously suggested approaches to handling less than perfectly informative data can be unacceptably conservative. Situations in which this model may not perform well are discussed, and an alternative model that requires additional computations is suggested.
Allele-sharing models: LOD scores and accurate linkage tests.

PubMed Central

Kong, A; Cox, N J

1997-01-01

Starting with a test statistic for linkage analysis based on allele sharing, we propose an associated one-parameter model. Under general missing-data patterns, this model allows exact calculation of likelihood ratios and LOD scores and has been implemented by a simple modification of existing software. Most important, accurate linkage tests can be performed. Using an example, we show that some previously suggested approaches to handling less than perfectly informative data can be unacceptably conservative. Situations in which this model may not perform well are discussed, and an alternative model that requires additional computations is suggested. PMID:9345087
Beyond Correlations: Usefulness of High School GPA and Test Scores in Making College Admissions Decisions

ERIC Educational Resources Information Center

Sawyer, Richard

2013-01-01

Correlational evidence suggests that high school GPA is better than admission test scores in predicting first-year college GPA, although test scores have incremental predictive validity. The usefulness of a selection variable in making admission decisions depends in part on its predictive validity, but also on institutions' selectivity and…
Graduate Students' Administration and Scoring Errors on the Woodcock-Johnson III Tests of Cognitive Abilities

ERIC Educational Resources Information Center

Ramos, Erica; Alfonso, Vincent C.; Schermerhorn, Susan M.

2009-01-01

The interpretation of cognitive test scores often leads to decisions concerning the diagnosis, educational placement, and types of interventions used for children. Therefore, it is important that practitioners administer and score cognitive tests without error. This study assesses the frequency and types of examiner errors that occur during the…
Evaluating the Stability of Test Score Means for the "TOEIC"® Speaking and Writing Tests. Research Report. ETS RR-17-50

ERIC Educational Resources Information Center

Qu, Yanxuan; Huo, Yan; Chan, Eric; Shotts, Matthew

2017-01-01

For educational tests, it is critical to maintain consistency of score scales and to understand the sources of variation in score means over time. This practice helps to ensure that interpretations about test takers' abilities are comparable from one administration (or one form) to another. This study examines the consistency of reported scores…
Reliability and validity of a talent identification test battery for seated and standing Paralympic throws.

PubMed

Spathis, Jemima Grace; Connick, Mark James; Beckman, Emma Maree; Newcombe, Peter Anthony; Tweedy, Sean Michael

2015-01-01

Paralympic throwing events for athletes with physical impairments comprise seated and standing javelin, shot put, discus and seated club throwing. Identification of talented throwers would enable prediction of future success and promote participation; however, a valid and reliable talent identification battery for Paralympic throwing has not been reported. This study evaluates the reliability and validity of a talent identification battery for Paralympic throws. Participants were non-disabled so that impairment would not confound analyses, and results would provide an indication of normative performance. Twenty-eight non-disabled participants (13 M; 15 F) aged 23.6 years (±5.44) performed five kinematically distinct criterion throws (three seated, two standing) and nine talent identification tests (three anthropometric, six motor); 23 were tested a second time to evaluate test-retest reliability. Talent identification test-retest reliability was evaluated using Intra-class Correlation Coefficient (ICC) and Bland-Altman plots (Limits of Agreement). Spearman's correlation assessed strength of association between criterion throws and talent identification tests. Reliability was generally acceptable (mean ICC = 0.89), but two seated talent identification tests require more extensive familiarisation. Correlation strength (mean rs = 0.76) indicated that the talent identification tests can be used to validly identify individuals with competitively advantageous attributes for each of the five kinematically distinct throwing activities. Results facilitate further research in this understudied area.

The effect of human immunodeficiency virus type 1 antibody status on military applicant aptitude test scores.

PubMed

Arday, D R; Brundage, J F; Gardner, L I; Goldenbaum, M; Wann, F; Wright, S

1991-06-15

The authors conducted a population-based study to attempt to estimate the effect of human immunodeficiency virus type 1 (HIV-1) seropositivity on Armed Services Vocational Aptitude Battery test scores in otherwise healthy individuals with early HIV-1 infection. The Armed Services Vocational Aptitude Battery is a 10-test written multiple aptitude battery administered to all civilian applicants for military enlistment prior to serologic screening for HIV-1 antibodies. A total of 975,489 induction testing records containing both Armed Services Vocational Aptitude Battery and HIV-1 results from October 1985 through March 1987 were examined. An analysis data set (n = 7,698) was constructed by choosing five controls for each of the 1,283 HIV-1-positive cases, matched on five-digit ZIP code, and a multiple linear regression analysis was performed to control for demographic and other factors that might influence test scores. Years of education was the strongest predictor of test scores, raising an applicant's score on a composite test nearly 0.16 standard deviation per year. The HIV-1-positive effect on the composite score was -0.09 standard deviation (99% confidence interval -0.17 to -0.02). Separate regressions on each component test within the battery showed HIV-1 effects between -0.39 and +0.06 standard deviation. The two Armed Services Vocational Aptitude Battery component tests felt a priori to be the most sensitive to HIV-1-positive status showed the least decrease with seropositivity. Much of the variability in test scores was not predicted by either HIV-1 serostatus or the demographic and other factors included in the model. There appeared to be little evidence of a strong HIV-1 effect.
Low aerobic fitness and obesity are associated with lower standardized test scores in children.

PubMed

Roberts, Christian K; Freed, Benjamin; McCarthy, William J

2010-05-01

To investigate whether aerobic fitness and obesity in school children are associated with standardized test performance. Ethnically diverse (n = 1989) 5th, 7th, and 9th graders attending California schools comprised the sample. Aerobic fitness was determined by a 1-mile run/walk test; body mass index (BMI) was obtained from state-mandated measurements. California standardized test scores were obtained from the school district. Students whose mile run/walk times exceeded California Fitnessgram standards or whose BMI exceeded Centers for Disease Control sex- and age-specific body weight standards scored lower on California standardized math, reading, and language tests than students with desirable BMI status or fitness level, even after controlling for parent education among other covariates. Ethnic differences in standardized test scores were consistent with ethnic differences in obesity status and aerobic fitness. BMI-for-age was no longer a significant multivariate predictor when covariates included fitness level. Low aerobic fitness is common among youth and varies among ethnic groups, and aerobic fitness level predicts performance on standardized tests across ethnic groups. More research is needed to uncover the physiological mechanisms by which aerobic fitness may contribute to performance on standardized academic tests.
Scoring clustering solutions by their biological relevance.

PubMed

Gat-Viks, I; Sharan, R; Shamir, R

2003-12-12

A central step in the analysis of gene expression data is the identification of groups of genes that exhibit similar expression patterns. Clustering gene expression data into homogeneous groups was shown to be instrumental in functional annotation, tissue classification, regulatory motif identification, and other applications. Although there is a rich literature on clustering algorithms for gene expression analysis, very few works addressed the systematic comparison and evaluation of clustering results. Typically, different clustering algorithms yield different clustering solutions on the same data, and there is no agreed upon guideline for choosing among them. We developed a novel statistically based method for assessing a clustering solution according to prior biological knowledge. Our method can be used to compare different clustering solutions or to optimize the parameters of a clustering algorithm. The method is based on projecting vectors of biological attributes of the clustered elements onto the real line, such that the ratio of between-groups and within-group variance estimators is maximized. The projected data are then scored using a non-parametric analysis of variance test, and the score's confidence is evaluated. We validate our approach using simulated data and show that our scoring method outperforms several extant methods, including the separation to homogeneity ratio and the silhouette measure. We apply our method to evaluate results of several clustering methods on yeast cell-cycle gene expression data. The software is available from the authors upon request.
The Emphasis of Student Test Scores in Teacher Appraisal Systems

ERIC Educational Resources Information Center

Smith, William C.; Kubacka, Katarzyna

2017-01-01

Over the past 30 years teachers have been held increasingly accountable for the quality of education in their classroom. During this transition, the line between teacher appraisals, traditionally an instrument for continuous formative teacher feedback, and summative teacher evaluations has blurred. Student test scores, as an "objective"…
Rising Stars: High School's Change Process Produces Higher Test Scores.

ERIC Educational Resources Information Center

McCown, Claire; Runnebaum, Robert

2001-01-01

Presents Bishop Ward High School (Kansas) as a case study that has seen great improvements in standardized testing results by changing its approach. States that realignment of curriculum, adjusting instructional strategies, and accommodating students with special needs are important aspects of raising assessment scores in high schools. (CJW)
Direct structural parameter identification by modal test results

NASA Technical Reports Server (NTRS)

Chen, J.-C.; Kuo, C.-P.; Garba, J. A.

1983-01-01

A direct identification procedure is proposed to obtain the mass and stiffness matrices based on the test measured eigenvalues and eigenvectors. The method is based on the theory of matrix perturbation in which the correct mass and stiffness matrices are expanded in terms of analytical values plus a modification matrix. The simplicity of the procedure enables real time operation during the structural testing.
Comparing the Effects of Elementary Music and Visual Arts Lessons on Standardized Mathematics Test Scores

ERIC Educational Resources Information Center

King, Molly Elizabeth

2016-01-01

The purpose of this quantitative, causal-comparative study was to compare the effect elementary music and visual arts lessons had on third through sixth grade standardized mathematics test scores. Inferential statistics were used to compare the differences between test scores of students who took in-school, elementary, music instruction during the…
Many Children Left Behind? Textbooks and Test Scores in Kenya. NBER Working Paper No. 13300

ERIC Educational Resources Information Center

Glewwe, Paul; Kremer, Michael; Moulin, Sylvie

2007-01-01

A randomized evaluation suggests that a program which provided official textbooks to randomly selected rural Kenyan primary schools did not increase test scores for the average student. In contrast, the previous literature suggests that textbook provision has a large impact on test scores. Disaggregating the results by students' initial academic…
Relationship of Elementary and Secondary School Achievement Test Scores to Later Academic Success.

ERIC Educational Resources Information Center

Loyd, Brenda H.; And Others

1980-01-01

This study investigated the relationship between achievement test scores on the Iowa Tests of Basic Skills (ITBS) and Iowa Tests of Educational Development (ITED), and high school and college grade point average. Support for the predictive validity of the ITBS and ITED achievement test batteries is provided. (Author/GK)
21 CFR 866.6050 - Ovarian adnexal mass assessment score test system.

Code of Federal Regulations, 2013 CFR

2013-04-01

... surgery is planned, is malignant. The test is for adjunctive use, in the context of a negative primary clinical and radiological evaluation, to augment the identification of patients whose gynecologic surgery... § 866.1(e). (c) Black box warning. Under section 520(e) of the Federal Food, Drug, and Cosmetic Act...
The Impact of Inclusion and Resource Instruction on Standardized Test Scores of Special Education Students

ERIC Educational Resources Information Center

Derico, Vontrice L.

2017-01-01

The purpose of the proposed quasi-experimental quantitative study was to determine if students who were taught in the inclusive setting yielded higher standardized test scores compared to students who were taught in the resource setting. The researcher analyzed the standardized test scores, in the areas of Language Arts, Reading, and Mathematics…
STABILITY OF ACADEMIC APTITUDE AND READING TEST SCORES OF MOBILE AND NON-MOBILE DISADVANTAGED CHILDREN.

ERIC Educational Resources Information Center

JUSTMAN, JOSEPH

CHANGES IN ACADEMIC APTITUDE AND ACHIEVEMENT TEST SCORES OF PUPILS ATTENDING PUBLIC SCHOOLS IN DISADVANTAGED AREAS IN NEW YORK CITY WERE INVESTIGATED. AN ATTEMPT WAS MADE TO DETERMINE WHETHER VARYING DEGREES OF MOBILITY WERE ASSOCIATED WITH VARIATION IN CHANGES IN TEST SCORES. THE CUMULATIVE RECORD CARDS OF SIXTH-GRADE PUPILS WERE EXAMINED TO…
Kindergarten Black-White Test Score Gaps: Replicating and Updating Previous Findings with New National Data

ERIC Educational Resources Information Center

Quinn, David

2014-01-01

A substantial body of evidence has shown large academic test score gaps between black and white students in early childhood. These gaps remain, and probably grow, as students progress through school. Many researchers have sought to explain these persistent test score gaps, and particularly, to understand the role of students' socio-economic status…
The Influence of an NCLB Accountability Plan on the Distribution of Student Test Score Gains

ERIC Educational Resources Information Center

Springer, Matthew G.

2008-01-01

Previous research on the effect of accountability programs on the distribution of student test score gains is decidedly mixed. This study examines the issue by estimating an educational production function in which test score gains are a function of the incentives schools have to focus instruction on below-proficient students. NCLB's threat of…
Test and Score Data Summary for TOEFL[R] Internet-Based and Paper-Based Tests. January 2008-December 2008 Test Data

ERIC Educational Resources Information Center

Educational Testing Service, 2008

2008-01-01

The Test of English as a Foreign Language[TM], better known as TOEFL[R], is designed to measure the English-language proficiency of people whose native language is not English. TOEFL scores are accepted by more than 6,000 colleges, universities, and licensing agencies in 130 countries. The test is also used by governments, and scholarship and…
Use of Standardized Test Scores to Predict Success in a Computer Applications Course

ERIC Educational Resources Information Center

Harris, Robert V.; King, Stephanie B.

2016-01-01

The purpose of this study was to see if a relationship existed between American College Testing (ACT) scores (i.e., English, reading, mathematics, science reasoning, and composite) and student success in a computer applications course at a Mississippi community college. The study showed that while the ACT scores were excellent predictors of…
A Comparison of the Approaches of Generalizability Theory and Item Response Theory in Estimating the Reliability of Test Scores for Testlet-Composed Tests

ERIC Educational Resources Information Center

Lee, Guemin; Park, In-Yong

2012-01-01

Previous assessments of the reliability of test scores for testlet-composed tests have indicated that item-based estimation methods overestimate reliability. This study was designed to address issues related to the extent to which item-based estimation methods overestimate the reliability of test scores composed of testlets and to compare several…
Evaluation of heparin-induced thrombocytopenia (HIT) laboratory testing and the 4Ts scoring system in the intensive care unit.

PubMed

Pierce, Wesly; Mazur, Joseph; Greenberg, Charles; Mueller, Joan; Foster, Joyce; Lazarchick, John

2013-01-01

Over-diagnosis of heparin-induced thrombocytopenia (HIT) results in costly and unnecessary laboratory screening and treatment with direct thrombin inhibitors. Our aim was to evaluate the utility of the 4Ts scoring system to predict HIT in multiple ICU settings and to characterize our treatment of these cases. Eighty-two patients from multiple ICU settings who underwent laboratory testing for HIT were classified as low-, intermediate-, or high-risk patients based on retrospectively adjudicated 4Ts scores. These results were compared with platelet-factor 4 enzyme-linked immunosorbent assays (PF4 ELISAs), optical density (OD) values, and serotonin-release assays (SRAs) to assess the utility of the 4Ts score to rule out ICU-related HIT and reduce laboratory and drug expenditures. Of the 82 patients reviewed, only 12 (11.4%) were PF4-positive and only 1 (1.2%) was SRA-positive for HIT. Heparin was discontinued in only 63.4% of patients suspected to have HIT. There were no significant differences in mean day of platelet fall, mean platelet nadir, and mean percent fall in platelet count between PF4-positive and negative patients (all p > 0.2). There was, however, a significantly higher proportion of patients with an intermediate to high 4Ts score in the PF4-positive group than in the PF4-negative group (66% vs. 30%, respectively; p = 0.02). The mean PF4 OD value in patients with intermediate to high 4Ts scores was significantly higher than in patients with low 4Ts scores (0.658 vs. 0.258, respectively; p < 0.001). The negative predictive values of the 4Ts score relative to the PF4 and SRA were 92% and 100%, respectively. The estimated laboratory and pharmacologic cost avoidance potential of the scoring system in this cohort was $21,450. Our modified 4Ts scoring system appears to be an effective tool for predicting HIT in the ICU and could avoid significant drug and laboratory expenditures if implemented prospectively. The clinical management of patients suspected of HIT
How Changes in Families and Schools Are Related to Trends in Black-White Test Scores

ERIC Educational Resources Information Center

Berends, Mark; Lucas, Samuel R.; Penaloza, Roberto V.

2008-01-01

Through several decades of research, a great deal has been written about trends in black-white test scores and the factors that may explain the gaps in different subject areas. Only a few studies have examined the changing relationships between gaps in students' test scores and family and school measures in nationally representative data over…
Clock Drawing Test and the diagnosis of amnestic mild cognitive impairment: can more detailed scoring systems do the work?

PubMed

Rubínová, Eva; Nikolai, Tomáš; Marková, Hana; Siffelová, Kamila; Laczó, Jan; Hort, Jakub; Vyhnálek, Martin

2014-01-01

The Clock Drawing Test is a frequently used cognitive screening test with several scoring systems in elderly populations. We compare simple and complex scoring systems and evaluate the usefulness of the combination of the Clock Drawing Test with the Mini-Mental State Examination to detect patients with mild cognitive impairment. Patients with amnestic mild cognitive impairment (n = 48) and age- and education-matched controls (n = 48) underwent neuropsychological examinations, including the Clock Drawing Test and the Mini-Mental State Examination. Clock drawings were scored by three blinded raters using one simple (6-point scale) and two complex (17- and 18-point scales) systems. The sensitivity and specificity of these scoring systems used alone and in combination with the Mini-Mental State Examination were determined. Complex scoring systems, but not the simple scoring system, were significant predictors of the amnestic mild cognitive impairment diagnosis in logistic regression analysis. At equal levels of sensitivity (87.5%), the Mini-Mental State Examination showed higher specificity (31.3%, compared with 12.5% for the 17-point Clock Drawing Test scoring scale). The combination of Clock Drawing Test and Mini-Mental State Examination scores increased the area under the curve (0.72; p < .001) and increased specificity (43.8%), but did not increase sensitivity, which remained high (85.4%). A simple 6-point scoring system for the Clock Drawing Test did not differentiate between healthy elderly and patients with amnestic mild cognitive impairment in our sample. Complex scoring systems were slightly more efficient, yet still were characterized by high rates of false-positive results. We found psychometric improvement using combined scores from the Mini-Mental State Examination and the Clock Drawing Test when complex scoring systems were used. The results of this study support the benefit of using combined scores from simple methods.

The Relationship between Academic Averages of Primary School Science and Technology Class and Test Sub-Test Scores of Placement Test of Science

ERIC Educational Resources Information Center

Guzeller, Cem Oktay

2012-01-01

In this research, the relationship between written exam scores of science and technology class of 6th, 7th, and 8th grades, project, participation in class activities and performance work, year-end academic success point averages and sub-test raw scores of LDT science of 6th, 7th and 8th grades. Academic success point averages were used as…
Racial Differences in Mathematics Test Scores for Advanced Mathematics Students

ERIC Educational Resources Information Center

Minor, Elizabeth Covay

2016-01-01

Research on achievement gaps has found that achievement gaps are larger for students who take advanced mathematics courses compared to students who do not. Focusing on the advanced mathematics student achievement gap, this study found that African American advanced mathematics students have significantly lower test scores and are less likely to be…
Commentary on "Validating the Interpretations and Uses of Test Scores"

ERIC Educational Resources Information Center

Brennan, Robert L.

2013-01-01

Kane's paper "Validating the Interpretations and Uses of Test Scores" is the most complete and clearest discussion yet available of the argument-based approach to validation. At its most basic level, validation as formulated by Kane is fundamentally a simply-stated two-step enterprise: (1) specify the claims inherent in a particular interpretation…
Using Test Scores from Students with Disabilities in Teacher Evaluation

ERIC Educational Resources Information Center

Buzick, Heather M.; Jones, Nathan D.

2015-01-01

Much of the recent focus of educational policymakers has been on improving the measurement of teacher effectiveness. Linking student growth to teacher effects has been a large part of reform efforts. To date, neither researchers nor practitioners have arrived at a consensus on how to treat test scores from students with disabilities in…
Piloting a Polychotomous Partial-Credit Scoring Procedure in a Multiple-Choice Test

ERIC Educational Resources Information Center

Tsopanoglou, Antonios; Ypsilandis, George S.; Mouti, Anna

2014-01-01

Multiple-choice (MC) tests are frequently used to measure language competence because they are quick, economical and straightforward to score. While degrees of correctness have been investigated for partially correct responses in combined-response MC tests, degrees of incorrectness in distractors and the role they play in determining the…
What's in a Teacher Test? Assessing the Relationship between Teacher Licensure Test Scores and Student STEM Achievement and Course-Taking. Working Paper 158

ERIC Educational Resources Information Center

Goldhaber, Dan; Gratz, Trevor; Theobald, Roddy

2016-01-01

We investigate the relationship between teacher licensure test scores and student test achievement and high school course-taking. We focus on three subject/grade combinations--middle school math, ninth-grade algebra and geometry, and ninth-grade biology--and find evidence that a teacher's basic skills test scores are modestly predictive of student…
The Bender Gestalt Test with the Human Figure Drawing Test for Young School Children. A Manual for Use with the Koppitz Scoring System.

ERIC Educational Resources Information Center

Koppitz, Elizabeth Munsterberg

Presented is a manual for scoring the Bender Gestalt Test and the Human Figure Drawing Test for screening and diagnostic uses with emotionally disturbed, brain damaged, or perceptually handicapped 5- to 11-year-old children. Given are suggestions for administering and scoring the Bender test which examines distortion of shape, rotation,…
A pretest prognostic score to assess patients undergoing exercise or pharmacological stress testing.

PubMed

Morise, Anthony; Evans, Matthew; Jalisi, Farrukh; Shetty, Rajendra; Stauffer, Marc

2007-02-01

A previously developed pretest score was validated to stratify patients presenting for exercise testing with suspected coronary disease according to the presence of angiographic coronary disease. Our goal was to determine how well this pretest score risk stratified patients undergoing pharmacological and exercise stress tests concerning prognostic endpoints. Retrospective cohort analysis. University hospital stress laboratory. 7452 unselected ambulatory patients with symptoms of suspected coronary disease undergoing stress testing between 1995 and 2004. All-cause death, cardiac death and non-fatal myocardial infarction. The rate of all-cause death was 5.5% (CI 5.0 to 6.1) with 4.3 (SD 2.4) years of follow-up (Exercise 2.8% (CI 2.3 to 3.2) v Pharmacological group 11.9% (CI 10.5 to 13.3); p<0.001). The rate of cardiac death/myocardial infarction was 2.6% (CI 2.2 to 3.0) (Exercise 1.4% (CI 1.1 to 1.8) v Pharmacological group 5.3% (CI 4.3 to 6.2); p<0.001). In both groups, stratification by pretest score was significant for all-cause death and the combined endpoint. However, stratification was more effective in the pharmacological group using the combined endpoint rather than all-cause death. Pharmacological stress patients in intermediate and high risk groups were at higher risk than their respective exercise test cohorts. Referral for pharmacological stress testing was found to be an independent predictor of time to death (2.7 (CI 2.0 to 3.6); p<0.001). A pretest score previously validated to stratify according to angiographic outcomes, effectively risk stratified pharmacological and exercise stress patients according to the combined endpoint of cardiac death/myocardial infarction.
TOEFL iBT Speaking Test Scores as Indicators of Oral Communicative Language Proficiency

ERIC Educational Resources Information Center

Bridgeman, Brent; Powers, Donald; Stone, Elizabeth; Mollaun, Pamela

2012-01-01

Scores assigned by trained raters and by an automated scoring system (SpeechRater[TM]) on the speaking section of the TOEFL iBT[TM] were validated against a communicative competence criterion. Specifically, a sample of 555 undergraduate students listened to speech samples from 184 examinees who took the Test of English as a Foreign Language…
Association between the gait pattern characteristics of older people and their two-step test scores.

PubMed

Kobayashi, Yoshiyuki; Ogata, Toru

2018-04-27

The Two-Step test is one of three official tests authorized by the Japanese Orthopedic Association to evaluate the risk of locomotive syndrome (a condition of reduced mobility caused by an impairment of the locomotive organs). It has been reported that the Two-Step test score has a good correlation with one's walking ability; however, its association with the gait pattern of older people during normal walking is still unknown. Therefore, this study aims to clarify the associations between the gait patterns of older people observed during normal walking and their Two-Step test scores. We analyzed the whole waveforms obtained from the lower-extremity joint angles and joint moments of 26 older people in various stages of locomotive syndrome using principal component analysis (PCA). The PCA was conducted using a 260 × 2424 input matrix constructed from the participants' time-normalized pelvic and right-lower-limb-joint angles along three axes (ten trials of 26 participants, 101 time points, 4 angles, 3 axes, and 2 variable types per trial). The Pearson product-moment correlation coefficient between the scores of the principal component vectors (PCVs) and the scores of the Two-Step test revealed that only one PCV (PCV 2) among the 61 obtained relevant PCVs is significantly related to the score of the Two-Step test. We therefore concluded that the joint angles and joint moments related to PCV 2-ankle plantar-flexion, ankle plantar-flexor moments during the late stance phase, ranges of motion and moments on the hip, knee, and ankle joints in the sagittal plane during the entire stance phase-are the motions associated with the Two-Step test.
Mixed handedness and achievement test scores of middle school boys.

PubMed

Sarma, P S B

2008-10-01

The purpose of the study was to replicate findings of an earlier study of fourth grade boys manifesting mixed handedness with a sample. Among 32 mixed-handed boys in Grades 6 to 8, the right-handed writer, left-handed thrower group obtained low spelling scores (Normal Curve Equivalent Scores) on the California Achievement Test significantly more frequently than the left-handed writer, right-handed thrower group. These findings are consistent with data for Grade 4 boys in the earlier study. Findings strengthen the hypotheses that mixed handedness is not a unitary neuropsychological entity and that boys who write with the right hand and throw with the left hand might be at risk for certain academic deficits.
Validity and reliability of Abbreviated Mental Test Score (AMTS) among older Iranian.

PubMed

Foroughan, Mahshid; Wahlund, Lars-Olof; Jafari, Zahra; Rahgozar, Mehdi; Farahani, Ida G; Rashedi, Vahid

2017-11-01

Cognitive impairment is common among older people and is associated with increased morbidity and mortality. The main aim of this study was to evaluate the validity of the Persian version of the Abbreviated Mental Test Score (AMTS) as a screening tool for dementia. Data were obtained from a cross-sectional study. One hundred and one older adults who were members of Iranian Alzheimer Association and 101 of their siblings were entered into this study by convenient sampling. The Diagnostic and Statistical Manual of Mental Disorders, 4th edition, criteria for diagnosing dementia and the Mini-Mental State Examination were used as the study tools. The gathered data were analyzed by the Mann-Whitney U-test, the Kruskal-Wallis test, Spearman's rank correlation coefficient, and the receiver-operating characteristic. The AMTS could successfully differentiate the dementia group from the non-dementia group. Scores were significantly correlated with Diagnostic and Statistical Manual of Mental Disorders diagnosis for dementia and Mini-Mental State Examination scores (P < 0.001). Educational level (P < 0.001) and male sex (P = 0.015) were positively associated with AMTS, whereas (P < 0.001) was negatively associated with AMTS. Total Cronbach's α coefficient was 0.90. The scores 6 and 7 showed the optimum balance between sensitivity (99% and 94%, respectively) and specificity (85% and 86%, respectively). The Persian version of the AMTS is a valid cognitive assessment tool for older Iranian adults and can be used for dementia screening in Iran. © 2017 Japanese Psychogeriatric Society.
A general equation to obtain multiple cut-off scores on a test from multinomial logistic regression.

PubMed

Bersabé, Rosa; Rivas, Teresa

2010-05-01

The authors derive a general equation to compute multiple cut-offs on a total test score in order to classify individuals into more than two ordinal categories. The equation is derived from the multinomial logistic regression (MLR) model, which is an extension of the binary logistic regression (BLR) model to accommodate polytomous outcome variables. From this analytical procedure, cut-off scores are established at the test score (the predictor variable) at which an individual is as likely to be in category j as in category j+1 of an ordinal outcome variable. The application of the complete procedure is illustrated by an example with data from an actual study on eating disorders. In this example, two cut-off scores on the Eating Attitudes Test (EAT-26) scores are obtained in order to classify individuals into three ordinal categories: asymptomatic, symptomatic and eating disorder. Diagnoses were made from the responses to a self-report (Q-EDD) that operationalises DSM-IV criteria for eating disorders. Alternatives to the MLR model to set multiple cut-off scores are discussed.
School accountability and the black-white test score gap.

PubMed

Gaddis, S Michael; Lauen, Douglas Lee

2014-03-01

Since at least the 1960s, researchers have closely examined the respective roles of families, neighborhoods, and schools in producing the black-white achievement gap. Although many researchers minimize the ability of schools to eliminate achievement gaps, the No Child Left Behind Act (NCLB) increased pressure on schools to do so by 2014. In this study, we examine the effects of NCLB's subgroup-specific accountability pressure on changes in black-white math and reading test score gaps using a school-level panel dataset on all North Carolina public elementary and middle schools between 2001 and 2009. Using difference-in-difference models with school fixed effects, we find that accountability pressure reduces black-white achievement gaps by raising mean black achievement without harming mean white achievement. We find no differential effects of accountability pressure based on the racial composition of schools, but schools with more affluent populations are the most successful at reducing the black-white math achievement gap. Thus, our findings suggest that school-based interventions have the potential to close test score gaps, but differences in school composition and resources play a significant role in the ability of schools to reduce racial inequality. Copyright © 2013 Elsevier Inc. All rights reserved.
Report: States See Test-Score Gains

ERIC Educational Resources Information Center

Viadero, Debra

2004-01-01

This article discusses a report from Education Trust, a Washington-based research and advocacy group. The report says almost half the states have seen rising math scores on their state exams for elementary school pupils since the federal No Child Left Behind law was enacted. It also states that reading scores have improved among 4th and 5th…
School Choice in Suburbia: Test Scores, Race, and Housing Markets

ERIC Educational Resources Information Center

Dougherty, Jack; Harelson, Jeffrey; Maloney, Laura; Murphy, Drew; Smith, Russell; Snow, Michael; Zannoni, Diane

2009-01-01

Home buyers exercise school choice when shopping for a private residence due to its location in a public school district or attendance area. In this quantitative study of one Connecticut suburban district, we measure the effect of elementary school test scores and racial composition on home buyers' willingness to purchase single-family homes over…
The Effect of Mobility on Texas Assessment of Knowledge and Skills Test Scores

ERIC Educational Resources Information Center

Alvarez, Ray

2006-01-01

This research studies the effects of mobility on the high-stakes test scores of a Title I South Central Texas school district. The study involved 10, 5th-grade elementary feeder school populations graduating to the 6th grade in 3 middle schools. The researcher compared the 1st administration scores of the Texas Assessment of Knowledge and Skills…
Effects of correcting for prematurity on cognitive test scores in childhood.

PubMed

Wilson-Ching, Michelle; Pascoe, Leona; Doyle, Lex W; Anderson, Peter J

2014-03-01

The American Academy of Pediatrics recommends that test scores should be corrected for prematurity up to 3 years of age, but this practice varies greatly in both clinical and research settings. The aim of this study was to contrast the effects of using chronological age and those of using corrected age on measures of cognitive outcome across childhood. A theoretical model was constructed using norms from the Bayley Scales of Infant and Toddler Development, Third Edition; the Wechsler Preschool and Primary Scale of Intelligence, Third Edition Australian; and the Wechsler Intelligence Scales for Children, Fourth Edition Australian. Baseline scores representing different levels of functioning (70, below average; 85, borderline; and 100, average) were recalculated using the normative data for ages 6 months to 16 years to account for 1, 2, 3 and 4 months of prematurity. The model created depicted the difference in standardised scores between chronological and corrected age. Compared with scores corrected for prematurity, the absolute reduction in scores using chronological age was greater for increasing degree of prematurity, younger ages at assessment and higher baseline scores and was substantial even beyond 3 years of age. However, the pattern was erratic, with considerable fluctuation evident across different ages and baseline scores. Chronological age results in a lowering of scores at all ages for preterm-born subjects that is greater in the first few years and in those born at earlier gestational ages. Whether or not to correct for prematurity depends upon the context of the assessment. © 2014 The Authors. Journal of Paediatrics and Child Health © 2014 Paediatrics and Child Health Division (Royal Australasian College of Physicians).
How Parents Can Help Kids Improve Test Scores: Taking the Stakes out of Literacy Testing

ERIC Educational Resources Information Center

Schneider, Steven

2006-01-01

In order to meet the goals of No Child Left Behind, standardized testing is preeminent as the sole indicator determining whether states all across America demonstrate adequate yearly progress regarding the improvement of student achievement in literacy education. This book will help teachers and parents raise children's scores on standardized…
The Effects of Group Members' Personalities on a Test Taker's L2 Group Oral Discussion Test Scores

ERIC Educational Resources Information Center

Ockey, Gary J.

2009-01-01

The second language group oral is a test of second language speaking proficiency, in which a group of three or more English language learners discuss an assigned topic without interaction with interlocutors. Concerns expressed about the extent to which test takers' personal characteristics affect the scores of others in the group have limited its…

Noncognitive Skills and the Gender Disparities in Test Scores and Teacher Assessments: Evidence from Primary School

ERIC Educational Resources Information Center

Cornwell, Christopher; Mustard, David B.; Van Parys, Jessica

2013-01-01

Using data from the 1998-99 ECLS-K cohort, we show that the grades awarded by teachers are not aligned with test scores. Girls in every racial category outperform boys on reading tests, while boys score at least as well on math and science tests as girls. However, boys in all racial categories across all subject areas are not represented in…
Web-based training and interrater reliability testing for scoring the Hamilton Depression Rating Scale.

PubMed

Rosen, Jules; Mulsant, Benoit H; Marino, Patricia; Groening, Christopher; Young, Robert C; Fox, Debra

2008-10-30

Despite the importance of establishing shared scoring conventions and assessing interrater reliability in clinical trials in psychiatry, these elements are often overlooked. Obstacles to rater training and reliability testing include logistic difficulties in providing live training sessions, or mailing videotapes of patients to multiple sites and collecting the data for analysis. To address some of these obstacles, a web-based interactive video system was developed. It uses actors of diverse ages, gender and race to train raters how to score the Hamilton Depression Rating Scale and to assess interrater reliability. This system was tested with a group of experienced and novice raters within a single site. It was subsequently used to train raters of a federally funded multi-center clinical trial on scoring conventions and to test their interrater reliability. The advantages and limitations of using interactive video technology to improve the quality of clinical trials are discussed.
Use of Enzyme Tests in Characterization and Identification of Aerobic and Facultatively Anaerobic Gram-Positive Cocci

PubMed Central

Bascomb, Shoshana; Manafi, Mammad

1998-01-01

The contribution of enzyme tests to the accurate and rapid routine identification of gram-positive cocci is introduced. The current taxonomy of the genera of aerobic and facultatively anaerobic cocci based on genotypic and phenotypic characterization is reviewed. The clinical and economic importance of members of these taxa is briefly summarized. Tables summarizing test schemes and kits available for the identification of staphylococci, enterococci, and streptococci on the basis of general requirements, number of tests, number of taxa, test classes, and completion times are discussed. Enzyme tests included in each scheme are compared on the basis of their synthetic moiety. The current understanding of the activity of enzymes important for classification and identification of the major groups, methods of testing, and relevance to the ease and speed of identification are reviewed. Publications describing the use of different identification kits are listed, and overall identification successes and problems are discussed. The relationships between the results of conventional biochemical and rapid enzyme tests are described and considered. The use of synthetic substrates for the detection of glycosidases and peptidases is reviewed, and the advantages of fluorogenic synthetic moieties are discussed. The relevance of enzyme tests to accurate and meaningful rapid routine identification is discussed. PMID:9564566
Opportunity to learn: Investigating possible predictors for pre-course Test Of Astronomy STandards TOAST scores

NASA Astrophysics Data System (ADS)

Berryhill, Katie J.

As astronomy education researchers become more interested in experimentally testing innovative teaching strategies to enhance learning in introductory astronomy survey courses ("ASTRO 101"), scholars are placing increased attention toward better understanding factors impacting student gain scores on the widely used Test Of Astronomy STandards (TOAST). Usually used in a pre-test and post-test research design, one might naturally assume that the pre-course differences observed between high- and low-scoring college students might be due in large part to their pre-existing motivation, interest, experience in science, and attitudes about astronomy. To explore this notion, 11 non-science majoring undergraduates taking ASTRO 101 at west coast community colleges were interviewed in the first few weeks of the course to better understand students' pre-existing affect toward learning astronomy with an eye toward predicting student success. In answering this question, we hope to contribute to our understanding of the incoming knowledge of students taking undergraduate introductory astronomy classes, but also gain insight into how faculty can best meet those students' needs and assist them in achieving success. Perhaps surprisingly, there was only weak correlation between students' motivation toward learning astronomy and their pre-test scores. Instead, the most fruitful predictor of TOAST pre-test scores was the quantity of pre-existing, informal, self-directed astronomy learning experiences.
Individual Differences in Digit Span, Susceptibility to Proactive Interference, and Aptitude/Achievement Test Scores.

ERIC Educational Resources Information Center

Dempster, Frank N.; Cooney, John B.

1982-01-01

Individual differences in digit span, susceptibility to proactive interference, and various aptitude/achievement test scores were investigated in two experiments with college students. Results indicated that digit span was strongly correlated with aptitude/achievement scores, but did not indicate that susceptibility to proactive interference…
A pretest prognostic score to assess patients undergoing exercise or pharmacological stress testing

PubMed Central

Morise, Anthony; Evans, Matthew; Jalisi, Farrukh; Shetty, Rajendra; Stauffer, Marc

2007-01-01

Objective A previously developed pretest score was validated to stratify patients presenting for exercise testing with suspected coronary disease according to the presence of angiographic coronary disease. Our goal was to determine how well this pretest score risk stratified patients undergoing pharmacological and exercise stress tests concerning prognostic endpoints. Design Retrospective cohort analysis. Setting University hospital stress laboratory. Patients 7452 unselected ambulatory patients with symptoms of suspected coronary disease undergoing stress testing between 1995 and 2004. Main outcomes measures All‐cause death, cardiac death and non‐fatal myocardial infarction. Results The rate of all‐cause death was 5.5% (CI 5.0 to 6.1) with 4.3 (SD 2.4) years of follow‐up (Exercise 2.8% (CI 2.3 to 3.2) v Pharmacological group 11.9% (CI 10.5 to 13.3); p<0.001). The rate of cardiac death/myocardial infarction was 2.6% (CI 2.2 to 3.0) (Exercise 1.4% (CI 1.1 to 1.8) v Pharmacological group 5.3% (CI 4.3 to 6.2); p<0.001). In both groups, stratification by pretest score was significant for all‐cause death and the combined endpoint. However, stratification was more effective in the pharmacological group using the combined endpoint rather than all‐cause death. Pharmacological stress patients in intermediate and high risk groups were at higher risk than their respective exercise test cohorts. Referral for pharmacological stress testing was found to be an independent predictor of time to death (2.7 (CI 2.0 to 3.6); p<0.001). Conclusion A pretest score previously validated to stratify according to angiographic outcomes, effectively risk stratified pharmacological and exercise stress patients according to the combined endpoint of cardiac death/myocardial infarction. PMID:17228070
Construction of an Exome-Wide Risk Score for Schizophrenia Based on a Weighted Burden Test.

PubMed

Curtis, David

2018-01-01

Polygenic risk scores obtained as a weighted sum of associated variants can be used to explore association in additional data sets and to assign risk scores to individuals. The methods used to derive polygenic risk scores from common SNPs are not suitable for variants detected in whole exome sequencing studies. Rare variants, which may have major effects, are seen too infrequently to judge whether they are associated and may not be shared between training and test subjects. A method is proposed whereby variants are weighted according to their frequency, their annotations and the genes they affect. A weighted sum across all variants provides an individual risk score. Scores constructed in this way are used in a weighted burden test and are shown to be significantly different between schizophrenia cases and controls using a five-way cross-validation procedure. This approach represents a first attempt to summarise exome sequence variation into a summary risk score, which could be combined with risk scores from common variants and from environmental factors. It is hoped that the method could be developed further. © 2017 John Wiley & Sons Ltd/University College London.
Can Machine Scoring Deal with Broad and Open Writing Tests as Well as Human Readers?

ERIC Educational Resources Information Center

McCurry, Doug

2010-01-01

This article considers the claim that machine scoring of writing test responses agrees with human readers as much as humans agree with other humans. These claims about the reliability of machine scoring of writing are usually based on specific and constrained writing tasks, and there is reason for asking whether machine scoring of writing requires…
Pediatric residents' learning styles and temperaments and their relationships to standardized test scores.

PubMed

Tuli, Sanjeev Y; Thompson, Lindsay A; Saliba, Heidi; Black, Erik W; Ryan, Kathleen A; Kelly, Maria N; Novak, Maureen; Mellott, Jane; Tuli, Sonal S

2011-12-01

Board certification is an important professional qualification and a prerequisite for credentialing, and the Accreditation Council for Graduate Medical Education (ACGME) assesses board certification rates as a component of residency program effectiveness. To date, research has shown that preresidency measures, including National Board of Medical Examiners scores, Alpha Omega Alpha Honor Medical Society membership, or medical school grades poorly predict postresidency board examination scores. However, learning styles and temperament have been identified as factors that 5 affect test-taking performance. The purpose of this study is to characterize the learning styles and temperaments of pediatric residents and to evaluate their relationships to yearly in-service and postresidency board examination scores. This cross-sectional study analyzed the learning styles and temperaments of current and past pediatric residents by administration of 3 validated tools: the Kolb Learning Style Inventory, the Keirsey Temperament Sorter, and the Felder-Silverman Learning Style test. These results were compared with known, normative, general and medical population data and evaluated for correlation to in-service examination and postresidency board examination scores. The predominant learning style for pediatric residents was converging 44% (33 of 75 residents) and the predominant temperament was guardian 61% (34 of 56 residents). The learning style and temperament distribution of the residents was significantly different from published population data (P = .002 and .04, respectively). Learning styles, with one exception, were found to be unrelated to standardized test scores. The predominant learning style and temperament of pediatric residents is significantly different than that of the populations of general and medical trainees. However, learning styles and temperament do not predict outcomes on standardized in-service and board examinations in pediatric residents.
Spinal appearance questionnaire: factor analysis, scoring, reliability, and validity testing.

PubMed

Carreon, Leah Y; Sanders, James O; Polly, David W; Sucato, Daniel J; Parent, Stefan; Roy-Beaudry, Marjolaine; Hopkins, Jeffrey; McClung, Anna; Bratcher, Kelly R; Diamond, Beverly E

2011-08-15

Cross sectional. This study presents the factor analysis of the Spinal Appearance Questionnaire (SAQ) and its psychometric properties. Although the SAQ has been administered to a large sample of patients with adolescent idiopathic scoliosis (AIS) treated surgically, its psychometric properties have not been fully evaluated. This study presents the factor analysis and scoring of the SAQ and evaluates its psychometric properties. The SAQ and the Scoliosis Research Society-22 (SRS-22) were administered to AIS patients who were being observed, braced or scheduled for surgery. Standard demographic data and radiographic measures including Lenke type and curve magnitude were also collected. Of the 1802 patients, 83% were female; with a mean age of 14.8 years and mean initial Cobb angle of 55.8° (range, 0°-123°). From the 32 items of the SAQ, 15 loaded on two factors with consistent and significant correlations across all Lenke types. There is an Appearance (items 1-10) and an Expectations factor (items 12-15). Responses are summed giving a range of 5 to 50 for the Appearance domain and 5 to 20 for the Expectations domain. The Cronbach's α was 0.88 for both domains and Total score with a test-retest reliability of 0.81 for Appearance and 0.91 for Expectations. Correlations with major curve magnitude were higher for the SAQ Appearance and SAQ Total scores compared to correlations between the SRS Appearance and SRS Total scores. The SAQ and SRS-22 Scores were statistically significantly different in patients who were scheduled for surgery compared to those who were observed or braced. The SAQ is a valid measure of self-image in patients with AIS with greater correlation to curve magnitude than SRS Appearance and Total score. It also discriminates between patients who require surgery from those who do not.
Effect of Item Arrangement, Knowledge of Arrangement, and Test Anxiety on Two Scoring Methods.

ERIC Educational Resources Information Center

Plake, Barbara S.; And Others

1981-01-01

Number right and elimination scores were analyzed on a college level mathematics exam assembled from pretest data. Anxiety measures were administered along with the experimental forms to undergraduates. Results suggest that neither test scores nor attitudes are influenced by item order knowledge thereof, or anxiety level. (Author/GK)
ACER Mathematics Profile Series: Number Test. (Test Booklet, Answer and Record Sheet, Score Key, and Teachers Handbook).

ERIC Educational Resources Information Center

Cornish, Greg; Wines, Robin

The Number Test of the ACER Mathematics Profile Series, contains 30 items, for each of three suggested grade levels: 7-8, 8-9, and 9-10. Raw scores on all tests in the ACER Mathematics Profile Series (Number, Operations, Space and Measurement) are converted to a common scale called MAPS, a major feature of the Series. Based on the Rasch Model,…
Linking Scores from Tests of Similar Content Given in Different Languages: An Illustration Involving Methodological Alternatives

ERIC Educational Resources Information Center

Cascallar, Alicia S.; Dorans, Neil J.

2005-01-01

This study compares two methods commonly used (concordance and prediction) to establish linkages between scores from tests of similar content given in different languages. Score linkages between the Verbal and Math sections of the SAT I and the corresponding sections of the Spanish-language admissions test, the Prueba de Aptitud Academica (PAA),…
The Alcohol Use Disorders Identification Scale (AUDIT) normative scores for a multiracial sample of Rhodes University residence students.

PubMed

Young, Charles; Mayson, Tamara

2010-06-01

The objective of this research is to obtain accurate drinking norms for students living in the university residences in preparation for future social norms interventions that would allow individual students to compare their drinking to an appropriate reference group. Random cluster sampling was used to obtain data from 318 residence students who completed the Alcohol Use Disorders Identification Test (AUDIT), a brief, reliable and valid screening measure designed by the World Health Organisation (Babor et al. 2001). The Cronbach alpha coefficient of 0.83 reported for this multicultural sample is high, suggesting that the AUDIT may be reliably used in this and similar contexts. Normative scores are reported in the form of percentiles. Comparisons between the portions of students drinking safely and hazardously according to race and gender indicate that while male students are drinking no more hazardously than female students, white students drink far more hazardously than black students. These differences suggest that both race- and gender-specific norms would be essential for an effective social norms intervention in this multicultural South African context. Finally, the racialised drinking patterns might reflect an informal segregation of social space at Rhodes University.
Identification of the Quality Spot Welding used Non Destructive Test-Ultrasonic Testing: (Effect of Welding Time)

NASA Astrophysics Data System (ADS)

Sifa, A.; Endramawan, T.; Badruzzaman

2017-03-01

Resistance Spot Welding (RSW) is frequently used as one way of welding is used in the manufacturing process, especially in the automotive industry [4][5][6][7]. Several parameters influence the process of welding points. To determine the quality of a welding job needs to be tested, either by damaging or testing without damage, in this study conducted experimental testing the quality of welding or identify quality of the nugget by using Non-Destructive Test (NDT) -Ultrasonic Testing (UT), in which the identification of the quality of the welding is done with parameter thickness of worksheet after welding using NDT-UT with use same material worksheet and have more thickness of worksheet, the thickness of the worksheet single plate 1mm, with the capability of propagation Ultrasonic Testing (UT) standard limited> 3 mm [1], welding process parameters such as the time difference between 1-10s and the welding current of 8 KV, visually Heat Affected Zone ( HAZ ) have different results due to the length of time of welding. UT uses a probe that is used with a frequency of 4 MHz, diameter 10 mm, range 100 and the couplant used is oil. Identification techniques using drop 6dB, with sound velocity 2267 m / s of Fe, with the result that the effect of the Welding time affect the size of the HAZ, identification with the lowest time 1s show results capable identified joined through NDT - UT.
Effect of vowel context on test-retest nasalance score variability in children with and without cleft palate.

PubMed

Ha, Seunghee; Jung, Seungeun; Koh, Kyung S

2018-06-01

The purpose of this study was to determine whether test-retest nasalance score variability differs between Korean children with and without cleft palate (CP) and vowel context influences variability in nasalance score. Thirty-four 3-to-5-year-old children with and without CP participated in the study. Three 8-syllable speech stimuli devoid of nasal consonants were used for data collection. Each stimulus was loaded with high, low, or mixed vowels, respectively. All participants were asked to repeat the speech stimuli twice after the examiner, and an immediate test-retest nasalance score was assessed with no headgear change. Children with CP exhibited significantly greater absolute difference in nasalance scores than children without CP. Variability in nasalance scores was significantly different for the vowel context, and the high vowel sentence showed a significantly larger difference in nasalance scores than the low vowel sentence. The cumulative frequencies indicated that, for children with CP in the high vowel sentence, only 8 of 17 (47%) repeated nasalance scores were within 5 points. Test-retest nasalance score variability was greater for children with CP than children without CP, and there was greater variability for the high vowel sentence(s) for both groups. Copyright © 2018 Elsevier B.V. All rights reserved.
Scoring Method of a Situational Judgment Test: Influence on Internal Consistency Reliability, Adverse Impact and Correlation with Personality?

ERIC Educational Resources Information Center

De Leng, W. E.; Stegers-Jager, K. M.; Husbands, A.; Dowell, J. S.; Born, M. Ph.; Themmen, A. P.

2017-01-01

Situational Judgment Tests (SJTs) are increasingly used for medical school selection. Scoring an SJT is more complicated than scoring a knowledge test, because there are no objectively correct answers. The scoring method of an SJT may influence the construct and concurrent validity and the adverse impact with respect to non-traditional students.…
Decision making under internal uncertainty: the case of multiple-choice tests with different scoring rules.

PubMed

Bereby-Meyer, Yoella; Meyer, Joachim; Budescu, David V

2003-02-01

This paper assesses framing effects on decision making with internal uncertainty, i.e., partial knowledge, by focusing on examinees' behavior in multiple-choice (MC) tests with different scoring rules. In two experiments participants answered a general-knowledge MC test that consisted of 34 solvable and 6 unsolvable items. Experiment 1 studied two scoring rules involving Positive (only gains) and Negative (only losses) scores. Although answering all items was the dominating strategy for both rules, the results revealed a greater tendency to answer under the Negative scoring rule. These results are in line with the predictions derived from Prospect Theory (PT) [Econometrica 47 (1979) 263]. The second experiment studied two scoring rules, which allowed respondents to exhibit partial knowledge. Under the Inclusion-scoring rule the respondents mark all answers that could be correct, and under the Exclusion-scoring rule they exclude all answers that might be incorrect. As predicted by PT, respondents took more risks under the Inclusion rule than under the Exclusion rule. The results illustrate that the basic process that underlies choice behavior under internal uncertainty and especially the effect of framing is similar to the process of choice under external uncertainty and can be described quite accurately by PT. Copyright 2002 Elsevier Science B.V.
Effects of Scoring by Section and Independent Scorers' Patterns on Scorer Reliability in Biology Essay Tests

ERIC Educational Resources Information Center

Ebuoh, Casmir N.; Ezeudu, S. A.

2015-01-01

The study investigated the effects of scoring by section, use of independent scorers and conventional patterns on scorer reliability in Biology essay tests. It was revealed from literature review that conventional pattern of scoring all items at a time in essay tests had been criticized for not being reliable. The study was true experimental study…
An Analysis of Cross Racial Identity Scale Scores Using Classical Test Theory and Rasch Item Response Models

ERIC Educational Resources Information Center

Sussman, Joshua; Beaujean, A. Alexander; Worrell, Frank C.; Watson, Stevie

2013-01-01

Item response models (IRMs) were used to analyze Cross Racial Identity Scale (CRIS) scores. Rasch analysis scores were compared with classical test theory (CTT) scores. The partial credit model demonstrated a high goodness of fit and correlations between Rasch and CTT scores ranged from 0.91 to 0.99. CRIS scores are supported by both methods.…

Pediatric Residents' Learning Styles and Temperaments and Their Relationships to Standardized Test Scores

PubMed Central

Tuli, Sanjeev Y.; Thompson, Lindsay A.; Saliba, Heidi; Black, Erik W.; Ryan, Kathleen A.; Kelly, Maria N.; Novak, Maureen; Mellott, Jane; Tuli, Sonal S.

2011-01-01

Background Board certification is an important professional qualification and a prerequisite for credentialing, and the Accreditation Council for Graduate Medical Education (ACGME) assesses board certification rates as a component of residency program effectiveness. To date, research has shown that preresidency measures, including National Board of Medical Examiners scores, Alpha Omega Alpha Honor Medical Society membership, or medical school grades poorly predict postresidency board examination scores. However, learning styles and temperament have been identified as factors that 5 affect test-taking performance. The purpose of this study is to characterize the learning styles and temperaments of pediatric residents and to evaluate their relationships to yearly in-service and postresidency board examination scores. Methods This cross-sectional study analyzed the learning styles and temperaments of current and past pediatric residents by administration of 3 validated tools: the Kolb Learning Style Inventory, the Keirsey Temperament Sorter, and the Felder-Silverman Learning Style test. These results were compared with known, normative, general and medical population data and evaluated for correlation to in-service examination and postresidency board examination scores. Results The predominant learning style for pediatric residents was converging 44% (33 of 75 residents) and the predominant temperament was guardian 61% (34 of 56 residents). The learning style and temperament distribution of the residents was significantly different from published population data (P = .002 and .04, respectively). Learning styles, with one exception, were found to be unrelated to standardized test scores. Conclusions The predominant learning style and temperament of pediatric residents is significantly different than that of the populations of general and medical trainees. However, learning styles and temperament do not predict outcomes on standardized in-service and board
Fine-Tuning Cross-Battery Assessment Procedures: After Follow-Up Testing, Use All Valid Scores, Cohesive or Not

ERIC Educational Resources Information Center

Schneider, W. Joel; Roman, Zachary

2018-01-01

We used data simulations to test whether composites consisting of cohesive subtest scores are more accurate than composites consisting of divergent subtest scores. We demonstrate that when multivariate normality holds, divergent and cohesive scores are equally accurate. Furthermore, excluding divergent scores results in biased estimates of…
How Should Colleges Treat Multiple Admissions Test Scores? ACT Working Paper 2017-4

ERIC Educational Resources Information Center

Mattern, Krista; Radunzel, Justine; Bertling, Maria; Ho, Andrew

2017-01-01

The percentage of students retaking college admissions tests is rising (Harmston & Crouse, 2016). Researchers and college admissions offices currently use a variety of methods for summarizing these multiple scores. Testing companies, interested in validity evidence like correlations with college first-year grade-point averages (FYGPA), often…
EXPLORATION OF SCORE AGREEMENT ON A MODIFIED UPPER QUARTER Y-BALANCE TEST KIT AS COMPARED TO THE UPPER QUARTER Y-BALANCE TEST.

PubMed

Cramer, Josh; Quintero, Miguel; Rhinehart, Alex; Rutherford, Caitlin; Nasypany, Alan; May, James; Baker, Russell T

2017-02-01

Physical performance measures (PPMs) such as The Star Excursion Balance Test (SEBT) and the Y-Balance Test (YBT) are functional movement tests used to assess participants' dynamic balance, which can be a vital component in physical exams to identify predisposing factors for risk of injury. The YBT is a functional assessment tool for the upper and lower body. It evolved from the SEBT, which has been previously used in research as a lower body functional assessment. It is comprised of fewer movement directions, which help limit fatigue. The YBT kit is a commercialized tool, which may pose barriers for clinicians with limited budgets and/or strict approval process for purchasing capital items in their clinics, especially healthcare providers in the secondary school setting. The cost may also pose a barrier for researchers with limited budgets. A less expensive, easy to make kit, may provide clinicians an opportunity to integrate functional testing into their evaluation or research. The purpose of this pilot study was to describe a cost efficient method to gather participant's upper quarter YBT (UQYBT) measurements and examine the inter- and intra-rater score agreement between this method and the commercial YBT measurements. A convenience sample of 20 physically active participants volunteered to participate in a comparison study of the of Upper Quarter Y-Balance Test (UQYBT) using the commercialized kit and the Modified Upper Quarter Y-Balance Test kit (mUQYBT) made with three cloth tape measures, athletic tape, a goniometer and three 2x4x8 wood blocks. A Pearson Product Moment correlation and Bland-Altman analyses were used to examine the relationship between intra-rater scores comparing the UQYBT and mUQYBT. Inter-rater scores were analyzed using intraclass correlation coefficients (ICC) (2,1) and Bland-Altman analyses. All Pearson Product Moment r-values for intra-rater scores were greater than .96 and statistically significant at p<0.05. Coefficients of
[Evaluation of a rapid trehalase test for the identification of Candida glabrata].

PubMed

Kirdar, Sevin; Gültekin, Berna; Evcil, Gonca; Ozkütük, Aydan; Sener, Asli Gamze; Aydin, Neriman

2009-04-01

Candida species which cause local infections, may also lead to fatal systemic infections. The increasing incidence of non-albicans Candida, especially fluconazole susceptible or resistant dose-dependent C. glabrata, increased the importance of rapid and accurate species level identification for Candida. Rapid and correct identification of C. glabrata is essential for the initiation of the appropriate antifungal therapy. This study was conducted to evaluate the performance of the rapid trehalase test in the diagnosis of C. glabrata isolates. A total of 173 Candida strains isolated from various clinical specimens and identified according to germ tube test, growth on cornmeal Tween 80 agar and the colony morphologies on Mast-CHROMagar Candida medium (Mast Diagnostics, UK), were included to the study. The identification of non-albicans Candida species were also confirmed by API 20CAUX (BioMerieux, France) system. Accordingly 86 (50%) of the isolates were identified as C. glabrata, 48 (28%) C. albicans, 17 (10%) C. krusei, 13 (8%) C. tropicalis, 5 (3%) C. parapsilosis, 3 (2%) C. kefyr and 1 (1%) Cutilis. In order to detect the presence of trehalase enzyme in Condida strains, all isolates were grown on Sabouraud dextrose agar containing 4% glucose and then one yeast colony was emulsified in 50 microl of citrate buffer containing 4% (wt/vol) trehalose for 3 h at 37 degrees C. Presence of glucose which emerged after the action of trehalase on trehalose, was detected by a commercial "urinary glucose detection dipstick" (Spinreacta, Spain). All C. glabrata strains yielded positive result by trehalase test. None C. glabrata isolates were found negative by trehalase test except for one strain of C. tropicalis. In this study, the trehalase test allowed identification of C. globrata with 100% sensitivity and 98.9% specificity. It was concluded that trehalase test is a rapid, cost-effective and simple test that can be used for the accurate identification of C. glabrata.
Evaluation of the Biotyper MALDI-TOF MS system for identification of Staphylococcus species.

PubMed

Zhu, Wenming; Sieradzki, Krzysztof; Albrecht, Valerie; McAllister, Sigrid; Lin, Wen; Stuchlik, Olga; Limbago, Brandi; Pohl, Jan; Kamile Rasheed, J

2015-10-01

The Bruker Biotyper MALDI-TOF MS (Biotyper) system, with a modified 30 minute formic acid extraction method, was evaluated by its ability to identify 216 clinical Staphylococcus isolates from the CDC reference collection comprising 23 species previously identified by conventional biochemical tests. 16S rDNA sequence analysis was used to resolve discrepancies. Of these, 209 (96.8%) isolates were correctly identified: 177 (84.7%) isolates had scores ≥2.0, while 32 (15.3%) had scores between 1.70 and 1.99. The Biotyper identification was inconsistent with the biochemical identification for seven (3.2%) isolates, but the Biotyper identifications were confirmed by 16S rDNA analysis. The distribution of low scores was strongly species-dependent, e.g. only 5% of Staphylococcus epidermidis and 4.8% of Staphylococcus aureus isolates scored below 2.0, while 100% of Staphylococcus cohnii, 75% of Staphylococcus sciuri, and 60% of Staphylococcus caprae produced low but accurate Biotyper scores. Our results demonstrate that the Biotyper can reliably identify Staphylococcus species with greater accuracy than conventional biochemicals. Broadening of the reference database by inclusion of additional examples of under-represented species could further optimize Biotyper results. Published by Elsevier B.V.
Student Neighborhoods, Schools, and Test Score Growth: Evidence from Milwaukee, Wisconsin

ERIC Educational Resources Information Center

Carlson, Deven; Cowen, Joshua M.

2015-01-01

Schools and neighborhoods are thought to be two of the most important contextual influences on student academic outcomes. Drawing on a unique data set that permits simultaneous estimation of neighborhood and school contributions to student test score gains, we analyze the distributions of these contributions to consider the relative importance of…
Teachers' Perceptions and Expectations and the Black-White Test Score Gap.

ERIC Educational Resources Information Center

Ferguson, Ronald F.

2003-01-01

Evaluates how schools can positively affect the test score gap between black and white students by examining two potential sources for this difference: teachers and students. Offers evidence for the proposition that teachers' perceptions, expectations, and behaviors interact with students' beliefs, behaviors, and work habits in ways that help to…
The Effect of Stakes on Accountability Test Scores and Pass Rates

ERIC Educational Resources Information Center

Steedle, Jeffrey T.; Grochowalski, Joseph

2017-01-01

Students may not fully demonstrate their knowledge and skills on accountability tests if there are no stakes attached to individual performance. In that case, assessment results may not accurately reflect student achievement, so the validity of score interpretations and uses suffers. For this study, matched samples of students taking state…
Effects of Analytical and Holistic Scoring Patterns on Scorer Reliability in Biology Essay Tests

ERIC Educational Resources Information Center

Ebuoh, Casmir N.

2018-01-01

Literature revealed that the patterns/methods of scoring essay tests had been criticized for not being reliable and this unreliability is more likely to be more in internal examinations than in the external examinations. The purpose of this study is to find out the effects of analytical and holistic scoring patterns on scorer reliability in…
A comparative overview of modal testing and system identification for control of structures

NASA Technical Reports Server (NTRS)

Juang, J.-N.; Pappa, R. S.

1988-01-01

A comparative overview is presented of the disciplines of modal testing used in structural engineering and system identification used in control theory. A list of representative references from both areas is given, and the basic methods are described briefly. Recent progress on the interaction of modal testing and control disciplines is discussed. It is concluded that combined efforts of researchers in both disciplines are required for unification of modal testing and system identification methods for control of flexible structures.
Impact of a standardized test package on exit examination scores and NCLEX-RN outcomes.

PubMed

Homard, Catherine M

2013-03-01

The purpose of this ex post facto correlational study was to compare exit examination scores and NCLEX-RN(®) pass rates of baccalaureate nursing students who differed in level of participation in a standardized test package. Three cohort groups emerged as a standardized test package was introduced: (a) students who did not participate in a standardized test package; (b) students with two semesters of a standardized test package; and (c) students with four semesters of a standardized test package. Benner's novice-to-expert theory framed the study in the belief that students best acquire knowledge and skills through practice and reflection. Students participating in four semesters of a standardized test package demonstrated higher exit examination scores and NCLEX-RN pass rates compared with students who did not participate in this package. This study's results could inform nurse educators about strategies to facilitate nursing student success on exit examinations and the NCLEX-RN. Copyright 2013, SLACK Incorporated.
Accuracy of four commonly used color vision tests in the identification of cone disorders.

PubMed

Thiadens, Alberta A H J; Hoyng, Carel B; Polling, Jan Roelof; Bernaerts-Biskop, Riet; van den Born, L Ingeborgh; Klaver, Caroline C W

2013-04-01

To determine which color vision test is most appropriate for the identification of cone disorders. In a clinic-based study, four commonly used color vision tests were compared between patients with cone dystrophy (n = 37), controls with normal visual acuity (n = 35), and controls with low vision (n = 39) and legal blindness (n = 11). Mean outcome measures were specificity, sensitivity, positive predictive value and discriminative accuracy of the Ishihara test, Hardy-Rand-Rittler (HRR) test, and the Lanthony and Farnsworth Panel D-15 tests. In the comparison between cone dystrophy and all controls, sensitivity, specificity and predictive value were highest for the HRR and Ishihara tests. When patients were compared to controls with normal vision, discriminative accuracy was highest for the HRR test (c-statistic for PD-axes 1, for T-axis 0.851). When compared to controls with poor vision, discriminative accuracy was again highest for the HRR test (c-statistic for PD-axes 0.900, for T-axis 0.766), followed by the Lanthony Panel D-15 test (c-statistic for PD-axes 0.880, for T-axis 0.500) and Ishihara test (c-statistic 0.886). Discriminative accuracies of all tests did not further decrease when patients were compared to controls who were legally blind. The HRR, Lanthony Panel D-15 and Ishihara all have a high discriminative accuracy to identify cone disorders, but the highest scores were for the HRR test. Poor visual acuity slightly decreased the accuracy of all tests. Our advice is to use the HRR test since this test also allows for evaluation of all three color axes and quantification of color defects.
Physical Function Does Not Predict Care Assessment Need Score in Older Veterans.

PubMed

Serra, Monica C; Addison, Odessa; Giffuni, Jamie; Paden, Lydia; Morey, Miriam C; Katzel, Leslie

2017-01-01

The Veterans Health Administration's Care Assessment Need (CAN) score is a statistical model, aimed to predict high-risk patients. We were interested in determining if a relationship existed between physical function and CAN scores. Seventy-four older (71 ± 1 years) male Veterans underwent assessment of CAN score and subjective (Short Form-36 [SF-36]) and objective (self-selected walking speed, four square step test, short physical performance battery) assessment of physical function. Approximately 25% of participants self-reported limitations performing lower intensity activities, while 70% to 90% reported limitations with more strenuous activities. When compared with cut points indicative of functional limitations, 35% to 65% of participants had limitations for each of the objective measures. Any measure of subjective or objective physical function did not predict CAN score. These data indicate that the addition of a physical function assessment may complement the CAN score in the identification of high-risk patients.
[Relationship between unipedal stance test score and center of pressure velocity in elderly].

PubMed

Rodrigo Antonio, Guzmán; Rony, Silvestre; Francisco Aniceto, Rodríguez; David Andrés, Arriagada; Pablo Andrés, Ortega

2011-01-01

Frequent falls are one of the most important health problems in the elderly population. The unipedal stance test (UPST), asses postural stability and is used in fall risk measures. Despite this, there is little information about its relationship with posturographic parameters (PP) that characterizes postural stability. Center of pressure velocity (CoPV) is one of the best PP that describes postural stability. The aim of this study was to analyze the relation between UST score and CoPV in elderly population. A sample of 38 healthy elderly subjects where divided in two groups according to their UPST score, low performance (LP, n=11) and high performance (HP, n=27). The correlation between UPST score and COP mean velocity (CoPmV), recorded from a posturographic test, was analyzed between both groups. An inverse correlation between UPST score and CoPmV was found in both groups. However, this was higher in the LP group (r=-0.69, P=.02) compared to the HP (r=-0.39, P=.04). Based on the results of this investigation, it may be concluded that the achievement on UPST has an inverse relationship with CoPmV, especially in subjects with low performance in the UPST. Copyright © 2010 SEGG. Published by Elsevier Espana. All rights reserved.
Linkage analysis in nuclear families. 2: Relationship between affected sib-pair tests and lod score analysis.

PubMed

Knapp, M; Seuchter, S A; Baur, M P

1994-01-01

It is believed that the main advantage of affected sib-pair tests is that their application requires no information about the underlying genetic mechanism of the disease. However, here it is proved that the mean test, which can be considered the most prominent of the affected sib-pair tests, is equivalent to lod score analysis for an assumed recessive mode of inheritance, irrespective of the true mode of the disease. Further relationships of certain sib-pair tests and lod score analysis under specific assumed genetic modes are investigated.
Consonant and Vowel Identification in Cochlear Implant Users Measured by Nonsense Words: A Systematic Review and Meta-Analysis.

PubMed

Rødvik, Arne Kirkhorn; von Koss Torkildsen, Janne; Wie, Ona Bø; Storaker, Marit Aarvaag; Silvola, Juha Tapio

2018-04-17

The purpose of this systematic review and meta-analysis was to establish a baseline of the vowel and consonant identification scores in prelingually and postlingually deaf users of multichannel cochlear implants (CIs) tested with consonant-vowel-consonant and vowel-consonant-vowel nonsense syllables. Six electronic databases were searched for peer-reviewed articles reporting consonant and vowel identification scores in CI users measured by nonsense words. Relevant studies were independently assessed and screened by 2 reviewers. Consonant and vowel identification scores were presented in forest plots and compared between studies in a meta-analysis. Forty-seven articles with 50 studies, including 647 participants, thereof 581 postlingually deaf and 66 prelingually deaf, met the inclusion criteria of this study. The mean performance on vowel identification tasks for the postlingually deaf CI users was 76.8% (N = 5), which was higher than the mean performance for the prelingually deaf CI users (67.7%; N = 1). The mean performance on consonant identification tasks for the postlingually deaf CI users was higher (58.4%; N = 44) than for the prelingually deaf CI users (46.7%; N = 6). The most common consonant confusions were found between those with same manner of articulation (/k/ as /t/, /m/ as /n/, and /p/ as /t/). The mean performance on consonant identification tasks for the prelingually and postlingually deaf CI users was found. There were no statistically significant differences between the scores for prelingually and postlingually deaf CI users. The consonants that were incorrectly identified were typically confused with other consonants with the same acoustic properties, namely, voicing, duration, nasality, and silent gaps. A univariate metaregression model, although not statistically significant, indicated that duration of implant use in postlingually deaf adults predict a substantial portion of their consonant identification ability. As there is no ceiling
Association of Health Sciences Reasoning Test scores with academic and experiential performance.

PubMed

Cox, Wendy C; McLaughlin, Jacqueline E

2014-05-15

To assess the association of scores on the Health Sciences Reasoning Test (HSRT) with academic and experiential performance in a doctor of pharmacy (PharmD) curriculum. The HSRT was administered to 329 first-year (P1) PharmD students. Performance on the HSRT and its subscales was compared with academic performance in 29 courses throughout the curriculum and with performance in advanced pharmacy practice experiences (APPEs). Significant positive correlations were found between course grades in 8 courses and HSRT overall scores. All significant correlations were accounted for by pharmaceutical care laboratory courses, therapeutics courses, and a law and ethics course. There was a lack of moderate to strong correlation between HSRT scores and academic and experiential performance. The usefulness of the HSRT as a tool for predicting student success may be limited.
Consistency of SAT® I: Reasoning Test Score Conversions. Research Report. ETS RR-08-67

ERIC Educational Resources Information Center

Haberman, Shelby J.; Guo, Hongwen; Liu, Jinghua; Dorans, Neil J.

2008-01-01

This study uses historical data to explore the consistency of SAT® I: Reasoning Test score conversions and to examine trends in scaled score means. During the period from April 1995 to December 2003, both Verbal (V) and Math (M) means display substantial seasonality, and a slight increasing trend for both is observed. SAT Math means increase more…
Do Standardized Tests Penalize Deep-Thinking, Creative, or Conscientious Students?: Some Personality Correlates of Graduate Record Examinations Test Scores

ERIC Educational Resources Information Center

Powers, Donald E.; Kaufman, James C.

2004-01-01

The objective of the study reported here was to explore the relationship of Graduate Record Examinations (GRE) General Test scores to selected personality traits--conscientiousness, rationality, ingenuity, quickness, creativity, and depth. A sample of 342 GRE test takers completed short personality inventory scales for each trait. Analyses…

Demographically Adjusted Groups for Equating Test Scores. Research Report. ETS RR-14-30

ERIC Educational Resources Information Center

Livingston, Samuel A.

2014-01-01

In this study, I investigated 2 procedures intended to create test-taker groups of equal ability by poststratifying on a composite variable created from demographic information. In one procedure, the stratifying variable was the composite variable that best predicted the test score. In the other procedure, the stratifying variable was the…
Evaluating Gifted Identification Practice: Aptitude Testing and Linguistically Diverse Learners

ERIC Educational Resources Information Center

Matthews, Michael S.; Kirsch, Lauri

2011-01-01

The authors examined individually administered IQ scores from an entire K-5 population (N = 432) of Limited English Proficient students referred for gifted program eligibility determination in a single large urban district in the southeastern United States. Of 8 IQ tests compared, only 1, the Stanford-Binet V, had scores appreciably lower than…
The Phoneme Identification Test for Assessment of Spectral and Temporal Discrimination Skills in Children: Development, Normative Data, and Test-Retest Reliability Studies.

PubMed

Cameron, Sharon; Chong-White, Nicky; Mealings, Kiri; Beechey, Tim; Dillon, Harvey; Young, Taegan

2018-02-01

Previous research suggests that a proportion of children experiencing reading and listening difficulties may have an underlying primary deficit in the way that the central auditory nervous system analyses the perceptually important, rapidly varying, formant frequency components of speech. The Phoneme Identification Test (PIT) was developed to investigate the ability of children to use spectro-temporal cues to perceptually categorize speech sounds based on their rapidly changing formant frequencies. The PIT uses an adaptive two-alternative forced-choice procedure whereby the participant identifies a synthesized consonant-vowel (CV) (/ba/ or /da/) syllable. CV syllables differed only in the second formant (F2) frequency along an 11-step continuum (between 0% and 100%-representing an ideal /ba/ and /da/, respectively). The CV syllables were presented in either quiet (PIT Q) or noise at a 0 dB signal-to-noise ratio (PIT N). Development of the PIT stimuli and test protocols, and collection of normative and test-retest reliability data. Twelve adults (aged 23 yr 10 mo to 50 yr 9 mo, mean 32 yr 5 mo) and 137 typically developing, primary-school children (aged 6 yr 0 mo to 12 yr 4 mo, mean 9 yr 3 mo). There were 73 males and 76 females. Data were collected using a touchscreen computer. Psychometric functions were automatically fit to individual data by the PIT software. Performance was determined by the width of the continuum for which responses were neither clearly /ba/ nor /da/ (referred to as the uncertainty region [UR]). A shallower psychometric function slope reflected greater uncertainty. Age effects were determined based on raw scores. Z scores were calculated to account for the effect of age on performance. Outliers, and individual data for which the confidence interval of the UR exceeded a maximum allowable value, were removed. Nonparametric tests were used as the data were skewed toward negative performance. Across participants, the median value of the F2 range
CaPTHUS scoring model in primary hyperparathyroidism: can it eliminate the need for ioPTH testing?

PubMed

Elfenbein, Dawn M; Weber, Sara; Schneider, David F; Sippel, Rebecca S; Chen, Herbert

2015-04-01

The CaPTHUS model was reported to have a positive predictive value of 100 % to correctly predict single-gland disease in patients with primary hyperparathyroidism, thus obviating the need for intraoperative parathyroid hormone (ioPTH) testing. We sought to apply the CaPTHUS scoring model in our patient population and assess its utility in predicting long-term biochemical cure. We retrospective reviewed all parathyroidectomies for primary hyperparathyroidism performed at our university hospital from 2003 to 2012. We routinely perform ioPTH testing. Biochemical cure was defined as a normal calcium level at 6 months. A total of 1,421 patients met the inclusion criteria: 78 % of patients had a single adenoma at the time of surgery, 98 % had a normal serum calcium at 1 week postoperatively, and 96 % had a normal serum calcium level 6 months postoperatively. Using the CaPTHUS scoring model, 307 patients (22.5 %) had a score of ≥ 3, with a positive predictive value of 91 % for single adenoma. A CaPTHUS score of ≥ 3 had a positive predictive value of 98 % for biochemical cure at 1 week as well as at 6 months. In our population, where ioPTH testing is used routinely to guide use of bilateral exploration, patients with a preoperative CaPTHUS score of ≥ 3 had good long-term biochemical cure rates. However, the model only predicted adenoma in 91 % of cases. If minimally invasive parathyroidectomy without ioPTH testing had been done for these patients, the cure rate would have dropped from 98 % to an unacceptable 89 %. Even in these patients with high CaPTHUS scores, multigland disease is present in almost 10 %, and ioPTH testing is necessary.
ADAMTS13 test and/or PLASMIC clinical score in management of acquired thrombotic thrombocytopenic purpura: a cost-effective analysis.

PubMed

Kim, Chong H; Simmons, Sierra C; Williams, Lance A; Staley, Elizabeth M; Zheng, X Long; Pham, Huy P

2017-11-01

The ADAMTS13 test distinguishes thrombotic thrombocytopenic purpura (TTP) from other thrombotic microangiopathies (TMAs). The PLASMIC score helps determine the pretest probability of ADAMTS13 deficiency. Due to inherent limitations of both tests, and potential adverse effects and cost of unnecessary treatments, we performed a cost-effectiveness analysis (CEA) investigating the benefits of incorporating an in-hospital ADAMTS13 test and/or PLASMIC score into our clinical practice. A CEA model was created to compare four scenarios for patients with TMAs, utilizing either an in-house or a send-out ADAMTS13 assay with or without prior risk stratification using PLASMIC scoring. Model variables, including probabilities and costs, were gathered from the medical literature, except for the ADAMTS13 send-out and in-house tests, which were obtained from our institutional data. If only the cost is considered, in-house ADAMTS13 test for patients with intermediate- to high-risk PLASMIC score is the least expensive option ($4,732/patient). If effectiveness is assessed as measured by the number of averted deaths, send-out ADAMTS13 test is the most effective. Considering the cost/effectiveness ratio, the in-house ADAMTS13 test in patients with intermediate- to high-risk PLASMIC score is the best option, followed by the in-house ADAMTS13 test without the PLASMIC score. In patients with clinical presentations of TMAs, having an in-hospital ADAMTS13 test to promptly establish the diagnosis of TTP appears to be cost-effective. Utilizing the PLASMIC score further increases the cost-effectiveness of the in-house ADAMTS13 test. Our findings indicate the benefit of having a rapid and reliable in-house ADAMTS13 test, especially in the tertiary medical center. © 2017 AABB.
Monitoring scale scores over time via quality control charts, model-based approaches, and time series techniques.

PubMed

Lee, Yi-Hsuan; von Davier, Alina A

2013-07-01

Maintaining a stable score scale over time is critical for all standardized educational assessments. Traditional quality control tools and approaches for assessing scale drift either require special equating designs, or may be too time-consuming to be considered on a regular basis with an operational test that has a short time window between an administration and its score reporting. Thus, the traditional methods are not sufficient to catch unusual testing outcomes in a timely manner. This paper presents a new approach for score monitoring and assessment of scale drift. It involves quality control charts, model-based approaches, and time series techniques to accommodate the following needs of monitoring scale scores: continuous monitoring, adjustment of customary variations, identification of abrupt shifts, and assessment of autocorrelation. Performance of the methodologies is evaluated using manipulated data based on real responses from 71 administrations of a large-scale high-stakes language assessment.
Changes of olfactory abilities in relation to age: odor identification in more than 1400 people aged 4 to 80 years.

PubMed

Sorokowska, A; Schriever, V A; Gudziol, V; Hummel, C; Hähner, A; Iannilli, E; Sinding, C; Aziz, M; Seo, H S; Negoias, S; Hummel, T

2015-08-01

The currently presented large dataset (n = 1,422) consists of results that have been assembled over the last 8 years at science fairs using the 16-item odor identification part of the "Sniffin' Sticks". In this context, the focus was on olfactory function in children; in addition before testing, we asked participants to rate their olfactory abilities and the patency of the nasal airways. We reinvestigated some simple questions, e.g., differences in olfactory odor identification abilities in relation to age, sex, self-ratings of olfactory function and nasal patency. Three major results evolved: first, consistent with previously published reports, we found that identification scores of the youngest and the oldest participants were lower than the scores obtained by people aged 20-60. Second, we observed an age-related increase in the olfactory abilities of children. Moreover, the self-assessed olfactory abilities were related to actual performance in the smell test, but only in adults, and self-assessed nasal patency was not related to the "Sniffin' Sticks" identification score.
The Effects of Using Selected Metacognitive Strategies on ACT Mathematics Sub-Test Scores

ERIC Educational Resources Information Center

LeMay, Jeffrey W.

2016-01-01

This quasi-experimental post-test only control group designed quantitative study examined whether or not members of an experimental group of participants who utilized two metacognitive strategy training regimens experienced a significant increase in their ACT mathematics sub-test scores compared to a group of students who did not utilize either of…
Interpreting the "g" Loadings of Intelligence Test Composite Scores in Light of Spearman's Law of Diminishing Returns

ERIC Educational Resources Information Center

Reynolds, Matthew R.

2013-01-01

The linear loadings of intelligence test composite scores on a general factor ("g") have been investigated recently in factor analytic studies. Spearman's law of diminishing returns (SLODR), however, implies that the "g" loadings of test scores likely decrease in magnitude as g increases, or they are nonlinear. The purpose of…
School Readiness and the Draw-a-Man Test: An Empiricaly Derived Alternative to Harris' Scoring System.

ERIC Educational Resources Information Center

Simner, Marvin L.

1985-01-01

An abbreviated scoring system for the Goodenough-Harris Draw-A-Man Test found that three items had the same overall potential for correctly identifying at-risk kindergarteners as more time-consuming scoring methods. (CL)
Economic impact of 21-gene recurrence score testing on early-stage breast cancer in Ireland.

PubMed

Smyth, Lillian; Watson, Geoff; Walsh, Elaine M; Kelly, Catherine M; Keane, Maccon; Kennedy, M John; Grogan, Liam; Hennessy, Bryan T; O'Reilly, Seamus; Coate, Linda E; O'Connor, Miriam; Quinn, Cecily; Verleger, Katharina; Schoeman, Olaf; O'Reilly, Susan; Walshe, Janice M

2015-10-01

The 21-gene test is a validated multi-gene diagnostic test that predicts chemotherapy (CT) benefit in oestrogen receptor positive (ER+), lymph node-negative (N0) breast cancer (BC) patients (pts). Ireland was the first public health care system to reimburse this test in Europe. Study objectives were to assess the impact of this test on decision-making and to analyse the economic impact of testing. Between October 2011 and February 2013, a national, retrospective, cross-sectional observational study of ER+, N0 BC pts tested with the 21-gene test was conducted. Surveyed breast medical oncologists, provided the assumption for the decision impact analysis that grade (G) 1 pts would not have received CT before testing and G2/3 pts would have received CT before testing. Descriptive statistical analyses were performed. 592 pts were identified; Low, intermediate and high recurrence score were identified in 53, 36 and 10 % pts, respectively. 384 (70 %) pts had G2, 129 (22 %) G3 and 76 (13 %) G1 tumours. Post testing, 345 pts (59 %) experienced a change in CT decision; 339 changed to hormone therapy alone and 6 advised to receive CT. 172 (30 %) pts received CT, 12 (3.9 %) of pts with low scores, 108 (50.9 %) of intermediate risk and 50 (90.9 %) of pts with high risk scores. Net reduction in CT use was 58 % and net savings achieved were €793,565. Since public reimbursement, the introduction of the 21-gene test has resulted in a significant reduction in chemotherapy administration and cost savings for the Irish public healthcare system.
Association between the Medical College Admission Test scores and Alpha Omega Alpha Medical Honors Society membership.

PubMed

Gauer, Jacqueline L; Jackson, J Brooks

2017-01-01

Medical schools worldwide are faced with the challenge of selecting from among many qualified applicants. One factor that might help admissions committees identify future exceptional medical students is scores on standardized entrance exams. The purpose of this study was to determine the association between scores on the most commonly used standardized medical school entrance exam in the USA, the Medical College Admission Test (MCAT), and election to the US medical honors society, Alpha Omega Alpha (AOA). MCAT scores and AOA membership data were analyzed for all the students pursuing Doctor of Medicine degrees at the University of Minnesota Medical School and who graduated between 2012-2016 (n=1,309). An independent-samples t -test found a significant difference (t=6.132, p <0.001) in MCAT scores between those who were elected to AOA (n=179) and those who were not (n=1,130). On average, students who were elected to AOA had composite MCAT scores of 1.65 points higher than those who were not. Percentages of students elected to AOA gradually but inconsistently increased with MCAT score. No student who scored <27 on the MCAT was elected to AOA. Among students with MCAT scores at the 99th percentile or above (scores of ≥38), 13 of 48 (27.1%) were elected to AOA. Election to AOA during medical school was significantly associated with higher MCAT scores. Admissions committees should carefully consider the role of standardized entrance exam scores, in the context of a holistic review, when selecting for exceptional medical students.
Association between the Medical College Admission Test scores and Alpha Omega Alpha Medical Honors Society membership

PubMed Central

Gauer, Jacqueline L; Jackson, J Brooks

2017-01-01

Introduction Medical schools worldwide are faced with the challenge of selecting from among many qualified applicants. One factor that might help admissions committees identify future exceptional medical students is scores on standardized entrance exams. The purpose of this study was to determine the association between scores on the most commonly used standardized medical school entrance exam in the USA, the Medical College Admission Test (MCAT), and election to the US medical honors society, Alpha Omega Alpha (AOA). Method MCAT scores and AOA membership data were analyzed for all the students pursuing Doctor of Medicine degrees at the University of Minnesota Medical School and who graduated between 2012–2016 (n=1,309). Results An independent-samples t-test found a significant difference (t=6.132, p<0.001) in MCAT scores between those who were elected to AOA (n=179) and those who were not (n=1,130). On average, students who were elected to AOA had composite MCAT scores of 1.65 points higher than those who were not. Percentages of students elected to AOA gradually but inconsistently increased with MCAT score. No student who scored <27 on the MCAT was elected to AOA. Among students with MCAT scores at the 99th percentile or above (scores of ≥38), 13 of 48 (27.1%) were elected to AOA. Discussion Election to AOA during medical school was significantly associated with higher MCAT scores. Admissions committees should carefully consider the role of standardized entrance exam scores, in the context of a holistic review, when selecting for exceptional medical students. PMID:28979178
Two for One: Using QAR to Increase Reading Comprehension and Improve Test Scores

ERIC Educational Resources Information Center

Green, Susan

2016-01-01

This teaching tip describes an intervention used in a third-grade classroom implemented to help students pass an end-of-grade reading comprehension test. Low scores on a practice end-of-grade comprehension test prompted a re-examination of classroom reading instruction and a plan for intervention. This teaching tip describes the phases implemented…
Estimating Teacher Effectiveness from Two-Year Changes in Students' Test Scores

ERIC Educational Resources Information Center

Leigh, Andrew

2010-01-01

Using a dataset covering over 10,000 Australian school teachers and over 90,000 pupils, I estimate how effective teachers are in raising students' test scores. Since the exams are biennial, it is necessary to take account of the teacher's work in the intervening year. Even adjusting for measurement error, the teacher fixed effects are widely…
Can Tracking Raise the Test Scores of High-Ability Minority Students?

ERIC Educational Resources Information Center

Card, David; Giuliano, Laura

2016-01-01

We evaluate a tracking program in a large urban district where schools with at least one gifted fourth grader create a separate "gifted/high achiever" classroom. Most seats are filled by non-gifted high achievers, ranked by previous-year test scores. We study the program's effects on the high achievers using (1) a rank-based regression…
Test-retest reliability and minimal detectable change scores for the timed "up & go" test, the six-minute walk test, and gait speed in people with Alzheimer disease.

PubMed

Ries, Julie D; Echternach, John L; Nof, Leah; Gagnon Blodgett, Michelle

2009-06-01

With the increasing incidence of Alzheimer disease (AD), determining the validity and reliability of outcome measures for people with this disease is necessary. The goals of this study were to assess test-retest reliability of data for the Timed "Up & Go" Test (TUG), the Six-Minute Walk Test (6MWT), and gait speed and to calculate minimal detectable change (MDC) scores for each outcome measure. Performance differences between groups with mild to moderate AD and moderately severe to severe AD (as determined by the Functional Assessment Staging [FAST] scale) were studied. This was a prospective, nonexperimental, descriptive methodological study. Background data collected for 51 people with AD included: use of an assistive device, Mini-Mental Status Examination scores, and FAST scale scores. Each participant engaged in 2 test sessions, separated by a 30- to 60-minute rest period, which included 2 TUG trials, 1 6MWT trial, and 2 gait speed trials using a computerized gait assessment system. A specific cuing protocol was followed to achieve optimal performance during test sessions. Test-retest reliability values for the TUG, the 6MWT, and gait speed were high for all participants together and for the mild to moderate AD and moderately severe to severe AD groups separately (intraclass correlation coefficients > or = .973); however, individual variability of performance also was high. Calculated MDC scores at the 90% confidence interval were: TUG=4.09 seconds, 6MWT=33.5 m (110 ft), and gait speed=9.4 cm/s. The 2 groups were significantly different in performance of clinical tests, with the participants who were more cognitively impaired being more physically and functionally impaired. A single researcher for data collection limited sample numbers and prohibited blinding to dementia level. The TUG, the 6MWT, and gait speed are reliable outcome measures for use with people with AD, recognizing that individual variability of performance is high. Minimal detectable change
Estimation and test for linkage between markers: a comparison of lod score and χ (2) test in a linkage study of maritime pine (Pinus pinaster Ait.).

PubMed

Gerber, S; Rodolphe, F

1994-06-01

The first step in the construction of a linkage map involves the estimation and test for linkage between all possible pairs of markers. The lod score method is used in many linkage studies for the latter purpose. In contrast with classical statistical tests, this method does not rely on the choice of a first-type error level. We thus provide a comparison between the lod score and a χ (2) test on linkage data from a gymnosperm, the maritime pine. The lod score appears to be a very conservative test with the usual thresholds. Its severity depends on the type of data used.
Validation of undergraduate medical student script concordance test (SCT) scores on the clinical assessment of the acute abdomen.

PubMed

Goos, Matthias; Schubach, Fabian; Seifert, Gabriel; Boeker, Martin

2016-08-17

Health professionals often manage medical problems in critical situations under time pressure and on the basis of vague information. In recent years, dual process theory has provided a framework of cognitive processes to assist students in developing clinical reasoning skills critical especially in surgery due to the high workload and the elevated stress levels. However, clinical reasoning skills can be observed only indirectly and the corresponding constructs are difficult to measure in order to assess student performance. The script concordance test has been established in this field. A number of studies suggest that the test delivers a valid assessment of clinical reasoning. However, different scoring methods have been suggested. They reflect different interpretations of the underlying construct. In this work we want to shed light on the theoretical framework of script theory and give an idea of script concordance testing. We constructed a script concordance test in the clinical context of "acute abdomen" and compared previously proposed scores with regard to their validity. A test comprising 52 items in 18 clinical scenarios was developed, revised along the guidelines and administered to 56 4(th) and 5(th) year medical students at the end of a blended-learning seminar. We scored the answers using five different scoring methods (distance (2×), aggregate (2×), single best answer) and compared the scoring keys, the resulting final scores and Cronbach's α after normalization of the raw scores. All scores except the single best answers calculation achieved acceptable reliability scores (>= 0.75), as measured by Cronbach's α. Students were clearly distinguishable from the experts, whose results were set to a mean of 80 and SD of 5 by the normalization process. With the two aggregate scoring methods, the students' means values were between 62.5 (AGGPEN) and 63.9 (AGG) equivalent to about three expert SD below the experts' mean value (Cronbach's α : 0.76 (AGGPEN
Testing contamination source identification methods for water distribution networks

DOE PAGES

Seth, Arpan; Klise, Katherine A.; Siirola, John D.; ...

2016-04-01

In the event of contamination in a water distribution network (WDN), source identification (SI) methods that analyze sensor data can be used to identify the source location(s). Knowledge of the source location and characteristics are important to inform contamination control and cleanup operations. Various SI strategies that have been developed by researchers differ in their underlying assumptions and solution techniques. The following manuscript presents a systematic procedure for testing and evaluating SI methods. The performance of these SI methods is affected by various factors including the size of WDN model, measurement error, modeling error, time and number of contaminant injections,more » and time and number of measurements. This paper includes test cases that vary these factors and evaluates three SI methods on the basis of accuracy and specificity. The tests are used to review and compare these different SI methods, highlighting their strengths in handling various identification scenarios. These SI methods and a testing framework that includes the test cases and analysis tools presented in this paper have been integrated into EPA’s Water Security Toolkit (WST), a suite of software tools to help researchers and others in the water industry evaluate and plan various response strategies in case of a contamination incident. Lastly, a set of recommendations are made for users to consider when working with different categories of SI methods.« less

The Effect of School Poverty on Racial Gaps in Tests Scores: The Case of the Minnesota Basic Standards Tests

ERIC Educational Resources Information Center

Myers, Samuel L.; Kim, Hyeoneui; Mandala, Cheryl

2004-01-01

A data from 1996,1998 and 1999 Minnesota comprehensive statewide testing on eight graders is used to analyze whether African American students perform worse than the white students who attend the poverty schools. The analyses conclude that African American-White test score gap is attributed more to the racial discriminations and racial treatments…
What's in a Teacher Test? Assessing the Relationship between Teacher Licensure Test Scores and Student STEM Achievement and Course-Taking. CEDR Working Paper. WP #2016-11

ERIC Educational Resources Information Center

Goldhaber, Dan; Gratz, Trevor; Theobald, Roddy

2016-01-01

We investigate the relationship between teacher licensure test scores and student test achievement and high school course-taking. We focus on three subject/grade combinations-- middle school math, ninth-grade algebra and geometry, and ninth-grade biology--and find evidence that a teacher's basic skills test scores are modestly predictive of…
Optimal Multi-Type Sensor Placement for Structural Identification by Static-Load Testing

PubMed Central

Papadopoulou, Maria; Vernay, Didier; Smith, Ian F. C.

2017-01-01

Assessing ageing infrastructure is a critical challenge for civil engineers due to the difficulty in the estimation and integration of uncertainties in structural models. Field measurements are increasingly used to improve knowledge of the real behavior of a structure; this activity is called structural identification. Error-domain model falsification (EDMF) is an easy-to-use model-based structural-identification methodology which robustly accommodates systematic uncertainties originating from sources such as boundary conditions, numerical modelling and model fidelity, as well as aleatory uncertainties from sources such as measurement error and material parameter-value estimations. In most practical applications of structural identification, sensors are placed using engineering judgment and experience. However, since sensor placement is fundamental to the success of structural identification, a more rational and systematic method is justified. This study presents a measurement system design methodology to identify the best sensor locations and sensor types using information from static-load tests. More specifically, three static-load tests were studied for the sensor system design using three types of sensors for a performance evaluation of a full-scale bridge in Singapore. Several sensor placement strategies are compared using joint entropy as an information-gain metric. A modified version of the hierarchical algorithm for sensor placement is proposed to take into account mutual information between load tests. It is shown that a carefully-configured measurement strategy that includes multiple sensor types and several load tests maximizes information gain. PMID:29240684
Cognitive test scores in male adolescent cigarette smokers compared to non-smokers: a population-based study.

PubMed

Weiser, Mark; Zarka, Salman; Werbeloff, Nomi; Kravitz, Efrat; Lubin, Gad

2010-02-01

Although previous studies indicate that people with lower intelligence quotient (IQ) scores are more likely to become cigarette smokers, IQ scores of siblings discordant for smoking and of adolescents who began smoking between ages 18-21 years have not been studied systematically. Each year a random sample of Israeli military recruits complete a smoking questionnaire. Cognitive functioning is assessed by the military using standardized tests equivalent to IQ. Of 20 221 18-year-old males, 28.5% reported smoking at least one cigarette a day (smokers). An unadjusted comparison found that smokers scored 0.41 effect sizes (ES, P < 0.001) lower than non-smokers; adjusted analyses remained significant (adjusted ES = 0.27, P < 0.001). Adolescents smoking one to five, six to 10, 11-20 and 21+ cigarettes/day had cognitive test scores 0.14, 0.22, 0.33 and 0.5 adjusted ES poorer than those of non-smokers (P < 0.001). Adolescents who did not smoke by age 18, and then began to smoke between ages 18-21 had lower cognitive test scores compared to never-smokers (adjusted ES = 0.14, P < 0.001). An analysis of brothers discordant for smoking found that smoking brothers had lower cognitive scores than non-smoking brothers (adjusted ES = 0.27; P = 0.014). Controlled analyses from this large population-based cohort of male adolescents indicate that IQ scores are lower in male adolescents who smoke compared to non-smokers and in brothers who smoke compared to their non-smoking brothers. The IQs of adolescents who began smoking between ages 18-21 are lower than those of non-smokers. Adolescents with poorer IQ scores might be targeted for programmes designed to prevent smoking.
PINS Testing and Modification for Explosive Identification

DOE Office of Scientific and Technical Information (OSTI.GOV)

E.H. Seabury; A.J. Caffrey

2011-09-01

The INL's Portable Isotopic Neutron Spectroscopy System (PINS)1 non-intrusively identifies the chemical fill of munitions and sealed containers. PINS is used routinely by the U.S. Army, the Defense Threat Reduction Agency, and foreign military units to determine the contents of munitions and other containers suspected to contain explosives, smoke-generating chemicals, and chemical warfare agents such as mustard and nerve gas. The objects assayed with PINS range from softball-sized M139 chemical bomblets to 200 gallon DOT 500X ton containers. INL had previously examined2 the feasibility of using a similar system for the identification of explosives, and based on this proof-of-principle test,more » the development of a dedicated system for the identification of explosives in an improvised nuclear device appears entirely feasible. INL has been tasked by NNSA NA-42 Render Safe Research and Development with the development of such a system.« less
Toward a Nonspeech Test of Auditory Cognition: Semantic Context Effects in Environmental Sound Identification in Adults of Varying Age and Hearing Abilities

PubMed Central

Sheft, Stanley; Norris, Molly; Spanos, George; Radasevich, Katherine; Formsma, Paige; Gygi, Brian

2016-01-01

Objective Sounds in everyday environments tend to follow one another as events unfold over time. The tacit knowledge of contextual relationships among environmental sounds can influence their perception. We examined the effect of semantic context on the identification of sequences of environmental sounds by adults of varying age and hearing abilities, with an aim to develop a nonspeech test of auditory cognition. Method The familiar environmental sound test (FEST) consisted of 25 individual sounds arranged into ten five-sound sequences: five contextually coherent and five incoherent. After hearing each sequence, listeners identified each sound and arranged them in the presentation order. FEST was administered to young normal-hearing, middle-to-older normal-hearing, and middle-to-older hearing-impaired adults (Experiment 1), and to postlingual cochlear-implant users and young normal-hearing adults tested through vocoder-simulated implants (Experiment 2). Results FEST scores revealed a strong positive effect of semantic context in all listener groups, with young normal-hearing listeners outperforming other groups. FEST scores also correlated with other measures of cognitive ability, and for CI users, with the intelligibility of speech-in-noise. Conclusions Being sensitive to semantic context effects, FEST can serve as a nonspeech test of auditory cognition for diverse listener populations to assess and potentially improve everyday listening skills. PMID:27893791
Stability of the alcohol use disorders identification test in practical service settings

PubMed Central

Sahker, Ethan; Lancianese, Donna A; Arndt, Stephan

2017-01-01

Objective The purpose of the present study is to explore the stability of the Alcohol Use Disorders Identification Test (AUDIT) in a clinical setting by comparing prescreening heavy drinking questions and AUDIT scores over time. Because instrument stability is equal to test–retest reliability at worst, investigating the stability of the AUDIT would help better understand patient behavior change in context and the appropriateness of the AUDIT in a clinical setting. Methods This was a retrospective exploratory analysis of Visit 1 to Visit 2 AUDIT stability (n=1,099; male [75.4%], female [24.6%]) from all patients with first-time and second-time records in the Iowa Screening, Brief Intervention, and Referral to Treatment project, October 2012 to July 7, 2015 (N=17,699; male [40.6%], female [59.4%]). Results The AUDIT demonstrated moderate stability (intraclass correlation=0.56, 95% confidence interval: 0.52–0.60). In a multiple regression predicting the (absolute) difference between the two AUDIT scores, the participants’ age was highly significant, t(1,092)=6.23, p<0.001. Younger participants clearly showed less stability than their older counterparts. Results are limited/biased by the observational nature of the study design and the use of clinical service data. Conclusion The present findings contribute to the literature by demonstrating that the AUDIT changes are moderately dependable from Visit 1 to Visit 2 while taking into account patient drinking behavior variability. It is important to know the stability of the AUDIT for continued use in Screening, Brief Intervention, and Referral to Treatment programming. PMID:28392719
Investigating Score Dependability in English/Chinese Interpreter Certification Performance Testing: A Generalizability Theory Approach

ERIC Educational Resources Information Center

Han, Chao

2016-01-01

As a property of test scores, reliability/dependability constitutes an important psychometric consideration, and it underpins the validity of measurement results. A review of interpreter certification performance tests (ICPTs) reveals that (a) although reliability/dependability checking has been recognized as an important concern, its theoretical…
Early Market Site Identification Data

DOE Data Explorer

Levi Kilcher

2016-04-01

This data was compiled for the 'Early Market Opportunity Hot Spot Identification' project. The data and scripts included were used in the 'MHK Energy Site Identification and Ranking Methodology' Reports (Part I: Wave, NREL Report #66038; Part II: Tidal, NREL Report #66079). The Python scripts will generate a set of results--based on the Excel data files--some of which were described in the reports. The scripts depend on the 'score_site' package, and the score site package depends on a number of standard Python libraries (see the score_site install instructions).
The reliability and validity of qualitative scores for the Controlled Oral Word Association Test.

PubMed

Ross, Thomas P; Calhoun, Emily; Cox, Tara; Wenner, Carolyn; Kono, Whitney; Pleasant, Morgan

2007-05-01

The reliability and validity of two qualitative scoring systems for the Controlled Oral Word Association Test [Benton, A. L., Hamsher, de S. K., & Sivan, A. B. (1983). Multilingual aplasia examination (2nd ed.). Iowa City, IA: AJA Associates] were examined in 108 healthy young adults. The scoring systems developed by Troyer et al. [Troyer, A. K., Moscovich, M., & Winocur, G. (1997). Clustering and switching as two components of verbal fluency: Evidence from younger and older healthy adults. Neuropsychology, 11, 138-146] and by Abwender et al. [Abwender, D. A., Swan, J. G., Bowerman, J. T., & Connolly, S. W. (2001a). Qualitative analysis of verbal fluency output: Review and comparison of several scoring methods. Assessment, 8, 323-336] each demonstrated excellent interrater reliability (all indices at or above r(icc)=.9). Consistent with previous research [e.g., Ross, T. P. (2003). The reliability of cluster and switch scores for the COWAT. Archives of Clinical Psychology, 18, 153-164), test-retest reliability coefficients (N=53; M interval 44.6 days) for the qualitative scores were modest to poor (r(icc)=.6 to .4 range). Correlations among COWAT scores, measures of executive functioning, verbal learning, working memory, and vocabulary were examined. The idea that qualitative scores represent distinct executive functions such as cognitive flexibility or strategy utilization was not supported. We offer the interpretation that COWAT performance may require the ability to retrieve words in a non-routine manner while suppressing habitual responses and associated processing interference, presumably due to a spread of activation across semantic or lexical networks. This interpretation, though speculative at present, implies that clustering and switching on the COWAT may not be entirely deliberate, but rather an artifact of a passive (i.e., state-dependent) process. Ideas for future research, most noticeably experimental studies using cognitive methods (e.g., priming), are
30 CFR 18.14 - Identification of tested noncertified explosion-proof enclosures.

Code of Federal Regulations, 2010 CFR

2010-07-01

... 30 Mineral Resources 1 2010-07-01 2010-07-01 false Identification of tested noncertified explosion-proof enclosures. 18.14 Section 18.14 Mineral Resources MINE SAFETY AND HEALTH ADMINISTRATION, DEPARTMENT OF LABOR TESTING, EVALUATION, AND APPROVAL OF MINING PRODUCTS ELECTRIC MOTOR-DRIVEN MINE EQUIPMENT...
Student Test Scores: How the Sausage Is Made and Why You Should Care. Evidence Speaks Reports, Vol 1, #25

ERIC Educational Resources Information Center

Jacob, Brian A.

2016-01-01

Contrary to popular belief, modern cognitive assessments--including the new Common Core tests--produce test scores based on sophisticated statistical models rather than the simple percent of items a student answers correctly. While there are good reasons for this, it means that reported test scores depend on many decisions made by test designers,…
Exploring Validity of Computer-Based Test Scores with Examinees' Response Behaviors and Response Times

ERIC Educational Resources Information Center

Sahin, Füsun

2017-01-01

Examining the testing processes, as well as the scores, is needed for a complete understanding of validity and fairness of computer-based assessments. Examinees' rapid-guessing and insufficient familiarity with computers have been found to be major issues that weaken the validity arguments of scores. This study has three goals: (a) improving…
The effect of instructional methodology on high school students natural sciences standardized tests scores

NASA Astrophysics Data System (ADS)

Powell, P. E.

Educators have recently come to consider inquiry based instruction as a more effective method of instruction than didactic instruction. Experience based learning theory suggests that student performance is linked to teaching method. However, research is limited on inquiry teaching and its effectiveness on preparing students to perform well on standardized tests. The purpose of the study to investigate whether one of these two teaching methodologies was more effective in increasing student performance on standardized science tests. The quasi experimental quantitative study was comprised of two stages. Stage 1 used a survey to identify teaching methods of a convenience sample of 57 teacher participants and determined level of inquiry used in instruction to place participants into instructional groups (the independent variable). Stage 2 used analysis of covariance (ANCOVA) to compare posttest scores on a standardized exam by teaching method. Additional analyses were conducted to examine the differences in science achievement by ethnicity, gender, and socioeconomic status by teaching methodology. Results demonstrated a statistically significant gain in test scores when taught using inquiry based instruction. Subpopulation analyses indicated all groups showed improved mean standardized test scores except African American students. The findings benefit teachers and students by presenting data supporting a method of content delivery that increases teacher efficacy and produces students with a greater cognition of science content that meets the school's mission and goals.
The validity of ACT-PEP test scores for predicting academic performance of registered nurses in BSN programs.

PubMed

Yang, J C; Noble, J

1990-01-01

This study investigated the validity of three American College Testing-Proficiency Examination Program (ACT-PEP) tests (Maternal and Child Nursing, Psychiatric/Mental Health Nursing, Adult Nursing) for predicting the academic performance of registered nurses (RNs) enrolled in bachelor's degree BSN programs nationwide. This study also examined RN students' performance on the ACT-PEP tests by their demographic characteristics: student's age, sex, race, student status (full- or part-time), and employment status (full- or part-time). The total sample for the three tests comprised 2,600 students from eight institutions nationwide. The median correlation coefficients between the three ACT-PEP tests and the semester grade point averages ranged from .36 to .56. Median correlation coefficients increased over time, supporting the stability of ACT-PEP test scores for predicting academic performance over time. The relative importance of selected independent variables for predicting academic performance was also examined; the most important variable for predicting academic performance was typically the ACT-PEP test score. Across the institutions, student demographic characteristics did not contribute significantly to explaining academic performance, over and above ACT-PEP scores.
Test Scores, Class Rank and College Performance: Lessons for Broadening Access and Promoting Success.

PubMed

Niu, Sunny X; Tienda, Marta

2012-04-01

Using administrative data for five Texas universities that differ in selectivity, this study evaluates the relative influence of two key indicators for college success-high school class rank and standardized tests. Empirical results show that class rank is the superior predictor of college performance and that test score advantages do not insulate lower ranked students from academic underperformance. Using the UT-Austin campus as a test case, we conduct a simulation to evaluate the consequences of capping students admitted automatically using both achievement metrics. We find that using class rank to cap the number of students eligible for automatic admission would have roughly uniform impacts across high schools, but imposing a minimum test score threshold on all students would have highly unequal consequences by greatly reduce the admission eligibility of the highest performing students who attend poor high schools while not jeopardizing admissibility of students who attend affluent high schools. We discuss the implications of the Texas admissions experiment for higher education in Europe.
An Index to Objectively Score Supraglottic Abnormalities in Refractory Asthma

PubMed Central

Good, James T.; Rollins, Donald R.; Curran-Everett, Douglas; Lommatzsch, Steven E.; Carolan, Brendan J.; Stubenrauch, Peter C.

2014-01-01

Background: Patients with refractory asthma frequently have elements of laryngopharyngeal reflux (LPR) with potential aspiration contributing to their poor control. We previously reported on a supraglottic index (SGI) scoring system that helps in the evaluation of LPR with potential aspiration. However, to further the usefulness of this SGI scoring system for bronchoscopists, a teaching system was developed that included both interobserver and intraobserver reproducibility. Methods: Five pulmonologists with expertise in fiber-optic bronchoscopy but novice to the SGI participated. A training system was developed that could be used via Internet interaction to make this learning technique widely available. Results: By the final testing, there was excellent interreader agreement (κ of at least 0.81), thus documenting reproducibility in scoring the SGI. For the measure of intrareader consistency, one reader was arbitrarily selected to rescore the final test 4 weeks later and had a κ value of 0.93, with a 95% CI of 0.79 to 1.00. Conclusions: In this study, we demonstrate that with an organized educational approach, bronchoscopists can develop skills to have highly reproducible assessment and scoring of supraglottic abnormalities. The SGI can be used to determine which patients need additional intervention to determine causes of LPR and gastroesophageal reflux. Identification of this problem in patients with refractory asthma allows for personal, individual directed therapy to improve asthma control. PMID:24202552
Maximal exercise testing variables and 10-year survival: fitness risk score derivation from the FIT Project.

PubMed

Ahmed, Haitham M; Al-Mallah, Mouaz H; McEvoy, John W; Nasir, Khurram; Blumenthal, Roger S; Jones, Steven R; Brawner, Clinton A; Keteyian, Steven J; Blaha, Michael J

2015-03-01

To determine which routinely collected exercise test variables most strongly correlate with survival and to derive a fitness risk score that can be used to predict 10-year survival. This was a retrospective cohort study of 58,020 adults aged 18 to 96 years who were free of established heart disease and were referred for an exercise stress test from January 1, 1991, through May 31, 2009. Demographic, clinical, exercise, and mortality data were collected on all patients as part of the Henry Ford ExercIse Testing (FIT) Project. Cox proportional hazards models were used to identify exercise test variables most predictive of survival. A "FIT Treadmill Score" was then derived from the β coefficients of the model with the highest survival discrimination. The median age of the 58,020 participants was 53 years (interquartile range, 45-62 years), and 28,201 (49%) were female. Over a median of 10 years (interquartile range, 8-14 years), 6456 patients (11%) died. After age and sex, peak metabolic equivalents of task and percentage of maximum predicted heart rate achieved were most highly predictive of survival (P<.001). Subsequent addition of baseline blood pressure and heart rate, change in vital signs, double product, and risk factor data did not further improve survival discrimination. The FIT Treadmill Score, calculated as [percentage of maximum predicted heart rate + 12(metabolic equivalents of task) - 4(age) + 43 if female], ranged from -200 to 200 across the cohort, was near normally distributed, and was found to be highly predictive of 10-year survival (Harrell C statistic, 0.811). The FIT Treadmill Score is easily attainable from any standard exercise test and translates basic treadmill performance measures into a fitness-related mortality risk score. The FIT Treadmill Score should be validated in external populations. Copyright © 2015 Mayo Foundation for Medical Education and Research. Published by Elsevier Inc. All rights reserved.
Relationships between the handball-specific complex test, non-specific field tests and the match performance score in elite professional handball players.

PubMed

Hermassi, Souhail; Chelly, Mohamed-Souhaiel; Wollny, Rainer; Hoffmeyer, Birgit; Fieseler, Georg; Schulze, Stephan; Irlenbusch, Lars; Delank, Karl-Stefan; Shephard, Roy J; Bartels, Thomas; Schwesig, René

2018-06-01

This study assessed the validity of the handball-specific complex test (HBCT) and two non-specific field tests in professional elite handball athletes, using the match performance score (MPS) as the gold standard of performance. Thirteen elite male handball players (age: 27.4±4.8 years; premier German league) performed the HBCT, the Yo-Yo Intermittent Recovery (YYIR) test and a repeated shuttle sprint ability (RSA) test at the beginning of pre-season training. The RSA results were evaluated in terms of best time, total time, and fatigue decrement. Heart rates (HR) were assessed at selected times throughout all tests; the recovery HR was measured immediately post-test and 10 minutes later. The match performance score was based on various handball specific parameters (e.g., field goals, assists, steals, blocks, and technical mistakes) as seen during all matches of the immediately subsequent season (2015/2016). The parameters of run 1, run 2, and HR recovery at minutes 6 and 10 of the RSA test all showed a variance of more than 10% (range: 11-15%). However, the variance of scores for the YYIR test was much smaller (range: 1-7%). The resting HR (r2=0.18), HR recovery at minute 10 (r2=0.10), lactate concentration at rest (r2=0.17), recovery of heart rate from 0 to 10 minutes (r2=0.15), and velocity of second throw at first trial (r2=0.37) were the most valid HBCT parameters. Much effort is necessary to assess MPS and to develop valid tests. Speed and the rate of functional recovery seem the best predictors of competitive performance for elite handball players.
Automated Essay Scoring versus Human Scoring: A Comparative Study

ERIC Educational Resources Information Center

Wang, Jinhao; Brown, Michelle Stallone

2007-01-01

The current research was conducted to investigate the validity of automated essay scoring (AES) by comparing group mean scores assigned by an AES tool, IntelliMetric [TM] and human raters. Data collection included administering the Texas version of the WriterPlacer "Plus" test and obtaining scores assigned by IntelliMetric [TM] and by…

College Math Assessment: SAT Scores vs. College Math Placement Scores

ERIC Educational Resources Information Center

Foley-Peres, Kathleen; Poirier, Dawn

2008-01-01

Many colleges and university's use SAT math scores or math placement tests to place students in the appropriate math course. This study compares the use of math placement scores and SAT scores for 188 freshman students. The student's grades and faculty observations were analyzed to determine if the SAT scores and/or college math assessment scores…
A Comparison of Scores on the WISC-R and Lorge-Thorndike Intelligence Test for Disadvantaged Black Elementary School Children

ERIC Educational Resources Information Center

Lowe, James D.; Karnes, Frances A.

1976-01-01

It is indicated that, although the scores [obtained on both tests] are significantly correlated, the tests yield significantly different scores with the Lorge-Thorndike consistently overestimating the WISC-R full scale I.Q. (Author)
The Effect of Four Intervention Programs on Standardized Test Scores by Gender

ERIC Educational Resources Information Center

Cryder, Rebecca E.

2012-01-01

This quantitative correlational study involved the analysis, by gender, of the effect of four intervention programs at an Arizona middle school as seen on Arizona's Instrument to Measure Standards (AIMS) test scores. These four intervention programs included: Advancement Via Individual Determination (AVID), a planner stamping system, a World…
International Test Score Comparisons and Educational Policy: A Review of the Critiques

ERIC Educational Resources Information Center

Carnoy, Martin

2015-01-01

Stanford education professor Martin Carnoy examines four main critiques of how international test results are used in policymaking. Of particular interest are critiques of the policy analyses published by the Program for International Student Assessment (PISA). Using average PISA scores as a comparative measure of student achievement is misleading…
Evaluation of Verigene Blood Culture Test Systems for Rapid Identification of Positive Blood Cultures.

PubMed

Kim, Jae-Seok; Kang, Go-Eun; Kim, Han-Sung; Kim, Hyun Soo; Song, Wonkeun; Lee, Kyu Man

2016-01-01

The performance of molecular tests using the Verigene Gram-Positive and Gram-Negative Blood Culture nucleic acid tests (BC-GP and BC-GN, resp.; Naosphere, Northbrook, IL, USA) was evaluated for the identification of microorganisms detected from blood cultures. Ninety-nine blood cultures containing Gram-positive bacteria and 150 containing Gram-negative bacteria were analyzed using the BC-GP and BC-GN assays, respectively. Blood cultures were performed using the Bactec blood culture system (BD Diagnostic Systems, Franklin Lakes, NJ, USA) and conventional identification and antibiotic-susceptibility tests were performed using a MicroScan system (Siemens, West Sacramento, CA, USA). When a single strain of bacteria was isolated from the blood culture, Verigene assays correctly identified 97.9% (94/96) of Gram-positive bacteria and 93.8% (137/146) of Gram-negative bacteria. Resistance genes mecA and vanA were correctly detected by the BC-GP assay, while the extended-spectrum β-lactamase CTX-M and the carbapenemase OXA resistance gene were detected from 30 cases cultures by the BC-GN assay. The BC-GP and BC-GN assays showed high agreement with conventional identification and susceptibility tests. These tests are useful for rapid identification of microorganisms and the detection of clinically important resistance genes from positive Bactec blood cultures.
Using the EZ-Diffusion Model to Score a Single-Category Implicit Association Test of Physical Activity

PubMed Central

Rebar, Amanda L.; Ram, Nilam; Conroy, David E.

2014-01-01

Objective The Single-Category Implicit Association Test (SC-IAT) has been used as a method for assessing automatic evaluations of physical activity, but measurement artifact or consciously-held attitudes could be confounding the outcome scores of these measures. The objective of these two studies was to address these measurement concerns by testing the validity of a novel SC-IAT scoring technique. Design Study 1 was a cross-sectional study, and study 2 was a prospective study. Method In study 1, undergraduate students (N = 104) completed SC-IATs for physical activity, flowers, and sedentary behavior. In study 2, undergraduate students (N = 91) completed a SC-IAT for physical activity, self-reported affective and instrumental attitudes toward physical activity, physical activity intentions, and wore an accelerometer for two weeks. The EZ-diffusion model was used to decompose the SC-IAT into three process component scores including the information processing efficiency score. Results In study 1, a series of structural equation model comparisons revealed that the information processing score did not share variability across distinct SC-IATs, suggesting it does not represent systematic measurement artifact. In study 2, the information processing efficiency score was shown to be unrelated to self-reported affective and instrumental attitudes toward physical activity, and positively related to physical activity behavior, above and beyond the traditional D-score of the SC-IAT. Conclusions The information processing efficiency score is a valid measure of automatic evaluations of physical activity. PMID:25484621
The Apgar score has survived the test of time.

PubMed

Finster, Mieczyslaw; Wood, Margaret

2005-04-01

In 1953, Virginia Apgar, M.D. published her proposal for a new method of evaluation of the newborn infant. The avowed purpose of this paper was to establish a simple and clear classification of newborn infants which can be used to compare the results of obstetric practices, types of maternal pain relief and the results of resuscitation. Having considered several objective signs pertaining to the condition of the infant at birth she selected five that could be evaluated and taught to the delivery room personnel without difficulty. These signs were heart rate, respiratory effort, reflex irritability, muscle tone and color. Sixty seconds after the complete birth of the baby a rating of zero, one or two was given to each sign, depending on whether it was absent or present. Virginia Apgar reviewed anesthesia records of 1025 infants born alive at Columbia Presbyterian Medical Center during the period of this report. All had been rated by her method. Infants in poor condition scored 0-2, infants in fair condition scored 3-7, while scores 8-10 were achieved by infants in good condition. The most favorable score 1 min after birth was obtained by infants delivered vaginally with the occiput the presenting part (average 8.4). Newborns delivered by version and breech extraction had the lowest score (average 6.3). Infants delivered by cesarean section were more vigorous (average score 8.0) when spinal was the method of anesthesia versus an average score of 5.0 when general anesthesia was used. Correlating the 60 s score with neonatal mortality, Virginia found that mature infants receiving 0, 1 or 2 scores had a neonatal death rate of 14%; those scoring 3, 4, 5, 6 or 7 had a death rate of 1.1%; and those in the 8-10 score group had a death rate of 0.13%. She concluded that the prognosis of an infant is excellent if he receives one of the upper three scores, and poor if one of the lowest three scores.
Bird on Your Smartphone: How to make identification faster?

NASA Astrophysics Data System (ADS)

Hidayat, T.; Kurniawan, I. S.; Tapilow, F. S.

2018-01-01

Identification skills of students are needed in the field activities of animal ecology course. Good identification skills will help students to understand the traits, determine differences and similarities in order to naming of birds’ species. This study aims to describe the identification skill of students by using smart phone applications designed in such a way as a support in the field activities. Research method used was quasi experiment involving 60 students which were divided into two groups, one group that use smartphone applications (SA) and other group using a guidebook (GB). This study was carried out in the classroom and outside (the field). Instruments used in this research included tests and questionnaire. The identification skills were measured by tests, indicated by an average score (AS). The results showed that the identification skills of SA students were higher (AS = 3.12) than those of GB one (AS = 2.91). These results are in accordance with response of students. The most of students (90.08%) mentioned that the use of smart phone applications in identifying birds is helpful, more effective and convenience to make identification faster. For further implementation, however, performance of the smartphone used here need to be enhanced to improve the identification skills of students and for wider use.
Methods for Improving Test Scores: The Good, the Bad, and the Ugly

ERIC Educational Resources Information Center

Wright, Robert J.

2009-01-01

The No Child Left Behind Act (NCLB 2001) has the faculties of every public and charter school scrambling to drive test scores of seven identified groups of children (African-American children, Anglo-White children, children with disabilities, Hispanic children, children of poverty, children with English language limitations, and Native-American…
An approach to analyzing a single subject's scores obtained in a standardized test with application to the Aachen Aphasia Test (AAT).

PubMed

Willmes, K

1985-08-01

Methods for the analysis of a single subject's test profile(s) proposed by Huber (1973) are applied to the Aachen Aphasia Test (AAT). The procedures are based on the classical test theory model (Lord & Novick, 1968) and are suited for any (achievement) test with standard norms from a large standardization sample and satisfactory reliability estimates. Two test profiles of a Wernicke's aphasic, obtained before and after a 3-month period of speech therapy, are analyzed using inferential comparisons between (groups of) subtest scores on one test application and between two test administrations for single (groups of) subtests. For each of these comparisons, the two aspects of (i) significant (reliable) differences in performance beyond measurement error and (ii) the diagnostic validity of that difference in the reference population of aphasic patients are assessed. Significant differences between standardized subtest scores and a remarkably better preserved reading and writing ability could be found for both test administrations using the multiple test procedure of Holm (1979). Comparison of both profiles revealed an overall increase in performance for each subtest as well as changes in level of performance relations between pairs of subtests.
Department of Defense (DOD) Automated Biometric Identification System (ABIS) Version 1.2: Initial Operational Test and Evaluation Report

DTIC Science & Technology

2015-05-01

Director, Operational Test and Evaluation Department of Defense (DOD) Automated Biometric Identification System (ABIS) Version 1.2 Initial...Operational Test and Evaluation Report May 2015 This report on the Department of Defense (DOD) Automated Biometric Identification System...COVERED - 4. TITLE AND SUBTITLE Department of Defense (DOD) Automated Biometric Identification System (ABIS) Version 1.2 Initial Operational Test
Reducing patient identification errors related to glucose point-of-care testing.

PubMed

Alreja, Gaurav; Setia, Namrata; Nichols, James; Pantanowitz, Liron

2011-01-01

Patient identification (ID) errors in point-of-care testing (POCT) can cause test results to be transferred to the wrong patient's chart or prevent results from being transmitted and reported. Despite the implementation of patient barcoding and ongoing operator training at our institution, patient ID errors still occur with glucose POCT. The aim of this study was to develop a solution to reduce identification errors with POCT. Glucose POCT was performed by approximately 2,400 clinical operators throughout our health system. Patients are identified by scanning in wristband barcodes or by manual data entry using portable glucose meters. Meters are docked to upload data to a database server which then transmits data to any medical record matching the financial number of the test result. With a new model, meters connect to an interface manager where the patient ID (a nine-digit account number) is checked against patient registration data from admission, discharge, and transfer (ADT) feeds and only matched results are transferred to the patient's electronic medical record. With the new process, the patient ID is checked prior to testing, and testing is prevented until ID errors are resolved. When averaged over a period of a month, ID errors were reduced to 3 errors/month (0.015%) in comparison with 61.5 errors/month (0.319%) before implementing the new meters. Patient ID errors may occur with glucose POCT despite patient barcoding. The verification of patient identification should ideally take place at the bedside before testing occurs so that the errors can be addressed in real time. The introduction of an ADT feed directly to glucose meters reduced patient ID errors in POCT.
From Test Scores to Language Use: Emergent Bilinguals Using English to Accomplish Academic Tasks

ERIC Educational Resources Information Center

Rodriguez-Mojica, Claudia

2018-01-01

Prominent discourses about emergent bilinguals' academic abilities tend to focus on performance as measured by test scores and perpetuate the message that emergent bilinguals trail far behind their peers. When we remove the constraints of formal testing situations, what can emergent bilinguals do in English as they engage in naturally occurring…
Comparison of Standardized Test Scores from Traditional Classrooms and Those Using Problem-Based Learning

ERIC Educational Resources Information Center

Needham, Martha Elaine

2010-01-01

This research compares differences between standardized test scores in problem-based learning (PBL) classrooms and a traditional classroom for 6th grade students using a mixed-method, quasi-experimental and qualitative design. The research shows that problem-based learning is as effective as traditional teaching methods on standardized tests. The…
Co-Educational Tutorial Classes and Their Significance on Gendered Test Scores of Wollo University Students: A Before-After Analyses

ERIC Educational Resources Information Center

Gidey, Mu'uz

2015-01-01

This action research is carried out in a practical class room setting to devise an innovative way of administering tutorial classes to improve students' learning competence with particular reference to gendered test scores. A before-after test score analyses of mean and standard deviations along with t-statistical tests of hypotheses of second…
An Approach to Defensible Nondiscriminatory Identification Model for the Gifted.

ERIC Educational Resources Information Center

Long, Robert R.

To develop an approach for a nondiscriminatory identification model for gifted students in Rome (GA) City Schools, mean IQ scores on the Otis-Lennon Mental Ability test were compared for fourth, fifth, and tenth grade students divided into four groups: White advantaged, White disadvantaged, Black advantaged, and Black disadvantaged. A significant…
Effects of septoplasty on olfactory function evaluated by the Brief Smell Identification Test: A study of 116 patients.

PubMed

Haytoğlu, Süheyl; Dengiz, Ramazan; Muluk, Nuray Bayar; Kuran, Gökhan; Arikan, Osman Kursat

2017-01-01

We conducted a prospective study of 116 patients-61 men and 55 women, aged 17 to 64 years (mean: 26.4)-to investigate the effects of septoplasty on olfactory function in patients with septal deviation (SD). The Mladina classification system was used to define SD types, and olfactory function was assessed with the Brief Smell Identification test (BSIT). The BSIT, which includes 12 odorants, was administered preoperatively and at postoperative months 1 and 3. The most common SD types were types 2 (20.7% of patients) and 1 (19.0%), followed by types 3 and 5 (both 16.4%). At postoperative month 1, the mean BSIT score was significantly higher in men than in the women. For patients with types 1 and 2 SD, BSIT scores at 1 month were significantly lower than the scores preoperatively and 3 months postoperatively. For types 3 and 4, BSIT values were significantly higher at 3 months than preoperatively or at 1 month. For type 3 SD, the preoperative mean score was significantly lower than those for types 1, 4, 5, 6, and 7; for type 2 SD, the BSIT score was significantly lower than those of types 5 and 6 only. At 1 month, the scores for types 2 and 3 were significantly lower than those for types 4, 5, 6, and 7. At 3 months, the BSIT score for type 2 was significantly lower than those of types 1, 3, 4, 5, and 6; the type 3 SD score at 3 months was significantly higher than those for types 1, 2, 5, 6, and 7. We conclude that septoplasty surgery for patients with a type 3 SD may improve olfactory function. In contrast, we found that olfactory function in patients with a type 2 SD did not improve to a satisfactory degree, even when good nasal patency was achieved with a corrected septum and an enlarged intranasal volume. Our findings should be investigated further in future studies.
Diagnostic value of three-dimensional magnetic resonance imaging of inner ear after intratympanic gadolinium injection, and clinical application of magnetic resonance imaging scoring system in patients with delayed endolymphatic hydrops.

PubMed

Gu, X; Fang, Z-M; Liu, Y; Lin, S-L; Han, B; Zhang, R; Chen, X

2014-01-01

Three-dimensional fluid-attenuated inversion recovery magnetic resonance imaging of the inner ear after intratympanic injection of gadolinium, together with magnetic resonance imaging scoring of the perilymphatic space, were used to investigate the positive identification rate of hydrops and determine the technique's diagnostic value for delayed endolymphatic hydrops. Twenty-five patients with delayed endolymphatic hydrops underwent pure tone audiometry, bithermal caloric testing, vestibular-evoked myogenic potential testing and three-dimensional magnetic resonance imaging of the inner ear after bilateral intratympanic injection of gadolinium. The perilymphatic space of the scanned images was analysed to investigate the positive identification rate of endolymphatic hydrops. According to the magnetic resonance imaging scoring of the perilymphatic space and the diagnostic standard, 84 per cent of the patients examined had endolymphatic hydrops. In comparison, the positive identification rates for vestibular-evoked myogenic potential and bithermal caloric testing were 52 per cent and 72 per cent respectively. Three-dimensional magnetic resonance imaging after intratympanic injection of gadolinium is valuable in the diagnosis of delayed endolymphatic hydrops and its classification. The perilymphatic space scoring system improved the diagnostic accuracy of magnetic resonance imaging.
CK-MM Polymorphism is Associated With Physical Fitness Test Scores in Military Recruits.

PubMed

Sprouse, Courtney; Tosi, Laura L; Gordish-Dressman, Heather; Abdel-Ghani, Mai S; Panchapakesan, Karuna; Niederberger, Brenda; Devaney, Joseph M; Kelly, Karen R

2015-09-01

Muscle-specific creatine kinase is thought to play an integral role in maintaining energy homeostasis by providing a supply of creatine phosphate. The genetic variant, rs8111989, contributes to individual differences in physical performance, and thus the purpose of this study was to determine if rs8111989 variant is predictive of Physical Fitness Test (PFT) scores in male, military infantry recruits. DNA was extracted from whole blood, and genotyping was performed in 176 Marines. Relationships between PFT measures (run, sit-ups, and pull-ups) and genotype were determined. Participants with 2 copies of the T allele for rs8111989 variant had higher PFT scores for run time, pull-ups, and total PFT score. Specifically, participants with 2 copies of the TT allele (variant) (n = 97) demonstrated an overall higher total PFT score as compared with those with one copy of the C allele (n = 79) (TT: 250 ± 31 vs. 238 ± 31; p = 0.02), run score (TT: 82 ± 10 vs. 78 ± 11; p = 0.04) and pull-up score (TT: 78 ± 11 vs. 65 ± 21; p = 0.04) or those with the CC/CT genotype. These results demonstrate an association between physical performance measures and genetic variation in the muscle-specific creatine kinase gene (rs8111989). Reprint & Copyright © 2015 Association of Military Surgeons of the U.S.
[Measures to prevent patient identification errors in blood collection/physiological function testing utilizing a laboratory information system].

PubMed

Shimazu, Chisato; Hoshino, Satoshi; Furukawa, Taiji

2013-08-01

We constructed an integrated personal identification workflow chart using both bar code reading and an all in-one laboratory information system. The information system not only handles test data but also the information needed for patient guidance in the laboratory department. The reception terminals at the entrance, displays for patient guidance and patient identification tools at blood-sampling booths are all controlled by the information system. The number of patient identification errors was greatly reduced by the system. However, identification errors have not been abolished in the ultrasound department. After re-evaluation of the patient identification process in this department, we recognized that the major reason for the errors came from excessive identification workflow. Ordinarily, an ultrasound test requires patient identification 3 times, because 3 different systems are required during the entire test process, i.e. ultrasound modality system, laboratory information system and a system for producing reports. We are trying to connect the 3 different systems to develop a one-time identification workflow, but it is not a simple task and has not been completed yet. Utilization of the laboratory information system is effective, but is not yet perfect for patient identification. The most fundamental procedure for patient identification is to ask a person's name even today. Everyday checks in the ordinary workflow and everyone's participation in safety-management activity are important for the prevention of patient identification errors.

Linking U.S. School District Test Score Distributions to a Common Scale. CEPA Working Paper No. 16-09

ERIC Educational Resources Information Center

Reardon, Sean F.; Kalogrides, Demetra; Ho, Andrew D.

2017-01-01

There is no comprehensive database of U.S. district-level test scores that is comparable across states. We describe and evaluate a method for constructing such a database. First, we estimate linear, reliability-adjusted linking transformations from state test score scales to the scale of the National Assessment of Educational Progress (NAEP). We…
The Disaggregation of Value-Added Test Scores to Assess Learning Outcomes in Economics Courses

ERIC Educational Resources Information Center

Walstad, William B.; Wagner, Jamie

2016-01-01

This study disaggregates posttest, pretest, and value-added or difference scores in economics into four types of economic learning: positive, retained, negative, and zero. The types are derived from patterns of student responses to individual items on a multiple-choice test. The micro and macro data from the "Test of Understanding in College…
[Effectiveness of enneagram group counseling for self-identification and depression in nursing college students].

PubMed

Lee, Jeong Seop; Yoon, Jeong Ah; Do, Keong Jin

2013-10-01

The purpose of this study was to examine effects of enneagram group counseling program on self-identification and depression in nursing college students. Three groups, categorized by how the students solve their conflicts, were selected to identify changes from the program. A quasi-experimental study with a non-equivalent control group and pre posttest design was used. Participants were assigned to the experimental group (n=30) or control group (n=33). The experimental group participated in enneagram group counseling program for 38 hours through eight sessions covering four different topics. Collected data were analyzed using Chi-square test, Fisher's exact test, t-test, and Wilcoxon signed rank test. Total self-identity score for the experimental group was significantly higher than the control group. However, there was no significant difference between the two groups for depression scores. The Assertive and Compliant groups demonstrated significant change in self-identification while the Withdrawn groups did not reveal any change. Results indicate that the enneagram group counseling program is very effective in establishing positive self-identification for nursing college students who face developmental crisis and stressful situations. It is also expected that this program would be useful to enhance the students' confidence through a deeper understanding and acceptance of themselves.
Comparison of System Identification Techniques for the Hydraulic Manipulator Test Bed (HMTB)

NASA Technical Reports Server (NTRS)

Morris, A. Terry

1996-01-01

In this thesis linear, dynamic, multivariable state-space models for three joints of the ground-based Hydraulic Manipulator Test Bed (HMTB) are identified. HMTB, housed at the NASA Langley Research Center, is a ground-based version of the Dexterous Orbital Servicing System (DOSS), a representative space station manipulator. The dynamic models of the HMTB manipulator will first be estimated by applying nonparametric identification methods to determine each joint's response characteristics using various input excitations. These excitations include sum of sinusoids, pseudorandom binary sequences (PRBS), bipolar ramping pulses, and chirp input signals. Next, two different parametric system identification techniques will be applied to identify the best dynamical description of the joints. The manipulator is localized about a representative space station orbital replacement unit (ORU) task allowing the use of linear system identification methods. Comparisons, observations, and results of both parametric system identification techniques are discussed. The thesis concludes by proposing a model reference control system to aid in astronaut ground tests. This approach would allow the identified models to mimic on-orbit dynamic characteristics of the actual flight manipulator thus providing astronauts with realistic on-orbit responses to perform space station tasks in a ground-based environment.
A Guide for Setting the Cut-Scores to Minimize Weighted Classification Errors in Test Batteries

ERIC Educational Resources Information Center

Grabovsky, Irina; Wainer, Howard

2017-01-01

In this article, we extend the methodology of the Cut-Score Operating Function that we introduced previously and apply it to a testing scenario with multiple independent components and different testing policies. We derive analytically the overall classification error rate for a test battery under the policy when several retakes are allowed for…
Probabilistic consensus scoring improves tandem mass spectrometry peptide identification.

PubMed

Nahnsen, Sven; Bertsch, Andreas; Rahnenführer, Jörg; Nordheim, Alfred; Kohlbacher, Oliver

2011-08-05

Database search is a standard technique for identifying peptides from their tandem mass spectra. To increase the number of correctly identified peptides, we suggest a probabilistic framework that allows the combination of scores from different search engines into a joint consensus score. Central to the approach is a novel method to estimate scores for peptides not found by an individual search engine. This approach allows the estimation of p-values for each candidate peptide and their combination across all search engines. The consensus approach works better than any single search engine across all different instrument types considered in this study. Improvements vary strongly from platform to platform and from search engine to search engine. Compared to the industry standard MASCOT, our approach can identify up to 60% more peptides. The software for consensus predictions is implemented in C++ as part of OpenMS, a software framework for mass spectrometry. The source code is available in the current development version of OpenMS and can easily be used as a command line application or via a graphical pipeline designer TOPPAS.
Participation in a coteaching classroom and students' end-of-course test scores

NASA Astrophysics Data System (ADS)

Debro, Ava

General education students consistently perform poorly on standardized science tests. Coteaching is an instructional strategy that improves the achievement of students with disabilities, but very little research exists that examines the effect of coteaching classrooms on the performance of general education students. The purpose of this study was to examine the effect of coteaching classrooms on the performance of general education students. The constructivist theoretical framework provided the foundation for this research. The research question examined the effect that coteaching classrooms had on the performance of general education biology students. In this experimental design utilizing a posttest-only control group, coteaching instructional strategy was the treatment, and student performance was measured using the scores obtained from the biology end-of-course test. Data for this study was analyzed using an independent t-test. The results of this study revealed that there was not a statistically significant difference in student performance on the biology end-of-course test between treatment and control groups. More than half of the general education biology students enrolled in coteaching classrooms failed the end-of-course test. Researchers may use this study as a catalyst to examine other instructional practices that may improve student performance in science courses. The results of this study may be used to persuade coteachers of the importance of attending frequent professional development opportunities that examine a variety of coteaching instructional strategies. Improving the performance of general education students in science may improve standardized test scores, afford more students the opportunity to attend college, and ensure that students are able to compete on a global level.
Raise Test Scores without Selling Your Soul: An Interview with Scott Mandel

ERIC Educational Resources Information Center

Curriculum Review, 2006

2006-01-01

With his 10th book, Improving Test Scores: A Practical Approach for Teachers and Administrators, Scott Mandel outlines steps educators can take to boost achievement on standardized exams while maintaining the integrity of their day-to-day teaching. Mandel, who holds a Ph.D. in curriculum and instruction from USC, teaches history and English at…
Linear score tests for variance components in linear mixed models and applications to genetic association studies.

PubMed

Qu, Long; Guennel, Tobias; Marshall, Scott L

2013-12-01

Following the rapid development of genome-scale genotyping technologies, genetic association mapping has become a popular tool to detect genomic regions responsible for certain (disease) phenotypes, especially in early-phase pharmacogenomic studies with limited sample size. In response to such applications, a good association test needs to be (1) applicable to a wide range of possible genetic models, including, but not limited to, the presence of gene-by-environment or gene-by-gene interactions and non-linearity of a group of marker effects, (2) accurate in small samples, fast to compute on the genomic scale, and amenable to large scale multiple testing corrections, and (3) reasonably powerful to locate causal genomic regions. The kernel machine method represented in linear mixed models provides a viable solution by transforming the problem into testing the nullity of variance components. In this study, we consider score-based tests by choosing a statistic linear in the score function. When the model under the null hypothesis has only one error variance parameter, our test is exact in finite samples. When the null model has more than one variance parameter, we develop a new moment-based approximation that performs well in simulations. Through simulations and analysis of real data, we demonstrate that the new test possesses most of the aforementioned characteristics, especially when compared to existing quadratic score tests or restricted likelihood ratio tests. © 2013, The International Biometric Society.
Maintenance of Wakefulness Test scores and driving performance in sleep disorder patients and controls.

PubMed

Philip, Pierre; Chaufton, Cyril; Taillard, Jacques; Sagaspe, Patricia; Léger, Damien; Raimondi, Monika; Vakulin, Andrew; Capelli, Aurore

2013-08-01

Sleepiness at the wheel is a risk factor for traffic accidents. Past studies have demonstrated the validity of the Maintenance of Wakefulness Test (MWT) scores as a predictor of driving impairment in untreated patients with obstructive sleep apnea syndrome (OSAS), but there is limited information on the validity of the maintenance of wakefulness test by MWT in predicting driving impairment in patients with hypersomnias of central origin (narcolepsy or idiopathic hypersomnia). The aim of this study was to compare the MWT scores with driving performance in sleep disorder patients and controls. 19 patients suffering from hypersomnias of central origin (9 narcoleptics and 10 idiopathic hypersomnia), 17 OSAS patients and 14 healthy controls performed a MWT (4×40-minute trials) and a 40-minute driving session on a real car driving simulator. Participants were divided into 4 groups defined by their MWT sleep latency scores. The groups were pathological (sleep latency 0-19 min), intermediate (20-33 min), alert (34-40 min) and control (>34 min). The main driving performance outcome was the number of inappropriate line crossings (ILCs) during the 40 minute drive test. Patients with pathological MWT sleep latency scores (0-19 min) displayed statistically significantly more ILC than patients from the intermediate, alert and control groups (F (3, 46)=7.47, p<0.001). Pathological sleep latencies on the MWT predicted driving impairment in patients suffering from hypersomnias of central origin as well as in OSAS patients. MWT is an objective measure of daytime sleepiness that appears to be useful in estimating the driving performance in sleepy patients. Copyright © 2013 Elsevier B.V. All rights reserved.
Self Adapted Testing as Formative Assessment: Effects of Feedback and Scoring on Engagement and Performance

ERIC Educational Resources Information Center

Arieli-Attali, Meirav

2016-01-01

This dissertation investigated the feasibility of self-adapted testing (SAT) as a formative assessment tool with the focus on learning. Under two different orientation goals--to excel on a test (performance goal) or to learn from the test (learning goal)--I examined the effect of different scoring rules provided as interactive feedback, on test…
HAZARD IDENTIFICATION: EFFICIENCY OF SHORT-TERM TESTS IN IDENTIFYING GERM CELL MUTAGENS AND PUTATIVE NONGENOTOXIC CARCINOGENS

EPA Science Inventory

For more than a decade, mutagenicity tests have had a clearly defined role in the identification of potential human mutagens and an ancillary role in the identification of potential human carcinogens. he efficiency of short-term tests in identifying germ cell mutagens has been ex...
Application of prognostic scores in the STOPAH trial: Discriminant function is no longer the optimal scoring system in alcoholic hepatitis.

PubMed

Forrest, Ewan H; Atkinson, Stephen R; Richardson, Paul; Masson, Steven; Ryder, Stephen; Thursz, Mark R; Allison, Michael

2018-03-01

'Static' prognostic models in alcoholic hepatitis, using data from a single time point, include the discriminant function (DF), Glasgow alcoholic hepatitis score (GAHS), the age, serum bilirubin, international normalized ratio and serum creatinine (ABIC) score and the model of end-stage liver disease (MELD). 'Dynamic' scores, incorporating evolution of bilirubin at seven days, include the Lille score. The aim of this study was to assess these scores' performance in patients from the STOPAH trial. Predictive performance of scores was assessed by area under the receiver operating curve (AUC). The effect of different therapeutic strategies upon survival was assessed by Kaplan-Meier analysis and tested using the log-rank test. A total of 1,068 patients were studied. The AUCs for the DF were significantly lower than for MELD, ABIC and GAHS for both 28- and 90-day outcomes: 90-day values were 0.670, 0.704, 0.726 and 0.713, respectively. 'Dynamic' scores and change in 'static' scores by Day 7 had similar AUCs. Patients with consistently low 'static' scores had low 28-day mortalities that were not improved with prednisolone (MELD <25: 8.6%; ABIC <6.71: 6.6%; GAHS <9: 5.9%). In patients with high 'static' scores without gastrointestinal bleeding or sepsis, prednisolone reduced 28-day mortality (MELD: 22.2% vs. 28.9%, p = 0.13; ABIC 14.6% vs. 21%, p = 0.02; GAHS 21% vs. 29.3%, p = 0.04). Overall mortality from treating all patients with a DF ≥32 and Lille assessment (90-day mortality 26.8%) was greater than combining newer 'static' and 'dynamic' scores (90-day mortality: MELD/Lille 21.8%; ABIC/Lille 23.7%; GAHS/Lille 20.6%). MELD, ABIC and GAHS are superior to the DF in alcoholic hepatitis. Consistently low scores have a favourable outcome not improved with prednisolone. Combined baseline 'static' and Day 7 scores reduce the number of patients exposed to corticosteroids and improve 90-day outcome. Alcoholic hepatitis is a life-threatening condition. Several
Do Neurocognitive SCAT3 Baseline Test Scores Differ Between Footballers (Soccer) Living With and Without Disability? A Cross-Sectional Study.

PubMed

Weiler, Richard; van Mechelen, Willem; Fuller, Colin; Ahmed, Osman Hassan; Verhagen, Evert

2018-01-01

To determine if baseline Sport Concussion Assessment Tool, third Edition (SCAT3) scores differ between athletes with and without disability. Cross-sectional comparison of preseason baseline SCAT3 scores for a range of England international footballers. Team doctors and physiotherapists supporting England football teams recorded players' SCAT 3 baseline tests from August 1, 2013 to July 31, 2014. A convenience sample of 249 England footballers, of whom 185 were players without disability (male: 119; female: 66) and 64 were players with disability (male learning disability: 17; male cerebral palsy: 28; male blind: 10; female deaf: 9). Between-group comparisons of median SCAT3 total and section scores were made using nonparametric Mann-Whitney-Wilcoxon ranked-sum test. All footballers with disability scored higher symptom severity scores compared with male players without disability. Male footballers with learning disability demonstrated no significant difference in the total number of symptoms, but recorded significantly lower scores on immediate memory and delayed recall compared with male players without disability. Male blind footballers' scored significantly higher for total concentration and delayed recall, and male footballers with cerebral palsy scored significantly higher on balance testing and immediate memory, when compared with male players without disability. Female footballers with deafness scored significantly higher for total concentration and balance testing than female footballers without disability. This study suggests that significant differences exist between SCAT3 baseline section scores for footballers with and without disability. Concussion consensus guidelines should recognize these differences and produce guidelines that are specific for the growing number of athletes living with disability.
See It, Be It, Write It: Using Performing Arts to Improve Writing Skills and Test Scores

ERIC Educational Resources Information Center

Blecher-Sass, Hope Sara; Moffitt, Maryellen

2010-01-01

Improve students' writing skills and boost their assessment scores while adding arts education, creativity, and fun to your writing curriculum. With this vibrant resource, improving writing skills goes hand-in-hand with improving test scores. Students learn how to use acting and visualization as prewriting activities to help them connect writing…
Improving Personality Facet Scores with Multidimensional Computer Adaptive Testing: An Illustration with the Neo Pi-R

ERIC Educational Resources Information Center

Makransky, Guido; Mortensen, Erik Lykke; Glas, Cees A. W.

2013-01-01

Narrowly defined personality facet scores are commonly reported and used for making decisions in clinical and organizational settings. Although these facets are typically related, scoring is usually carried out for a single facet at a time. This method can be ineffective and time consuming when personality tests contain many highly correlated…
Reducing patient identification errors related to glucose point-of-care testing

PubMed Central

Alreja, Gaurav; Setia, Namrata; Nichols, James; Pantanowitz, Liron

2011-01-01

Background: Patient identification (ID) errors in point-of-care testing (POCT) can cause test results to be transferred to the wrong patient's chart or prevent results from being transmitted and reported. Despite the implementation of patient barcoding and ongoing operator training at our institution, patient ID errors still occur with glucose POCT. The aim of this study was to develop a solution to reduce identification errors with POCT. Materials and Methods: Glucose POCT was performed by approximately 2,400 clinical operators throughout our health system. Patients are identified by scanning in wristband barcodes or by manual data entry using portable glucose meters. Meters are docked to upload data to a database server which then transmits data to any medical record matching the financial number of the test result. With a new model, meters connect to an interface manager where the patient ID (a nine-digit account number) is checked against patient registration data from admission, discharge, and transfer (ADT) feeds and only matched results are transferred to the patient's electronic medical record. With the new process, the patient ID is checked prior to testing, and testing is prevented until ID errors are resolved. Results: When averaged over a period of a month, ID errors were reduced to 3 errors/month (0.015%) in comparison with 61.5 errors/month (0.319%) before implementing the new meters. Conclusion: Patient ID errors may occur with glucose POCT despite patient barcoding. The verification of patient identification should ideally take place at the bedside before testing occurs so that the errors can be addressed in real time. The introduction of an ADT feed directly to glucose meters reduced patient ID errors in POCT. PMID:21633490
Rapid Identification and Susceptibility Testing of Candida spp. from Positive Blood Cultures by Combination of Direct MALDI-TOF Mass Spectrometry and Direct Inoculation of Vitek 2

PubMed Central

Idelevich, Evgeny A.; Grunewald, Camilla M.; Wüllenweber, Jörg; Becker, Karsten

2014-01-01

Fungaemia is associated with high mortality rates and early appropriate antifungal therapy is essential for patient management. However, classical diagnostic workflow takes up to several days due to the slow growth of yeasts. Therefore, an approach for direct species identification and direct antifungal susceptibility testing (AFST) without prior time-consuming sub-culturing of yeasts from positive blood cultures (BCs) is urgently needed. Yeast cell pellets prepared using Sepsityper kit were used for direct identification by MALDI-TOF mass spectrometry (MS) and for direct inoculation of Vitek 2 AST-YS07 card for AFST. For comparison, MALDI-TOF MS and Vitek 2 testing were performed from yeast subculture. A total of twenty four positive BCs including twelve C. glabrata, nine C. albicans, two C. dubliniensis and one C. krusei isolate were processed. Applying modified thresholds for species identification (score ≥1.5 with two identical consecutive propositions), 62.5% of BCs were identified by direct MALDI-TOF MS. AFST results were generated for 72.7% of BCs directly tested by Vitek 2 and for 100% of standardized suspensions from 24 h cultures. Thus, AFST comparison was possible for 70 isolate-antifungal combinations. Essential agreement (minimum inhibitory concentration difference ≤1 double dilution step) was 88.6%. Very major errors (VMEs) (false-susceptibility), major errors (false-resistance) and minor errors (false categorization involving intermediate result) amounted to 33.3% (of resistant isolates), 1.9% (of susceptible isolates) and 1.4% providing 90.0% categorical agreement. All VMEs were due to fluconazole or voriconazole. This direct method saved on average 23.5 h for identification and 15.1 h for AFST, compared to routine procedures. However, performance for azole susceptibility testing was suboptimal and testing from subculture remains indispensable to validate the direct finding. PMID:25489741
Rapid identification and susceptibility testing of Candida spp. from positive blood cultures by combination of direct MALDI-TOF mass spectrometry and direct inoculation of Vitek 2.

PubMed

Idelevich, Evgeny A; Grunewald, Camilla M; Wüllenweber, Jörg; Becker, Karsten

2014-01-01

Fungaemia is associated with high mortality rates and early appropriate antifungal therapy is essential for patient management. However, classical diagnostic workflow takes up to several days due to the slow growth of yeasts. Therefore, an approach for direct species identification and direct antifungal susceptibility testing (AFST) without prior time-consuming sub-culturing of yeasts from positive blood cultures (BCs) is urgently needed. Yeast cell pellets prepared using Sepsityper kit were used for direct identification by MALDI-TOF mass spectrometry (MS) and for direct inoculation of Vitek 2 AST-YS07 card for AFST. For comparison, MALDI-TOF MS and Vitek 2 testing were performed from yeast subculture. A total of twenty four positive BCs including twelve C. glabrata, nine C. albicans, two C. dubliniensis and one C. krusei isolate were processed. Applying modified thresholds for species identification (score ≥ 1.5 with two identical consecutive propositions), 62.5% of BCs were identified by direct MALDI-TOF MS. AFST results were generated for 72.7% of BCs directly tested by Vitek 2 and for 100% of standardized suspensions from 24 h cultures. Thus, AFST comparison was possible for 70 isolate-antifungal combinations. Essential agreement (minimum inhibitory concentration difference ≤ 1 double dilution step) was 88.6%. Very major errors (VMEs) (false-susceptibility), major errors (false-resistance) and minor errors (false categorization involving intermediate result) amounted to 33.3% (of resistant isolates), 1.9% (of susceptible isolates) and 1.4% providing 90.0% categorical agreement. All VMEs were due to fluconazole or voriconazole. This direct method saved on average 23.5 h for identification and 15.1 h for AFST, compared to routine procedures. However, performance for azole susceptibility testing was suboptimal and testing from subculture remains indispensable to validate the direct finding.
Are students' impressions of improved learning through active learning methods reflected by improved test scores?

PubMed

Everly, Marcee C

2013-02-01

To report the transformation from lecture to more active learning methods in a maternity nursing course and to evaluate whether student perception of improved learning through active-learning methods is supported by improved test scores. The process of transforming a course into an active-learning model of teaching is described. A voluntary mid-semester survey for student acceptance of the new teaching method was conducted. Course examination results, from both a standardized exam and a cumulative final exam, among students who received lecture in the classroom and students who had active learning activities in the classroom were compared. Active learning activities were very acceptable to students. The majority of students reported learning more from having active-learning activities in the classroom rather than lecture-only and this belief was supported by improved test scores. Students who had active learning activities in the classroom scored significantly higher on a standardized assessment test than students who received lecture only. The findings support the use of student reflection to evaluate the effectiveness of active-learning methods and help validate the use of student reflection of improved learning in other research projects. Copyright © 2011 Elsevier Ltd. All rights reserved.

Higher blood harmane (1-methyl-9H-pyrido[3,4-b]indole) concentrations correlate with lower olfactory scores in essential tremor.

PubMed

Louis, Elan D; Rios, Eileen; Pellegrino, Kathryn M; Jiang, Wendy; Factor-Litvak, Pam; Zheng, Wei

2008-05-01

Harmane (1-methyl-9H-pyrido[3,4-b]indole), a neurotoxin, may be an environmental risk factor for essential tremor (ET). Harmane and related chemicals are toxic to the cerebellum. Whether it is through this mechanism (cerebellar toxicity) that harmane leads to ET is unknown. Impaired olfaction may be a feature of cerebellar disease. To determine whether blood harmane concentrations correlate with olfactory test scores in patients with ET. Blood harmane concentrations were quantified using high performance liquid chromatography. Odor identification testing was performed with the University of Pennsylvania Smell Identification Test (UPSIT). In 83 ET cases, higher log blood harmane concentration was correlated with lower UPSIT score (rho=-0.46, p<0.001). 25/40 (62.5%) cases with high log blood harmane concentration (based on a median split) had low UPSIT scores (based on a median split) vs. 12/43 (27.9%) ET cases with low log blood harmane concentration (adjusted odd ratios (OR) 4.04, 95% confidence intervals (CI) 1.42-11.50, p=0.009). When compared with the low log blood harmane tertile, the odds of olfactory dysfunction were 2.64 times higher in cases in the middle tertile and 10.95 times higher in cases in the high tertile. In 69 control subjects, higher log blood harmane concentration was not correlated with lower UPSIT score (rho=0.12, p=0.32). Blood harmane concentrations were correlated with UPSIT scores in ET cases but not controls. These analyses set the stage for postmortem studies to further explore the role of harmane as a cerebellar toxin in ET.
Changes in Student Populations and Average Test Scores of Dutch Primary Schools

ERIC Educational Resources Information Center

Luyten, Hans; de Wolf, Inge

2011-01-01

This article focuses on the relation between student population characteristics and average test scores per school in the final grade of primary education from a dynamic perspective. Aggregated data of over 5,000 Dutch primary schools covering a 6-year period were used to study the relation between changes in school populations and shifts in mean…
States Eyeing Expense of Hand-Scored Tests in Light of NCLB Rules

ERIC Educational Resources Information Center

Archer, Jeff

2005-01-01

When students put down their pencils at the end of Connecticut's testing each year, another intensive process begins. Hundreds of trained evaluators work day and night for about a month to score the written responses. Although expensive, the use of open-ended questions drives the kind of instruction that state leaders say they want in their…
Updating prognosis of cirrhosis by Cox's regression model using Child-Pugh score and aminopyrine breath test as time-dependent covariates.

PubMed

Merkel, C; Morabito, A; Sacerdoti, D; Bolognesi, M; Angeli, P; Gatta, A

1998-06-01

The determination of aminopyrine breath test on entry into the study was recently shown to improve the accuracy of prediction of death based on the Child-Pugh classification, but the possible usefulness of serial determinations of both parameters has not been assessed. In the present study, we aimed at evaluating whether serial determinations of aminopyrine breath test and Child-Pugh score improve prognostic accuracy in patients with cirrhosis, compared with determinations obtained only on admission. In 74 patients with liver cirrhosis aminopyrine breath test and Child-Pugh score were obtained upon entry into the study. Patients were followed with sequential aminopyrine breath tests and assessments of the Child-Pugh score every 4-6 months. A total number of 232 determinations were obtained. During follow-up 45 patients died, on average after 12 months of follow-up. Child-Pugh score improved in the beginning of follow-up, and then remained fairly constant; aminopyrine breath test showed no improvement in the beginning of follow-up, but rather a slowly progressive decline. In patients who died, both the Child-Pugh score and the metabolism of aminopyrine were significantly more impaired in the last year preceding death (p < 0.05). Applying Cox's regression model with time-dependent covariates, Child-Pugh score and aminopyrine breath test were independent significant predictors of survival. The model with time-dependent covariates explained the observed survival much better than the model with time-fixed covariates (chi-sq. explained by regression = 31.45 vs 11.97; d.f. = 2; p = 0.0000001 vs 0.003). These data suggest that serial determinations of Child-Pugh score and aminopyrine breath test can be used to efficiently update prognosis of cirrhosis.
Assessing Growth in Young Children: A Comparison of Raw, Age-Equivalent, and Standard Scores Using the Peabody Picture Vocabulary Test

ERIC Educational Resources Information Center

Sullivan, Jeremy R.; Winter, Suzanne M.; Sass, Daniel A.; Svenkerud, Nicole

2014-01-01

Many tests provide users with several different types of scores to facilitate interpretation and description of students' performance. Common examples include raw scores, age- and grade-equivalent scores, and standard scores. However, when used within the context of assessing growth among young children, these scores should not be interchangeable…
The Impact of the 2004 Hurricanes on Florida Comprehensive Assessment Test Scores: Implications for School Counselors

ERIC Educational Resources Information Center

Baggerly, Jennifer; Ferretti, Larissa K.

2008-01-01

What is the impact of natural disasters on students' statewide assessment scores? To answer this question, Florida Comprehensive Assessment Test (FCAT) scores of 55,881 students in grades 4 through 10 were analyzed to determine if there were significant decreases after the 2004 hurricanes. Results reveal that there was statistical but no practical…
Lower Quarter Y-Balance Test Scores and Lower Extremity Injury in NCAA Division I Athletes.

PubMed

Lai, Wilson C; Wang, Dean; Chen, James B; Vail, Jeremy; Rugg, Caitlin M; Hame, Sharon L

2017-08-01

Functional movement tests that are predictive of injury risk in National Collegiate Athletic Association (NCAA) athletes are useful tools for sports medicine professionals. The Lower Quarter Y-Balance Test (YBT-LQ) measures single-leg balance and reach distances in 3 directions. To assess whether the YBT-LQ predicts the laterality and risk of sports-related lower extremity (LE) injury in NCAA athletes. Case-control study; Level of evidence, 3. The YBT-LQ was administered to 294 NCAA Division I athletes from 21 sports during preparticipation physical examinations at a single institution. Athletes were followed prospectively over the course of the corresponding season. Correlation analysis was performed between the laterality of reach asymmetry and composite scores (CS) versus the laterality of injury. Receiver operating characteristic (ROC) analysis was used to determine the optimal asymmetry cutoff score for YBT-LQ. A multivariate regression analysis adjusting for sex, sport type, body mass index, and history of prior LE surgery was performed to assess predictors of earlier and higher rates of injury. Neither the laterality of reach asymmetry nor the CS correlated with the laterality of injury. ROC analysis found optimal cutoff scores of 2, 9, and 3 cm for anterior, posteromedial, and posterolateral reach, respectively. All of these potential cutoff scores, along with a cutoff score of 4 cm used in the majority of prior studies, were associated with poor sensitivity and specificity. Furthermore, none of the asymmetric cutoff scores were associated with earlier or increased rate of injury in the multivariate analyses. YBT-LQ scores alone do not predict LE injury in this collegiate athlete population. Sports medicine professionals should be cautioned against using the YBT-LQ alone to screen for injury risk in collegiate athletes.
Principles for valid histopathologic scoring in research

PubMed Central

Gibson-Corley, Katherine N.; Olivier, Alicia K.; Meyerholz, David K.

2013-01-01

Histopathologic scoring is a tool by which semi-quantitative data can be obtained from tissues. Initially, a thorough understanding of the experimental design, study objectives and methods are required to allow the pathologist to appropriately examine tissues and develop lesion scoring approaches. Many principles go into the development of a scoring system such as tissue examination, lesion identification, scoring definitions and consistency in interpretation. Masking (a.k.a. “blinding”) of the pathologist to experimental groups is often necessary to constrain bias and multiple mechanisms are available. Development of a tissue scoring system requires appreciation of the attributes and limitations of the data (e.g. nominal, ordinal, interval and ratio data) to be evaluated. Incidence, ordinal and rank methods of tissue scoring are demonstrated along with key principles for statistical analyses and reporting. Validation of a scoring system occurs through two principal measures: 1) validation of repeatability and 2) validation of tissue pathobiology. Understanding key principles of tissue scoring can help in the development and/or optimization of scoring systems so as to consistently yield meaningful and valid scoring data. PMID:23558974
Working memory and the identification of facial expression in patients with left frontal glioma.

PubMed

Mu, Yong-Gao; Huang, Ling-Juan; Li, Shi-Yun; Ke, Chao; Chen, Yu; Jin, Yu; Chen, Zhong-Ping

2012-09-01

Patients with brain tumors may have cognitive dysfunctions including memory deterioration, such as working memory, that affect quality of life. This study was to explore the presence of defects in working memory and the identification of facial expressions in patients with left frontal glioma. This case-control study recruited 11 matched pairs of patients and healthy control subjects (mean age ± standard deviation, 37.00 ± 10.96 years vs 36.73 ± 11.20 years; 7 male and 4 female) from March through December 2011. The psychological tests contained tests that estimate verbal/visual-spatial working memory, executive function, and the identification of facial expressions. According to the paired samples analysis, there were no differences in the anxiety and depression scores or in the intelligence quotients between the 2 groups (P > .05). All indices of the Digit Span Test were significantly worse in patients than in control subjects (P < .05), but the Tapping Test scores did not differ between patient and control groups. Of all 7 Wisconsin Card Sorting Test (WCST) indexes, only the Preservative Response was significantly different between patients and control subjects (P < .05). Patients were significantly less accurate in detecting angry facial expressions than were control subjects (30.3% vs 57.6%; P < .05) but showed no deficits in the identification of other expressions. The backward indexes of the Digit Span Test were associated with emotion scores and tumor size and grade (P < .05). Patients with left frontal glioma had deficits in verbal working memory and the ability to identify anger. These may have resulted from damage to functional frontal cortex regions, in which roles in these 2 capabilities have not been confirmed. However, verbal working memory performance might be affected by emotional and tumor-related factors.
Large Modal Survey Testing Using the Ibrahim Time Domain Identification Technique

NASA Technical Reports Server (NTRS)

Ibrahim, S. R.; Pappa, R. S.

1985-01-01

The ability of the ITD identification algorithm in identifying a complete set of structural modal parameters using a large number of free-response time histories simultaneously in one analysis, assuming a math model with a high number of degrees-of-freedom, has been studied. Identification results using simulated free responses of a uniform rectangular plate, with 225 measurement stations, and experimental responses from a ground vibration test of the Long Duration Exposure Facility (LDEF) Space Shuttle payload, with 142 measurement stations, are presented. As many as 300 degrees-of-freedom were allowed in analyzing these data. In general, the use of a significantly oversized math model in the identification process was found to maintain or increase identification accuracy and to identify modes of low response level that are not identified with smaller math model sizes. The concept of a Mode Shape Correlation Constant is introduced for use when more than one identification analysis of the same structure are conducted. This constant quantifies the degree of correlation between any two sets of complex mode shapes identified using different excitation conditions, different user-selectable algorithm constants, or overlapping sets of measurements.
Wavelet-based identification of rotor blades in passage-through-resonance tests

NASA Astrophysics Data System (ADS)

Carassale, Luigi; Marrè-Brunenghi, Michela; Patrone, Stefano

2018-01-01

Turbine blades are critical components in turbo engines and their design process usually includes experimental tests in order to validate and/or update numerical models. These tests are generally carried out on full-scale rotors having some blades instrumented with strain gauges and usually involve a run-up or a run-down phase. The quantification of damping in these conditions is rather challenging for several reasons. In this work, we show through numerical simulations that the usual identification procedures lead to a systematic overestimation of damping due both to the finite sweep velocity, as well as to the variation of the blade natural frequencies with the rotation speed. To overcome these problems, an identification procedure based on the continuous wavelet transform is proposed and validated through numerical simulation.
Improvement in intelligence test scores from 6 to 10 years in children of teenage mothers.

PubMed

Cornelius, Marie D; Goldschmidt, Lidush; De Genna, Natacha M; Richardson, Gale A; Leech, Sharon L; Day, Richard

2010-06-01

This study investigates change in IQ scores among 290 children born to teenage mothers and identifies social, economic, and environmental variables that may be associated with change in intelligence test performance. The children of 290 teenage mothers (72% African-American and 28% European American) were assessed with the Stanford-Binet Intelligence Scale-4th Edition at ages 6 and 10. The mean composite score at age 6 was 84.8 and 91.2 at age 10, an improvement of 6.4 points. Significant cross-sectional predictors at both ages 6 and 10 of higher Stanford-Binet Intelligence Scale scores were maternal cognitive ability, school grade, white ethnicity, and caregiver education. Having more children in the household significantly predicted lower Stanford-Binet Intelligence Scale scores at age 6. Higher satisfaction with maternal social support predicted higher Stanford-Binet Intelligence Scale scores at age 10. Change in IQ scores was not related to maternal socioeconomic status, social support, home environment, ethnicity, or family interactions. Custodial stability was associated with an improvement in IQ scores, whereas increase in caregiver depression was related to decline in IQ scores. Our findings suggest that improvement in IQ scores of offspring of teenage mothers may be related to stability of maternal custody. More research is needed to determine the impact of the maturation of adolescent mothers' parenting and the role of early education on improvement in cognitive abilities.
Variables affecting results of sodium chloride tolerance test for identification of rapidly growing mycobacteria.

PubMed

Conville, P S; Witebsky, F G

1998-06-01

The sodium chloride tolerance test is often used in the identification of rapidly growing mycobacteria, particularly for distinguishing between Mycobacterium abscessus and Mycobacterium chelonae. This test, however, is frequently unreliable for the identification of some species. In this study we examined the following variables: medium manufacturer, inoculum concentration, and atmosphere and temperature of incubation. Results show that reliability is improved if the test and control slants are inoculated with an organism suspension spectrophotometrically equal to a 1 McFarland standard. Slants should be incubated at 35 degrees C in ambient air and checked weekly for 4 weeks. Growth on control slants should be critically evaluated to determine the adequacy of the inoculum; colonies should number greater than 50. Salt-containing media should be examined carefully to detect pinpoint or tiny colonies, and colonies should number greater than 50 for a positive reaction. Concurrent use of a citrate slant may be helpful for distinguishing between M. abscessus and M. chelonae. Molecular methodologies are probably the most reliable means for the identification of rapidly growing mycobacteria and should be used, if possible, when unequivocal species identification is of particular importance.
Variables Affecting Results of Sodium Chloride Tolerance Test for Identification of Rapidly Growing Mycobacteria

PubMed Central

Conville, Patricia S.; Witebsky, Frank G.

1998-01-01

The sodium chloride tolerance test is often used in the identification of rapidly growing mycobacteria, particularly for distinguishing between Mycobacterium abscessus and Mycobacterium chelonae. This test, however, is frequently unreliable for the identification of some species. In this study we examined the following variables: medium manufacturer, inoculum concentration, and atmosphere and temperature of incubation. Results show that reliability is improved if the test and control slants are inoculated with an organism suspension spectrophotometrically equal to a 1 McFarland standard. Slants should be incubated at 35°C in ambient air and checked weekly for 4 weeks. Growth on control slants should be critically evaluated to determine the adequacy of the inoculum; colonies should number greater than 50. Salt-containing media should be examined carefully to detect pinpoint or tiny colonies, and colonies should number greater than 50 for a positive reaction. Concurrent use of a citrate slant may be helpful for distinguishing between M. abscessus and M. chelonae. Molecular methodologies are probably the most reliable means for the identification of rapidly growing mycobacteria and should be used, if possible, when unequivocal species identification is of particular importance. PMID:9620376
Development and Validation of a Food-Associated Olfactory Test (FAOT).

PubMed

Denzer-Lippmann, Melanie Yvonne; Beauchamp, Jonathan; Freiherr, Jessica; Thuerauf, Norbert; Kornhuber, Johannes; Buettner, Andrea

2017-01-01

Olfactory tests are an important tool in human nutritional research for studying food preferences, yet comprehensive tests dedicated solely to food odors are currently lacking. Therefore, within this study, an innovative food-associated olfactory test (FAOT) system was developed. The FAOT comprises 16 odorant pens that contain representative food odors relating to different macronutrient classes. The test underwent a sensory validation based on identification rate, intensity, hedonic value, and food association scores. The accuracy of the test was further compared to the accuracy of the established Sniffin' Sticks identification test. The identification rates and intensities of this new FAOT were found to be comparable to the Sniffin' Sticks olfactory identification test. The odorant pens were also assessed chemo-analytically and were found to be chemically stable for at least 24 weeks. Overall, this new identification test for use in assessing olfaction in a food-associated context is valid both in terms of its use in sensory perception studies and its chemical stability. The FOAT is particularly suited to examinations of the sense of smell regarding food odors. © The Author 2016. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Optimizing identification of clinically relevant Gram-positive organisms by use of the Bruker Biotyper matrix-assisted laser desorption ionization-time of flight mass spectrometry system.

PubMed

McElvania Tekippe, Erin; Shuey, Sunni; Winkler, David W; Butler, Meghan A; Burnham, Carey-Ann D

2013-05-01

Matrix-assisted laser desorption ionization-time of flight mass spectrometry (MALDI-TOF MS) can be used as a method for the rapid identification of microorganisms. This study evaluated the Bruker Biotyper (MALDI-TOF MS) system for the identification of clinically relevant Gram-positive organisms. We tested 239 aerobic Gram-positive organisms isolated from clinical specimens. We evaluated 4 direct-smear methods, including "heavy" (H) and "light" (L) smears, with and without a 1-μl direct formic acid (FA) overlay. The quality measure assigned to a MALDI-TOF MS identification is a numerical value or "score." We found that a heavy smear with a formic acid overlay (H+FA) produced optimal MALDI-TOF MS identification scores and the highest percentage of correctly identified organisms. Using a score of ≥2.0, we identified 183 of the 239 isolates (76.6%) to the genus level, and of the 181 isolates resolved to the species level, 141 isolates (77.9%) were correctly identified. To maximize the number of correct identifications while minimizing misidentifications, the data were analyzed using a score of ≥1.7 for genus- and species-level identification. Using this score, 220 of the 239 isolates (92.1%) were identified to the genus level, and of the 181 isolates resolved to the species level, 167 isolates (92.2%) could be assigned an accurate species identification. We also evaluated a subset of isolates for preanalytic factors that might influence MALDI-TOF MS identification. Frequent subcultures increased the number of unidentified isolates. Incubation temperatures and subcultures of the media did not alter the rate of identification. These data define the ideal bacterial preparation, identification score, and medium conditions for optimal identification of Gram-positive bacteria by use of MALDI-TOF MS.
Optimizing Identification of Clinically Relevant Gram-Positive Organisms by Use of the Bruker Biotyper Matrix-Assisted Laser Desorption Ionization–Time of Flight Mass Spectrometry System

PubMed Central

McElvania TeKippe, Erin; Shuey, Sunni; Winkler, David W.; Butler, Meghan A.

2013-01-01

Matrix-assisted laser desorption ionization–time of flight mass spectrometry (MALDI-TOF MS) can be used as a method for the rapid identification of microorganisms. This study evaluated the Bruker Biotyper (MALDI-TOF MS) system for the identification of clinically relevant Gram-positive organisms. We tested 239 aerobic Gram-positive organisms isolated from clinical specimens. We evaluated 4 direct-smear methods, including “heavy” (H) and “light” (L) smears, with and without a 1-μl direct formic acid (FA) overlay. The quality measure assigned to a MALDI-TOF MS identification is a numerical value or “score.” We found that a heavy smear with a formic acid overlay (H+FA) produced optimal MALDI-TOF MS identification scores and the highest percentage of correctly identified organisms. Using a score of ≥2.0, we identified 183 of the 239 isolates (76.6%) to the genus level, and of the 181 isolates resolved to the species level, 141 isolates (77.9%) were correctly identified. To maximize the number of correct identifications while minimizing misidentifications, the data were analyzed using a score of ≥1.7 for genus- and species-level identification. Using this score, 220 of the 239 isolates (92.1%) were identified to the genus level, and of the 181 isolates resolved to the species level, 167 isolates (92.2%) could be assigned an accurate species identification. We also evaluated a subset of isolates for preanalytic factors that might influence MALDI-TOF MS identification. Frequent subcultures increased the number of unidentified isolates. Incubation temperatures and subcultures of the media did not alter the rate of identification. These data define the ideal bacterial preparation, identification score, and medium conditions for optimal identification of Gram-positive bacteria by use of MALDI-TOF MS. PMID:23426925
Sequential Neighborhood Effects: The Effect of Long-Term Exposure to Concentrated Disadvantage on Children's Reading and Math Test Scores.

PubMed

Hicks, Andrew L; Handcock, Mark S; Sastry, Narayan; Pebley, Anne R

2018-02-01

Prior research has suggested that children living in a disadvantaged neighborhood have lower achievement test scores, but these studies typically have not estimated causal effects that account for neighborhood choice. Recent studies used propensity score methods to account for the endogeneity of neighborhood exposures, comparing disadvantaged and nondisadvantaged neighborhoods. We develop an alternative propensity function approach in which cumulative neighborhood effects are modeled as a continuous treatment variable. This approach offers several advantages. We use our approach to examine the cumulative effects of neighborhood disadvantage on reading and math test scores in Los Angeles. Our substantive results indicate that recency of exposure to disadvantaged neighborhoods may be more important than average exposure for children's test scores. We conclude that studies of child development should consider both average cumulative neighborhood exposure and the timing of this exposure.
Sequential Neighborhood Effects: The Effect of Long-Term Exposure to Concentrated Disadvantage on Children's Reading and Math Test Scores

PubMed Central

Hicks, Andrew L.; Handcock, Mark S.; Sastry, Narayan

2018-01-01

Prior research has suggested that children living in a disadvantaged neighborhood have lower achievement test scores, but these studies typically have not estimated causal effects that account for neighborhood choice. Recent studies used propensity score methods to account for the endogeneity of neighborhood exposures, comparing disadvantaged and nondisadvantaged neighborhoods. We develop an alternative propensity function approach in which cumulative neighborhood effects are modeled as a continuous treatment variable. This approach offers several advantages. We use our approach to examine the cumulative effects of neighborhood disadvantage on reading and math test scores in Los Angeles. Our substantive results indicate that recency of exposure to disadvantaged neighborhoods may be more important than average exposure for children's test scores. We conclude that studies of child development should consider both average cumulative neighborhood exposure and the timing of this exposure. PMID:29192386
Refining Ovarian Cancer Test accuracy Scores (ROCkeTS): protocol for a prospective longitudinal test accuracy study to validate new risk scores in women with symptoms of suspected ovarian cancer

PubMed Central

Sundar, Sudha; Rick, Caroline; Dowling, Francis; Au, Pui; Rai, Nirmala; Champaneria, Rita; Stobart, Hilary; Neal, Richard; Davenport, Clare; Mallett, Susan; Sutton, Andrew; Kehoe, Sean; Timmerman, Dirk; Bourne, Tom; Van Calster, Ben; Gentry-Maharaj, Aleksandra; Deeks, Jon

2016-01-01

Introduction Ovarian cancer (OC) is associated with non-specific symptoms such as bloating, making accurate diagnosis challenging: only 1 in 3 women with OC presents through primary care referral. National Institute for Health and Care Excellence guidelines recommends sequential testing with CA125 and routine ultrasound in primary care. However, these diagnostic tests have limited sensitivity or specificity. Improving accurate triage in women with vague symptoms is likely to improve mortality by streamlining referral and care pathways. The Refining Ovarian Cancer Test Accuracy Scores (ROCkeTS; HTA 13/13/01) project will derive and validate new tests/risk prediction models that estimate the probability of having OC in women with symptoms. This protocol refers to the prospective study only (phase III). Methods and analysis ROCkeTS comprises four parallel phases. The full ROCkeTS protocol can be found at http://www.birmingham.ac.uk/ROCKETS. Phase III is a prospective test accuracy study. The study will recruit 2450 patients from 15 UK sites. Recruited patients complete symptom and anxiety questionnaires, donate a serum sample and undergo ultrasound scored as per International Ovarian Tumour Analysis (IOTA) criteria. Recruitment is at rapid access clinics, emergency departments and elective clinics. Models to be evaluated include those based on ultrasound derived by the IOTA group and novel models derived from analysis of existing data sets. Estimates of sensitivity, specificity, c-statistic (area under receiver operating curve), positive predictive value and negative predictive value of diagnostic tests are evaluated and a calibration plot for models will be presented. ROCkeTS has received ethical approval from the NHS West Midlands REC (14/WM/1241) and is registered on the controlled trials website (ISRCTN17160843) and the National Institute of Health Research Cancer and Reproductive Health portfolios. PMID:27507231

Toward the Automated Scoring of Written Arguments: Developing an Innovative Approach for Annotation. Research Report. ETS RR-17-11

ERIC Educational Resources Information Center

Song, Yi; Deane, Paul; Beigman Klebanov, Beata

2017-01-01

This project focuses on laying the foundations for automated analysis of argumentation schemes, supporting identification and classification of the arguments being made in a text, for the purpose of scoring the quality of written analyses of arguments. We developed annotation protocols for 20 argument prompts from a college-level test under the…
Generalizing Terwilliger's likelihood approach: a new score statistic to test for genetic association.

PubMed

el Galta, Rachid; Uitte de Willige, Shirley; de Visser, Marieke C H; Helmer, Quinta; Hsu, Li; Houwing-Duistermaat, Jeanine J

2007-09-24

In this paper, we propose a one degree of freedom test for association between a candidate gene and a binary trait. This method is a generalization of Terwilliger's likelihood ratio statistic and is especially powerful for the situation of one associated haplotype. As an alternative to the likelihood ratio statistic, we derive a score statistic, which has a tractable expression. For haplotype analysis, we assume that phase is known. By means of a simulation study, we compare the performance of the score statistic to Pearson's chi-square statistic and the likelihood ratio statistic proposed by Terwilliger. We illustrate the method on three candidate genes studied in the Leiden Thrombophilia Study. We conclude that the statistic follows a chi square distribution under the null hypothesis and that the score statistic is more powerful than Terwilliger's likelihood ratio statistic when the associated haplotype has frequency between 0.1 and 0.4 and has a small impact on the studied disorder. With regard to Pearson's chi-square statistic, the score statistic has more power when the associated haplotype has frequency above 0.2 and the number of variants is above five.
PepArML: A Meta-Search Peptide Identification Platform

PubMed Central

Edwards, Nathan J.

2014-01-01

The PepArML meta-search peptide identification platform provides a unified search interface to seven search engines; a robust cluster, grid, and cloud computing scheduler for large-scale searches; and an unsupervised, model-free, machine-learning-based result combiner, which selects the best peptide identification for each spectrum, estimates false-discovery rates, and outputs pepXML format identifications. The meta-search platform supports Mascot; Tandem with native, k-score, and s-score scoring; OMSSA; MyriMatch; and InsPecT with MS-GF spectral probability scores — reformatting spectral data and constructing search configurations for each search engine on the fly. The combiner selects the best peptide identification for each spectrum based on search engine results and features that model enzymatic digestion, retention time, precursor isotope clusters, mass accuracy, and proteotypic peptide properties, requiring no prior knowledge of feature utility or weighting. The PepArML meta-search peptide identification platform often identifies 2–3 times more spectra than individual search engines at 10% FDR. PMID:25663956
An Evaluation of Three Approximate Item Response Theory Models for Equating Test Scores.

ERIC Educational Resources Information Center

Marco, Gary L.; And Others

Three item response models were evaluated for estimating item parameters and equating test scores. The models, which approximated the traditional three-parameter model, included: (1) the Rasch one-parameter model, operationalized in the BICAL computer program; (2) an approximate three-parameter logistic model based on coarse group data divided…
Using College Admission Test Scores to Clarify High School Placement. Leading Indicator Spotlight

ERIC Educational Resources Information Center

Flug, Susanna

2010-01-01

In "Beyond Test Scores: Leading Indicators for Education," Foley and colleagues (2008) define leading indicators as those that "provide early signals of progress toward academic achievement" (p. 1) and stress that educators "need leading indicators to help them see the direction their efforts are going in and to take…
Differences of wells scores accuracy, caprini scores and padua scores in deep vein thrombosis diagnosis

NASA Astrophysics Data System (ADS)

Gatot, D.; Mardia, A. I.

2018-03-01

Deep Vein Thrombosis (DVT) is the venous thrombus in lower limbs. Diagnosis is by using venography or ultrasound compression. However, these examinations are not available yet in some health facilities. Therefore many scoring systems are developed for the diagnosis of DVT. The scoring method is practical and safe to use in addition to efficacy, and effectiveness in terms of treatment and costs. The existing scoring systems are wells, caprini and padua score. There have been many studies comparing the accuracy of this score but not in Medan. Therefore, we are interested in comparative research of wells, capriniand padua score in Medan.An observational, analytical, case-control study was conducted to perform diagnostic tests on the wells, caprini and padua score to predict the risk of DVT. The study was at H. Adam Malik Hospital in Medan.From a total of 72 subjects, 39 people (54.2%) are men and the mean age are 53.14 years. Wells score, caprini score and padua score has a sensitivity of 80.6%; 61.1%, 50% respectively; specificity of 80.65; 66.7%; 75% respectively, and accuracy of 87.5%; 64.3%; 65.7% respectively.Wells score has better sensitivity, specificity and accuracy than caprini and padua score in diagnosing DVT.
Correcting Two-Sample "z" and "t" Tests for Correlation: An Alternative to One-Sample Tests on Difference Scores

ERIC Educational Resources Information Center

Zimmerman, Donald W.

2012-01-01

In order to circumvent the influence of correlation in paired-samples and repeated measures experimental designs, researchers typically perform a one-sample Student "t" test on difference scores. That procedure entails some loss of power, because it employs N - 1 degrees of freedom instead of the 2N - 2 degrees of freedom of the…
Loanwords and Vocabulary Size Test Scores: A Case of Different Estimates for Different L1 Learners

ERIC Educational Resources Information Center

Laufer, Batia; McLean, Stuart

2016-01-01

The article investigated how the inclusion of loanwords in vocabulary size tests affected the test scores of two L1 groups of EFL learners: Hebrew and Japanese. New BNC- and COCA-based vocabulary size tests were constructed in three modalities: word form recall, word form recognition, and word meaning recall. Depending on the test modality, the…
Performance and cost analysis of matrix-assisted laser desorption ionization-time of flight mass spectrometry for routine identification of yeast.

PubMed

Dhiman, Neelam; Hall, Leslie; Wohlfiel, Sherri L; Buckwalter, Seanne P; Wengenack, Nancy L

2011-04-01

Matrix-assisted laser desorption ionization-time of flight (MALDI-TOF) mass spectrometry was compared to phenotypic testing for yeast identification. MALDI-TOF mass spectrometry yielded 96.3% and 84.5% accurate species level identifications (spectral scores, ≥ 1.8) for 138 common and 103 archived strains of yeast. MALDI-TOF mass spectrometry is accurate, rapid (5.1 min of hands-on time/identification), and cost-effective ($0.50/sample) for yeast identification in the clinical laboratory.
Insights into Using "TOEIC"® Test Scores to Inform Human Resource Management Decisions. Research Report. ETS RR-17-48

ERIC Educational Resources Information Center

Oliveri, María Elena; Tannenbaum, Richard J.

2017-01-01

This report explores the ways in which human resource (HR) managers use "TOEIC"® scores to inform hiring, promotion, and training decisions in an international workplace. Two data sources were used (a) previously collected test users' testimonials that described managers' use of TOEIC scores to inform HR decisions and (b) test-use…
Using Multivariate Base Rates to Interpret Low Scores on an Abbreviated Battery of the Delis-Kaplan Executive Function System.

PubMed

Karr, Justin E; Garcia-Barrera, Mauricio A; Holdnack, James A; Iverson, Grant L

2017-05-01

Executive function consists of multiple cognitive processes that operate as an interactive system to produce volitional goal-oriented behavior, governed in large part by frontal microstructural and physiological networks. Identification of deficits in executive function in those with neurological or psychiatric conditions can be difficult because the normal variation in executive function test scores, in healthy adults when multiple tests are used, is largely unknown. This study addresses that gap in the literature by examining the prevalence of low scores on a brief battery of executive function tests. The sample consisted of 1,050 healthy individuals (ages 16-89) from the standardization sample for the Delis-Kaplan Executive Function System (D-KEFS). Seven individual test scores from the Trail Making Test, Color-Word Interference Test, and Verbal Fluency Test were analyzed. Low test scores, as defined by commonly used clinical cut-offs (i.e., ≤25th, 16th, 9th, 5th, and 2nd percentiles), occurred commonly among the adult portion of the D-KEFS normative sample (e.g., 62.8% of the sample had one or more scores ≤16th percentile, 36.1% had one or more scores ≤5th percentile), and the prevalence of low scores increased with lower intelligence and fewer years of education. The multivariate base rates (BR) in this article allow clinicians to understand the normal frequency of low scores in the general population. By use of these BRs, clinicians and researchers can improve the accuracy with which they identify executive dysfunction in clinical groups, such as those with traumatic brain injury or neurodegenerative diseases. © The Author 2016. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com
Identification of Streptococcus pyogenes - Phenotypic Tests vs Molecular Assay (spy1258PCR): A Comparative Study.

PubMed

Abraham, Tintu; Sistla, Sujatha

2016-07-01

Traditionally Group A Streptococcus pyogenes (GAS) is differentiated from other beta haemolytic streptococci (BHS) by certain presumptive tests such as bacitracin sensitivity and production of Pyrollidonyl Aryl Sulfatase (PYR). The phenotypic and genotypic confirmatory tests are Lancefield grouping for cell wall carbohydrate antigen and PCR for spy1258 gene respectively. Reliance on presumptive tests alone may lead to misidentification of isolates. To compare the predictive values of routine phenotypic tests with spy1258 PCR for the identification of Streptococcus pyogenes. This comparative analytical study was carried out in the Department of Microbiology, JIPMER, Puducherry, over a period of 18 months (1(st) November 2013 to 30(th) April 2015). Two hundred and six consecutive BHS isolates from various clinical samples were subjected to phenotypic tests such as bacitracin sensitivity, PYR test and Lancefield grouping. The results were compared with spy1258 PCR which was considered 95 the confirmatory test for identification. The sensitivity and specificity of phenotypic tests were as follows; Susceptibility to bacitracin - 95.42%, 70.96%, PYR test - 95.42%, 77.41%, Lancefield grouping- 97.71%, 80.64%. Clinical laboratories should not depend on bacitracin sensitivity as a single presumptive test for the routine identification of GAS but should use supplemental tests such as PYR test or latex agglutination test and for best results use spy1258 PCR.
Electromagnetic Test-Facility characterization: an identification approach

DOE Office of Scientific and Technical Information (OSTI.GOV)

Zicker, J.E.; Candy, J.V.

The response of an object subjected to high energy, transient electromagnetic (EM) fields sometimes called electromagnetic pulses (EMP), is an important issue in the survivability of electronic systems (e.g., aircraft), especially when the field has been generated by a high altitude nuclear burst. The characterization of transient response information is a matter of national concern. In this report we discuss techniques to: (1) improve signal processing at a test facility; and (2) parameterize a particular object response. First, we discuss the application of identification-based signal processing techniques to improve signal levels at the Lawrence Livermore National Laboratory (LLNL) EM Transientmore » Test Facility. We identify models of test equipment and then use these models to deconvolve the input/output sequences for the object under test. A parametric model of the object is identified from this data. The model can be used to extrapolate the response to these threat level EMP. Also discussed is the development of a facility simulator (EMSIM) useful for experimental design and calibration and a deconvolution algorithm (DECONV) useful for removing probe effects from the measured data.« less
Exploration of Analysis Methods for Diagnostic Imaging Tests: Problems with ROC AUC and Confidence Scores in CT Colonography

PubMed Central

Mallett, Susan; Halligan, Steve; Collins, Gary S.; Altman, Doug G.

2014-01-01

Background Different methods of evaluating diagnostic performance when comparing diagnostic tests may lead to different results. We compared two such approaches, sensitivity and specificity with area under the Receiver Operating Characteristic Curve (ROC AUC) for the evaluation of CT colonography for the detection of polyps, either with or without computer assisted detection. Methods In a multireader multicase study of 10 readers and 107 cases we compared sensitivity and specificity, using radiological reporting of the presence or absence of polyps, to ROC AUC calculated from confidence scores concerning the presence of polyps. Both methods were assessed against a reference standard. Here we focus on five readers, selected to illustrate issues in design and analysis. We compared diagnostic measures within readers, showing that differences in results are due to statistical methods. Results Reader performance varied widely depending on whether sensitivity and specificity or ROC AUC was used. There were problems using confidence scores; in assigning scores to all cases; in use of zero scores when no polyps were identified; the bimodal non-normal distribution of scores; fitting ROC curves due to extrapolation beyond the study data; and the undue influence of a few false positive results. Variation due to use of different ROC methods exceeded differences between test results for ROC AUC. Conclusions The confidence scores recorded in our study violated many assumptions of ROC AUC methods, rendering these methods inappropriate. The problems we identified will apply to other detection studies using confidence scores. We found sensitivity and specificity were a more reliable and clinically appropriate method to compare diagnostic tests. PMID:25353643
Exploration of analysis methods for diagnostic imaging tests: problems with ROC AUC and confidence scores in CT colonography.

PubMed

Mallett, Susan; Halligan, Steve; Collins, Gary S; Altman, Doug G

2014-01-01

Different methods of evaluating diagnostic performance when comparing diagnostic tests may lead to different results. We compared two such approaches, sensitivity and specificity with area under the Receiver Operating Characteristic Curve (ROC AUC) for the evaluation of CT colonography for the detection of polyps, either with or without computer assisted detection. In a multireader multicase study of 10 readers and 107 cases we compared sensitivity and specificity, using radiological reporting of the presence or absence of polyps, to ROC AUC calculated from confidence scores concerning the presence of polyps. Both methods were assessed against a reference standard. Here we focus on five readers, selected to illustrate issues in design and analysis. We compared diagnostic measures within readers, showing that differences in results are due to statistical methods. Reader performance varied widely depending on whether sensitivity and specificity or ROC AUC was used. There were problems using confidence scores; in assigning scores to all cases; in use of zero scores when no polyps were identified; the bimodal non-normal distribution of scores; fitting ROC curves due to extrapolation beyond the study data; and the undue influence of a few false positive results. Variation due to use of different ROC methods exceeded differences between test results for ROC AUC. The confidence scores recorded in our study violated many assumptions of ROC AUC methods, rendering these methods inappropriate. The problems we identified will apply to other detection studies using confidence scores. We found sensitivity and specificity were a more reliable and clinically appropriate method to compare diagnostic tests.
Stinging insect identification: Are the allergy specialists any better than their patients?

PubMed

Baker, Troy W; Forester, Joseph P; Johnson, Monica L; Sikora, Jeremy M; Stolfi, Adrienne; Stahl, Mark C

2016-05-01

It has been reported that the general population is not skillful at identifying stinging insects with the exception of the honeybee. No information is available to evaluate allergy physicians' accuracy with stinging insect identification. To measure the accuracy of allergists' ability to identify stinging insects and assess their common practices for evaluating individuals with suspected insect hypersensitivity. A picture-based survey and a dried specimen insect box were constructed to determine allergists' and nonallergists' accuracy in identifying insects. Allergists attending the 2013 American College of Allergy, Asthma, and Immunology meeting were invited to participate in the study. Common practice approaches for evaluating individuals with stinging insect hypersensitivity were also investigated using a brief questionnaire. Allergy physicians are collectively better at insect identification than nonallergists. Overall, the mean (SD) number of correct responses for nonallergists was 5.4 (2.0) of a total of 10. This score was significantly lower than the score for allergists (6.1 [2.0]; P = .01) who participated in the study. Most allergists (78.5%) test for all stinging insects and use skin testing (69.5%) as the initial test of choice for evaluating individuals with insect hypersensitivity. Overall, allergists are more skilled at Hymenoptera identification. Most allergy specialists reported testing for all stinging insects when evaluating insect hypersensitivity, and skin testing was the preferred testing method in nearly 70% of allergists. These data support the practice parameter's recommendation to consider testing for all flying Hymenoptera insects during venom evaluation, which most of the participating allergists surveyed incorporate into their clinical practice. Copyright © 2016 American College of Allergy, Asthma & Immunology. Published by Elsevier Inc. All rights reserved.
Comprehensive School Reform and Standardized Test Scores in Illinois Elementary and Middle Schools

ERIC Educational Resources Information Center

McEnroe, James D.

2010-01-01

The study examined the effects of the federally funded Comprehensive School Reform (CSR) program on student performance on mandated standardized tests. The study focused on the mathematics and reading scores of Illinois public elementary and middle and junior high school students. The federal CSR program provided Illinois schools with an annual…
Relationship of Friends, Physical Education, and State Test Scores: Implications for School Counselors

ERIC Educational Resources Information Center

Hollingsworth, Mary Ann

2010-01-01

This study examined the relationship between dimensions of wellness and academic performance for 634 third through fifth grade students in Title One schools in rural Mississippi, using composites of the Five Factor Wellness Inventory for Elementary Children and Reading, Language, and Math Scores of the Mississippi Curriculum Test (a state level…
A Picture-Identification Test for Hearing-Impaired Children. Final Report.

ERIC Educational Resources Information Center

Ross, Mark; Lerman, Jay

The Word Intelligibility by Picture Identification Test (WIPI) was developed to measure speech discrimination ability in hearing impaired children. In the first phase of development, the word stimuli were evaluated to determine whether they were within the recognition vocabulary of 15 hearing impaired children (aged 6 to 12) and whether the…
An evaluation of automatic coronary artery calcium scoring methods with cardiac CT using the orCaScore framework.

PubMed

Wolterink, Jelmer M; Leiner, Tim; de Vos, Bob D; Coatrieux, Jean-Louis; Kelm, B Michael; Kondo, Satoshi; Salgado, Rodrigo A; Shahzad, Rahil; Shu, Huazhong; Snoeren, Miranda; Takx, Richard A P; van Vliet, Lucas J; van Walsum, Theo; Willems, Tineke P; Yang, Guanyu; Zheng, Yefeng; Viergever, Max A; Išgum, Ivana

2016-05-01

The amount of coronary artery calcification (CAC) is a strong and independent predictor of cardiovascular disease (CVD) events. In clinical practice, CAC is manually identified and automatically quantified in cardiac CT using commercially available software. This is a tedious and time-consuming process in large-scale studies. Therefore, a number of automatic methods that require no interaction and semiautomatic methods that require very limited interaction for the identification of CAC in cardiac CT have been proposed. Thus far, a comparison of their performance has been lacking. The objective of this study was to perform an independent evaluation of (semi)automatic methods for CAC scoring in cardiac CT using a publicly available standardized framework. Cardiac CT exams of 72 patients distributed over four CVD risk categories were provided for (semi)automatic CAC scoring. Each exam consisted of a noncontrast-enhanced calcium scoring CT (CSCT) and a corresponding coronary CT angiography (CCTA) scan. The exams were acquired in four different hospitals using state-of-the-art equipment from four major CT scanner vendors. The data were divided into 32 training exams and 40 test exams. A reference standard for CAC in CSCT was defined by consensus of two experts following a clinical protocol. The framework organizers evaluated the performance of (semi)automatic methods on test CSCT scans, per lesion, artery, and patient. Five (semi)automatic methods were evaluated. Four methods used both CSCT and CCTA to identify CAC, and one method used only CSCT. The evaluated methods correctly detected between 52% and 94% of CAC lesions with positive predictive values between 65% and 96%. Lesions in distal coronary arteries were most commonly missed and aortic calcifications close to the coronary ostia were the most common false positive errors. The majority (between 88% and 98%) of correctly identified CAC lesions were assigned to the correct artery. Linearly weighted Cohen's kappa

Identification of medically relevant Nocardia species with an abbreviated battery of tests.

PubMed

Kiska, Deanna L; Hicks, Karen; Pettit, David J

2002-04-01

Identification of Nocardia to the species level is useful for predicting antimicrobial susceptibility patterns and defining the pathogenicity and geographic distribution of these organisms. We sought to develop an identification method which was accurate, timely, and employed tests which would be readily available in most clinical laboratories. We evaluated the API 20C AUX yeast identification system as well as several biochemical tests and Kirby-Bauer susceptibility patterns for the identification of 75 isolates encompassing the 8 medically relevant Nocardia species. There were few biochemical reactions that were sufficiently unique for species identification; of note, N. nova were positive for arylsulfatase, N. farcinica were positive for opacification of Middlebrook 7H11 agar, and N. brasiliensis and N. pseudobrasiliensis were the only species capable of liquefying gelatin. API 20C sugar assimilation patterns were unique for N. transvalensis, N. asteroides IV, and N. brevicatena. There was overlap among the assimilation patterns for the other species. Species-specific patterns of susceptibility to gentamicin, tobramycin, amikacin, and erythromycin were obtained for N. nova, N. farcinica, and N. brevicatena, while there was overlap among the susceptibility patterns for the other isolates. No single method could identify all Nocardia isolates to the species level; therefore, a combination of methods was necessary. An algorithm utilizing antibiotic susceptibility patterns, citrate utilization, acetamide utilization, and assimilation of inositol and adonitol accurately identified all isolates. The algorithm was expanded to include infrequent drug susceptibility patterns which have been reported in the literature but which were not seen in this study.
Robust wafer identification recognition based on asterisk-shape filter and high-low score comparison method.

PubMed

Hsu, Wei-Chih; Yu, Tsan-Ying; Chen, Kuan-Liang

2009-12-10

Wafer identifications (wafer ID) can be used to identify wafers from each other so that wafer processing can be traced easily. Wafer ID recognition is one of the problems of optical character recognition. The process to recognize wafer IDs is similar to that used in recognizing car license-plate characters. However, due to some unique characteristics, such as the irregular space between two characters and the unsuccessive strokes of wafer ID, it will not get a good result to recognize wafer ID by directly utilizing the approaches used in car license-plate character recognition. Wafer ID scratches are engraved by a laser scribe almost along the following four fixed directions: horizontal, vertical, plus 45 degrees , and minus 45 degrees orientations. The closer to the center line of a wafer ID scratch, the higher the gray level will be. These and other characteristics increase the difficulty to recognize the wafer ID. In this paper a wafer ID recognition scheme based on an asterisk-shape filter and a high-low score comparison method is proposed to cope with the serious influence of uneven luminance and make recognition more efficiently. Our proposed approach consists of some processing stages. Especially in the final recognition stage, a template-matching method combined with stroke analysis is used as a recognizing scheme. This is because wafer IDs are composed of Semiconductor Equipment and Materials International (SEMI) standard Arabic numbers and English alphabets, and thus the template ID images are easy to obtain. Furthermore, compared with the approach that requires prior training, such as a support vector machine, which often needs a large amount of training image samples, no prior training is required for our approach. The testing results show that our proposed scheme can efficiently and correctly segment out and recognize the wafer ID with high performance.
A Mixture Rasch Model-Based Computerized Adaptive Test for Latent Class Identification

ERIC Educational Resources Information Center

Jiao, Hong; Macready, George; Liu, Junhui; Cho, Youngmi

2012-01-01

This study explored a computerized adaptive test delivery algorithm for latent class identification based on the mixture Rasch model. Four item selection methods based on the Kullback-Leibler (KL) information were proposed and compared with the reversed and the adaptive KL information under simulated testing conditions. When item separation was…
Depressive status explains a significant amount of the variance in COPD assessment test (CAT) scores

PubMed Central

Miravitlles, Marc; Molina, Jesús; Quintano, José Antonio; Campuzano, Anna; Pérez, Joselín; Roncero, Carlos

2018-01-01

Background COPD assessment test (CAT) is a short, easy-to-complete health status tool that has been incorporated into the multidimensional assessment of COPD in order to guide therapy; therefore, it is important to understand the factors determining CAT scores. Methods This is a post hoc analysis of a cross-sectional, observational study conducted in respiratory medicine departments and primary care centers in Spain with the aim of identifying the factors determining CAT scores, focusing particularly on the cognitive status measured by the Mini-Mental State Examination (MMSE) and levels of depression measured by the short Beck Depression Inventory (BDI). Results A total of 684 COPD patients were analyzed; 84.1% were men, the mean age of patients was 68.7 years, and the mean forced expiratory volume in 1 second (%) was 55.1%. Mean CAT score was 21.8. CAT scores correlated with the MMSE score (Pearson’s coefficient r=−0.371) and the BDI (r=0.620), both p<0.001. In the multivariate analysis, the usual COPD severity variables (age, dyspnea, lung function, and comorbidity) together with MMSE and BDI scores were significantly associated with CAT scores and explained 45% of the variability. However, a model including only MMSE and BDI scores explained up to 40% and BDI alone explained 38% of the CAT variance. Conclusion CAT scores are associated with clinical variables of severity of COPD. However, cognitive status and, in particular, the level of depression explain a larger percentage of the variance in the CAT scores than the usual COPD clinical severity variables. PMID:29563782
Depressive status explains a significant amount of the variance in COPD assessment test (CAT) scores.

PubMed

Miravitlles, Marc; Molina, Jesús; Quintano, José Antonio; Campuzano, Anna; Pérez, Joselín; Roncero, Carlos

2018-01-01

COPD assessment test (CAT) is a short, easy-to-complete health status tool that has been incorporated into the multidimensional assessment of COPD in order to guide therapy; therefore, it is important to understand the factors determining CAT scores. This is a post hoc analysis of a cross-sectional, observational study conducted in respiratory medicine departments and primary care centers in Spain with the aim of identifying the factors determining CAT scores, focusing particularly on the cognitive status measured by the Mini-Mental State Examination (MMSE) and levels of depression measured by the short Beck Depression Inventory (BDI). A total of 684 COPD patients were analyzed; 84.1% were men, the mean age of patients was 68.7 years, and the mean forced expiratory volume in 1 second (%) was 55.1%. Mean CAT score was 21.8. CAT scores correlated with the MMSE score (Pearson's coefficient r =-0.371) and the BDI ( r =0.620), both p <0.001. In the multivariate analysis, the usual COPD severity variables (age, dyspnea, lung function, and comorbidity) together with MMSE and BDI scores were significantly associated with CAT scores and explained 45% of the variability. However, a model including only MMSE and BDI scores explained up to 40% and BDI alone explained 38% of the CAT variance. CAT scores are associated with clinical variables of severity of COPD. However, cognitive status and, in particular, the level of depression explain a larger percentage of the variance in the CAT scores than the usual COPD clinical severity variables.
Meta-Analyses of the Relationship of Creative Achievement to both IQ and Divergent Thinking Test Scores

ERIC Educational Resources Information Center

Kim, Kyung Hee

2008-01-01

There is disagreement among researchers about whether IQ tests or divergent thinking (DT) tests are better predictors of creative achievement. Resolving this dispute is complicated by the fact that some research has shown a relationship between IQ and DT test scores (e.g., Runco & Albert, 1986; Wallach, 1970). The present study conducted…
Specific algorithm method of scoring the Clock Drawing Test applied in cognitively normal elderly

PubMed Central

Mendes-Santos, Liana Chaves; Mograbi, Daniel; Spenciere, Bárbara; Charchat-Fichman, Helenice

2015-01-01

The Clock Drawing Test (CDT) is an inexpensive, fast and easily administered measure of cognitive function, especially in the elderly. This instrument is a popular clinical tool widely used in screening for cognitive disorders and dementia. The CDT can be applied in different ways and scoring procedures also vary. Objective The aims of this study were to analyze the performance of elderly on the CDT and evaluate inter-rater reliability of the CDT scored by using a specific algorithm method adapted from Sunderland et al. (1989). Methods We analyzed the CDT of 100 cognitively normal elderly aged 60 years or older. The CDT ("free-drawn") and Mini-Mental State Examination (MMSE) were administered to all participants. Six independent examiners scored the CDT of 30 participants to evaluate inter-rater reliability. Results and Conclusion A score of 5 on the proposed algorithm ("Numbers in reverse order or concentrated"), equivalent to 5 points on the original Sunderland scale, was the most frequent (53.5%). The CDT specific algorithm method used had high inter-rater reliability (p<0.01), and mean score ranged from 5.06 to 5.96. The high frequency of an overall score of 5 points may suggest the need to create more nuanced evaluation criteria, which are sensitive to differences in levels of impairment in visuoconstructive and executive abilities during aging. PMID:29213954
Higher Blood Harmane (1-Methyl-9h-Pyrido[3,4-B]Indole) Concentrations Correlate With Lower Olfactory Scores In Essential Tremor

PubMed Central

Louis, Elan D.; Rios, Eileen; Pellegrino, Kathryn M.; Jiang, Wendy; Factor-Litvak, Pam; Zheng, Wei

2008-01-01

Background Harmane (1-methyl-9H-pyrido[3,4-b]indole), a neurotoxin, may be an environmental risk factor for essential tremor (ET). Harmane and related chemicals are toxic to the cerebellum. Whether it is through this mechanism (cerebellar toxicity) that harmane leads to ET is unknown. Impaired olfaction may be a feature of cerebellar disease. Objective To determine whether blood harmane concentrations correlate with olfactory test scores in patients with ET. Methods Blood harmane concentrations were quantified using high performance liquid chromatography. Odor identification testing was performed with the University of Pennsylvania Smell Identification Test (UPSIT). Results In 83 ET cases, higher log blood harmane concentration was correlated with lower UPSIT score (rho = −0.46, p < 0.001). 25/40 (62.5%) cases with high log blood harmane concentration (based on a median split) had low UPSIT scores (based on a median split) vs. 12/43 (27.9%) ET cases with low log blood harmane concentration (adjusted OR 4.04, 95% CI 1.42 – 11.50, p = 0.009). When compared with the low log blood harmane tertile, the odds of olfactory dysfunction were 2.64 times higher in cases in the middle tertile and 10.95 times higher in cases in the high tertile. In 69 control subjects, higher log blood harmane concentration was not correlated with lower UPSIT score (rho = 0.12, p = 0.32). Conclusions Blood harmane concentrations were correlated with UPSIT scores in ET cases but not controls. These analyses set the stage for postmortem studies to further explore the role of harmane as a cerebellar toxin in ET. PMID:18417221
An Investigation of Calculator Use on Employment Tests of Mathematical Ability: Effects on Reliability, Validity, Test Scores, and Speed of Completion

ERIC Educational Resources Information Center

Bing, Mark N.; Stewart, Susan M.; Davison, H. Kristl

2009-01-01

Handheld calculators have been used on the job for more than 30 years, yet the degree to which these devices can affect performance on employment tests of mathematical ability has not been thoroughly examined. This study used a within-subjects research design (N = 167) to investigate the effects of calculator use on test score reliability, test…
Developing Local Oral Reading Fluency Cut Scores for Predicting High-Stakes Test Performance

ERIC Educational Resources Information Center

Grapin, Sally L.; Kranzler, John H.; Waldron, Nancy; Joyce-Beaulieu, Diana; Algina, James

2017-01-01

This study evaluated the classification accuracy of a second grade oral reading fluency curriculum-based measure (R-CBM) in predicting third grade state test performance. It also compared the long-term classification accuracy of local and publisher-recommended R-CBM cut scores. Participants were 266 students who were divided into a calibration…
Understanding the Role of "SES," Ethnicity, and Discipline Infractions in Students' Standardized Test Scores

ERIC Educational Resources Information Center

Koca, Fatih

2017-01-01

The goal of the current study is to examine the impact of students' social economic status, ethnicity, and discipline infractions on their standardized test scores in Indiana, the USA. Data from this study extracted from Indiana Department of Education. ISTEP is a criterion-referenced standardized test. It consists of items that assess a student's…
Physiologic Dysfunction Scores and Cognitive Function Test Performance in United States Adults

PubMed Central

Kobrosly, Roni W; Seplaki, Christopher L; Jones, Courtney M; van Wijngaarden, Edwin

2013-01-01

Objective To investigate the relationship between a measure of cumulative physiologic dysfunction and specific domains of cognitive function. Methods We examined a summary score measuring physiological dysfunction, a multisystem measure of the body’s ability to effectively adapt to physical and psychological demands, in relation to cognitive function deficits in a population of 4511 adults aged 20 to 59 who participated in the third National Health and Nutrition Examination Survey (1988–1994). Measures of cognitive function comprised three domains: working memory, visuomotor speed, and perceptual-motor speed. ‘Physiologic dysfunction’ scores summarizing measures of cardiovascular, immunologic, kidney, and liver function were explored. We used multiple linear regression models to estimate associations between cognitive function measures and physiological dysfunction scores, adjusting for socioeconomic factors, test conditions, and self-reported health factors. Results We noted a dose-response relationship between physiologic dysfunction and working memory (coefficient = 0.207, 95% CI = (0.066, 0.348), p < 0.0001) that persisted after adjustment for all covariates (p = 0.03). We did not observe any significant relationships between dysfunction scores and visuomotor (p = 0.37) or perceptual-motor ability (p = 0.33). Conclusions Our findings suggest that multisystem physiologic dysfunction is associated with working memory. Future longitudinal studies are needed to clarify the underlying mechanisms and explore the persistency of this association into later life. We suggest that such studies should incorporate physiologic data, neuroendocrine parameters, and a wide range of specific cognitive domains. PMID:22155941
Rapid Identification of Bacteria in Positive Blood Culture Broths by Matrix-Assisted Laser Desorption Ionization-Time of Flight Mass Spectrometry▿

PubMed Central

Stevenson, Lindsay G.; Drake, Steven K.; Murray, Patrick R.

2010-01-01

Matrix-assisted laser desorption ionization-time of flight (MALDI-TOF) mass spectrometry is a rapid, accurate method for identifying bacteria and fungi recovered on agar culture media. We report herein a method for the direct identification of bacteria in positive blood culture broths by MALDI-TOF mass spectrometry. A total of 212 positive cultures were examined, representing 32 genera and 60 species or groups. The identification of bacterial isolates by MALDI-TOF mass spectrometry was compared with biochemical testing, and discrepancies were resolved by gene sequencing. No identification (spectral score of <1.7) was obtained for 42 (19.8%) of the isolates, due most commonly to insufficient numbers of bacteria in the blood culture broth. Of the bacteria with a spectral score of ≥1.7, 162 (95.3%) of 170 isolates were correctly identified. All 8 isolates of Streptococcus mitis were misidentified as being Streptococcus pneumoniae isolates. This method provides a rapid, accurate, definitive identification of bacteria within 1 h of detection in positive blood cultures with the caveat that the identification of S. pneumoniae would have to be confirmed by an alternative test. PMID:19955282
Rapid identification of bacteria in positive blood culture broths by matrix-assisted laser desorption ionization-time of flight mass spectrometry.

PubMed

Stevenson, Lindsay G; Drake, Steven K; Murray, Patrick R

2010-02-01

Matrix-assisted laser desorption ionization-time of flight (MALDI-TOF) mass spectrometry is a rapid, accurate method for identifying bacteria and fungi recovered on agar culture media. We report herein a method for the direct identification of bacteria in positive blood culture broths by MALDI-TOF mass spectrometry. A total of 212 positive cultures were examined, representing 32 genera and 60 species or groups. The identification of bacterial isolates by MALDI-TOF mass spectrometry was compared with biochemical testing, and discrepancies were resolved by gene sequencing. No identification (spectral score of < 1.7) was obtained for 42 (19.8%) of the isolates, due most commonly to insufficient numbers of bacteria in the blood culture broth. Of the bacteria with a spectral score of > or = 1.7, 162 (95.3%) of 170 isolates were correctly identified. All 8 isolates of Streptococcus mitis were misidentified as being Streptococcus pneumoniae isolates. This method provides a rapid, accurate, definitive identification of bacteria within 1 h of detection in positive blood cultures with the caveat that the identification of S. pneumoniae would have to be confirmed by an alternative test.
Automated Essay Scoring versus Human Scoring: A Correlational Study

ERIC Educational Resources Information Center

Wang, Jinhao; Brown, Michelle Stallone

2008-01-01

The purpose of the current study was to analyze the relationship between automated essay scoring (AES) and human scoring in order to determine the validity and usefulness of AES for large-scale placement tests. Specifically, a correlational research design was used to examine the correlations between AES performance and human raters' performance.…
Academic Identification as a Mediator of the Relationship between Parental Socialization and Academic Achievement

ERIC Educational Resources Information Center

Strambler, Michael J.; Linke, Lance H.; Ward, Nadia L.

2013-01-01

This study examines whether academic identification, or one's psychological and emotional investment in academics, mediates the association between child-reported parental educational socialization and standardized achievement test scores among a predominantly ethnic minority sample of 367 urban middle school students. We predicted that academic…
Turkish version of the modified Constant-Murley score and standardized test protocol: reliability and validity.

PubMed

Çelik, Derya

2016-01-01

The Constant-Murley score (CMS) is widely used to evaluate disabilities associated with shoulder injuries, but it has been criticized for relying on imprecise terminology and a lack of standardized methodology. A modified guideline, therefore, was published in 2008 with several recommendations. This new version has not yet been translated or culturally adapted for Turkish-speaking populations. The purpose of this study was to translate and cross-culturally adapt the modified CMS and its test protocol, as well as define and measure its reliability and validity. The modified CMS was translated into Turkish, consistent with published methodological guidelines. The measurement properties of the Turkish version of the modified CMS were tested in 30 patients (12 males, 18 females; mean age: 59.5±13.5 years) with a variety of shoulder pathologies. Intraclass correlation coefficients (ICC) were used to estimate test-retest reliability. Construct validity was analyzed with the Turkish version of the American Shoulder and Elbow Surgeons (ASES) Standardized Shoulder Assessment Form and Short-Form Health Survey (SF-12). No difficulties were found in the translation process. The Turkish version of the modified CMS showed excellent test-retest reliability (ICC=0.86). The correlation coefficients between the Turkish version of the modified CMS and the ASES, SF-12-physical component score, and SF-12 mental component scores were found to be 0.48, 0.35, and 0.05, respectively. No floor or ceiling effects were found. The translation and cultural adaptation of the modified CMS and its standardized test protocol into Turkish were successful. The Turkish version of the modified CMS has sufficient reliability and validity to measure a variety of shoulder disorders for Turkish-speaking individuals.
Multidimensional CAT Item Selection Methods for Domain Scores and Composite Scores: Theory and Applications

ERIC Educational Resources Information Center

Yao, Lihua

2012-01-01

Multidimensional computer adaptive testing (MCAT) can provide higher precision and reliability or reduce test length when compared with unidimensional CAT or with the paper-and-pencil test. This study compared five item selection procedures in the MCAT framework for both domain scores and overall scores through simulation by varying the structure…
Using Automated Essay Scores as an Anchor When Equating Constructed Response Writing Tests

ERIC Educational Resources Information Center

Almond, Russell G.

2014-01-01

Assessments consisting of only a few extended constructed response items (essays) are not typically equated using anchor test designs as there are typically too few essay prompts in each form to allow for meaningful equating. This article explores the idea that output from an automated scoring program designed to measure writing fluency (a common…
Development and Validation of Scores from an Instrument Measuring Student Test-Taking Motivation

ERIC Educational Resources Information Center

Eklof, Hanna

2006-01-01

Using the expectancy-value model of achievement motivation as a basis, this study's purpose is to develop, apply, and validate scores from a self-report instrument measuring student test-taking motivation. Sampled evidence of construct validity for the present sample indicates that a number of the items in the instrument could be used as an…

The Fight's Not Always Fixed: Using Literary Response to Transcend Standardized Test Scores

ERIC Educational Resources Information Center

Avila, JuliAnna

2012-01-01

In 2004, the National Endowment for the Arts (NEA) concluded that "literature reading is fading as a meaningful activity, especially among younger people." How can educators continue to teach students about the power of literary response when the priority is for them to achieve proficiency on standardized tests, whose scores can only be narrowly…
Testing a potential alternative to traditional identification procedures: Reaction time-based concealed information test does not work for lineups with cooperative witnesses.

PubMed

Sauerland, Melanie; Wolfs, Andrea C F; Crans, Samantha; Verschuere, Bruno

2017-11-27

Direct eyewitness identification is widely used, but prone to error. We tested the validity of indirect eyewitness identification decisions using the reaction time-based concealed information test (CIT) for assessing cooperative eyewitnesses' face memory as an alternative to traditional lineup procedures. In a series of five experiments, a total of 401 mock eyewitnesses watched one of 11 different stimulus events that depicted a breach of law. Eyewitness identifications in the CIT were derived from longer reaction times as compared to well-matched foil faces not encountered before. Across the five experiments, the weighted mean effect size d was 0.14 (95% CI 0.08-0.19). The reaction time-based CIT seems unsuited for testing cooperative eyewitnesses' memory for faces. The careful matching of the faces required for a fair lineup or the lack of intent to deceive may have hampered the diagnosticity of the reaction time-based CIT.
Implications of Deployed and Nondeployed Fathers on Seventh Graders' California Achievement Test Scores during a Military Crisis.

ERIC Educational Resources Information Center

Pisano, Mark C.

The differences in California Achievement Test (CAT) scores from 1990 to 1991 in seventh graders, currently enrolled in Albritton Junior High School in the Fort Bragg Schools, of deployed and nondeployed fathers were analyzed. CAT percentile scores from 1990 and 1991 (1991 being the year of "Desert Storm") were obtained in reading, math…
Benchmarks for the Dichotic Sentence Identification test in Brazilian Portuguese for ear and age.

PubMed

Andrade, Adriana Neves de; Gil, Daniela; Iorio, Maria Cecilia Martinelli

2015-01-01

Dichotic listening tests should be used in local languages and adapted for the population. Standardize the Brazilian Portuguese version of the Dichotic Sentence Identification test in normal listeners, comparing the performance for age and ear. This prospective study included 200 normal listeners divided into four groups according to age: 13-19 years (GI), 20-29 years (GII), 30-39 years (GIII), and 40-49 years (GIV). The Dichotic Sentence Identification was applied in four stages: training, binaural integration and directed sound from right and left. Better results for the right ear were observed in the stages of binaural integration in all assessed groups. There was a negative correlation between age and percentage of correct responses in both ears for free report and training. The worst performance in all stages of the test was observed for the age group 40-49 years old. Reference values for the Brazilian Portuguese version of the Dichotic Sentence Identification test in normal listeners aged 13-49 years were established according to age, ear, and test stage; they should be used as benchmarks when evaluating individuals with these characteristics. Copyright © 2015 Associação Brasileira de Otorrinolaringologia e Cirurgia Cérvico-Facial. Published by Elsevier Editora Ltda. All rights reserved.
Using a Concept-Grounded, Curriculum-Based Measure in Mathematics To Predict Statewide Test Scores for Middle School Students with LD.

ERIC Educational Resources Information Center

Helwig, Robert; Anderson, Lisbeth; Tindal, Gerald

2002-01-01

An 11-item math concept curriculum-based measure (CBM) was administered to 171 eighth grade students. Scores were correlated with scores from a computer adaptive test designed in conjunction with the state to approximate the official statewide mathematics achievement tests. Correlations for general education students and students with learning…
Zertifikat Deutsch als Fremdsprache and the Oral Proficiency Interview: A Comparison of Test Scores and Examinations.

ERIC Educational Resources Information Center

Lalande, John F.; Schweckendiek, Jurgen

1986-01-01

Investigates what correlations might exist between an individual's score on the Zertifikat Deutsch als Fremdsprache and on the Oral Proficiency Interview. The tests themselves are briefly described. Results indicate that the two tests appear to correlate well in their evaluation of speaking skills. (SED)
Linking Composite Scores: Effects of Anchor Test Length and Content Representativeness. Research Report. ETS RR-16-36

ERIC Educational Resources Information Center

Lin, Peng; Dorans, Neil; Weeks, Jonathan

2016-01-01

The nonequivalent groups with anchor test (NEAT) design is frequently used in test score equating or linking. One important assumption of the NEAT design is that the anchor test is a miniversion of the 2 tests to be equated/linked. When the content of the 2 tests is different, it is not possible for the anchor test to be adequately representative…
Validation of scores of use of inhalation devices: valoration of errors *

PubMed Central

Zambelli-Simões, Letícia; Martins, Maria Cleusa; Possari, Juliana Carneiro da Cunha; Carvalho, Greice Borges; Coelho, Ana Carla Carvalho; Cipriano, Sonia Lucena; de Carvalho-Pinto, Regina Maria; Cukier, Alberto; Stelmach, Rafael

2015-01-01

Abstract Objective: To validate two scores quantifying the ability of patients to use metered dose inhalers (MDIs) or dry powder inhalers (DPIs); to identify the most common errors made during their use; and to identify the patients in need of an educational program for the use of these devices. Methods: This study was conducted in three phases: validation of the reliability of the inhaler technique scores; validation of the contents of the two scores using a convenience sample; and testing for criterion validation and discriminant validation of these instruments in patients who met the inclusion criteria. Results: The convenience sample comprised 16 patients. Interobserver disagreement was found in 19% and 25% of the DPI and MDI scores, respectively. After expert analysis on the subject, the scores were modified and were applied in 72 patients. The most relevant difficulty encountered during the use of both types of devices was the maintenance of total lung capacity after a deep inhalation. The degree of correlation of the scores by observer was 0.97 (p < 0.0001). There was good interobserver agreement in the classification of patients as able/not able to use a DPI (50%/50% and 52%/58%; p < 0.01) and an MDI (49%/51% and 54%/46%; p < 0.05). Conclusions: The validated scores allow the identification and correction of inhaler technique errors during consultations and, as a result, improvement in the management of inhalation devices. PMID:26398751
A sup-score test for the cure fraction in mixture models for long-term survivors.

PubMed

Hsu, Wei-Wen; Todem, David; Kim, KyungMann

2016-12-01

The evaluation of cure fractions in oncology research under the well known cure rate model has attracted considerable attention in the literature, but most of the existing testing procedures have relied on restrictive assumptions. A common assumption has been to restrict the cure fraction to a constant under alternatives to homogeneity, thereby neglecting any information from covariates. This article extends the literature by developing a score-based statistic that incorporates covariate information to detect cure fractions, with the existing testing procedure serving as a special case. A complication of this extension, however, is that the implied hypotheses are not typical and standard regularity conditions to conduct the test may not even hold. Using empirical processes arguments, we construct a sup-score test statistic for cure fractions and establish its limiting null distribution as a functional of mixtures of chi-square processes. In practice, we suggest a simple resampling procedure to approximate this limiting distribution. Our simulation results show that the proposed test can greatly improve efficiency over tests that neglect the heterogeneity of the cure fraction under the alternative. The practical utility of the methodology is illustrated using ovarian cancer survival data with long-term follow-up from the surveillance, epidemiology, and end results registry. © 2016, The International Biometric Society.
Rey's Auditory Verbal Learning Test scores can be predicted from whole brain MRI in Alzheimer's disease.

PubMed

Moradi, Elaheh; Hallikainen, Ilona; Hänninen, Tuomo; Tohka, Jussi

2017-01-01

Rey's Auditory Verbal Learning Test (RAVLT) is a powerful neuropsychological tool for testing episodic memory, which is widely used for the cognitive assessment in dementia and pre-dementia conditions. Several studies have shown that an impairment in RAVLT scores reflect well the underlying pathology caused by Alzheimer's disease (AD), thus making RAVLT an effective early marker to detect AD in persons with memory complaints. We investigated the association between RAVLT scores (RAVLT Immediate and RAVLT Percent Forgetting) and the structural brain atrophy caused by AD. The aim was to comprehensively study to what extent the RAVLT scores are predictable based on structural magnetic resonance imaging (MRI) data using machine learning approaches as well as to find the most important brain regions for the estimation of RAVLT scores. For this, we built a predictive model to estimate RAVLT scores from gray matter density via elastic net penalized linear regression model. The proposed approach provided highly significant cross-validated correlation between the estimated and observed RAVLT Immediate (R = 0.50) and RAVLT Percent Forgetting (R = 0.43) in a dataset consisting of 806 AD, mild cognitive impairment (MCI) or healthy subjects. In addition, the selected machine learning method provided more accurate estimates of RAVLT scores than the relevance vector regression used earlier for the estimation of RAVLT based on MRI data. The top predictors were medial temporal lobe structures and amygdala for the estimation of RAVLT Immediate and angular gyrus, hippocampus and amygdala for the estimation of RAVLT Percent Forgetting. Further, the conversion of MCI subjects to AD in 3-years could be predicted based on either observed or estimated RAVLT scores with an accuracy comparable to MRI-based biomarkers.
Changing abilities vs. changing tasks: Examining validity degradation with test scores and college performance criteria both assessed longitudinally.

PubMed

Dahlke, Jeffrey A; Kostal, Jack W; Sackett, Paul R; Kuncel, Nathan R

2018-05-03

We explore potential explanations for validity degradation using a unique predictive validation data set containing up to four consecutive years of high school students' cognitive test scores and four complete years of those students' college grades. This data set permits analyses that disentangle the effects of predictor-score age and timing of criterion measurements on validity degradation. We investigate the extent to which validity degradation is explained by criterion dynamism versus the limited shelf-life of ability scores. We also explore whether validity degradation is attributable to fluctuations in criterion variability over time and/or GPA contamination from individual differences in course-taking patterns. Analyses of multiyear predictor data suggest that changes to the determinants of performance over time have much stronger effects on validity degradation than does the shelf-life of cognitive test scores. The age of predictor scores had only a modest relationship with criterion-related validity when the criterion measurement occasion was held constant. Practical implications and recommendations for future research are discussed. (PsycINFO Database Record (c) 2018 APA, all rights reserved).
Clinical score and rapid antigen detection test to guide antibiotic use for sore throats: randomised controlled trial of PRISM (primary care streptococcal management).

PubMed

Little, Paul; Hobbs, F D Richard; Moore, Michael; Mant, David; Williamson, Ian; McNulty, Cliodna; Cheng, Ying Edith; Leydon, Geraldine; McManus, Richard; Kelly, Joanne; Barnett, Jane; Glasziou, Paul; Mullee, Mark

2013-10-10

To determine the effect of clinical scores that predict streptococcal infection or rapid streptococcal antigen detection tests compared with delayed antibiotic prescribing. Open adaptive pragmatic parallel group randomised controlled trial. Primary care in United Kingdom. Patients aged ≥ 3 with acute sore throat. An internet programme randomised patients to targeted antibiotic use according to: delayed antibiotics (the comparator group for analyses), clinical score, or antigen test used according to clinical score. During the trial a preliminary streptococcal score (score 1, n=1129) was replaced by a more consistent score (score 2, n=631; features: fever during previous 24 hours; purulence; attends rapidly (within three days after onset of symptoms); inflamed tonsils; no cough/coryza (acronym FeverPAIN). Symptom severity reported by patients on a 7 point Likert scale (mean severity of sore throat/difficulty swallowing for days two to four after the consultation (primary outcome)), duration of symptoms, use of antibiotics. For score 1 there were no significant differences between groups. For score 2, symptom severity was documented in 80% (168/207 (81%) in delayed antibiotics group; 168/211 (80%) in clinical score group; 166/213 (78%) in antigen test group). Reported severity of symptoms was lower in the clinical score group (-0.33, 95% confidence interval -0.64 to -0.02; P=0.04), equivalent to one in three rating sore throat a slight versus moderate problem, with a similar reduction for the antigen test group (-0.30, -0.61 to -0.00; P=0.05). Symptoms rated moderately bad or worse resolved significantly faster in the clinical score group (hazard ratio 1.30, 95% confidence interval 1.03 to 1.63) but not the antigen test group (1.11, 0.88 to 1.40). In the delayed antibiotics group, 75/164 (46%) used antibiotics. Use of antibiotics in the clinical score group (60/161) was 29% lower (adjusted risk ratio 0.71, 95% confidence interval 0.50 to 0.95; P=0.02) and in the
Direct-to-consumer genetic testing for predicting sports performance and talent identification: Consensus statement

PubMed Central

Webborn, Nick; Williams, Alun; McNamee, Mike; Bouchard, Claude; Pitsiladis, Yannis; Ahmetov, Ildus; Ashley, Euan; Byrne, Nuala; Camporesi, Silvia; Collins, Malcolm; Dijkstra, Paul; Eynon, Nir; Fuku, Noriyuki; Garton, Fleur C; Hoppe, Nils; Holm, Søren; Kaye, Jane; Klissouras, Vassilis; Lucia, Alejandro; Maase, Kamiel; Moran, Colin; North, Kathryn N; Pigozzi, Fabio; Wang, Guan

2015-01-01

The general consensus among sport and exercise genetics researchers is that genetic tests have no role to play in talent identification or the individualised prescription of training to maximise performance. Despite the lack of evidence, recent years have witnessed the rise of an emerging market of direct-to-consumer marketing (DTC) tests that claim to be able to identify children's athletic talents. Targeted consumers include mainly coaches and parents. There is concern among the scientific community that the current level of knowledge is being misrepresented for commercial purposes. There remains a lack of universally accepted guidelines and legislation for DTC testing in relation to all forms of genetic testing and not just for talent identification. There is concern over the lack of clarity of information over which specific genes or variants are being tested and the almost universal lack of appropriate genetic counselling for the interpretation of the genetic data to consumers. Furthermore independent studies have identified issues relating to quality control by DTC laboratories with different results being reported from samples from the same individual. Consequently, in the current state of knowledge, no child or young athlete should be exposed to DTC genetic testing to define or alter training or for talent identification aimed at selecting gifted children or adolescents. Large scale collaborative projects, may help to develop a stronger scientific foundation on these issues in the future. PMID:26582191
Towards reporting standards for neuropsychological study results: A proposal to minimize communication errors with standardized qualitative descriptors for normalized test scores.

PubMed

Schoenberg, Mike R; Rum, Ruba S

2017-11-01

Rapid, clear and efficient communication of neuropsychological results is essential to benefit patient care. Errors in communication are a lead cause of medical errors; nevertheless, there remains a lack of consistency in how neuropsychological scores are communicated. A major limitation in the communication of neuropsychological results is the inconsistent use of qualitative descriptors for standardized test scores and the use of vague terminology. PubMed search from 1 Jan 2007 to 1 Aug 2016 to identify guidelines or consensus statements for the description and reporting of qualitative terms to communicate neuropsychological test scores was conducted. The review found the use of confusing and overlapping terms to describe various ranges of percentile standardized test scores. In response, we propose a simplified set of qualitative descriptors for normalized test scores (Q-Simple) as a means to reduce errors in communicating test results. The Q-Simple qualitative terms are: 'very superior', 'superior', 'high average', 'average', 'low average', 'borderline' and 'abnormal/impaired'. A case example illustrates the proposed Q-Simple qualitative classification system to communicate neuropsychological results for neurosurgical planning. The Q-Simple qualitative descriptor system is aimed as a means to improve and standardize communication of standardized neuropsychological test scores. Research are needed to further evaluate neuropsychological communication errors. Conveying the clinical implications of neuropsychological results in a manner that minimizes risk for communication errors is a quintessential component of evidence-based practice. Copyright © 2017 Elsevier B.V. All rights reserved.
Sensitivity and Specificity of the Coma Recovery Scale--Revised Total Score in Detection of Conscious Awareness.

PubMed

Bodien, Yelena G; Carlowicz, Cecilia A; Chatelle, Camille; Giacino, Joseph T

2016-03-01

To describe the sensitivity and specificity of Coma Recovery Scale-Revised (CRS-R) total scores in detecting conscious awareness. Data were retrospectively extracted from the medical records of patients enrolled in a specialized disorders of consciousness (DOC) program. Sensitivity and specificity analyses were completed using CRS-R-derived diagnoses of minimally conscious state (MCS) or emerged from minimally conscious state (EMCS) as the reference standard for conscious awareness and the total CRS-R score as the test criterion. A receiver operating characteristic curve was constructed to demonstrate the optimal CRS-R total cutoff score for maximizing sensitivity and specificity. Specialized DOC program. Patients enrolled in the DOC program (N=252, 157 men; mean age, 49y; mean time from injury, 48d; traumatic etiology, n=127; nontraumatic etiology, n=125; diagnosis of coma or vegetative state, n=70; diagnosis of MCS or EMCS, n=182). Not applicable. Sensitivity and specificity of CRS-R total scores in detecting conscious awareness. A CRS-R total score of 10 or higher yielded a sensitivity of .78 for correct identification of patients in MCS or EMCS, and a specificity of 1.00 for correct identification of patients who did not meet criteria for either of these diagnoses (ie, were diagnosed with vegetative state or coma). The area under the curve in the receiver operating characteristic curve analysis is .98. A total CRS-R score of 10 or higher provides strong evidence of conscious awareness but resulted in a false-negative diagnostic error in 22% of patients who demonstrated conscious awareness based on CRS-R diagnostic criteria. A cutoff score of 8 provides the best balance between sensitivity and specificity, accurately classifying 93% of cases. The optimal total score cutoff will vary depending on the user's objective. Copyright © 2016 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.
Effect of Mindfulness Meditation on Perceived Stress Scores and Autonomic Function Tests of Pregnant Indian Women.

PubMed

Muthukrishnan, Shobitha; Jain, Reena; Kohli, Sangeeta; Batra, Swaraj

2016-04-01

Various pregnancy complications like hypertension, preeclampsia have been strongly correlated with maternal stress. One of the connecting links between pregnancy complications and maternal stress is mind-body intervention which can be part of Complementary and Alternative Medicine (CAM). Biologic measures of stress during pregnancy may get reduced by such interventions. To evaluate the effect of Mindfulness meditation on perceived stress scores and autonomic function tests of pregnant Indian women. Pregnant Indian women of 12 weeks gestation were randomised to two treatment groups: Test group with Mindfulness meditation and control group with their usual obstetric care. The effect of Mindfulness meditation on perceived stress scores and cardiac sympathetic functions and parasympathetic functions (Heart rate variation with respiration, lying to standing ratio, standing to lying ratio and respiratory rate) were evaluated on pregnant Indian women. There was a significant decrease in perceived stress scores, a significant decrease of blood pressure response to cold pressor test and a significant increase in heart rate variability in the test group (p< 0.05, significant) which indicates that mindfulness meditation is a powerful modulator of the sympathetic nervous system and can thereby reduce the day-to-day perceived stress in pregnant women. The results of this study suggest that mindfulness meditation improves parasympathetic functions in pregnant women and is a powerful modulator of the sympathetic nervous system during pregnancy.
Factor Structure of Child Behavior Scale Scores in Peruvian Preschoolers

ERIC Educational Resources Information Center

Meyer, Erin L.; Schaefer, Barbara A.; Soto, Cesar Merino; Simmons, Crystal S.; Anguiano, Rebecca; Brett, Jeremy; Holman, Alea; Martin, Justin F.; Hata, Heidi K.; Roberts, Kimberly J.; Mello, Zena R.; Worrell, Frank C.

2011-01-01

Behavior rating scales aid in the identification of problem behaviors, as well as the development of interventions to reduce such behavior. Although scores on many behavior rating scales have been validated in the United States, there have been few such studies in other cultural contexts. In this study, the structural validity of scores on a…
Segregation and the Black-White Test Score Gap. NBER Working Paper No. 12988

ERIC Educational Resources Information Center

Vigdor, Jacob; Ludwig, Jens

2007-01-01

The mid-1980s witnessed breaks in two important trends related to race and schooling. School segregation, which had been declining, began a period of relative stasis. Black-white test score gaps, which had also been declining, also stagnated. The notion that these two phenomena may be related is also supported by basic cross-sectional evidence. We…
Adults with poor reading skills: How lexical knowledge interacts with scores on standardized reading comprehension tests

PubMed Central

McKoon, Gail; Ratcliff, Roger

2016-01-01

Millions of adults in the United States lack the necessary literacy skills for most living wage jobs. For students from adult learning classes, we used a lexical decision task to measure their knowledge of words and we used a decision-making model (Ratcliff’s, 1978, diffusion model) to abstract the mechanisms underlying their performance from their RTs and accuracy. We also collected scores for each participant on standardized IQ tests and standardized reading tests used commonly in the education literature. We found significant correlations between the model’s estimates of the strengths with which words are represented in memory and scores for some of the standardized tests but not others. The findings point to the feasibility and utility of combining a test of word knowledge, lexical decision, that is well-established in psycholinguistic research, a decision-making model that supplies information about underlying mechanisms, and standardized tests. The goal for future research is to use this combination of approaches to understand better how basic processes relate to standardized tests with the eventual aim of understanding what these tests are measuring and what the specific difficulties are for individual, low-literacy adults. PMID:26550803
Adults with poor reading skills: How lexical knowledge interacts with scores on standardized reading comprehension tests.

PubMed

McKoon, Gail; Ratcliff, Roger

2016-01-01

Millions of adults in the United States lack the necessary literacy skills for most living wage jobs. For students from adult learning classes, we used a lexical decision task to measure their knowledge of words and we used a decision-making model (Ratcliff's, 1978, diffusion model) to abstract the mechanisms underlying their performance from their RTs and accuracy. We also collected scores for each participant on standardized IQ tests and standardized reading tests used commonly in the education literature. We found significant correlations between the model's estimates of the strengths with which words are represented in memory and scores for some of the standardized tests but not others. The findings point to the feasibility and utility of combining a test of word knowledge, lexical decision, that is well-established in psycholinguistic research, a decision-making model that supplies information about underlying mechanisms, and standardized tests. The goal for future research is to use this combination of approaches to understand better how basic processes relate to standardized tests with the eventual aim of understanding what these tests are measuring and what the specific difficulties are for individual, low-literacy adults. Copyright © 2015. Published by Elsevier B.V.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.