mixture rasch model: Topics by Science.gov

Sample records for mixture rasch model

Rasch Mixture Models for DIF Detection

PubMed Central

Strobl, Carolin; Zeileis, Achim

2014-01-01

Rasch mixture models can be a useful tool when checking the assumption of measurement invariance for a single Rasch model. They provide advantages compared to manifest differential item functioning (DIF) tests when the DIF groups are only weakly correlated with the manifest covariates available. Unlike in single Rasch models, estimation of Rasch mixture models is sensitive to the specification of the ability distribution even when the conditional maximum likelihood approach is used. It is demonstrated in a simulation study how differences in ability can influence the latent classes of a Rasch mixture model. If the aim is only DIF detection, it is not of interest to uncover such ability differences as one is only interested in a latent group structure regarding the item difficulties. To avoid any confounding effect of ability differences (or impact), a new score distribution for the Rasch mixture model is introduced here. It ensures the estimation of the Rasch mixture model to be independent of the ability distribution and thus restricts the mixture to be sensitive to latent structure in the item difficulties only. Its usefulness is demonstrated in a simulation study, and its application is illustrated in a study of verbal aggression. PMID:29795819
Rasch Mixture Models for DIF Detection: A Comparison of Old and New Score Specifications

ERIC Educational Resources Information Center

Frick, Hannah; Strobl, Carolin; Zeileis, Achim

2015-01-01

Rasch mixture models can be a useful tool when checking the assumption of measurement invariance for a single Rasch model. They provide advantages compared to manifest differential item functioning (DIF) tests when the DIF groups are only weakly correlated with the manifest covariates available. Unlike in single Rasch models, estimation of Rasch…
Mixture Rasch Models with Joint Maximum Likelihood Estimation

ERIC Educational Resources Information Center

Willse, John T.

2011-01-01

This research provides a demonstration of the utility of mixture Rasch models. Specifically, a model capable of estimating a mixture partial credit model using joint maximum likelihood is presented. Like the partial credit model, the mixture partial credit model has the beneficial feature of being appropriate for analysis of assessment data…
Spurious Latent Classes in the Mixture Rasch Model

ERIC Educational Resources Information Center

Alexeev, Natalia; Templin, Jonathan; Cohen, Allan S.

2011-01-01

Mixture Rasch models have been used to study a number of psychometric issues such as goodness of fit, response strategy differences, strategy shifts, and multidimensionality. Although these models offer the potential for improving understanding of the latent variables being measured, under some conditions overextraction of latent classes may…
The Impact of Various Class-Distinction Features on Model Selection in the Mixture Rasch Model

ERIC Educational Resources Information Center

Choi, In-Hee; Paek, Insu; Cho, Sun-Joo

2017-01-01

The purpose of the current study is to examine the performance of four information criteria (Akaike's information criterion [AIC], corrected AIC [AICC] Bayesian information criterion [BIC], sample-size adjusted BIC [SABIC]) for detecting the correct number of latent classes in the mixture Rasch model through simulations. The simulation study…
A Mixture Rasch Model-Based Computerized Adaptive Test for Latent Class Identification

ERIC Educational Resources Information Center

Jiao, Hong; Macready, George; Liu, Junhui; Cho, Youngmi

2012-01-01

This study explored a computerized adaptive test delivery algorithm for latent class identification based on the mixture Rasch model. Four item selection methods based on the Kullback-Leibler (KL) information were proposed and compared with the reversed and the adaptive KL information under simulated testing conditions. When item separation was…
Using the Mixture Rasch Model to Explore Knowledge Resources Students Invoke in Mathematic and Science Assessments

ERIC Educational Resources Information Center

Zhang, Danhui; Orrill, Chandra; Campbell, Todd

2015-01-01

The purpose of this study was to investigate whether mixture Rasch models followed by qualitative item-by-item analysis of selected Programme for International Student Assessment (PISA) mathematics and science items offered insight into knowledge students invoke in mathematics and science separately and combined. The researchers administered an…
Fitting a Mixture Rasch Model to English as a Foreign Language Listening Tests: The Role of Cognitive and Background Variables in Explaining Latent Differential Item Functioning

ERIC Educational Resources Information Center

Aryadoust, Vahid

2015-01-01

The present study uses a mixture Rasch model to examine latent differential item functioning in English as a foreign language listening tests. Participants (n = 250) took a listening and lexico-grammatical test and completed the metacognitive awareness listening questionnaire comprising problem solving (PS), planning and evaluation (PE), mental…
Mixture Rasch model for guessing group identification

NASA Astrophysics Data System (ADS)

Siow, Hoo Leong; Mahdi, Rasidah; Siew, Eng Ling

2013-04-01

Several alternative dichotomous Item Response Theory (IRT) models have been introduced to account for guessing effect in multiple-choice assessment. The guessing effect in these models has been considered to be itemrelated. In the most classic case, pseudo-guessing in the three-parameter logistic IRT model is modeled to be the same for all the subjects but may vary across items. This is not realistic because subjects can guess worse or better than the pseudo-guessing. Derivation from the three-parameter logistic IRT model improves the situation by incorporating ability in guessing. However, it does not model non-monotone function. This paper proposes to study guessing from a subject-related aspect which is guessing test-taking behavior. Mixture Rasch model is employed to detect latent groups. A hybrid of mixture Rasch and 3-parameter logistic IRT model is proposed to model the behavior based guessing from the subjects' ways of responding the items. The subjects are assumed to simply choose a response at random. An information criterion is proposed to identify the behavior based guessing group. Results show that the proposed model selection criterion provides a promising method to identify the guessing group modeled by the hybrid model.
Reweighting Data in the Spirit of Tukey: Using Bayesian Posterior Probabilities as Rasch Residuals for Studying Misfit

ERIC Educational Resources Information Center

Dardick, William R.; Mislevy, Robert J.

2016-01-01

A new variant of the iterative "data = fit + residual" data-analytical approach described by Mosteller and Tukey is proposed and implemented in the context of item response theory psychometric models. Posterior probabilities from a Bayesian mixture model of a Rasch item response theory model and an unscalable latent class are expressed…
Different Approaches to Covariate Inclusion in the Mixture Rasch Model

ERIC Educational Resources Information Center

Li, Tongyun; Jiao, Hong; Macready, George B.

2016-01-01

The present study investigates different approaches to adding covariates and the impact in fitting mixture item response theory models. Mixture item response theory models serve as an important methodology for tackling several psychometric issues in test development, including the detection of latent differential item functioning. A Monte Carlo…
Latent Transition Analysis with a Mixture Item Response Theory Measurement Model

ERIC Educational Resources Information Center

Cho, Sun-Joo; Cohen, Allan S.; Kim, Seock-Ho; Bottge, Brian

2010-01-01

A latent transition analysis (LTA) model was described with a mixture Rasch model (MRM) as the measurement model. Unlike the LTA, which was developed with a latent class measurement model, the LTA-MRM permits within-class variability on the latent variable, making it more useful for measuring treatment effects within latent classes. A simulation…
A Mixture Rasch Model with a Covariate: A Simulation Study via Bayesian Markov Chain Monte Carlo Estimation

ERIC Educational Resources Information Center

Dai, Yunyun

2013-01-01

Mixtures of item response theory (IRT) models have been proposed as a technique to explore response patterns in test data related to cognitive strategies, instructional sensitivity, and differential item functioning (DIF). Estimation proves challenging due to difficulties in identification and questions of effect size needed to recover underlying…
Partially Observed Mixtures of IRT Models: An Extension of the Generalized Partial-Credit Model

ERIC Educational Resources Information Center

Von Davier, Matthias; Yamamoto, Kentaro

2004-01-01

The generalized partial-credit model (GPCM) is used frequently in educational testing and in large-scale assessments for analyzing polytomous data. Special cases of the generalized partial-credit model are the partial-credit model--or Rasch model for ordinal data--and the two parameter logistic (2PL) model. This article extends the GPCM to the…
A Semi-Parametric Bayesian Mixture Modeling Approach for the Analysis of Judge Mediated Data

ERIC Educational Resources Information Center

Muckle, Timothy Joseph

2010-01-01

Existing methods for the analysis of ordinal-level data arising from judge ratings, such as the Multi-Facet Rasch model (MFRM, or the so-called Facets model) have been widely used in assessment in order to render fair examinee ability estimates in situations where the judges vary in their behavior or severity. However, this model makes certain…
Person Heterogeneity of the BDI-II-C and Its Effects on Dimensionality and Construct Validity: Using Mixture Item Response Models

ERIC Educational Resources Information Center

Wu, Pei-Chen; Huang, Tsai-Wei

2010-01-01

This study was to apply the mixed Rasch model to investigate person heterogeneity of Beck Depression Inventory-II-Chinese version (BDI-II-C) and its effects on dimensionality and construct validity. Person heterogeneity was reflected by two latent classes that differ qualitatively. Additionally, person heterogeneity adversely affected the…
Measuring Middle Grades Teachers' Understanding of Rational Numbers with the Mixture Rasch Model

ERIC Educational Resources Information Center

Izsak, Andrew; Orrill, Chandra Hawley; Cohen, Allan S.; Brown, Rachael Eriksen

2010-01-01

We report the development of a multiple-choice instrument that measures the mathematical knowledge needed for teaching arithmetic with fractions, decimals, and proportions. In particular, the instrument emphasizes the knowledge needed to reason about such arithmetic when numbers are embedded in problem situations. We administered our instrument to…
On Local Homogeneity and Stochastically Ordered Mixed Rasch Models

ERIC Educational Resources Information Center

Kreiner, Svend; Hansen, Mogens; Hansen, Carsten Rosenberg

2006-01-01

Mixed Rasch models add latent classes to conventional Rasch models, assuming that the Rasch model applies within each class and that relative difficulties of items are different in two or more latent classes. This article considers a family of stochastically ordered mixed Rasch models, with ordinal latent classes characterized by increasing total…
Understanding Rasch Measurement: Rasch Models Overview.

ERIC Educational Resources Information Center

Wright, Benjamin D.; Mok, Magdalena

2000-01-01

Presents an overview of Rasch measurement models that begins with a conceptualization of continuous experiences often captured as discrete observations. Discusses the mathematical properties of the Rasch family of models that allow the transformation of discrete deterministic counts into continuous probabilistic abstractions. Also discusses six of…
Likelihood Ratio Tests for Special Rasch Models

ERIC Educational Resources Information Center

Hessen, David J.

2010-01-01

In this article, a general class of special Rasch models for dichotomous item scores is considered. Although Andersen's likelihood ratio test can be used to test whether a Rasch model fits to the data, the test does not differentiate between special Rasch models. Therefore, in this article, new likelihood ratio tests are proposed for testing…

Polytomous Rasch Models in Counseling Assessment

ERIC Educational Resources Information Center

Willse, John T.

2017-01-01

This article provides a brief introduction to the Rasch model. Motivation for using Rasch analyses is provided. Important Rasch model concepts and key aspects of result interpretation are introduced, with major points reinforced using a simulation demonstration. Concrete guidelines are provided regarding sample size and the evaluation of items.
Rasch-family models are more valuable than score-based approaches for analysing longitudinal patient-reported outcomes with missing data.

PubMed

de Bock, Élodie; Hardouin, Jean-Benoit; Blanchin, Myriam; Le Neel, Tanguy; Kubis, Gildas; Bonnaud-Antignac, Angélique; Dantan, Étienne; Sébille, Véronique

2016-10-01

The objective was to compare classical test theory and Rasch-family models derived from item response theory for the analysis of longitudinal patient-reported outcomes data with possibly informative intermittent missing items. A simulation study was performed in order to assess and compare the performance of classical test theory and Rasch model in terms of bias, control of the type I error and power of the test of time effect. The type I error was controlled for classical test theory and Rasch model whether data were complete or some items were missing. Both methods were unbiased and displayed similar power with complete data. When items were missing, Rasch model remained unbiased and displayed higher power than classical test theory. Rasch model performed better than the classical test theory approach regarding the analysis of longitudinal patient-reported outcomes with possibly informative intermittent missing items mainly for power. This study highlights the interest of Rasch-based models in clinical research and epidemiology for the analysis of incomplete patient-reported outcomes data. © The Author(s) 2013.
Predicting responses from Rasch measures.

PubMed

Linacre, John M

2010-01-01

There is a growing family of Rasch models for polytomous observations. Selecting a suitable model for an existing dataset, estimating its parameters and evaluating its fit is now routine. Problems arise when the model parameters are to be estimated from the current data, but used to predict future data. In particular, ambiguities in the nature of the current data, or overfit of the model to the current dataset, may mean that better fit to the current data may lead to worse fit to future data. The predictive power of several Rasch and Rasch-related models are discussed in the context of the Netflix Prize. Rasch-related models are proposed based on Singular Value Decomposition (SVD) and Boltzmann Machines.
Some Improved Diagnostics for Failure of The Rasch Model.

ERIC Educational Resources Information Center

Molenaar, Ivo W.

1983-01-01

Goodness of fit tests for the Rasch model are typically large-sample, global measures. This paper offers suggestions for small-sample exploratory techniques for examining the fit of item data to the Rasch model. (Author/JKS)
Rasch validation of the Arabic version of the lower extremity functional scale.

PubMed

Alnahdi, Ali H

2018-02-01

The purpose of this study was to examine the internal construct validity of the Arabic version of the Lower Extremity Functional Scale (20-item Arabic LEFS) using Rasch analysis. Patients (n = 170) with lower extremity musculoskeletal dysfunction were recruited. Rasch analysis of 20-item Arabic LEFS was performed. Once the initial Rasch analysis indicated that the 20-item Arabic LEFS did not fit the Rasch model, follow-up analyses were conducted to improve the fit of the scale to the Rasch measurement model. These modifications included removing misfitting individuals, changing item scoring structure, removing misfitting items, addressing bias caused by response dependency between items and differential item functioning (DIF). Initial analysis indicated deviation of the 20-item Arabic LEFS from the Rasch model. Disordered thresholds in eight items and response dependency between six items were detected with the scale as a whole did not meet the requirement of unidimensionality. Refinements led to a 15-item Arabic LEFS that demonstrated excellent internal consistency (person separation index [PSI] = 0.92) and satisfied all the requirement of the Rasch model. Rasch analysis did not support the 20-item Arabic LEFS as a unidimensional measure of lower extremity function. The refined 15-item Arabic LEFS met all the requirement of the Rasch model and hence is a valid objective measure of lower extremity function. The Rasch-validated 15-item Arabic LEFS needs to be further tested in an independent sample to confirm its fit to the Rasch measurement model. Implications for Rehabilitation The validity of the 20-item Arabic Lower Extremity Functional Scale to measure lower extremity function is not supported. The 15-item Arabic version of the LEFS is a valid measure of lower extremity function and can be used to quantify lower extremity function in patients with lower extremity musculoskeletal disorders.
Rasch analysis indicates that the Simple Shoulder Test is robust, but minor item modifications and attention to gender differences should be considered.

PubMed

Raman, Jayaprakash; MacDermid, Joy C; Walton, David; Athwal, George S

Repeated cross-sectional study. Multiple studies have evaluated the psychometric properties of the Simple Shoulder Test (SST) through traditional methods supporting it as valid and reliable. Since the evidentiary pool supporting the use of the SST has only partially addressed key measurement properties and the development of SST pre-dates the common use of Rasch model, validation of SST has become a necessity to establish as a reliable and valid PRO for shoulder conditions. To date, no study has analysed SST through Rasch, a modern method for analyzing properties of measurement tools. The purpose of this study was to perform a Rasch analysis of the SST to assess the overall fit to the Rasch model, individual item fit, gender-based DIF, local dependency of items and the unidimensionality of the scale. A secondary purpose was to determine the stability of fit to the Rasch model when captured pre-operatively or post-operatively. Patients completed SST before surgery and between 6 months and 1 year after surgery. Rasch analysis was performed to analyse the carious properties of SST through the Rasch model. SST appears to be robust when tested against the Rasch model. Rasch analysis has highlighted potential areas for to improve in the SST questionnaire. The potential areas to improve are to consider questions that measure the ability of a person to lift the arm above shoulder level and to consider gender differences when measuring the ability to carry weights with the affected arm. This study adds to previous body of empirical evidence arising classical measurement approaches that have suggested that the SST has robust measurement properties, by providing evidence of adequate fit to the Rasch model after minor adjustments. The results of this study should provide confidence to clinicians on SST who wish to use a brief shoulder-specific measure in their practice. The SST appears to be robust when tested against the Rasch model despite some potential areas for improvement. The potential areas that should be explored in future Rasch analyses are the questions that measure the ability of a person to lift the arm above shoulder level and the potential for gender differences when measuring the ability to carry weights with the affected arm. Copyright © 2017 Hanley & Belfus. Published by Elsevier Inc. All rights reserved.
A gentle introduction to Rasch measurement models for metrologists

NASA Astrophysics Data System (ADS)

Mari, Luca; Wilson, Mark

2013-09-01

The talk introduces the basics of Rasch models by systematically interpreting them in the conceptual and lexical framework of the International Vocabulary of Metrology, third edition (VIM3). An admittedly simple example of physical measurement highlights the analogies between physical transducers and tests, as they can be understood as measuring instruments of Rasch models and psychometrics in general. From the talk natural scientists and engineers might learn something of Rasch models, as a specifically relevant case of social measurement, and social scientists might re-interpret something of their knowledge of measurement in the light of the current physical measurement models.
A Note on Item-Restscore Association in Rasch Models

ERIC Educational Resources Information Center

Kreiner, Svend

2011-01-01

To rule out the need for a two-parameter item response theory (IRT) model during item analysis by Rasch models, it is important to check the Rasch model's assumption that all items have the same item discrimination. Biserial and polyserial correlation coefficients measuring the association between items and restscores are often used in an informal…
A Comparison of Uniform DIF Effect Size Estimators under the MIMIC and Rasch Models

ERIC Educational Resources Information Center

Jin, Ying; Myers, Nicholas D.; Ahn, Soyeon; Penfield, Randall D.

2013-01-01

The Rasch model, a member of a larger group of models within item response theory, is widely used in empirical studies. Detection of uniform differential item functioning (DIF) within the Rasch model typically employs null hypothesis testing with a concomitant consideration of effect size (e.g., signed area [SA]). Parametric equivalence between…
Psychometric Properties on Lecturers' Beliefs on Teaching Function: Rasch Model Analysis

ERIC Educational Resources Information Center

Mofreh, Samah Ali Mohsen; Ghafar, Mohammed Najib Abdul; Omar, Abdul Hafiz Hj; Mosaku, Monsurat; Ma'ruf, Amar

2014-01-01

This paper focuses on the psychometric analysis of lecturers' beliefs on teaching function (LBTF) survey using Rasch Model analysis. The sample comprised 34 Community Colleges' lecturers. The Rasch Model is applied to produce specific measurements on the lecturers' beliefs on teaching function in order to generalize results and inferential…
On Applications of Rasch Models in International Comparative Large-Scale Assessments: A Historical Review

ERIC Educational Resources Information Center

Wendt, Heike; Bos, Wilfried; Goy, Martin

2011-01-01

Several current international comparative large-scale assessments of educational achievement (ICLSA) make use of "Rasch models", to address functions essential for valid cross-cultural comparisons. From a historical perspective, ICLSA and Georg Rasch's "models for measurement" emerged at about the same time, half a century ago. However, the…
Understanding Rasch Measurement: Partial Credit Model and Pivot Anchoring.

ERIC Educational Resources Information Center

Bode, Rita K.

2001-01-01

Describes the Rasch measurement partial credit model, what it is, how it differs from other Rasch models, and when and how to use it. Also describes the calibration of instruments with increasingly complex items. Explains pivot anchoring and illustrates its use and describes the effect of pivot anchoring on step calibrations, item hierarchy, and…
Sample Size and Statistical Conclusions from Tests of Fit to the Rasch Model According to the Rasch Unidimensional Measurement Model (Rumm) Program in Health Outcome Measurement.

PubMed

Hagell, Peter; Westergren, Albert

Sample size is a major factor in statistical null hypothesis testing, which is the basis for many approaches to testing Rasch model fit. Few sample size recommendations for testing fit to the Rasch model concern the Rasch Unidimensional Measurement Models (RUMM) software, which features chi-square and ANOVA/F-ratio based fit statistics, including Bonferroni and algebraic sample size adjustments. This paper explores the occurrence of Type I errors with RUMM fit statistics, and the effects of algebraic sample size adjustments. Data with simulated Rasch model fitting 25-item dichotomous scales and sample sizes ranging from N = 50 to N = 2500 were analysed with and without algebraically adjusted sample sizes. Results suggest the occurrence of Type I errors with N less then or equal to 500, and that Bonferroni correction as well as downward algebraic sample size adjustment are useful to avoid such errors, whereas upward adjustment of smaller samples falsely signal misfit. Our observations suggest that sample sizes around N = 250 to N = 500 may provide a good balance for the statistical interpretation of the RUMM fit statistics studied here with respect to Type I errors and under the assumption of Rasch model fit within the examined frame of reference (i.e., about 25 item parameters well targeted to the sample).
Detecting Aberrant Response Patterns in the Rasch Model. Rapport 87-3.

ERIC Educational Resources Information Center

Kogut, Jan

In this paper, the detection of response patterns aberrant from the Rasch model is considered. For this purpose, a new person fit index, recently developed by I. W. Molenaar (1987) and an iterative estimation procedure are used in a simulation study of Rasch model data mixed with aberrant data. Three kinds of aberrant response behavior are…
Person-Fit and the Rasch Model, with an Application to Knowledge of Logical Quantors.

ERIC Educational Resources Information Center

Molenaar, Ivo W.; Hoijtink, Herbert

1996-01-01

Some specific person-fit results for the Rasch model are presented, followed by an application to a test measuring knowledge of reasoning with logical quantors. Some issues are relevant to all attempts to use person-fit statistics in research, but the special role of the Rasch model is highlighted. (SLD)
Rasch Measurement and Item Banking: Theory and Practice.

ERIC Educational Resources Information Center

Nakamura, Yuji

The Rasch Model is an item response theory, one parameter model developed that states that the probability of a correct response on a test is a function of the difficulty of the item and the ability of the candidate. Item banking is useful for language testing. The Rasch Model provides estimates of item difficulties that are meaningful,…
Rasch analysis of the UK Functional Assessment Measure in patients with complex disability after stroke.

PubMed

Medvedev, Oleg N; Turner-Stokes, Lynne; Ashford, Stephen; Siegert, Richard J

2018-02-28

To determine whether the UK Functional Assessment Measure (UK FIM+FAM) fits the Rasch model in stroke patients with complex disability and, if so, to derive a conversion table of Rasch-transformed interval level scores. The sample included a UK multicentre cohort of 1,318 patients admitted for specialist rehabilitation following a stroke. Rasch analysis was conducted for the 30-item scale including 3 domains of items measuring physical, communication and psychosocial functions. The fit of items to the Rasch model was examined using 3 different analytical approaches referred to as "pathways". The best fit was achieved in the pathway where responses from motor, communication and psychosocial domains were summarized into 3 super-items and where some items were split because of differential item functioning (DIF) relative to left and right hemisphere location (χ2 (10) = 14.48, p = 0.15). Re-scoring of items showing disordered thresholds did not significantly improve the overall model fit. The UK FIM+FAM with domain super-items satisfies expectations of the unidimensional Rasch model without the need for re-scoring. A conversion table was produced to convert the total scale scores into interval-level data based on person estimates of the Rasch model. The clinical benefits of interval-transformed scores require further evaluation.
Rasch analysis of the Edmonton Symptom Assessment System and research implications.

PubMed

Cheifetz, O; Packham, T L; Macdermid, J C

2014-04-01

Reliable and valid assessment of the disease burden across all forms of cancer is critical to the evaluation of treatment effectiveness and patient progress. The Edmonton Symptom Assessment System (esas) is used for routine evaluation of people attending for cancer care. In the present study, we used Rasch analysis to explore the measurement properties of the esas and to determine the effect of using Rasch-proposed interval-level esas scoring compared with traditional scoring when evaluating the effects of an exercise program for cancer survivors. Polytomous Rasch analysis (Andrich's rating-scale model) was applied to data from 26,645 esas questionnaires completed at the Juravinski Cancer Centre. The fit of the esas to the polytomous Rasch model was investigated, including evaluations of differential item functioning for sex, age, and disease group. The research implication was investigated by comparing the results of an observational research study previously analysed using a traditional approach with the results obtained by Rasch-proposed interval-level esas scoring. The Rasch reliability index was 0.73, falling short of the desired 0.80-0.90 level. However, the esas was found to fit the Rasch model, including the criteria for uni-dimensional data. The analysis suggests that the current esas scoring system of 0-10 could be collapsed to a 6-point scale. Use of the Rasch-proposed interval-level scoring yielded results that were different from those calculated using summarized ordinal-level esas scores. Differential item functioning was not found for sex, age, or diagnosis groups. The esas is a moderately reliable uni-dimensional measure of cancer disease burden and can provide interval-level scaling with Rasch-based scoring. Further, our study indicates that, compared with the traditional scoring metric, Rasch-based scoring could result in substantive changes to conclusions.
Exact Tests for the Rasch Model via Sequential Importance Sampling

ERIC Educational Resources Information Center

Chen, Yuguo; Small, Dylan

2005-01-01

Rasch proposed an exact conditional inference approach to testing his model but never implemented it because it involves the calculation of a complicated probability. This paper furthers Rasch's approach by (1) providing an efficient Monte Carlo methodology for accurately approximating the required probability and (2) illustrating the usefulness…
Development and Validation of a Teacher Success Questionnaire Using the Rasch Model

ERIC Educational Resources Information Center

Tabatabaee-Yazdi, Mona; Motallebzadeh, Khalil; Ashraf, Hamid; Baghaei, Purya

2018-01-01

An increased enthusiasm on teacher accountability, in recent times, has led policy makers and teachers to a significant care over evaluating teachers' success. To this aim, a 40-item Teacher Success questionnaire was developed and validated by the application of the Rasch model. The Rasch model is used to decide whether the scores of an instrument…

Application of the Rasch Model to the Measurement of Creativity: The Creative Achievement Questionnaire

ERIC Educational Resources Information Center

Wang, Chia-Chi; Ho, Hsiao-Chi; Cheng, Chih-Ling; Cheng, Ying-Yao

2014-01-01

This study was designed to provide multiple sources of evidence of the validity of the Creative Achievement Questionnaire (CAQ) and to clarify the hierarchy of creative achievement using Rasch analyses. A total of 905 Taiwanese participants (345 men and 558 women) completed the CAQ online. The Rasch model was used to assess model-data fit. A…
Latent Trait Theory in the Affective Domain--Applications of the Rasch Model.

ERIC Educational Resources Information Center

Curry, Allen R.; Riegel, N. Blyth

The Rasch model of test theory is described in general terms, compared with latent trait theory, and shown to have interesting applications for the measurement of affective as well as cognitive traits. Three assumption of the Rasch model are stated to support the conclusion that calibration of the items and tests is independent of the examinee…
The Rasch Wars: The Emergence of Rasch Measurement in Language Testing

ERIC Educational Resources Information Center

McNamara, Tim; Knoch, Ute

2012-01-01

This paper examines the uptake of Rasch measurement in language testing through a consideration of research published in language testing research journals in the period 1984 to 2009. Following the publication of the first papers on this topic, exploring the potential of the simple Rasch model for the analysis of dichotomous language test data, a…
Obtaining Content Weights for Test Specifications from Job Analysis Task Surveys: An Application of the Many-Facets Rasch Model

ERIC Educational Resources Information Center

Wang, Ning; Stahl, John

2012-01-01

This article discusses the use of the Many-Facets Rasch Model, via the FACETS computer program (Linacre, 2006a), to scale job/practice analysis survey data as well as to combine multiple rating scales into single composite weights representing the tasks' relative importance. Results from the Many-Facets Rasch Model are compared with those…
Formulating the Rasch Differential Item Functioning Model under the Marginal Maximum Likelihood Estimation Context and Its Comparison with Mantel-Haenszel Procedure in Short Test and Small Sample Conditions

ERIC Educational Resources Information Center

Paek, Insu; Wilson, Mark

2011-01-01

This study elaborates the Rasch differential item functioning (DIF) model formulation under the marginal maximum likelihood estimation context. Also, the Rasch DIF model performance was examined and compared with the Mantel-Haenszel (MH) procedure in small sample and short test length conditions through simulations. The theoretically known…
An Analysis of Cross Racial Identity Scale Scores Using Classical Test Theory and Rasch Item Response Models

ERIC Educational Resources Information Center

Sussman, Joshua; Beaujean, A. Alexander; Worrell, Frank C.; Watson, Stevie

2013-01-01

Item response models (IRMs) were used to analyze Cross Racial Identity Scale (CRIS) scores. Rasch analysis scores were compared with classical test theory (CTT) scores. The partial credit model demonstrated a high goodness of fit and correlations between Rasch and CTT scores ranged from 0.91 to 0.99. CRIS scores are supported by both methods.…
Causal Rasch models.

PubMed

Stenner, A Jackson; Fisher, William P; Stone, Mark H; Burdick, Donald S

2013-01-01

Rasch's unidimensional models for measurement show how to connect object measures (e.g., reader abilities), measurement mechanisms (e.g., machine-generated cloze reading items), and observational outcomes (e.g., counts correct on reading instruments). Substantive theory shows what interventions or manipulations to the measurement mechanism can be traded off against a change to the object measure to hold the observed outcome constant. A Rasch model integrated with a substantive theory dictates the form and substance of permissible interventions. Rasch analysis, absent construct theory and an associated specification equation, is a black box in which understanding may be more illusory than not. Finally, the quantitative hypothesis can be tested by comparing theory-based trade-off relations with observed trade-off relations. Only quantitative variables (as measured) support such trade-offs. Note that to test the quantitative hypothesis requires more than manipulation of the algebraic equivalencies in the Rasch model or descriptively fitting data to the model. A causal Rasch model involves experimental intervention/manipulation on either reader ability or text complexity or a conjoint intervention on both simultaneously to yield a successful prediction of the resultant observed outcome (count correct). We conjecture that when this type of manipulation is introduced for individual reader text encounters and model predictions are consistent with observations, the quantitative hypothesis is sustained.
Causal Rasch models

PubMed Central

Stenner, A. Jackson; Fisher, William P.; Stone, Mark H.; Burdick, Donald S.

2013-01-01

Rasch's unidimensional models for measurement show how to connect object measures (e.g., reader abilities), measurement mechanisms (e.g., machine-generated cloze reading items), and observational outcomes (e.g., counts correct on reading instruments). Substantive theory shows what interventions or manipulations to the measurement mechanism can be traded off against a change to the object measure to hold the observed outcome constant. A Rasch model integrated with a substantive theory dictates the form and substance of permissible interventions. Rasch analysis, absent construct theory and an associated specification equation, is a black box in which understanding may be more illusory than not. Finally, the quantitative hypothesis can be tested by comparing theory-based trade-off relations with observed trade-off relations. Only quantitative variables (as measured) support such trade-offs. Note that to test the quantitative hypothesis requires more than manipulation of the algebraic equivalencies in the Rasch model or descriptively fitting data to the model. A causal Rasch model involves experimental intervention/manipulation on either reader ability or text complexity or a conjoint intervention on both simultaneously to yield a successful prediction of the resultant observed outcome (count correct). We conjecture that when this type of manipulation is introduced for individual reader text encounters and model predictions are consistent with observations, the quantitative hypothesis is sustained. PMID:23986726
Scale construction utilising the Rasch unidimensional measurement model: A measurement of adolescent attitudes towards abortion.

PubMed

Hendriks, Jacqueline; Fyfe, Sue; Styles, Irene; Skinner, S Rachel; Merriman, Gareth

2012-01-01

Measurement scales seeking to quantify latent traits like attitudes, are often developed using traditional psychometric approaches. Application of the Rasch unidimensional measurement model may complement or replace these techniques, as the model can be used to construct scales and check their psychometric properties. If data fit the model, then a scale with invariant measurement properties, including interval-level scores, will have been developed. This paper highlights the unique properties of the Rasch model. Items developed to measure adolescent attitudes towards abortion are used to exemplify the process. Ten attitude and intention items relating to abortion were answered by 406 adolescents aged 12 to 19 years, as part of the "Teen Relationships Study". The sampling framework captured a range of sexual and pregnancy experiences. Items were assessed for fit to the Rasch model including checks for Differential Item Functioning (DIF) by gender, sexual experience or pregnancy experience. Rasch analysis of the original dataset initially demonstrated that some items did not fit the model. Rescoring of one item (B5) and removal of another (L31) resulted in fit, as shown by a non-significant item-trait interaction total chi-square and a mean log residual fit statistic for items of -0.05 (SD=1.43). No DIF existed for the revised scale. However, items did not distinguish as well amongst persons with the most intense attitudes as they did for other persons. A person separation index of 0.82 indicated good reliability. Application of the Rasch model produced a valid and reliable scale measuring adolescent attitudes towards abortion, with stable measurement properties. The Rasch process provided an extensive range of diagnostic information concerning item and person fit, enabling changes to be made to scale items. This example shows the value of the Rasch model in developing scales for both social science and health disciplines.
Measuring Disability: Application of the Rasch Model to Activities of Daily Living (ADL/IADL).

ERIC Educational Resources Information Center

Sheehan, T. Joseph; DeChello, Laurie M.; Garcia, Ramon; Fifield, Judith; Rothfield, Naomi; Reisine, Susan

2001-01-01

Performed a comparative analysis of Activities of Daily Living (ADL) items administered to 4,430 older adults and Instrumental Activities of Daily Living administered to 605 people with rheumatoid arthritis scoring both with Likert and Rasch measurement models. Findings show the superiority of the Rasch approach over the Likert method. (SLD)
Rasch measurement: the Arm Activity measure (ArmA) passive function sub-scale.

PubMed

Ashford, Stephen; Siegert, Richard J; Alexandrescu, Roxana

2016-01-01

To evaluate the conformity of the Arm Activity measure (ArmA) passive function sub-scale to the Rasch model. A consecutive cohort of patients (n = 92) undergoing rehabilitation, including upper limb rehabilitation and spasticity management, at two specialist rehabilitation units were included. Rasch analysis was used to examine scaling and conformity to the model. Responses were analysed using Rasch unidimensional measurement models (RUMM 2030). The following aspects were considered: overall model and individual item fit statistics and fit residuals, internal reliability, item response threshold ordering, item bias, local dependency and unidimensionality. ArmA contains both active and passive function sub-scales, but in this analysis only the passive function sub-scale was considered. Four of the seven items in the ArmA passive function sub-scale initially had disordered thresholds. These items were rescored to four response options, which resulted in ordered thresholds for all items. Once the items with disordered thresholds had been rescored, item bias was not identified for age, global disability level or diagnosis, but with a small difference in difficulty between males and females for one item of the scale. Local dependency was not observed and the unidimensionality of the sub-scale was supported and good fit to the Rasch model was identified. The person separation index (PSI) was 0.95 indicating that the scale is able to reliably differentiate at least two groups of patients. The ArmA passive function sub-scale was shown in this evaluation to conform to the Rasch model once disordered thresholds had been addressed. Using the logit scores produced by the Rasch model it was possible to convert this back to the original scale range. Implications for Rehabilitation The ArmA passive function sub-scale was shown, in this evaluation, to conform to the Rasch model once disordered thresholds had been addressed and therefore to be a clinically applicable and potentially useful hierarchical measure. Using Rasch logit scores it has be possible to convert back to the original ordinal scale range and provide an indication of real change to enable evaluation of clinical outcome of importance to patients and clinicians.
Rasch analysis of the Edmonton Symptom Assessment System and research implications

PubMed Central

Cheifetz, O.; Packham, T.L.; MacDermid, J.C.

2014-01-01

Background Reliable and valid assessment of the disease burden across all forms of cancer is critical to the evaluation of treatment effectiveness and patient progress. The Edmonton Symptom Assessment System (esas) is used for routine evaluation of people attending for cancer care. In the present study, we used Rasch analysis to explore the measurement properties of the esas and to determine the effect of using Rasch-proposed interval-level esas scoring compared with traditional scoring when evaluating the effects of an exercise program for cancer survivors. Methods Polytomous Rasch analysis (Andrich’s rating-scale model) was applied to data from 26,645 esas questionnaires completed at the Juravinski Cancer Centre. The fit of the esas to the polytomous Rasch model was investigated, including evaluations of differential item functioning for sex, age, and disease group. The research implication was investigated by comparing the results of an observational research study previously analysed using a traditional approach with the results obtained by Rasch-proposed interval-level esas scoring. Results The Rasch reliability index was 0.73, falling short of the desired 0.80–0.90 level. However, the esas was found to fit the Rasch model, including the criteria for uni-dimensional data. The analysis suggests that the current esas scoring system of 0–10 could be collapsed to a 6-point scale. Use of the Rasch-proposed interval-level scoring yielded results that were different from those calculated using summarized ordinal-level esas scores. Differential item functioning was not found for sex, age, or diagnosis groups. Conclusions The esas is a moderately reliable uni-dimensional measure of cancer disease burden and can provide interval-level scaling with Rasch-based scoring. Further, our study indicates that, compared with the traditional scoring metric, Rasch-based scoring could result in substantive changes to conclusions. PMID:24764703
Spurious Latent Class Problem in the Mixed Rasch Model: A Comparison of Three Maximum Likelihood Estimation Methods under Different Ability Distributions

ERIC Educational Resources Information Center

Sen, Sedat

2018-01-01

Recent research has shown that over-extraction of latent classes can be observed in the Bayesian estimation of the mixed Rasch model when the distribution of ability is non-normal. This study examined the effect of non-normal ability distributions on the number of latent classes in the mixed Rasch model when estimated with maximum likelihood…
Rasch analysis on OSCE data : An illustrative example.

PubMed

Tor, E; Steketee, C

2011-01-01

The Objective Structured Clinical Examination (OSCE) is a widely used tool for the assessment of clinical competence in health professional education. The goal of the OSCE is to make reproducible decisions on pass/fail status as well as students' levels of clinical competence according to their demonstrated abilities based on the scores. This paper explores the use of the polytomous Rasch model in evaluating the psychometric properties of OSCE scores through a case study. The authors analysed an OSCE data set (comprised of 11 stations) for 80 fourth year medical students based on the polytomous Rasch model in an effort to answer two research questions: (1) Do the clinical tasks assessed in the 11 OSCE stations map on to a common underlying construct, namely clinical competence? (2) What other insights can Rasch analysis offer in terms of scaling, item analysis and instrument validation over and above the conventional analysis based on classical test theory? The OSCE data set has demonstrated a sufficient degree of fit to the Rasch model (Χ(2) = 17.060, DF=22, p=0.76) indicating that the 11 OSCE station scores have sufficient psychometric properties to form a measure for a common underlying construct, i.e. clinical competence. Individual OSCE station scores with good fit to the Rasch model (p > 0.1 for all Χ(2) statistics) further corroborated the characteristic of unidimensionality of the OSCE scale for clinical competence. A Person Separation Index (PSI) of 0.704 indicates sufficient level of reliability for the OSCE scores. Other useful findings from the Rasch analysis that provide insights, over and above the analysis based on classical test theory, are also exemplified using the data set. The polytomous Rasch model provides a useful and supplementary approach to the calibration and analysis of OSCE examination data.
Career Decision Self-Efficacy Scale-Short Form: A Rasch Analysis of the Portuguese Version

ERIC Educational Resources Information Center

Miguel, Jose P.; Silva, Jose T.; Prieto, Gerardo

2013-01-01

The present study analyzes the psychometric properties of the Career Decision Self-Efficacy Scale-Short Form (CDSE-SF) in a sample of Portuguese secondary education students using the Rasch model. The results indicate that the 25 items of the CDSE-SF are well fitted to a latent unidimensional structure, as required by Rasch modeling. The response…
Rasch Model Analysis Gives New Insights Into the Structural Validity of the QuickDASH in Patients With Musculoskeletal Shoulder Pain.

PubMed

Jerosch-Herold, Christina; Chester, Rachel; Shepstone, Lee

2017-09-01

Study Design Cross-sectional secondary analysis of a prospective cohort study. Background The shortened version of the Disabilities of the Arm, Shoulder and Hand questionnaire (QuickDASH) is a widely used outcome measure that has been extensively evaluated using classical test theory. Rasch model analysis can identify strengths and weaknesses of rating scales and goes beyond classical test theory approaches. It uses a mathematical model to test the fit between the observed data and expected responses and converts ordinal-level scores into interval-level measurement. Objective To test the structural validity of the QuickDASH using Rasch analysis. Methods A prospective cohort study of 1030 patients with shoulder pain provided baseline data. Rasch analysis was conducted to (1) assess how the QuickDASH fits the Rasch model, (2) identify sources of misfit, and (3) explore potential solutions to these. Results There was evidence of multidimensionality and significant misfit to the Rasch model (χ 2 = 331.09, P<.001). Two items had disordered threshold responses with strong floor effects. Response bias was detected in most items for age and sex. Rescoring resulted in ordered thresholds; however, the 11-item scale still did not meet the expectations of the Rasch model. Conclusion Rasch model analysis on the QuickDASH has identified a number of problems that cannot be easily detected using traditional analyses. While revisions to the QuickDASH resulted in better fit, a "shoulder-specific" version is not advocated at present. Caution needs to be exercised when interpreting results of the QuickDASH outcome measure, as it does not meet the criteria for interval-level measurement and shows significant response bias by age and sex. J Orthop Sports Phys Ther 2017;47(9):664-672. Epub 13 Jul 2017. doi:10.2519/jospt.2017.7288.
A critique of Rasch residual fit statistics.

PubMed

Karabatsos, G

2000-01-01

In test analysis involving the Rasch model, a large degree of importance is placed on the "objective" measurement of individual abilities and item difficulties. The degree to which the objectivity properties are attained, of course, depends on the degree to which the data fit the Rasch model. It is therefore important to utilize fit statistics that accurately and reliably detect the person-item response inconsistencies that threaten the measurement objectivity of persons and items. Given this argument, it is somewhat surprising that there is far more emphasis placed in the objective measurement of person and items than there is in the measurement quality of Rasch fit statistics. This paper provides a critical analysis of the residual fit statistics of the Rasch model, arguably the most often used fit statistics, in an effort to illustrate that the task of Rasch fit analysis is not as simple and straightforward as it appears to be. The faulty statistical properties of the residual fit statistics do not allow either a convenient or a straightforward approach to Rasch fit analysis. For instance, given a residual fit statistic, the use of a single minimum critical value for misfit diagnosis across different testing situations, where the situations vary in sample and test properties, leads to both the overdetection and underdetection of misfit. To improve this situation, it is argued that psychometricians need to implement residual-free Rasch fit statistics that are based on the number of Guttman response errors, or use indices that are statistically optimal in detecting measurement disturbances.
Using the GLIMMIX Procedure in SAS 9.3 to Fit a Standard Dichotomous Rasch and Hierarchical 1-PL IRT Model

ERIC Educational Resources Information Center

Black, Ryan A.; Butler, Stephen F.

2012-01-01

Although Rasch models have been shown to be a sound methodological approach to develop and validate measures of psychological constructs for more than 50 years, they remain underutilized in psychology and other social sciences. Until recently, one reason for this underutilization was the lack of syntactically simple procedures to fit Rasch and…
Examining the Association between Patient-Reported Symptoms of Attention and Memory Dysfunction with Objective Cognitive Performance: A Latent Regression Rasch Model Approach.

PubMed

Li, Yuelin; Root, James C; Atkinson, Thomas M; Ahles, Tim A

2016-06-01

Patient-reported cognition generally exhibits poor concordance with objectively assessed cognitive performance. In this article, we introduce latent regression Rasch modeling and provide a step-by-step tutorial for applying Rasch methods as an alternative to traditional correlation to better clarify the relationship of self-report and objective cognitive performance. An example analysis using these methods is also included. Introduction to latent regression Rasch modeling is provided together with a tutorial on implementing it using the JAGS programming language for the Bayesian posterior parameter estimates. In an example analysis, data from a longitudinal neurocognitive outcomes study of 132 breast cancer patients and 45 non-cancer matched controls that included self-report and objective performance measures pre- and post-treatment were analyzed using both conventional and latent regression Rasch model approaches. Consistent with previous research, conventional analysis and correlations between neurocognitive decline and self-reported problems were generally near zero. In contrast, application of latent regression Rasch modeling found statistically reliable associations between objective attention and processing speed measures with self-reported Attention and Memory scores. Latent regression Rasch modeling, together with correlation of specific self-reported cognitive domains with neurocognitive measures, helps to clarify the relationship of self-report with objective performance. While the majority of patients attribute their cognitive difficulties to memory decline, the Rash modeling suggests the importance of processing speed and initial learning. To encourage the use of this method, a step-by-step guide and programming language for implementation is provided. Implications of this method in cognitive outcomes research are discussed. © The Author 2016. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Accounting for standard errors of vision-specific latent trait in regression models.

PubMed

Wong, Wan Ling; Li, Xiang; Li, Jialiang; Wong, Tien Yin; Cheng, Ching-Yu; Lamoureux, Ecosse L

2014-07-11

To demonstrate the effectiveness of Hierarchical Bayesian (HB) approach in a modeling framework for association effects that accounts for SEs of vision-specific latent traits assessed using Rasch analysis. A systematic literature review was conducted in four major ophthalmic journals to evaluate Rasch analysis performed on vision-specific instruments. The HB approach was used to synthesize the Rasch model and multiple linear regression model for the assessment of the association effects related to vision-specific latent traits. The effectiveness of this novel HB one-stage "joint-analysis" approach allows all model parameters to be estimated simultaneously and was compared with the frequently used two-stage "separate-analysis" approach in our simulation study (Rasch analysis followed by traditional statistical analyses without adjustment for SE of latent trait). Sixty-six reviewed articles performed evaluation and validation of vision-specific instruments using Rasch analysis, and 86.4% (n = 57) performed further statistical analyses on the Rasch-scaled data using traditional statistical methods; none took into consideration SEs of the estimated Rasch-scaled scores. The two models on real data differed for effect size estimations and the identification of "independent risk factors." Simulation results showed that our proposed HB one-stage "joint-analysis" approach produces greater accuracy (average of 5-fold decrease in bias) with comparable power and precision in estimation of associations when compared with the frequently used two-stage "separate-analysis" procedure despite accounting for greater uncertainty due to the latent trait. Patient-reported data, using Rasch analysis techniques, do not take into account the SE of latent trait in association analyses. The HB one-stage "joint-analysis" is a better approach, producing accurate effect size estimations and information about the independent association of exposure variables with vision-specific latent traits. Copyright 2014 The Association for Research in Vision and Ophthalmology, Inc.

Rasch analysis of the Chedoke-McMaster Attitudes towards Children with Handicaps scale.

PubMed

Armstrong, Megan; Morris, Christopher; Tarrant, Mark; Abraham, Charles; Horton, Mike C

2017-02-01

Aim To assess whether the Chedoke-McMaster Attitudes towards Children with Handicaps (CATCH) 36-item total scale and subscales fit the unidimensional Rasch model. Method The CATCH was administered to 1881 children, aged 7-16 years in a cross-sectional survey. Data were used from a random sample of 416 for the initial Rasch analysis. The analysis was performed on the 36-item scale and then separately for each subscale. The analysis explored fit to the Rasch model in terms of overall scale fit, individual item fit, item response categories, and unidimensionality. Item bias for gender and school level was also assessed. Revised scales were then tested on an independent second random sample of 415 children. Results Analyses indicated that the 36-item overall scale was not unidimensional and did not fit the Rasch model. Two scales of affective attitudes and behavioural intention were retained after four items were removed from each due to misfit to the Rasch model. Additionally, the scaling was improved when the two most negative response categories were aggregated. There was no item bias by gender or school level on the revised scales. Items assessing cognitive attitudes did not fit the Rasch model and had low internal consistency as a scale. Conclusion Affective attitudes and behavioural intention CATCH sub-scales should be treated separately. Caution should be exercised when using the cognitive subscale. Implications for Rehabilitation The 36-item Chedoke-McMaster Attitudes towards Children with Handicaps (CATCH) scale as a whole did not fit the Rasch model; thus indicating a multi-dimensional scale. Researchers should use two revised eight-item subscales of affective attitudes and behavioural intentions when exploring interventions aiming to improve children's attitudes towards disabled people or factors associated with those attitudes. Researchers should use the cognitive subscale with caution, as it did not create a unidimensional and internally consistent scale. Therefore, conclusions drawn from this scale may not accurately reflect children's attitudes.
A Psychometric Revision of the European American Values Scale for Asian Americans Using the Rasch Model

ERIC Educational Resources Information Center

Hong, Sehee; Kim, Bryan S. K.; Wolfe, Maren M.

2005-01-01

In this study, the 18-item European American Values Scale for Asian Americans (M. M. Wolfe, P. H. Yang, E. C. Wong, & D. R. Atkinson, 2001) was revised on the basis of results from a psychometric analysis using the Rasch Model (G. Rasch, 1960). The results led to the establishment of the 25-item European American Values Scale for Asian…
Rasch Analysis of the Edmonton Symptom Assessment System.

PubMed

Sprague, Emma; Siegert, Richard J; Medvedev, Oleg; Roberts, Margaret H

2018-05-01

The Edmonton Symptom Assessment System (ESAS) is a widely used multisymptom assessment tool in cancer and palliative care settings, but its psychometric properties have not been widely tested using modern psychometric methods such as Rasch analysis. To apply Rasch analysis to the ESAS in a community palliative care setting and determine its suitability for assessing symptom burden in this group. ESAS data collected from 229 patients enrolled in a community hospice service were evaluated using a partial credit Rasch model with RUMM2030 software (RUMM Laboratory Pty, Ltd., Duncraig, WA). Where disordered thresholds were discovered, item rescoring was undertaken. Rasch model fit and differential item functioning were evaluated after each iterative phase. Uniform rescoring was necessary for all 12 items to display ordered thresholds. The best model fit was achieved after item rescoring and combining three pairs of locally dependent items into three superitems (χ 2 = 29.56 [27]; P = 0.33) that permitted ordinal-to-interval conversion. The ESAS satisfied unidimensional Rasch model expectations in a 12-item format after minor modifications. This included uniform rescoring of the disordered response categories and creating superitems to improve model fit and clinical utility. The accuracy of the ESAS scores can be improved by using ordinal-to-interval conversion tables published in the article. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.
Psychometric properties of the pain stages of change questionnaire as evaluated by rasch analysis in patients with chronic musculoskeletal pain

PubMed Central

2014-01-01

Background Our objective was to evaluate the measurement properties of the Pain Stages of Change Questionnaire (PSOCQ) and its four subscales Precontemplation, Contemplation, Action and Maintenance. Methods A total of 231 patients, median age 42 years, with chronic musculoskeletal pain responded to the 30 items in PSOCQ. Thresholds for item scores, and unidimensionality and invariance of the PSOCQ and its four subscales were evaluated by Rasch analysis, partial credit model. Results The items had disordered threshold and needed to be rescored. The 30 items in the PSOCQ did not fit the Rasch model Chi- square item trait statistics. All subscales fitted the Rasch models. The associations to pain (11 point numeric rating scale), emotional distress (Hopkins symptom check list v 25) and self-efficacy (Arthritis Self-Efficacy Scale) were highest for the Precontemplation subscale. Conclusion The present analysis revealed that all four subscales in PSOCQ fitted the Rasch model. No common construct for all subscales were identified, but the Action and Maintenance subscales were closely related. PMID:24646065
FIM measurement properties and Rasch model details.

PubMed

Wright, B D; Linacre, J M; Smith, R M; Heinemann, A W; Granger, C V

1997-12-01

To summarize, we take issue with the criticisms of Dickson & Köhler for two main reasons: 1. Rasch analysis provides a model from which to approach the analysis of the FIM, an ordinal scale, as an interval scale. The existence of examples of items or individuals which do not fit the model does not disprove the overall efficacy of the model; and 2. the principal components analysis of FIM motor items as presented by Dickson & Köhler tends to undermine rather than support their argument. Their own analyses produce a single major factor explaining between 58.5 and 67.1% of the variance, depending upon the sample, with secondary factors explaining much less variance. Finally, analysis of item response, or latent trait, is a powerful method for understanding the meaning of a measure. However, it presumes that item scores are accurate. Another concern is that Dickson & Köhler do not address the issue of reliability of scoring the FIM items on which they report, a critical point in comparing results. The Uniform Data System for Medical Rehabilitation (UDSMRSM) expends extensive effort in the training of clinicians of subscribing facilities to score items accurately. This is followed up with a credentialing process. Phase 1 involves the testing of individual clinicians who are submitting data to determine if they have achieved mastery over the use of the FIM instrument. Phase 2 involves examining the data for outlying values. When Dickson & Köhler investigate more carefully the application of the Rasch model to their FIM data, they will discover that the results presented in their paper support rather than contradict their application of the Rasch model! This paper is typical of supposed refutations of Rasch model applications. Dickson & Köhler will find that idiosyncrasies in their data and misunderstandings of the Rasch model are the only basis for a claim to have disproven the relevance of the model to FIM data. The Rasch model is a mathematical theorem (like Pythagoras') and so cannot be disproven by empirical data once it has been deduced on theoretical grounds. Sometimes empirical data are not suitable for construction of a measure. When this happens, the routine fit statistics indicate the unsuitable segments of the data. Most FIM data do conform closely enough to the Rasch model to support generalizable linear measures. Science can advance!
Using Rasch Analysis to Identify Uncharacteristic Responses to Undergraduate Assessments

ERIC Educational Resources Information Center

Edwards, Antony; Alcock, Lara

2010-01-01

Rasch Analysis is a statistical technique that is commonly used to analyse both test data and Likert survey data, to construct and evaluate question item banks, and to evaluate change in longitudinal studies. In this article, we introduce the dichotomous Rasch model, briefly discussing its assumptions. Then, using data collected in an…
A Cross-Cultural Validation of Stage Development: A Rasch Re-Analysis of Longitudinal Socio-Moral Reasoning Data

ERIC Educational Resources Information Center

Boom, Jan; Wouters, Hans; Keller, Monika

2007-01-01

Kohlberg's characterization of moral development as displaying an invariant hierarchical order of structurally consistent stages is losing ground. However, by applying Rasch analysis, Dawson recently gave new interpretation and support to his characterization of stage development. Using Rasch models, we replicated and strengthened her findings in…
Comparison of CTT and Rasch-based approaches for the analysis of longitudinal Patient Reported Outcomes.

PubMed

Blanchin, Myriam; Hardouin, Jean-Benoit; Le Neel, Tanguy; Kubis, Gildas; Blanchard, Claire; Mirallié, Eric; Sébille, Véronique

2011-04-15

Health sciences frequently deal with Patient Reported Outcomes (PRO) data for the evaluation of concepts, in particular health-related quality of life, which cannot be directly measured and are often called latent variables. Two approaches are commonly used for the analysis of such data: Classical Test Theory (CTT) and Item Response Theory (IRT). Longitudinal data are often collected to analyze the evolution of an outcome over time. The most adequate strategy to analyze longitudinal latent variables, which can be either based on CTT or IRT models, remains to be identified. This strategy must take into account the latent characteristic of what PROs are intended to measure as well as the specificity of longitudinal designs. A simple and widely used IRT model is the Rasch model. The purpose of our study was to compare CTT and Rasch-based approaches to analyze longitudinal PRO data regarding type I error, power, and time effect estimation bias. Four methods were compared: the Score and Mixed models (SM) method based on the CTT approach, the Rasch and Mixed models (RM), the Plausible Values (PV), and the Longitudinal Rasch model (LRM) methods all based on the Rasch model. All methods have shown comparable results in terms of type I error, all close to 5 per cent. LRM and SM methods presented comparable power and unbiased time effect estimations, whereas RM and PV methods showed low power and biased time effect estimations. This suggests that RM and PV methods should be avoided to analyze longitudinal latent variables. Copyright © 2010 John Wiley & Sons, Ltd.
A systematic literature review on the application of Rasch analysis in musculoskeletal disease -- a special interest group report of OMERACT 11.

PubMed

Leung, Ying-Ying; Png, May-Ee; Conaghan, Philip; Tennant, Alan

2014-01-01

The Rasch measurement model provides robust analysis of the internal construct validity of outcome measures. We reviewed the application of Rasch analysis in musculoskeletal medicine as part of the work leading to discussion in a Special Interest Group in Rasch Analysis at Outcome Measures in Rheumatology 11. A systematic literature review of SCOPUS and MEDLINE was performed (January 1, 1985, to February 29, 2012. Original research reports in English using "Rasch" or "Item Response Theory" in musculoskeletal diseases were assessed by 2 independent reviewers. The topics of focus and analysis methodology details were recorded. Of 212 articles reviewed, 114 were included. The number of publications rose from 1 in 1991-1992 to 23 in 2011-February 2012. Disease areas included rheumatoid arthritis (28%), osteoarthritis (16.6%), and general musculoskeletal disorders (43%). Sixty-six reports (57.9%) evaluated psychometric properties of existing scales and 35 (30.7%) involved development of new scales. Nine articles (7.9%) were on methodology illustration. Four articles were on item banking and computer adaptive testing. A majority of the articles reported fit statistics, while the basic Rasch model assumption (i.e., unidimensionality) was examined in only 57.2% of the articles. An improvement in reporting qualities with Rasch articles was noted over time. In addition, only 11.4% of the articles provided a transformation table for interval scale measurement in clinical practice. The Rasch model has been increasingly used in rheumatology over the last 2 decades in a wide range of applications. The majority of the articles demonstrated reasonable quality of reporting. Improvements in quality of reporting over time were revealed.
Rasch analysis of the Patient Rated Elbow Evaluation questionnaire.

PubMed

Vincent, Joshua I; MacDermid, Joy C; King, Graham J W; Grewal, Ruby

2015-06-20

The Patient Rated Elbow Evaluation (PREE) was developed as an elbow joint specific measure of pain and disability and validated with classical psychometric methods. More recently, Rasch analysis has contributed new methods for analyzing the clinical measurement properties of self-report outcome measures. The objective of the study was to determine aspects of validity of the PREE using the Rasch model to assess the overall fit of the PREE data, the response scaling, individual item fit, differential item functioning (DIF), local dependency, unidimensionality and person separation index (PSI). A convenience sample of 236 patients (Age range 21-79 years; M: F- 97:139) with elbow disorders were recruited from the Roth│McFarlane Hand and Upper Limb Centre, London, Ontario, Canada. The baseline scores of the PREE were used. Rasch analysis was conducted using RUMM 2030 software on the 3 sub scales of the PREE separately. The 3 sub scales showed misfit initially with disordered thresholds on17 out of 20 items), uniform DIF was observed for two items ("Carrying a 10lbs object" from specific activities subscale for age group; and "household work" from the usual activities subscale for gender); multidimensionality and local dependency. The Pain subscale satisfied Rasch expectations when item 2 "Pain - At rest" was split for age group, while the usual activities subscale readily stood up to Rasch requirements when the item 2 "household work" was split for gender. The specific activities subscale demonstrated fit to the Rasch model when sub test analysis accounted for local dependency. All three subscales of the PREE were well targeted and had high reliability (PSI >0.80). The three subscales of the PREE appear to be robust when tested against the Rasch model when subject to a few alterations. The value of changing the 0-10 format is questionable given its widespread use; further Rasch-based analysis of whether these findings are stable in other samples is warranted.
Logit Models for the Analysis of Two-Way Categorical Data

ERIC Educational Resources Information Center

Draxler, Clemens

2011-01-01

This article discusses the application of logit models for the analyses of 2-way categorical observations. The models described are generalized linear models using the logit link function. One of the models is the Rasch model (Rasch, 1960). The objective is to test hypotheses of marginal and conditional independence between explanatory quantities…
An evaluation of the structural validity of the shoulder pain and disability index (SPADI) using the Rasch model.

PubMed

Jerosch-Herold, Christina; Chester, Rachel; Shepstone, Lee; Vincent, Joshua I; MacDermid, Joy C

2018-02-01

The shoulder pain and disability index (SPADI) has been extensively evaluated for its psychometric properties using classical test theory (CTT). The purpose of this study was to evaluate its structural validity using Rasch model analysis. Responses to the SPADI from 1030 patients referred for physiotherapy with shoulder pain and enrolled in a prospective cohort study were available for Rasch model analysis. Overall fit, individual person and item fit, response format, dependence, unidimensionality, targeting, reliability and differential item functioning (DIF) were examined. The SPADI pain subscale initially demonstrated a misfit due to DIF by age and gender. After iterative analysis it showed good fit to the Rasch model with acceptable targeting and unidimensionality (overall fit Chi-square statistic 57.2, p = 0.1; mean item fit residual 0.19 (1.5) and mean person fit residual 0.44 (1.1); person separation index (PSI) of 0.83. The disability subscale however shows significant misfit due to uniform DIF even after iterative analyses were used to explore different solutions to the sources of misfit (overall fit (Chi-square statistic 57.2, p = 0.1); mean item fit residual 0.54 (1.26) and mean person fit residual 0.38 (1.0); PSI 0.84). Rasch Model analysis of the SPADI has identified some strengths and limitations not previously observed using CTT methods. The SPADI should be treated as two separate subscales. The SPADI is a widely used outcome measure in clinical practice and research; however, the scores derived from it must be interpreted with caution. The pain subscale fits the Rasch model expectations well. The disability subscale does not fit the Rasch model and its current format does not meet the criteria for true interval-level measurement required for use as a primary endpoint in clinical trials. Clinicians should therefore exercise caution when interpreting score changes on the disability subscale and attempt to compare their scores to age- and sex-stratified data.
Measurement of change in health status with Rasch models.

PubMed

Anselmi, Pasquale; Vidotto, Giulio; Bettinardi, Ornella; Bertolotti, Giorgio

2015-02-07

The traditional approach to the measurement of change presents important drawbacks (no information at individual level, ordinal scores, variance of the measurement instrument across time points), which Rasch models overcome. The article aims to illustrate the features of the measurement of change with Rasch models. To illustrate the measurement of change using Rasch models, the quantitative data of a longitudinal study of heart-surgery patients (N = 98) were used. The scale "Perception of Positive Change" was used as an example of measurement instrument. All patients underwent cardiac rehabilitation, individual psychological intervention, and educational intervention. Nineteen patients also attended progressive muscle relaxation group trainings. The scale was administered before and after the interventions. Three Rasch approaches were used. Two separate analyses were run on the data from the two time points to test the invariance of the instrument. An analysis was run on the stacked data from both time points to measure change in a common frame of reference. Results of the latter analysis were compared with those of an analysis that removed the influence of local dependency on patient measures. Statistics t, χ(2) and F were used for comparing the patient and item measures estimated in the Rasch analyses (a-priori α = .05). Infit, Outfit, R and item Strata were used for investigating Rasch model fit, reliability, and validity of the instrument. Data of all 98 patients were included in the analyses. The instrument was reliable, valid, and substantively unidimensional (Infit, Outfit < 2 for all items, R = .84, item Strata range = 3.93-6.07). Changes in the functioning of the instrument occurred across the two time, which prevented the use of the two separate analyses to unambiguously measure change. Local dependency had a negligible effect on patient measures (p ≥ .8674). Thirteen patients improved, whereas 3 worsened. The patients who attended the relaxation group trainings did not report greater improvement than those who did not (p = .1007). Rasch models represent a valid framework for the measurement of change and a useful complement to traditional approaches.
Setting, Evaluating, and Maintaining Certification Standards with the Rasch Model.

ERIC Educational Resources Information Center

Grosse, Martin E.; Wright, Benjamin D.

1986-01-01

Based on the standard setting procedures or the American Board of Preventive Medicine for their Core Test, this article describes how Rasch measurement can facilitate using test content judgments in setting a standard. Rasch measurement can then be used to evaluate and improve the precision of the standard and to hold it constant across time.…
Linear Logistic Test Modeling with R

ERIC Educational Resources Information Center

Baghaei, Purya; Kubinger, Klaus D.

2015-01-01

The present paper gives a general introduction to the linear logistic test model (Fischer, 1973), an extension of the Rasch model with linear constraints on item parameters, along with eRm (an R package to estimate different types of Rasch models; Mair, Hatzinger, & Mair, 2014) functions to estimate the model and interpret its parameters. The…
Rasch Measurement: A Response to Payanides, Robinson and Tymms

ERIC Educational Resources Information Center

Goldstein, Harvey

2015-01-01

A response is made to a paper that urges the use of the Rasch model for educational assessment. This paper argues that the model is inadequate and that claims for its efficacy are exaggerated and technically weak.
Measuring coping in parents of children with disabilities: a rasch model approach.

PubMed

Gothwal, Vijaya K; Bharani, Seelam; Reddy, Shailaja P

2015-01-01

Parents of a child with disability must cope with greater demands than those living with a healthy child. Coping refers to a person's cognitive or behavioral efforts to manage the demands of a stressful situation. The Coping Health Inventory for Parents (CHIP) is a well-recognized measure of coping among parents of chronically ill children and assesses different coping patterns using its three subscales. The purpose of this study was to provide further insights into the psychometric properties of the CHIP subscales in a sample of parents of children with disabilities. In this cross-sectional study, 220 parents (mean age, 33.4 years; 85% mothers) caring for a child with disability enrolled in special schools as well as in mainstream schools completed the 45-item CHIP. Rasch analysis was applied to the CHIP data and the psychometric performance of each of the three subscales was tested. Subscale revision was performed in the context of Rasch analysis statistics. Response categories were not used as intended, necessitating combining categories, thereby reducing the number from 4 to 3. The subscale - 'maintaining social support' satisfied all the Rasch model expectations. Four item misfit the Rasch model in the subscale -maintaining family integration', but their deletion resulted in a 15-item scale with items that fit the Rasch model well. The remaining subscale - 'understanding the healthcare situation' lacked adequate measurement precision (<2.0 logits). The current Rasch analyses add to the evidence of measurement properties of the CHIP and show that the two of its subscales (one original and the other revised) have good psychometric properties and work well to measure coping patterns in parents of children with disabilities. However the third subscale is limited by its inadequate measurement precision and requires more items.
Psychometric evaluation of the revised Illness Perception Questionnaire (IPQ-R) in cancer patients: confirmatory factor analysis and Rasch analysis.

PubMed

Ashley, Laura; Smith, Adam B; Keding, Ada; Jones, Helen; Velikova, Galina; Wright, Penny

2013-12-01

To provide new insights into the psychometrics of the revised Illness Perception Questionnaire (IPQ-R) in cancer patients. To undertake, for the first time using data from breast, colorectal and prostate cancer patients, a confirmatory factor analysis (CFA) to assess the validity of the IPQ-R's core seven-factor structure. Also, for the first time in any illness group, to undertake Rasch analysis to explore the extent to which the IPQ-R factors form unidimensional scales, with linear measurement properties and no Differential Item Functioning (DIF). Patients with potentially curable breast, colorectal or prostate cancer, within 6months post-diagnosis, completed the IPQ-R online (N=531). CFA was conducted, including multi-sample analysis, and for each IPQ-R factor fit to the Rasch model was assessed by examining, amongst other things, item fit, DIF and unidimensionality. The CFA showed a moderate fit of the data to the IPQ-R model, and stability across diagnosis, although fit was significantly improved following the removal of selected items. All seven factors achieved fit to the Rasch model, and exhibited unidimensionality and minimal DIF, although in most cases this was after some item rescoring and/or deletion. In both analyses, IPQ-R items 12, 18 and 24 were indicated as misfitting and removed. Given the rigorous standard of Rasch measurement, and the generic nature of the IPQ-R, it stood up well to the demands of the Rasch model in this study. Importantly, the results show that with some relatively minor, pragmatic modifications the IPQ-R could possess Rasch-standard measurement in cancer patients. © 2013.
Conceptualization and measurement of celebrity worship.

PubMed

McCutcheon, Lynn E; Lange, Rense; Houran, James

2002-02-01

Celebrity worship has been conceptualized as having pathological and nonpathological forms. To avoid problems associated with item-level factor analysis, 'top-down purification' was used to test the validity of this conceptualization. The respondents (N = 249) completed items modelled after existing celebrity worship questionnaires. A subset of 17 unidimensional and Rasch scalable items was discovered (the local reliability ranged from.71 to.96), which showed no biases related to age and gender. This subset was dubbed the Celebrity Worship Scale (CWS). The items also showed no celebrity bias, indicating that CWS applies equally to acting, music, sports, and 'other' celebrities. The Rasch nature of the items defines celebrity worship as consisting of three qualitatively different stages. Low worship involves individualistic behaviours such as watching and reading about a celebrity. At slightly higher levels, celebrity worship takes on a social character. Lastly, the highest levels are characterized by a mixture of empathy with the celebrity's successes and failures, over-identification with the celebrity, compulsive behaviours, as well as obsession with details of the celebrity's life. Based on these findings, the authors propose a model of celebrity worship based on psychological absorption (leading to delusions of actual relationships with celebrities) and addiction (fostering the need for progressively stronger involvement to feel connected with the celebrity).
The Equivalence of Two Methods of Parameter Estimation for the Rasch Model.

ERIC Educational Resources Information Center

Blackwood, Larry G.; Bradley, Edwin L.

1989-01-01

Two methods of estimating parameters in the Rasch model are compared. The equivalence of likelihood estimations from the model of G. J. Mellenbergh and P. Vijn (1981) and from usual unconditional maximum likelihood (UML) estimation is demonstrated. Mellenbergh and Vijn's model is a convenient method of calculating UML estimates. (SLD)

Some Empirical Evidence for Latent Trait Model Selection.

ERIC Educational Resources Information Center

Hutten, Leah R.

The results of this study suggest that for purposes of estimating ability by latent trait methods, the Rasch model compares favorably with the three-parameter logistic model. Using estimated parameters to make predictions about 25 actual number-correct score distributions with samples of 1,000 cases each, those predicted by the Rasch model fit the…
Rasch analysis of SF-Qualiveen in multiple sclerosis.

PubMed

Milinis, Kristijonas; Tennant, Alan; A Young, Carolyn

2017-04-01

A 30-item Qualiveen questionnaire was developed to measure the impact of urinary problems on everyday living in spinal cord injury, and subsequently an 8-item SF-Qualiveen was developed for those with multiple sclerosis (MS). The validity of this short form has not been previously examined using modern psychometric techniques, such as the Rasch measurement model. The aim of this study is to test if the short form meets the requirements of the Rasch model. A total of 401 patients with clinically definite MS were given the questionnaire at three neuroscience centres in the UK. A total of 258 patients (64.3% response) completed the questionnaire. The original scale failed to meet the expectations of the Rasch model. A two-testlet solution was sought to account for local dependence, differential item functioning and disordered thresholds. After the modifications were made the scale fitted the model (χ 2 = 5.93 P = 0.4305), had high internal consistency (α = 0.88) and was unidimensional. SF-Qualiveen is a simple and valid measure of the impact of urinary problems in multiple sclerosis, which meets the requirements of the Rasch measurement model. Summed ordinal scores can be converted to interval-level using the transformation table provided. © 2016 Wiley Periodicals, Inc.
A Note on the Computation of the Second-Order Derivatives of the Elementary Symmetric Functions in the Rasch Model.

ERIC Educational Resources Information Center

Formann, Anton K.

1986-01-01

It is shown that for equal parameters explicit formulas exist, facilitating the application of the Newton-Raphson procedure to estimate the parameters in the Rasch model and related models according to the conditional maximum likelihood principle. (Author/LMO)
Rasch analysis of the hospital anxiety and depression scale among Chinese cataract patients.

PubMed

Lin, Xianchai; Chen, Ziyan; Jin, Ling; Gao, Wuyou; Qu, Bo; Zuo, Yajing; Liu, Rongjiao; Yu, Minbin

2017-01-01

To analyze the validity of the Hospital Anxiety and Depression Scale (HADS) among Chinese cataract population. A total of 275 participants with unilateral or bilateral cataract were recruited to complete the Chinese version of HADS. The patients' demographic and ophthalmic characteristics were documented. Rasch analysis was conducted to examine the model fit statistics, the thresholds ordering of the polytomous items, targeting, person separation index and reliability, local dependency, unidimentionality, differential item functioning (DIF) and construct validity of the HADS individual and summary measures. Rasch analysis was performed on anxiety and depression subscales as well as HADS-Total score respectively. The items of original HADS-Anxiety, HADS-Depression and HADS-Total demonstrated evidence of misfit of the Rasch model. Removing items A7 for anxiety subscale and rescoring items D14 for depression subscale significantly improved Rasch model fit. A 12-item higher order total scale with further removal of D12 was found to fit the Rasch model. The modified items had ordered response thresholds. No uniform DIF was detected, whereas notable non-uniform DIF in high-ability group was found. The revised cut-off points were given for the modified anxiety and depression subscales. The modified version of HADS with HADS-A and HADS-D as subscale and HADS-T as a higher-order measure is a reliable and valid instrument that may be useful for assessing anxiety and depression states in Chinese cataract population.
The Rasch Rating Model and the Disordered Threshold Controversy

ERIC Educational Resources Information Center

Adams, Raymond J.; Wu, Margaret L.; Wilson, Mark

2012-01-01

The Rasch rating (or partial credit) model is a widely applied item response model that is used to model ordinal observed variables that are assumed to collectively reflect a common latent variable. In the application of the model there is considerable controversy surrounding the assessment of fit. This controversy is most notable when the set of…
Rasch Analysis of the General Self-Efficacy Scale in Workers with Traumatic Limb Injuries.

PubMed

Wu, Tzu-Yi; Yu, Wan-Hui; Huang, Chien-Yu; Hou, Wen-Hsuan; Hsieh, Ching-Lin

2016-09-01

Purpose The purpose of this study was to apply Rasch analysis to examine the unidimensionality and reliability of the General Self-Efficacy Scale (GSE) in workers with traumatic limb injuries. Furthermore, if the items of the GSE fitted the Rasch model's assumptions, we transformed the raw sum ordinal scores of the GSE into Rasch interval scores. Methods A total of 1076 participants completed the GSE at 1 month post injury. Rasch analysis was used to examine the unidimensionality and person reliability of the GSE. The unidimensionality of the GSE was verified by determining whether the items fit the Rasch model's assumptions: (1) item fit indices: infit and outfit mean square (MNSQ) ranged from 0.6 to 1.4; and (2) the eigenvalue of the first factor extracted from principal component analysis (PCA) for residuals was <2. Person reliability was calculated. Results The unidimensionality of the 10-item GSE was supported in terms of good item fit statistics (infit and outfit MNSQ ranging from 0.92 to 1.32) and acceptable eigenvalues (1.6) of the first factor of the PCA, with person reliability = 0.89. Consequently, the raw sum scores of the GSE were transformed into Rasch scores. Conclusions The results indicated that the items of GSE are unidimensional and have acceptable person reliability in workers with traumatic limb injuries. Additionally, the raw sum scores of the GSE can be transformed into Rasch interval scores for prospective users to quantify workers' levels of self-efficacy and to conduct further statistical analyses.
Rasch analysis of Stamps's Index of Work Satisfaction in nursing population.

PubMed

Ahmad, Nora; Oranye, Nelson Ositadimma; Danilov, Alyona

2017-01-01

One of the most commonly used tools for measuring job satisfaction in nursing is the Stamps Index of Work Satisfaction. Several studies have reported on the reliability of the Stamps' tool based on traditional statistical model. The aim of this study was to apply the Rasch model to examine the adequacy of Stamps's Index of Work Satisfaction for measuring nurses' job satisfaction cross-culturally and to determine the validity and reliability of the instrument using the Rasch criteria. A secondary data analysis was conducted on a sample of 556 registered nurses from two countries. The RUMM 2030 software was used to analyse the psychometric properties of the Index of Work Satisfaction. The persons mean location of -0.018 approximated the items mean of 0.00, suggesting a good alignment of the measure and the traits being measured. However, at the items level, some items were misfiting to the Rasch model.
Exploring the measurement properties of the osteopathy clinical teaching questionnaire using Rasch analysis.

PubMed

Vaughan, Brett

2018-01-01

Clinical teaching evaluations are common in health profession education programs to ensure students are receiving a quality clinical education experience. Questionnaires students use to evaluate their clinical teachers have been developed in professions such as medicine and nursing. The development of a questionnaire that is specifically for the osteopathy on-campus, student-led clinic environment is warranted. Previous work developed the 30-item Osteopathy Clinical Teaching Questionnaire. The current study utilised Rasch analysis to investigate the construct validity of the Osteopathy Clinical Teaching Questionnaire and provide evidence for the validity argument through fit to the Rasch model. Senior osteopathy students at four institutions in Australia, New Zealand and the United Kingdom rated their clinical teachers using the Osteopathy Clinical Teaching Questionnaire. Three hundred and ninety-nine valid responses were received and the data were evaluated for fit to the Rasch model. Reliability estimations (Cronbach's alpha and McDonald's omega) were also evaluated for the final model. The initial analysis demonstrated the data did not fit the Rasch model. Accordingly, modifications to the questionnaire were made including removing items, removing person responses, and rescoring one item. The final model contained 12 items and fit to the Rasch model was adequate. Support for unidimensionality was demonstrated through both the Principal Components Analysis/t-test, and the Cronbach's alpha and McDonald's omega reliability estimates. Analysis of the questionnaire using McDonald's omega hierarchical supported a general factor (quality of clinical teaching in osteopathy). The evidence for unidimensionality and the presence of a general factor support the calculation of a total score for the questionnaire as a sufficient statistic. Further work is now required to investigate the reliability of the 12-item Osteopathy Clinical Teaching Questionnaire to provide evidence for the validity argument.
Rasch Modeling of Revised Token Test Performance: Validity and Sensitivity to Change

ERIC Educational Resources Information Center

Hula, William; Doyle, Patrick J.; McNeil, Malcolm R.; Mikolic, Joseph M.

2006-01-01

The purpose of this research was to examine the validity of the 55-item Revised Token Test (RTT) and to compare traditional and Rasch-based scores in their ability to detect group differences and change over time. The 55-item RTT was administered to 108 left- and right-hemisphere stroke survivors, and the data were submitted to Rasch analysis.…
ESEA Title I Linking Project. Final Report.

ERIC Educational Resources Information Center

Holmes, Susan E.

The Rasch model for test score equating was compared with three other equating procedures as methods for implementing the norm referenced method (RMC Model A) of evaluating ESEA Title I projects. The Rasch model and its theoretical limitations were described. The three other equating methods used were: linear observed score equating, linear true…
The Nature of Objectivity with the Rasch Model.

ERIC Educational Resources Information Center

Whitely, Susan E.; Dawis, Rene V.

Although it has been claimed that the Rasch model leads to a higher degree of objectivity in measurement than has been previously possible, this model has had little impact on test development. Population-invariant item and ability calibrations along with the statistical equivalency of any two item subsets are supposedly possible if the item pool…
A Note on the Usefulness of the Behavioural Rasch Selection Model for Causal Inference in the Social Sciences

NASA Astrophysics Data System (ADS)

Rabbitt, Matthew P.

2016-11-01

Social scientists are often interested in examining causal relationships where the outcome of interest is represented by an intangible concept, such as an individual's well-being or ability. Estimating causal relationships in this scenario is particularly challenging because the social scientist must rely on measurement models to measure individual's properties or attributes and then address issues related to survey data, such as omitted variables. In this paper, the usefulness of the recently proposed behavioural Rasch selection model is explored using a series of Monte Carlo experiments. The behavioural Rasch selection model is particularly useful for these types of applications because it is capable of estimating the causal effect of a binary treatment effect on an outcome that is represented by an intangible concept using cross-sectional data. Other methodology typically relies of summary measures from measurement models that require additional assumptions, some of which make these approaches less efficient. Recommendations for application of the behavioural Rasch selection model are made based on results from the Monte Carlo experiments.
A rasch analysis of the Manchester foot pain and disability index

PubMed Central

Muller, Sara; Roddy, Edward

2009-01-01

Background There is currently no interval-level measure of foot-related disability and this has hampered research in this area. The Manchester Foot Pain and Disability Index (FPDI) could potentially fill this gap. Objective To assess the fit of the three subscales (function, pain, appearance) of the FPDI to the Rasch unidimensional measurement model in order to form interval-level scores. Methods A two-stage postal survey at a general practice in the UK collected data from 149 adults aged 50 years and over with foot pain. The 17 FPDI items, in three subscales, were assessed for their fit to the Rasch model. Checks were carried out for differential item functioning by age and gender. Results The function and pain items fit the Rasch model and interval-level scores can be constructed. There were too few people without extreme scores on the appearance subscale to allow fit to the Rasch model to be tested. Conclusion The items from the FPDI function and pain subscales can be used to obtain interval level scores for these factors for use in future research studies in older adults. Further work is needed to establish the interval nature of these subscale scores in more diverse populations and to establish the measurement properties of these interval-level scores. PMID:19878536
Rasch model analysis of the Depression, Anxiety and Stress Scales (DASS)

PubMed Central

Shea, Tracey L; Tennant, Alan; Pallant, Julie F

2009-01-01

Background There is a growing awareness of the need for easily administered, psychometrically sound screening tools to identify individuals with elevated levels of psychological distress. Although support has been found for the psychometric properties of the Depression, Anxiety and Stress Scales (DASS) using classical test theory approaches it has not been subjected to Rasch analysis. The aim of this study was to use Rasch analysis to assess the psychometric properties of the DASS-21 scales, using two different administration modes. Methods The DASS-21 was administered to 420 participants with half the sample responding to a web-based version and the other half completing a traditional pencil-and-paper version. Conformity of DASS-21 scales to a Rasch partial credit model was assessed using the RUMM2020 software. Results To achieve adequate model fit it was necessary to remove one item from each of the DASS-21 subscales. The reduced scales showed adequate internal consistency reliability, unidimensionality and freedom from differential item functioning for sex, age and mode of administration. Analysis of all DASS-21 items combined did not support its use as a measure of general psychological distress. A scale combining the anxiety and stress items showed satisfactory fit to the Rasch model after removal of three items. Conclusion The results provide support for the measurement properties, internal consistency reliability, and unidimensionality of three slightly modified DASS-21 scales, across two different administration methods. The further use of Rasch analysis on the DASS-21 in larger and broader samples is recommended to confirm the findings of the current study. PMID:19426512
Construct validity of the Swedish version of the revised piper fatigue scale in an oncology sample--a Rasch analysis.

PubMed

Lundgren-Nilsson, Asa; Dencker, Anna; Jakobsson, Sofie; Taft, Charles; Tennant, Alan

2014-06-01

Fatigue is a common and distressing symptom in cancer patients due to both the disease and its treatments. The concept of fatigue is multidimensional and includes both physical and mental components. The 22-item Revised Piper Fatigue Scale (RPFS) is a multidimensional instrument developed to assess cancer-related fatigue. This study reports on the construct validity of the Swedish version of the RPFS from the perspective of Rasch measurement. The Swedish version of the RPFS was answered by 196 cancer patients fatigued after 4 to 5 weeks of curative radiation therapy. Data from the scale were fitted to the Rasch measurement model. This involved testing a series of assumptions, including the stochastic ordering of items, local response dependency, and unidimensionality. A series of fit statistics were computed, differential item functioning (DIF) was tested, and local response dependency was accommodated through testlets. The Behavioral, Affective and Sensory domains all satisfied the Rasch model expectations. No DIF was observed, and all domains were found to be unidimensional. The Mood/Cognitive scale failed to fit the model, and substantial multidimensionality was found. Splitting the scale between Mood and Cognitive items resolved fit to the Rasch model, and new domains were unidimensional without DIF. The current Rasch analyses add to the evidence of measurement properties of the scale and show that the RPFS has good psychometric properties and works well to measure fatigue. The original four-factor structure, however, was not supported. Copyright © 2014 International Society for Pharmacoeconomics and Outcomes Research (ISPOR). Published by Elsevier Inc. All rights reserved.
Rasch model analysis of the Depression, Anxiety and Stress Scales (DASS).

PubMed

Shea, Tracey L; Tennant, Alan; Pallant, Julie F

2009-05-09

There is a growing awareness of the need for easily administered, psychometrically sound screening tools to identify individuals with elevated levels of psychological distress. Although support has been found for the psychometric properties of the Depression, Anxiety and Stress Scales (DASS) using classical test theory approaches it has not been subjected to Rasch analysis. The aim of this study was to use Rasch analysis to assess the psychometric properties of the DASS-21 scales, using two different administration modes. The DASS-21 was administered to 420 participants with half the sample responding to a web-based version and the other half completing a traditional pencil-and-paper version. Conformity of DASS-21 scales to a Rasch partial credit model was assessed using the RUMM2020 software. To achieve adequate model fit it was necessary to remove one item from each of the DASS-21 subscales. The reduced scales showed adequate internal consistency reliability, unidimensionality and freedom from differential item functioning for sex, age and mode of administration. Analysis of all DASS-21 items combined did not support its use as a measure of general psychological distress. A scale combining the anxiety and stress items showed satisfactory fit to the Rasch model after removal of three items. The results provide support for the measurement properties, internal consistency reliability, and unidimensionality of three slightly modified DASS-21 scales, across two different administration methods. The further use of Rasch analysis on the DASS-21 in larger and broader samples is recommended to confirm the findings of the current study.
A Rasch-validated version of the upper extremity functional index for interval-level measurement of upper extremity function.

PubMed

Hamilton, Clayon B; Chesworth, Bert M

2013-11-01

The original 20-item Upper Extremity Functional Index (UEFI) has not undergone Rasch validation. The purpose of this study was to determine whether Rasch analysis supports the UEFI as a measure of a single construct (ie, upper extremity function) and whether a Rasch-validated UEFI has adequate reproducibility for individual-level patient evaluation. This was a secondary analysis of data from a repeated-measures study designed to evaluate the measurement properties of the UEFI over a 3-week period. Patients (n=239) with musculoskeletal upper extremity disorders were recruited from 17 physical therapy clinics across 4 Canadian provinces. Rasch analysis of the UEFI measurement properties was performed. If the UEFI did not fit the Rasch model, misfitting patients were deleted, items with poor response structure were corrected, and misfitting items and redundant items were deleted. The impact of differential item functioning on the ability estimate of patients was investigated. A 15-item modified UEFI was derived to achieve fit to the Rasch model where the total score was supported as a measure of upper extremity function only. The resultant UEFI-15 interval-level scale (0-100, worst to best state) demonstrated excellent internal consistency (person separation index=0.94) and test-retest reliability (intraclass correlation coefficient [2,1]=.95). The minimal detectable change at the 90% confidence interval was 8.1. Patients who were ambidextrous or bilaterally affected were excluded to allow for the analysis of differential item functioning due to limb involvement and arm dominance. Rasch analysis did not support the validity of the 20-item UEFI. However, the UEFI-15 was a valid and reliable interval-level measure of a single dimension: upper extremity function. Rasch analysis supports using the UEFI-15 in physical therapist practice to quantify upper extremity function in patients with musculoskeletal disorders of the upper extremity.
A Rasch-Validated Version of the Upper Extremity Functional Index for Interval-Level Measurement of Upper Extremity Function

PubMed Central

Chesworth, Bert M.

2013-01-01

Background The original 20-item Upper Extremity Functional Index (UEFI) has not undergone Rasch validation. Objective The purpose of this study was to determine whether Rasch analysis supports the UEFI as a measure of a single construct (ie, upper extremity function) and whether a Rasch-validated UEFI has adequate reproducibility for individual-level patient evaluation. Design This was a secondary analysis of data from a repeated-measures study designed to evaluate the measurement properties of the UEFI over a 3-week period. Methods Patients (n=239) with musculoskeletal upper extremity disorders were recruited from 17 physical therapy clinics across 4 Canadian provinces. Rasch analysis of the UEFI measurement properties was performed. If the UEFI did not fit the Rasch model, misfitting patients were deleted, items with poor response structure were corrected, and misfitting items and redundant items were deleted. The impact of differential item functioning on the ability estimate of patients was investigated. Results A 15-item modified UEFI was derived to achieve fit to the Rasch model where the total score was supported as a measure of upper extremity function only. The resultant UEFI-15 interval-level scale (0–100, worst to best state) demonstrated excellent internal consistency (person separation index=0.94) and test-retest reliability (intraclass correlation coefficient [2,1]=.95). The minimal detectable change at the 90% confidence interval was 8.1. Limitations Patients who were ambidextrous or bilaterally affected were excluded to allow for the analysis of differential item functioning due to limb involvement and arm dominance. Conclusion Rasch analysis did not support the validity of the 20-item UEFI. However, the UEFI-15 was a valid and reliable interval-level measure of a single dimension: upper extremity function. Rasch analysis supports using the UEFI-15 in physical therapist practice to quantify upper extremity function in patients with musculoskeletal disorders of the upper extremity. PMID:23813086
Dimensionality of the Knee Numeric-Entity Evaluation Score (KNEES-ACL): a condition-specific questionnaire.

PubMed

Comins, J D; Krogsgaard, M R; Kreiner, S; Brodersen, J

2013-10-01

The benefit of anterior cruciate ligament (ACL) reconstruction has been questioned based on patient-reported outcome measures (PROMs). Valid interpretation of such results requires confirmation of the psychometric properties of the PROM. Rasch analysis is the gold standard for validation of PROMs, yet PROMs used for ACL reconstruction have not been validated using Rasch analysis. We used Rasch analysis to investigate the psychometric properties of the Knee Numeric-Entity Evaluation Score (KNEES-ACL), a newly developed PROM for patients treated for ACL deficiency. Two-hundred forty-two patients pre- and post-ACL reconstruction completed the pilot PROM. Rasch models were used to assess the psychometric properties (e.g., unidimensionality, local response dependency, and differential item functioning). Forty-one items distributed across seven unidimensional constructs measuring impairment, functional limitations, and psychosocial consequences were confirmed to fit Rasch models. Fourteen items were removed because of statistical lack of fit and inadequate face validity. Local response dependency and differential item functioning were identified and adjusted. The KNEES-ACL is the first Rasch-validated condition-specific PROM constructed for patients with ACL deficiency and patients with ACL reconstruction. Thus, this instrument can be used for within- and between-group comparisons. © 2013 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Applying Rasch model analysis in the development of the cantonese tone identification test (CANTIT).

PubMed

Lee, Kathy Y S; Lam, Joffee H S; Chan, Kit T Y; van Hasselt, Charles Andrew; Tong, Michael C F

2017-01-01

Applying Rasch analysis to evaluate the internal structure of a lexical tone perception test known as the Cantonese Tone Identification Test (CANTIT). A 75-item pool (CANTIT-75) with pictures and sound tracks was developed. Respondents were required to make a four-alternative forced choice on each item. A short version of 30 items (CANTIT-30) was developed based on fit statistics, difficulty estimates, and content evaluation. Internal structure was evaluated by fit statistics and Rasch Factor Analysis (RFA). 200 children with normal hearing and 141 children with hearing impairment were recruited. For CANTIT-75, all infit and 97% of outfit values were < 2.0. RFA revealed 40.1% of total variance was explained by the Rasch measure. The first residual component explained 2.5% of total variance in an eigenvalue of 3.1. For CANTIT-30, all infit and outfit values were < 2.0. The Rasch measure explained 38.8% of total variance, the first residual component explained 3.9% of total variance in an eigenvalue of 1.9. The Rasch model provides excellent guidance for the development of short forms. Both CANTIT-75 and CANTIT-30 possess satisfactory internal structure as a construct validity evidence in measuring the lexical tone identification ability of the Cantonese speakers.

Item Information in the Rasch Model. Project Psychometric Aspects of Item Banking No. 34. Research Report 88-7.

ERIC Educational Resources Information Center

Engelen, Ron J. H.; And Others

Fisher's information measure for the item difficulty parameter in the Rasch model and its marginal and conditional formulations are investigated. It is shown that expected item information in the unconditional model equals information in the marginal model, provided the assumption of sampling examinees from an ability distribution is made. For the…
Applying Rasch analysis to evaluate measurement equivalence of different administration formats of the Activity Limitation scale of the Cambridge Pulmonary Hypertension Outcome Review (CAMPHOR).

PubMed

Twiss, J; McKenna, S P; Graham, J; Swetz, K; Sloan, J; Gomberg-Maitland, M

2016-04-09

Electronic formats of patient-reported outcome (PRO) measures are now routinely used in clinical research studies. When changing from a validated paper and pen to electronic administration it is necessary to establish their equivalence. This study reports on the value of Rasch analysis in this process. Three groups of US pulmonary hypertension (PH) patients participated. The first completed an electronic version of the CAMPHOR Activity Limitation scale (e-sample) and this was compared with two pen and paper administrated samples (pp1 and pp2). The three databases were combined and analysed for fit to the Rasch model. Equivalence was evaluated by differential item functioning (DIF) analyses. The three datasets were matched randomly in terms of sample size (n = 147). Mean age (years) and percentage of male respondents were as follows: e-sample (51.7, 16.0 %); pp1 (50.0, 14.0 %); pp2 (55.5, 40.4 %). The combined dataset achieved fit to the Rasch model. Two items showed evidence of borderline DIF. Further analyses showed the inclusion of these items had little impact on Rasch estimates indicating the DIF identified was unimportant. Differences between the performance of the electronic and pen and paper administrations of the CAMPHOR Activity Limitation scale were minor. The results were successful in showing how the Rasch model can be used to determine the equivalence of alternative formats of PRO measures.
Quantifying Local, Response Dependence between Two Polytomous Items Using the Rasch Model

ERIC Educational Resources Information Center

Andrich, David; Humphry, Stephen M.; Marais, Ida

2012-01-01

Models of modern test theory imply statistical independence among responses, generally referred to as "local independence." One violation of local independence occurs when the response to one item governs the response to a subsequent item. Expanding on a formulation of this kind of violation as a process in the dichotomous Rasch model,…
Examining the Invariance of Rater and Project Calibrations Using a Multi-facet Rasch Model.

ERIC Educational Resources Information Center

O'Neill, Thomas R.; Lunz, Mary E.

To generalize test results beyond the particular test administration, an examinee's ability estimate must be independent of the particular items attempted, and the item difficulty calibrations must be independent of the particular sample of people attempting the items. This stability is a key concept of the Rasch model, a latent trait model of…
An Introduction to Item Response Theory and Rasch Models for Speech-Language Pathologists

ERIC Educational Resources Information Center

Baylor, Carolyn; Hula, William; Donovan, Neila J.; Doyle, Patrick J.; Kendall, Diane; Yorkston, Kathryn

2011-01-01

Purpose: To present a primarily conceptual introduction to item response theory (IRT) and Rasch models for speech-language pathologists (SLPs). Method: This tutorial introduces SLPs to basic concepts and terminology related to IRT as well as the most common IRT models. The article then continues with an overview of how instruments are developed…
Extensions of Rasch's Multiplicative Poisson Model.

ERIC Educational Resources Information Center

Jansen, Margo G. H.; van Duijn, Marijtje A. J.

1992-01-01

A model developed by G. Rasch that assumes scores on some attainment tests can be realizations of a Poisson process is explained and expanded by assuming a prior distribution, with fixed but unknown parameters, for the subject parameters. How additional between-subject and within-subject factors can be incorporated is discussed. (SLD)
Item Screening in Graphical Loglinear Rasch Models

ERIC Educational Resources Information Center

Kreiner, Svend; Christensen, Karl Bang

2011-01-01

In behavioural sciences, local dependence and DIF are common, and purification procedures that eliminate items with these weaknesses often result in short scales with poor reliability. Graphical loglinear Rasch models (Kreiner & Christensen, in "Statistical Methods for Quality of Life Studies," ed. by M. Mesbah, F.C. Cole & M.T.…
Evaluation of the Bess TRS-CA Using the Rasch Rating Scale Model

ERIC Educational Resources Information Center

DiStefano, Christine; Morgan, Grant B.

2010-01-01

This study examined the Behavioral and Emotional Screening System Teacher Rating System for Children and Adolescents (BESS TRS-CA; Kamphaus & Reynolds, 2007) screener using Rasch Rating Scale model (RSM) methodology to provide additional information about psychometric properties of items. Data from the Behavioral Assessment System for Children…
Multidimensional fatigue inventory and post-polio syndrome - a Rasch analysis.

PubMed

Dencker, Anna; Sunnerhagen, Katharina S; Taft, Charles; Lundgren-Nilsson, Åsa

2015-02-12

Fatigue is a common symptom in post-polio syndrome (PPS) and can have a substantial impact on patients. There is a need for validated questionnaires to assess fatigue in PPS for use in clinical practice and research. The aim with this study was to assess the validity and reliability of the Swedish version of Multidimensional Fatigue Inventory (MFI-20) in patients with PPS using the Rasch model. A total of 231 patients diagnosed with PPS completed the Swedish MFI-20 questionnaire at post-polio out-patient clinics in Sweden. The mean age of participants was 62 years and 61% were females. Data were tested against assumptions of the Rasch measurement model (i.e. unidimensionality of the scale, good item fit, independency of items and absence of differential item functioning). Reliability was tested with the person separation index (PSI). A transformation of the ordinal total scale scores into an interval scale for use in parametric analysis was performed. Dummy cases with minimum and maximum scoring were used for the transformation table to achieve interval scores between 20 and 100, which are comprehensive limits for the MFI-20 scale. An initial Rasch analysis of the full scale with 20 items showed misfit to the Rasch model (p < 0.001). Seven items showed slightly disordered thresholds and person estimates were not significantly improved by rescoring items. Analysis of MFI-20 scale with the 5 MFI-20 subscales as testlets showed good fit with a non-significant x (2) value (p = 0.089). PSI for the testlet solution was 0.86. Local dependency was present in all subscales and fit to the Rasch model was solved with testlets within each subscale. PSI ranged from 0.52 to 0.82 in the subscales. This study shows that the Swedish MFI-20 total scale and subscale scores yield valid and reliable measures of fatigue in persons with post-polio syndrome. The Rasch transformed total scores can be used for parametric statistical analyses in future clinical studies.
Rasch analysis of the London Handicap Scale in stroke patients: a cross-sectional study.

PubMed

Park, Eun-Young; Choi, Yoo-Im

2014-07-31

Although activity and participation are the target domains in stroke rehabilitation interventions, there is insufficient evidence available regarding the validity of participation measurement. The purpose of this study was to investigate the psychometric properties of the London Handicap Scale in community-dwelling stroke patients, using Rasch analysis. Participants were 170 community-dwelling stroke survivors. The data were analyzed using Winsteps (version 3.62) with the Rasch model to determine the unidimensionality of item fit, the distribution of item difficulty, and the reliability and suitability of the rating process for the London Handicap Scale. Data of 16 participants did not fit the Rasch model and there were no misfitting items. The person separation value was 2.42, and the reliability was .85; furthermore, the rating process for the London Handicap Scale was found to be suitable for use with stroke patients. This was the first trial to investigate the psychometric properties of the London Handicap Scale using Rasch analysis; the results supported the suitability of this scale for use with stroke patients.
Understanding Rasch Measurement: Rasch Techniques for Detecting Bias in Performance Assessments: An Example Comparing the Performance of Native and Non-native Speakers on a Test of Academic English.

ERIC Educational Resources Information Center

Elder, Catherine; McNamara, Tim; Congdon, Peter

2003-01-01

Used Rasch analytic procedures to study item bias or differential item functioning in both dichotomous and scalar items on a test of English for academic purposes. Results for 139 college students on a pilot English language test model the approach and illustrate the measurement challenges posed by a diagnostic instrument to measure English…
Psychometric assessment of HIV/STI sexual risk scale among MSM: a Rasch model approach.

PubMed

Li, Jian; Liu, Hongjie; Liu, Hui; Feng, Tiejian; Cai, Yumao

2011-10-05

Little research has assessed the degree of severity and ordering of different types of sexual behaviors for HIV/STI infection in a measurement scale. The purpose of this study was to apply the Rasch model on psychometric assessment of an HIV/STI sexual risk scale among men who have sex with men (MSM). A cross-sectional study using respondent driven sampling was conducted among 351 MSM in Shenzhen, China. The Rasch model was used to examine the psychometric properties of an HIV/STI sexual risk scale including nine types of sexual behaviors. The Rasch analysis of the nine items met the unidimensionality and local independence assumption. Although the person reliability was low at 0.35, the item reliability was high at 0.99. The fit statistics provided acceptable infit and outfit values. Item difficulty invariance analysis showed that the item estimates of the risk behavior items were invariant (within error). The findings suggest that the Rasch model can be utilized for measuring the level of sexual risk for HIV/STI infection as a single latent construct and for establishing the relative degree of severity of each type of sexual behavior in HIV/STI transmission and acquisition among MSM. The measurement scale provides a useful measurement tool to inform, design and evaluate behavioral interventions for HIV/STI infection among MSM.
Is Going Beyond Rasch Analysis Necessary to Assess the Construct Validity of a Motor Function Scale?

PubMed

Guillot, Tiffanie; Roche, Sylvain; Rippert, Pascal; Hamroun, Dalil; Iwaz, Jean; Ecochard, René; Vuillerot, Carole

2018-04-03

To examine whether a Rasch analysis is sufficient to establish the construct validity of the Motor Function Measure (MFM) and discuss whether weighting the MFM item scores would improve the MFM construct validity. Observational cross-sectional multicenter study. Twenty-three physical medicine departments, neurology departments, or reference centers for neuromuscular diseases. Patients (N=911) aged 6 to 60 years with Charcot-Marie-Tooth disease (CMT), facioscapulohumeral dystrophy (FSHD), or myotonic dystrophy type 1 (DM1). None. Comparison of the goodness-of-fit of the confirmatory factor analysis (CFA) model vs that of a modified multidimensional Rasch model on MFM item scores in each considered disease. The CFA model showed good fit to the data and significantly better goodness of fit than the modified multidimensional Rasch model regardless of the disease (P<.001). Statistically significant differences in item standardized factor loadings were found between DM1, CMT, and FSHD in only 6 of 32 items (items 6, 27, 2, 7, 9 and 17). For multidimensional scales designed to measure patient abilities in various diseases, a Rasch analysis might not be the most convenient, whereas a CFA is able to establish the scale construct validity and provide weights to adapt the item scores to a specific disease. Copyright © 2018 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.
A Negative Binomial Regression Model for Accuracy Tests

ERIC Educational Resources Information Center

Hung, Lai-Fa

2012-01-01

Rasch used a Poisson model to analyze errors and speed in reading tests. An important property of the Poisson distribution is that the mean and variance are equal. However, in social science research, it is very common for the variance to be greater than the mean (i.e., the data are overdispersed). This study embeds the Rasch model within an…
Validating Translation Test Items via the Many-Facet Rasch Model.

PubMed

Tseng, Wen-Ta; Su, Tzi-Ying; Nix, John-Michael L

2018-01-01

This study applied the many-facet Rasch model to assess learners' translation ability in an English as a foreign language context. Few attempts have been made in extant research to detect and calibrate rater severity in the domain of translation testing. To fill the research gap, this study documented the process of validating a test of Chinese-to-English sentence translation and modeled raters' scoring propensity defined by harshness or leniency, expert/novice effects on severity, and concomitant effects on item difficulty. Two hundred twenty-five, third-year senior high school Taiwanese students and six educators from tertiary and secondary educational institutions served as participants. The students' mean age was 17.80 years ( SD = 1.20, range 17-19). The exam consisted of 10 translation items adapted from two entrance exam tests. The results showed that this subjectively scored performance assessment exhibited robust unidimensionality, thus reliably measuring translation ability free from unmodeled disturbances. Furthermore, discrepancies in ratings between novice and expert raters were also identified and modeled by the many-facet Rasch model. The implications for applying the many-facet Rasch model in translation tests at the tertiary level were discussed.
Psychometric properties of the NEPSY-II affect recognition subtest in a preschool sample: a Rasch modeling approach.

PubMed

Yao, Shih-Ying; Bull, Rebecca; Khng, Kiat Hui; Rahim, Anisa

2018-01-01

Understanding a child's ability to decode emotion expressions is important to allow early interventions for potential difficulties in social and emotional functioning. This study applied the Rasch model to investigate the psychometric properties of the NEPSY-II Affect Recognition subtest, a U.S. normed measure for 3-16 year olds which assesses the ability to recognize facial expressions of emotion. Data were collected from 1222 children attending preschools in Singapore. We first performed the Rasch analysis with the raw item data, and examined the technical qualities and difficulty pattern of the studied items. We subsequently investigated the relation of the estimated affect recognition ability from the Rasch analysis to a teacher-reported measure of a child's behaviors, emotions, and relationships. Potential gender differences were also examined. The Rasch model fits our data well. Also, the NEPSY-II Affect Recognition subtest was found to have reasonable technical qualities, expected item difficulty pattern, and desired association with the external measure of children's behaviors, emotions, and relationships for both boys and girls. Overall, findings from this study suggest that the NEPSY-II Affect Recognition subtest is a promising measure of young children's affect recognition ability. Suggestions for future test improvement and research were discussed.
Optimal Designs for the Rasch Model

ERIC Educational Resources Information Center

Grasshoff, Ulrike; Holling, Heinz; Schwabe, Rainer

2012-01-01

In this paper, optimal designs will be derived for estimating the ability parameters of the Rasch model when difficulty parameters are known. It is well established that a design is locally D-optimal if the ability and difficulty coincide. But locally optimal designs require that the ability parameters to be estimated are known. To attenuate this…
Rasch Model Based Analysis of the Force Concept Inventory

ERIC Educational Resources Information Center

Planinic, Maja; Ivanjek, Lana; Susac, Ana

2010-01-01

The Force Concept Inventory (FCI) is an important diagnostic instrument which is widely used in the field of physics education research. It is therefore very important to evaluate and monitor its functioning using different tools for statistical analysis. One of such tools is the stochastic Rasch model, which enables construction of linear…
Guessing and the Rasch Model

ERIC Educational Resources Information Center

Holster, Trevor A.; Lake, J.

2016-01-01

Stewart questioned Beglar's use of Rasch analysis of the Vocabulary Size Test (VST) and advocated the use of 3-parameter logistic item response theory (3PLIRT) on the basis that it models a non-zero lower asymptote for items, often called a "guessing" parameter. In support of this theory, Stewart presented fit statistics derived from…
Analysis of the Professional Choice Self-Efficacy Scale Using the Rasch-Andrich Rating Scale Model

ERIC Educational Resources Information Center

Ambiel, Rodolfo A. M.; Noronha, Ana Paula Porto; de Francisco Carvalho, Lucas

2015-01-01

The aim of this research was to analyze the psychometrics properties of the professional choice self-efficacy scale (PCSES), using the Rasch-Andrich rating scale model. The PCSES assesses four factors: self-appraisal, gathering occupational information, practical professional information search and future planning. Participants were 883 Brazilian…

Fitting the Mixed Rasch Model to a Reading Comprehension Test: Identifying Reader Types

ERIC Educational Resources Information Center

Baghaei, Purya; Carstensen, Claus H.

2013-01-01

Standard unidimensional Rasch models assume that persons with the same ability parameters are comparable. That is, the same interpretation applies to persons with identical ability estimates as regards the underlying mental processes triggered by the test. However, research in cognitive psychology shows that persons at the same trait level may…
A Monte Carlo Approach to Unidimensionality Testing in Polytomous Rasch Models

ERIC Educational Resources Information Center

Christensen, Karl Bang; Kreiner, Svend

2007-01-01

Many statistical tests are designed to test the different assumptions of the Rasch model, but only few are directed at detecting multidimensionality. The Martin-Lof test is an attractive approach, the disadvantage being that its null distribution deviates strongly from the asymptotic chi-square distribution for most realistic sample sizes. A Monte…
Evaluation of Two Teaching Programs Based on Structural Learning Principles.

ERIC Educational Resources Information Center

Haussler, Peter

1978-01-01

Structural learning theory and the Rasch model measured learning gain, retention, and transfer in 1,037 students, grades 7-10. Students learned nine functional relationships with either spontaneous or synthetic algorithms. The Rasch model gave the better description of the data. The hypothesis that the synthetic method was superior was refuted.…
Validity and Realibility of Chemistry Systemic Multiple Choices Questions (CSMCQs)

ERIC Educational Resources Information Center

Priyambodo, Erfan; Marfuatun

2016-01-01

Nowadays, Rasch model analysis is used widely in social research, moreover in educational research. In this research, Rasch model is used to determine the validation and the reliability of systemic multiple choices question in chemistry teaching and learning. There were 30 multiple choices question with systemic approach for high school student…
Estimation of the Proportion of Underachieving Students in Compulsory Secondary Education in Spain: An Application of the Rasch Model

PubMed Central

Veas, Alejandro; Gilar, Raquel; Miñano, Pablo; Castejón, Juan-Luis

2016-01-01

There are very few studies in Spain that treat underachievement rigorously, and those that do are typically related to gifted students. The present study examined the proportion of underachieving students using the Rasch measurement model. A sample of 643 first-year high school students (mean age = 12.09; SD = 0.47) from 8 schools in the province of Alicante (Spain) completed the Battery of Differential and General Skills (Badyg), and these students' General Points Average (GPAs) were recovered by teachers. Dichotomous and Partial credit Rasch models were performed. After adjusting the measurement instruments, the individual underachievement index provided a total sample of 181 underachieving students, or 28.14% of the total sample across the ability levels. This study confirms that the Rasch measurement model can accurately estimate the construct validity of both the intelligence test and the academic grades for the calculation of underachieving students. Furthermore, the present study constitutes a pioneer framework for the estimation of the prevalence of underachievement in Spain. PMID:26973586
Rasch-built Overall Disability Scale for patients with chemotherapy-induced peripheral neuropathy (CIPN-R-ODS).

PubMed

Binda, D; Vanhoutte, E K; Cavaletti, G; Cornblath, D R; Postma, T J; Frigeni, B; Alberti, P; Bruna, J; Velasco, R; Argyriou, A A; Kalofonos, H P; Psimaras, D; Ricard, D; Pace, A; Galiè, E; Briani, C; Dalla Torre, C; Lalisang, R I; Boogerd, W; Brandsma, D; Koeppen, S; Hense, J; Storey, D; Kerrigan, S; Schenone, A; Fabbri, S; Rossi, E; Valsecchi, M G; Faber, C G; Merkies, I S J; Galimberti, S; Lanzani, F; Mattavelli, L; Piatti, M L; Bidoli, P; Cazzaniga, M; Cortinovis, D; Lucchetta, M; Campagnolo, M; Bakkers, M; Brouwer, B; Boogerd, W; Grant, R; Reni, L; Piras, B; Pessino, A; Padua, L; Granata, G; Leandri, M; Ghignotti, I; Plasmati, R; Pastorelli, F; Heimans, J J; Eurelings, M; Meijer, R J; Grisold, W; Lindeck Pozza, E; Mazzeo, A; Toscano, A; Russo, M; Tomasello, C; Altavilla, G; Penas Prado, M; Dominguez Gonzalez, C; Dorsey, S G

2013-09-01

Chemotherapy-induced peripheral neuropathy (CIPN) is a common neurological side-effect of cancer treatment and may lead to declines in patients' daily functioning and quality of life. To date, there are no modern clinimetrically well-evaluated outcome measures available to assess disability in CIPN patients. The objective of the study was to develop an interval-weighted scale to capture activity limitations and participation restrictions in CIPN patients using the Rasch methodology and to determine its validity and reliability properties. A preliminary Rasch-built Overall Disability Scale (pre-R-ODS) comprising 146 items was assessed twice (interval: 2-3 weeks; test-retest reliability) in 281 CIPN patients with a stable clinical condition. The obtained data were subjected to Rasch analyses to determine whether model expectations would be met, and if necessarily, adaptations were made to obtain proper model fit (internal validity). External validity was obtained by correlating the CIPN-R-ODS with the National Cancer Institute-Common Toxicity Criteria (NCI-CTC) neuropathy scales and the Pain-Intensity Numeric-Rating-Scale (PI-NRS). The preliminary R-ODS did not meet Rasch model's expectations. Items displaying misfit statistics, disordered thresholds, item bias or local dependency were systematically removed. The final CIPN-R-ODS consisting of 28 items fulfilled all the model's expectations with proper validity and reliability, and was unidimensional. The final CIPN-R-ODS is a Rasch-built disease-specific, interval measure suitable to detect disability in CIPN patients and bypasses the shortcomings of classical test theory ordinal-based measures. Its use is recommended in future clinical trials in CIPN. Copyright © 2013 Elsevier Ltd. All rights reserved.
Using the Many-Facet Rasch Model to Evaluate Standard-Setting Judgments: Setting Performance Standards for Advanced Placement® Examinations

ERIC Educational Resources Information Center

Kaliski, Pamela; Wind, Stefanie A.; Engelhard, George, Jr.; Morgan, Deanna; Plake, Barbara; Reshetar, Rosemary

2012-01-01

The Many-Facet Rasch (MFR) Model is traditionally used to evaluate the quality of ratings on constructed response assessments; however, it can also be used to evaluate the quality of judgments from panel-based standard setting procedures. The current study illustrates the use of the MFR Model by examining the quality of ratings obtained from a…
Application of the Rasch Rating Scale Model to the Assessment of Quality of Life of Persons with Intellectual Disability

ERIC Educational Resources Information Center

Gomez, Laura E.; Arias, Benito; Verdugo, Miguel Angel; Navas, Patricia

2012-01-01

Background: Most instruments that assess quality of life have been validated by means of the classical test theory (CTT). However, CTT limitations have resulted in the development of alternative models, such as the Rasch rating scale model (RSM). The main goal of this paper is testing and improving the psychometric properties of the INTEGRAL…
A Psychometric Revision of the Asian Values Scale Using the Rasch Model

ERIC Educational Resources Information Center

Kim, Bryan S. K.; Hong, Sehee

2004-01-01

In this article, the 36-item Asian Values Scale (B. S. K. Kim, D. R. Atkinson, & P. H. Yang, 1999) was revised on the basis of G. Rasch's (1960) model and data from 618 Asian Americans. The results led to the establishment of a 25-item measure named the Asian Values Scale-Revised.
An Analysis of Peer Assessment through Many Facet Rasch Model

ERIC Educational Resources Information Center

Sahin, Melek Gülsah; Teker, Gülsen Tasdelen; Güler, Nese

2016-01-01

This study analyses peer assessment through many facet Rasch model (MFRM). The research was performed with 91 undergraduate students and with lecturer teaching the course. The research data were collected with holistic rubric employed by 6 peers and the lecturer in rating the projects prepared by 85 students taking the course. This study analyses…
Assessment of Differential Item Functioning in Testlet-Based Items Using the Rasch Testlet Model

ERIC Educational Resources Information Center

Wang, Wen-Chung; Wilson, Mark

2005-01-01

This study presents a procedure for detecting differential item functioning (DIF) for dichotomous and polytomous items in testlet-based tests, whereby DIF is taken into account by adding DIF parameters into the Rasch testlet model. Simulations were conducted to assess recovery of the DIF and other parameters. Two independent variables, test type…
Georg Rasch and Benjamin Wright's Struggle with the Unidimensional Polytomous Model with Sufficient Statistics

ERIC Educational Resources Information Center

Andrich, David

2016-01-01

This article reproduces correspondence between Georg Rasch of The University of Copenhagen and Benjamin Wright of The University of Chicago in the period from January 1966 to July 1967. This correspondence reveals their struggle to operationalize a unidimensional measurement model with sufficient statistics for responses in a set of ordered…
Identifying Differential Item Functioning of Rating Scale Items with the Rasch Model: An Introduction and an Application

ERIC Educational Resources Information Center

Myers, Nicholas D.; Wolfe, Edward W.; Feltz, Deborah L.; Penfield, Randall D.

2006-01-01

This study (a) provided a conceptual introduction to differential item functioning (DIF), (b) introduced the multifaceted Rasch rating scale model (MRSM) and an associated statistical procedure for identifying DIF in rating scale items, and (c) applied this procedure to previously collected data from American coaches who responded to the coaching…
Making Meaningful Measurement in Survey Research: A Demonstration of the Utility of the Rasch Model. IR Applications. Volume 28

ERIC Educational Resources Information Center

Royal, Kenneth D.

2010-01-01

Quality measurement is essential in every form of research, including institutional research and assessment. This paper addresses the erroneous assumptions institutional researchers often make with regard to survey research and provides an alternative method to producing more valid and reliable measures. Rasch measurement models are discussed and…
Sample Size Determination for Rasch Model Tests

ERIC Educational Resources Information Center

Draxler, Clemens

2010-01-01

This paper is concerned with supplementing statistical tests for the Rasch model so that additionally to the probability of the error of the first kind (Type I probability) the probability of the error of the second kind (Type II probability) can be controlled at a predetermined level by basing the test on the appropriate number of observations.…
Bully-Victimization Scale: Using Rasch Modeling in the Analysis of a Qualitative Scale

ERIC Educational Resources Information Center

Lehto, Marybeth

2009-01-01

The primary purpose of this study was to determine whether the data from the qualitative study fit Rasch model requirements for the definition of a measure, as well as to address concern in the extant literature regarding the appropriate number of items needed in analysis to assure unidimensionality. The self-report victimization scale was…
Invariance, Artifact, and the Psychological Setting of Rasch's Model: Comments on Engelhard

ERIC Educational Resources Information Center

Michell, Joel

2008-01-01

In the following, I confine my comments mainly to the issue of invariance in relation to Rasch's model for dichotomous, ability test items. "It is senseless to seek in the logical process of mathematical elaboration a psychologically significant precision that was not present in the psychological setting of the problem." (Boring, 1920)
Analysis of High School German Textbooks through Rasch Measurement Model

ERIC Educational Resources Information Center

Batdi, Veli; Elaldi, Senel

2016-01-01

The purpose of the present study is to analyze German teacher trainers' views on high school German textbooks through the Rasch measurement model. A survey research design was employed and study group consisted of a total of 21 teacher trainers, three from each region and selected randomly from provinces which are located in seven regions and…
Rasch models suggested the satisfactory psychometric properties of the World Health Organization Quality of Life-Brief among lung cancer patients.

PubMed

Lin, Chung-Ying; Yang, Szu-Chun; Lai, Wu-Wei; Su, Wu-Chou; Wang, Jung-Der

2017-03-01

The study examined whether the items of the World Health Organization Quality of Life-Brief questionnaire can assess its four underlying domains (Physical, Psychological, Social, and Environment) in a sample of lung cancer patients. All patients ( n = 1150) were recruited from a medical center in Tainan, and each participant completed the World Health Organization Quality of Life-Brief. Several Rasch rating scale models were used to examine the data-model fit, and Rasch analyses corroborated that each domain of the World Health Organization Quality of Life-Brief could be unidimensional. Although three items were found to have a poor fit, all the other items fit the unidimensionality with ordered thresholds.
Assessing social isolation in motor neurone disease: a Rasch analysis of the MND Social Withdrawal Scale.

PubMed

Gibbons, Chris J; Thornton, Everard W; Ealing, John; Shaw, Pamela J; Talbot, Kevin; Tennant, Alan; Young, Carolyn A

2013-11-15

Social withdrawal is described as the condition in which an individual experiences a desire to make social contact, but is unable to satisfy that desire. It is an important issue for patients with motor neurone disease who are likely to experience severe physical impairment. This study aims to reassess the psychometric and scaling properties of the MND Social Withdrawal Scale (MND-SWS) domains and examine the feasibility of a summary scale, by applying scale data to the Rasch model. The MND Social Withdrawal Scale was administered to 298 patients with a diagnosis of MND, alongside the Hospital Anxiety and Depression Scale. The factor structure of the MND Social Withdrawal Scale was assessed using confirmatory factor analysis. Model fit, category threshold analysis, differential item functioning (DIF), dimensionality and local dependency were evaluated. Factor analysis confirmed the suitability of the four-factor solution suggested by the original authors. Mokken scale analysis suggested the removal of item five. Rasch analysis removed a further three items; from the Community (one item) and Emotional (two items) withdrawal subscales. Following item reduction, each scale exhibited excellent fit to the Rasch model. A 14-item Summary scale was shown to fit the Rasch model after subtesting the items into three subtests corresponding to the Community, Family and Emotional subscales, indicating that items from these three subscales could be summed together to create a total measure for social withdrawal. Removal of four items from the Social Withdrawal Scale led to a four factor solution with a 14-item hierarchical Summary scale that were all unidimensional, free for DIF and well fitted to the Rasch model. The scale is reliable and allows clinicians and researchers to measure social withdrawal in MND along a unidimensional construct. © 2013. Published by Elsevier B.V. All rights reserved.

Evaluating Instrument Quality in Science Education: Rasch-based analyses of a Nature of Science test

NASA Astrophysics Data System (ADS)

Neumann, Irene; Neumann, Knut; Nehm, Ross

2011-07-01

Given the central importance of the Nature of Science (NOS) and Scientific Inquiry (SI) in national and international science standards and science learning, empirical support for the theoretical delineation of these constructs is of considerable significance. Furthermore, tests of the effects of varying magnitudes of NOS knowledge on domain-specific science understanding and belief require the application of instruments validated in accordance with AERA, APA, and NCME assessment standards. Our study explores three interrelated aspects of a recently developed NOS instrument: (1) validity and reliability; (2) instrument dimensionality; and (3) item scales, properties, and qualities within the context of Classical Test Theory and Item Response Theory (Rasch modeling). A construct analysis revealed that the instrument did not match published operationalizations of NOS concepts. Rasch analysis of the original instrument-as well as a reduced item set-indicated that a two-dimensional Rasch model fit significantly better than a one-dimensional model in both cases. Thus, our study revealed that NOS and SI are supported as two separate dimensions, corroborating theoretical distinctions in the literature. To identify items with unacceptable fit values, item quality analyses were used. A Wright Map revealed that few items sufficiently distinguished high performers in the sample and excessive numbers of items were present at the low end of the performance scale. Overall, our study outlines an approach for how Rasch modeling may be used to evaluate and improve Likert-type instruments in science education.
Evaluation of the internal construct validity of the Personal Care Participation Assessment and Resource Tool (PC-PART) using Rasch analysis.

PubMed

Darzins, Susan; Imms, Christine; Di Stefano, Marilyn; Taylor, Nicholas F; Pallant, Julie F

2014-11-05

The Personal Care Participation Assessment and Resource Tool (PC-PART) is a 43-item, clinician-administered assessment, designed to identify patients' unmet needs (participation restrictions) in activities of daily living (ADL) required for community life. This information is important for identifying problems that need addressing to enable, for example, discharge from inpatient settings to community living. The objective of this study was to evaluate internal construct validity of the PC-PART using Rasch methods. Fit to the Rasch model was evaluated for 41 PC-PART items, assessing threshold ordering, overall model fit, individual item fit, person fit, internal consistency, Differential Item Functioning (DIF), targeting of items and dimensionality. Data used in this research were taken from admission data from a randomised controlled trial conducted at two publically funded inpatient rehabilitation units in Melbourne, Australia, with 996 participants (63% women; mean age 74 years) and with various impairment types. PC-PART items assessed as one scale, and original PC-PART domains evaluated as separate scales, demonstrated poor fit to the Rasch model. Adequate fit to the Rasch model was achieved in two newly formed PC-PART scales: Self-Care (16 items) and Domestic Life (14 items). Both scales were unidimensional, had acceptable internal consistency (PSI =0.85, 0.76, respectively) and well-targeted items. Rasch analysis did not support conventional summation of all PC-PART item scores to create a total score. However, internal construct validity of the newly formed PC-PART scales, Self-Care and Domestic Life, was supported. Their Rasch-derived scores provided interval-level measurement enabling summation of scores to form a total score on each scale. These scales may assist clinicians, managers and researchers in rehabilitation settings to assess and measure changes in ADL participation restrictions relevant to community living. Data used in this research were gathered during a registered randomised controlled trial: Australian and New Zealand Clinical Trials Registry ACTRN12609000973213. Ethics committee approval was gained for secondary analysis of data for this study.
Rasch analysis of the patient-rated wrist evaluation questionnaire.

PubMed

Esakki, Saravanan; MacDermid, Joy C; Vincent, Joshua I; Packham, Tara L; Walton, David; Grewal, Ruby

2018-01-01

The Patient-Rated Wrist Evaluation (PRWE) was developed as a wrist joint specific measure of pain and disability and evidence of sound validity has been accumulated through classical psychometric methods. Rasch analysis (RA) has been endorsed as a newer method for analyzing the clinical measurement properties of self-report outcome measures. The purpose of this study was to evaluate the PRWE using Rasch modeling. We employed the Rasch model to assess overall fit, response scaling, individual item fit, differential item functioning (DIF), local dependency, unidimensionality and person separation index (PSI). A convenience sample of 382 patients with distal radius fracture was recruited from the hand and upper limb clinic at large academic healthcare organization, London, Ontario, Canada, 6-month post-injury scores of the PRWE was used. RA was conducted on the 3 subscales (pain, specific activities, and usual activities) of the PRWE separately. The pain subscale adequately fit the Rasch model when item 4 "Pain - When it is at its worst" was deleted to eliminate non-uniform DIF by age group, and item 5 "How often do you have pain" was rescored by collapsing into 8 intervals to eliminate disordered thresholds. Uniform DIF for "Use my affected hand to push up from the chair" (by work status) and "Use bathroom tissue with my affected hand" (by injured hand) was addressed by splitting the items for analysis. After background rescoring of 2 items in pain subscale, 2 items in specific activities and 3 items in usual activities, all three subscales of the PRWE were well targeted and had high reliability (PSI = 0.86). These changes provided a unidimensional, interval-level scaled measure. Like a previous analysis of the Patient-Rated Wrist and Hand Evaluation, this study found the PRWE could be fit to the Rasch model with rescoring of multiple items. However, the modifications required to achieve fit were not the same across studies, our fit statistics also suggested one of the pain items should be deleted. This study adds to the pool of evidence supporting the PRWE, but cannot confidently provide a Rasch-based scoring algorithm.
Rasch-Transformed Total Neuropathy Score clinical version (RT-TNSc(©) ) in patients with chemotherapy-induced peripheral neuropathy.

PubMed

Binda, Davide; Cavaletti, Guido; Cornblath, David R; Merkies, Ingemar S J

2015-09-01

Composite scales such as the Total Neuropathy Score clinical version (TNSc(©) ) have been widely used to measure neurological impairment in a standardized manner but they have been criticized due to their ordinal setting having no fixed unit. This study aims to improve impairment assessment in patients with chemotherapy-induced peripheral neuropathy (CIPN) by subjecting TNSc(©) records to Rasch analyses. In particular, we wanted to investigate the influence of factors affecting the use of the TNSc(©) in clinical practice. TNSc(©) has 7 domains (sensory, motor, autonomic, pin-prick, vibration, strength, and deep tendon reflexes [DTR]) each being scored 0-4. Data obtained in 281 patients with stable CIPN were subjected to Rasch analyses to determine the fit to the model. The TNSc(©) did not meet Rasch model's expectations primarily because of misfit statistics in autonomic and DTR domains. Removing these two, acceptable model fit and uni-dimensionality were obtained. However, disordered thresholds (vibration and strength) and item bias (mainly cultural) were still seen, but these findings were kept to balance the assessment range of the Rasch-Transformed TNSc(©) (RT-TNSc(©) ). Acceptable reliability findings were also obtained. A 5-domains RT-TNSc(©) may be a more proper assessment tool in patients with CIPN. Future studies are needed to examine its responsive properties. © 2015 Peripheral Nerve Society.
Should the SCOPA-COG be modified? A Rasch analysis perspective.

PubMed

Forjaz, M J; Frades-Payo, B; Rodriguez-Blazquez, C; Ayala, A; Martinez-Martin, P

2010-02-01

The SCales for Outcomes in PArkinson's disease-Cognition (SCOPA-COG) is a specific measure of cognitive function for Parkinson's disease (PD) patients. Previous studies, under the frame of the classic test theory, indicate satisfactory psychometric properties. The Rasch model, an item response theory approach, provides new information about the scale, as well as results in a linear scale. This study aims at analysing the SCOPA-COG according to the Rasch model and, on the basis of results, suggesting modification to the SCOPA-COG. Fit to the Rasch model was analysed using a sample of 384 PD patients. A good fit was obtained after rescoring for disordered thresholds. The person separation index, a reliability measure, was 0.83. Differential item functioning was observed by age for three items and by gender for one item. The SCOPA-COG is a unidimensional measure of global cognitive function in PD patients, with good scale targeting and no empirical evidence for use of the subscale scores. Its adequate reliability and internal construct validity were supported. The SCOPA-COG, with the proposed scoring scheme, generates true linear interval scores.
Using the Many-Faceted Rasch Model to Evaluate Standard Setting Judgments: An Illustration with the Advanced Placement Environmental Science Exam

ERIC Educational Resources Information Center

Kaliski, Pamela K.; Wind, Stefanie A.; Engelhard, George, Jr.; Morgan, Deanna L.; Plake, Barbara S.; Reshetar, Rosemary A.

2013-01-01

The many-faceted Rasch (MFR) model has been used to evaluate the quality of ratings on constructed response assessments; however, it can also be used to evaluate the quality of judgments from panel-based standard setting procedures. The current study illustrates the use of the MFR model for examining the quality of ratings obtained from a standard…
Measurement properties of painDETECT: Rasch analysis of responses from community-dwelling adults with neuropathic pain.

PubMed

Packham, Tara L; Cappelleri, Joseph C; Sadosky, Alesia; MacDermid, Joy C; Brunner, Florian

2017-03-04

painDETECT (PD-Q) is a self-reported assessment of pain qualities developed as a screening tool for pain of neuropathic origin. Rasch analysis is a strategy for examining the measurement characteristics of a scale using a form of item response theory. We conducted a Rasch analysis to consider if the scoring and measurement properties of PD-Q would support its use as an outcome measure. Rasch analysis was conducted on PD-Q scores drawn from a cross-sectional study of the burden and costs of NeP. The analysis followed an iterative process based on recommendations in the literature, including examination of sequential scoring categories, unidimensionality, reliability and differential item function. Data from 624 persons with a diagnosis of painful diabetic polyneuropathy, small fibre neuropathy, and neuropathic pain associated with chronic low back pain, spinal cord injury, HIV-related pain, or chronic post-surgical pain was used for this analysis. PD-Q demonstrated fit to the Rasch model after adjustments of scoring categories for four items, and omission of the time course and radiating questions. The resulting seven-item scale of pain qualities demonstrated good reliability with a person-separation index of 0.79. No scoring bias (differential item functioning) was found for this version. Rasch modelling suggests the seven pain-qualities items from PD-Q may be used as an outcome measure. Further research is required to confirm validity and responsiveness in a clinical setting.
Developing a Measure of Therapist Adherence to Contingency Management: An Application of the Many-Facet Rasch Model

ERIC Educational Resources Information Center

Chapman, Jason E.; Sheidow, Ashli J.; Henggeler, Scott W.; Halliday-Boykins, Colleen A.; Cunningham, Phillippe B.

2008-01-01

A unique application of the Many-Facet Rasch Model (MFRM) is introduced as the preferred method for evaluating the psychometric properties of a measure of therapist adherence to Contingency Management (CM) treatment of adolescent substance use. The utility of psychometric methods based in Classical Test Theory was limited by complexities of the…
The Rasch Model and Missing Data, with an Emphasis on Tailoring Test Items.

ERIC Educational Resources Information Center

de Gruijter, Dato N. M.

Many applications of educational testing have a missing data aspect (MDA). This MDA is perhaps most pronounced in item banking, where each examinee responds to a different subtest of items from a large item pool and where both person and item parameter estimates are needed. The Rasch model is emphasized, and its non-parametric counterpart (the…
Developing the Impossible Figures Task to Assess Visual-Spatial Talents among Chinese Students: A Rasch Measurement Model Analysis

ERIC Educational Resources Information Center

Chan, David W.

2010-01-01

Data of item responses to the Impossible Figures Task (IFT) from 492 Chinese primary, secondary, and university students were analyzed using the dichotomous Rasch measurement model. Item difficulty estimates and person ability estimates located on the same logit scale revealed that the pooled sample of Chinese students, who were relatively highly…
Measuring ability to assess claims about treatment effects: a latent trait analysis of items from the 'Claim Evaluation Tools' database using Rasch modelling.

PubMed

Austvoll-Dahlgren, Astrid; Guttersrud, Øystein; Nsangi, Allen; Semakula, Daniel; Oxman, Andrew D

2017-05-25

The Claim Evaluation Tools database contains multiple-choice items for measuring people's ability to apply the key concepts they need to know to be able to assess treatment claims. We assessed items from the database using Rasch analysis to develop an outcome measure to be used in two randomised trials in Uganda. Rasch analysis is a form of psychometric testing relying on Item Response Theory. It is a dynamic way of developing outcome measures that are valid and reliable. To assess the validity, reliability and responsiveness of 88 items addressing 22 key concepts using Rasch analysis. We administrated four sets of multiple-choice items in English to 1114 people in Uganda and Norway, of which 685 were children and 429 were adults (including 171 health professionals). We scored all items dichotomously. We explored summary and individual fit statistics using the RUMM2030 analysis package. We used SPSS to perform distractor analysis. Most items conformed well to the Rasch model, but some items needed revision. Overall, the four item sets had satisfactory reliability. We did not identify significant response dependence between any pairs of items and, overall, the magnitude of multidimensionality in the data was acceptable. The items had a high level of difficulty. Most of the items conformed well to the Rasch model's expectations. Following revision of some items, we concluded that most of the items were suitable for use in an outcome measure for evaluating the ability of children or adults to assess treatment claims. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2017. All rights reserved. No commercial use is permitted unless otherwise expressly granted.
Power analysis on the time effect for the longitudinal Rasch model.

PubMed

Feddag, M L; Blanchin, M; Hardouin, J B; Sebille, V

2014-01-01

Statistics literature in the social, behavioral, and biomedical sciences typically stress the importance of power analysis. Patient Reported Outcomes (PRO) such as quality of life and other perceived health measures (pain, fatigue, stress,...) are increasingly used as important health outcomes in clinical trials or in epidemiological studies. They cannot be directly observed nor measured as other clinical or biological data and they are often collected through questionnaires with binary or polytomous items. The Rasch model is the well known model in the item response theory (IRT) for binary data. The article proposes an approach to evaluate the statistical power of the time effect for the longitudinal Rasch model with two time points. The performance of this method is compared to the one obtained by simulation study. Finally, the proposed approach is illustrated on one subscale of the SF-36 questionnaire.
Calibrating Charisma: The many-facet Rasch model for leader measurement and automated coaching

NASA Astrophysics Data System (ADS)

Barney, Matt

2016-11-01

No one is a leader unless others follow. Consequently, leadership is fundamentally a social judgment construct, and may be best measured via a Many Facet Rasch Model designed for this purpose. Uniquely, the MFRM allows for objective, accurate and precise estimation of leader attributes, along with identification of rater biases and other distortions of the available information. This presentation will outline a mobile computer-adaptive measurement system that measures and develops charisma, among others. Uniquely, the approach calibrates and mass-personalizes artificially intelligent, Rasch-calibrated electronic coaching that is neither too hard nor too easy but “just right” to help each unique leader develop improved charisma.
Indonesian teacher engagement index: a rasch model analysis

NASA Astrophysics Data System (ADS)

Sasmoko; Abbas, B. S.; Indrianti, Y.; Widhoyoko, S. A.

2018-01-01

The research aimed to calibrate Indonesian Teacher Engagement Index (ITEI) using instrument with RASCH MODEL. The respondents were 672 teachers of elementary, junior high, high school and vocational school. The number of items planned was 165 items with the initial reliability of 0.98. The ITEI scale uses Likert Scale (1 to 4) which was converted from ordinal scale to Equal Interval Scale. RASCH MODEL analysis was done by selecting based on Outfit Mean Square (MNSQ) between 0.5-1.5 as a good item, and measuring Point Measure Correlation (Pt Mean Corr) with the criterion of 0.4-0.85. Moderate Outfit Z-Standard (ZSTD) was ignored because the sample was >500. Conclusions: ITEI is valid with 30 items and reliability of 0.97, and less engage teachers significantly at α <0.05.
Stroke Self-efficacy Questionnaire: a Rasch-refined measure of confidence post stroke.

PubMed

Riazi, Afsane; Aspden, Trefor; Jones, Fiona

2014-05-01

Measuring self-efficacy during rehabilitation provides an important insight into understanding recovery post stroke. A Rasch analysis of the Stroke Self-efficacy Questionnaire (SSEQ) was undertaken to establish its use as a clinically meaningful and scientifically rigorous measure. One hundred and eighteen stroke patients completed the SSEQ with the help of an interviewer. Participants were recruited from local acute stroke units and community stroke rehabilitation teams. Data were analysed with confirmatory factor analysis conducted using AMOS and Rasch analysis conducted using RUMM2030 software. Confirmatory factor analysis and Rasch analyses demonstrated the presence of two separate scales that measure stroke survivors' self-efficacy with: i) self-management and ii) functional activities. Guided by Rasch analyses, the response categories of these two scales were collapsed from an 11-point to a 4-point scale. Modified scales met the expectations of the Rasch model. Items satisfied the Rasch requirements (overall and individual item fit, local response independence, differential item functioning, unidimensionality). Furthermore, the two subscales showed evidence of good construct validity. The new SSEQ has good psychometric properties and is a clinically useful assessment of self-efficacy after stroke. The scale measures stroke survivors' self-efficacy with self-management and activities as two unidimensional constructs. It is recommended for use in clinical and research interventions, and in evaluating stroke self-management interventions.
Measurement properties of the Patient-Rated Wrist and Hand Evaluation: Rasch analysis of responses from a traumatic hand injury population.

PubMed

Packham, Tara; MacDermid, Joy C

2013-01-01

The Patient-Rated Wrist and Hand Evaluation (PRWHE) is a self-reported assessment of pain and disability to evaluate outcome after hand injuries. Rasch analysis is an alternative strategy for examining the psychometric properties of a measurement scale based in item response theory, rather than classical test theory. This study used Rasch analysis to examine the content, scoring and measurement properties of the PRWHE. PRWHE scores (n = 264) from persons with a traumatic injury or reconstructive surgery to one hand were collected from an outpatient hand rehabilitation facility. Rasch analysis was conducted to assess how the PRWHE fit the Rasch model, confirms the scaling structure of the pain and disability subscales, and identifies any areas of bias from differential item functioning. Rasch analysis of the PRWHE supports internal consistency of the scale (α = 0.96) and reliability (as measured by the person separation index) of 0.95. While gender, age, diagnosis, and duration since injury all systematically influenced how people scored the PRWHE, hand dominance and affected side did not. Rasch analysis supported a 3 subscale structure (pain, specific activities and usual activities) rather than the current divisions of pain and disability. Initial examination of the PRWHE indicates the psychometric properties of consistency, reliability and responsiveness previously tested by classical methods are further supported by Rasch analysis. It also suggests the scale structure may be best considered as 3 subscales rather than simply pain and disability. Copyright © 2013 Hanley & Belfus. Published by Elsevier Inc. All rights reserved.
Controlling Guessing Bias in the Dichotomous Rasch Model Applied to a Large-Scale, Vertically Scaled Testing Program

ERIC Educational Resources Information Center

Andrich, David; Marais, Ida; Humphry, Stephen Mark

2016-01-01

Recent research has shown how the statistical bias in Rasch model difficulty estimates induced by guessing in multiple-choice items can be eliminated. Using vertical scaling of a high-profile national reading test, it is shown that the dominant effect of removing such bias is a nonlinear change in the unit of scale across the continuum. The…
Vertical Scaling with the Rasch Model Utilizing Default and Tight Convergence Settings with WINSTEPS and BILOG-MG

ERIC Educational Resources Information Center

Custer, Michael; Omar, Md Hafidz; Pomplun, Mark

2006-01-01

This study compared vertical scaling results for the Rasch model from BILOG-MG and WINSTEPS. The item and ability parameters for the simulated vocabulary tests were scaled across 11 grades; kindergarten through 10th. Data were based on real data and were simulated under normal and skewed distribution assumptions. WINSTEPS and BILOG-MG were each…
Fitting the Mixed Rasch Model to a Reading Comprehension Test: Exploring Individual Difference Profiles in L2 Reading

ERIC Educational Resources Information Center

Aryadoust, Vahid; Zhang, Limei

2016-01-01

The present study used the mixed Rasch model (MRM) to identify subgroups of readers within a sample of students taking an EFL reading comprehension test. Six hundred and two (602) Chinese college students took a reading test and a lexico-grammatical knowledge test and completed a Metacognitive and Cognitive Strategy Use Questionnaire (MCSUQ)…
Using Distractor-Driven Standards-Based Multiple-Choice Assessments and Rasch Modeling to Investigate Hierarchies of Chemistry Misconceptions and Detect Structural Problems with Individual Items

ERIC Educational Resources Information Center

Herrmann-Abell, Cari F.; DeBoer, George E.

2011-01-01

Distractor-driven multiple-choice assessment items and Rasch modeling were used as diagnostic tools to investigate students' understanding of middle school chemistry ideas. Ninety-one items were developed according to a procedure that ensured content alignment to the targeted standards and construct validity. The items were administered to 13360…

On an Extension of the Rasch Model to the Case of Polychotomously Scored Items.

ERIC Educational Resources Information Center

Vogt, Dorothee K.

The Rasch model for the probability of a person's response to an item is extended to the case where this response depends on a set of scoring or category weights, in addition to person and item parameters. The maximum likelihood approach introduced by Wright for the dichotomous case is applicable here also, and it is shown to yield a unique…
A Computer Program for Solving a Set of Conditional Maximum Likelihood Equations Arising in the Rasch Model for Questionnaires.

ERIC Educational Resources Information Center

Andersen, Erling B.

A computer program for solving the conditional likelihood equations arising in the Rasch model for questionnaires is described. The estimation method and the computational problems involved are described in a previous research report by Andersen, but a summary of those results are given in two sections of this paper. A working example is also…
Psychometric Assessment of the Job Embeddedness Instrument: A Rasch Perspective.

PubMed

Reitz, O Ed; Smith, Everett V

2018-05-01

The aim of this study was to examine the psychometric properties of the job embeddedness instrument (JEI) using a Rasch perspective in a sample of Registered Nurses (RNs). A secondary analysis of data was conducted from a previous study examining the job embeddedness of rural and urban RNs. A Rasch analysis supported the six underlying dimensions: organizational fit, community fit, organizational links, community links, organizational sacrifice, and community sacrifice. The results of this study also demonstrate additional evidence of the validity, reliability, and generalizability of the JEI inferences with a sample of RNs. In total, 38 of 39 items of the original JEI were retained in the model. The psychometric evaluation attained through this multidimensional Rasch analysis provided support for using the JEI to assess the level of job embeddedness for RNs.
Rasch analysis of the Trypophobia Questionnaire.

PubMed

Imaizumi, Shu; Tanno, Yoshihiko

2018-02-14

This study aimed to assess Rasch-based psychometric properties of the Trypophobia Questionnaire measuring proneness to trypophobia, which refers to disgust and unpleasantness induced by the observation of clusters of objects (e.g., lotus seed pods). Rasch analysis was performed on data from 582 healthy Japanese adults. The results suggested that Trypophobia Questionnaire has a unidimensional structure with ordered response categories and sufficient person and item reliabilities, and that it does not have differential item functioning across sexes and age groups, whereas the targeting of the scale leaves room for improvements. When items that did not fit the Rasch model were removed, the shortened version showed slightly improved psychometric properties. However, results were not conclusive in determining whether the full or shortened version is better for practical use. Further assessment and validation are needed.
Rasch-built Overall Disability Scale (R-ODS) for immune-mediated peripheral neuropathies.

PubMed

van Nes, S I; Vanhoutte, E K; van Doorn, P A; Hermans, M; Bakkers, M; Kuitwaard, K; Faber, C G; Merkies, I S J

2011-01-25

To develop a patient-based, linearly weighted scale that captures activity and social participation limitations in patients with Guillain-Barré syndrome (GBS), chronic inflammatory demyelinating polyradiculoneuropathy (CIDP), and gammopathy-related polyneuropathy (MGUSP). A preliminary Rasch-built Overall Disability Scale (R-ODS) containing 146 activity and participation items was constructed, based on the WHO International Classification of Functioning, Disability and Health, literature search, and patient interviews. The preliminary R-ODS was assessed twice (interval: 2-4 weeks; test-retest reliability studies) in 294 patients who experienced GBS in the past (n = 174) or currently have stable CIDP (n = 80) or MGUSP (n = 40). Data were analyzed using the Rasch unidimensional measurement model (RUMM2020). The preliminary R-ODS did not meet the Rasch model expectations. Based on disordered thresholds, misfit statistics, item bias, and local dependency, items were systematically removed to improve the model fit, regularly controlling the class intervals and model statistics. Finally, we succeeded in constructing a 24-item scale that fulfilled all Rasch requirements. "Reading a newspaper/book" and "eating" were the 2 easiest items; "standing for hours" and "running" were the most difficult ones. Good validity and reliability were obtained. The R-ODS is a linearly weighted scale that specifically captures activity and social participation limitations in patients with GBS, CIDP, and MGUSP. Compared to the Overall Disability Sum Score, the R-ODS represents a wider range of item difficulties, thereby better targeting patients with different ability levels. If responsive, the R-ODS will be valuable for future clinical trials and follow-up studies in these conditions.
Using Rasch model to analyze the ability of pre-university students in vector

NASA Astrophysics Data System (ADS)

Ibrahim, Faridah Mohamed; Shariff, Asma Ahmad; Tahir, Rohayatimah Muhammad

2015-10-01

Evaluating students' performance only from overall examination marks does not give accurate evidence of their achievement on a particular subject. For a more detailed analysis, an instrument called Rasch Measurement Model (Rasch Model), widely used in education research, may be applied. Using the analysis map, the level of each student's ability and the level of the questions difficulty can be measured. This paper describes how the Rasch Model is used to evaluate students' achivement and performance in Vector, a subject taken by students enrolled in the Physical Science Program at the Centre for Foundation Studies in Science, University of Malaya. Usually, students' understanding of the subject and performance are assessed and examined at the end of the semester in the final examination, apart from continuous assessment done throughout the course. In order to evaluate the individual achievement and get a better and accurate evidence on the performance, 28 male and 28 female students' marks were taken randomly from the final examination results and analysed using the Rasch Model. Observation made from the map showed that more than half of the questions were categorized as difficult while the two most difficult questions could be answered correctly by 33.9% of the students. Results showed that the students performed very well and their achievement was above expectation. About 27% of the sudents could be considered as having very high ability in answering all the questions, with one student being able to answer well, obtaining perfect score. However, two students were found to be misfits since they were able to answer difficult questions but gave poor response to easy ones.
Increasing meaning in measurement: a Rasch analysis of the Child-Adolescent Teasing Scale.

PubMed

Vessey, Judith A; DiFazio, Rachel L; Strout, Tania D

2012-01-01

In today's increasingly violent society, many childhood incidents that begin as simple teasing deteriorate into persistent bullying. The Child-Adolescent Teasing Scale (CATS) was developed to measure self-perceived teasing in youths aged 11-15 years. It was validated initially using the principles of classical test theory and deemed to be a reliable and valid measure of teasing; it has been responsive to change in intervention studies. The aim of this study was to evaluate further the psychometric properties of the CATS by evaluating the degree to which the CATS items are congruent with the primary assumptions of the Rasch measurement model. A methodological study design using a Rasch Rating Scale Model was utilized to examine the psychometric properties of the 32-item CATS. The sample of the CATS consisted of 666 youths aged 11-15 years from diverse racial and socioeconomic backgrounds and geographic regions. Unidimensionality, hierarchical ordering, and stretching of the variable's responses along a continuum were examined. The current CATS subscales do not fit the criteria for the Rasch model. The subscales are not unidimensional or hierarchical and do not exist on upon a continuum upon which items can be ordered and children can be placed. The divergent results between the classical test theory and Rasch analyses, although not completely surprising, underscore the need for continued refinement of an instrument's psychometric properties to ensure it is measuring the concept of interest in the way it was intended.
X-Ray Your Data with Rasch

ERIC Educational Resources Information Center

Curtis, David D.; Boman, Peter

2007-01-01

By using the Rasch model, much detailed diagnostic information is available to developers of survey and assessment instruments and to the researchers who use them. We outline an approach to the analysis of data obtained from the administration of survey instruments that can enable researchers to recognise and diagnose difficulties with those…
Modern Psychometrics for Assessing Achievement Goal Orientation: A Rasch Analysis

ERIC Educational Resources Information Center

Muis, Krista R.; Winne, Philip H.; Edwards, Ordene V.

2009-01-01

Background: A program of research is needed that assesses the psychometric properties of instruments designed to quantify students' achievement goal orientations to clarify inconsistencies across previous studies and to provide a stronger basis for future research. Aim: We conducted traditional psychometric and modern Rasch-model analyses of the…
Differential Item Functioning Analysis Using Rasch Item Information Functions

ERIC Educational Resources Information Center

Wyse, Adam E.; Mapuranga, Raymond

2009-01-01

Differential item functioning (DIF) analysis is a statistical technique used for ensuring the equity and fairness of educational assessments. This study formulates a new DIF analysis method using the information similarity index (ISI). ISI compares item information functions when data fits the Rasch model. Through simulations and an international…
Measuring ability to assess claims about treatment effects: a latent trait analysis of items from the ‘Claim Evaluation Tools’ database using Rasch modelling

PubMed Central

Austvoll-Dahlgren, Astrid; Guttersrud, Øystein; Nsangi, Allen; Semakula, Daniel; Oxman, Andrew D

2017-01-01

Background The Claim Evaluation Tools database contains multiple-choice items for measuring people’s ability to apply the key concepts they need to know to be able to assess treatment claims. We assessed items from the database using Rasch analysis to develop an outcome measure to be used in two randomised trials in Uganda. Rasch analysis is a form of psychometric testing relying on Item Response Theory. It is a dynamic way of developing outcome measures that are valid and reliable. Objectives To assess the validity, reliability and responsiveness of 88 items addressing 22 key concepts using Rasch analysis. Participants We administrated four sets of multiple-choice items in English to 1114 people in Uganda and Norway, of which 685 were children and 429 were adults (including 171 health professionals). We scored all items dichotomously. We explored summary and individual fit statistics using the RUMM2030 analysis package. We used SPSS to perform distractor analysis. Results Most items conformed well to the Rasch model, but some items needed revision. Overall, the four item sets had satisfactory reliability. We did not identify significant response dependence between any pairs of items and, overall, the magnitude of multidimensionality in the data was acceptable. The items had a high level of difficulty. Conclusion Most of the items conformed well to the Rasch model’s expectations. Following revision of some items, we concluded that most of the items were suitable for use in an outcome measure for evaluating the ability of children or adults to assess treatment claims. PMID:28550019
The construct validity of the Major Depression Inventory: A Rasch analysis of a self-rating scale in primary care.

PubMed

Nielsen, Marie Germund; Ørnbøl, Eva; Vestergaard, Mogens; Bech, Per; Christensen, Kaj Sparle

2017-06-01

We aimed to assess the measurement properties of the ten-item Major Depression Inventory when used on clinical suspicion in general practice by performing a Rasch analysis. General practitioners asked consecutive persons to respond to the web-based Major Depression Inventory on clinical suspicion of depression. We included 22 practices and 245 persons. Rasch analysis was performed using RUMM2030 software. The Rasch model fit suggests that all items contribute to a single underlying trait (defined as internal construct validity). Mokken analysis was used to test dimensionality and scalability. Our Rasch analysis showed misfit concerning the sleep and appetite items (items 9 and 10). The response categories were disordered for eight items. After modifying the original six-point to a four-point scoring system for all items, we achieved ordered response categories for all ten items. The person separation reliability was acceptable (0.82) for the initial model. Dimensionality testing did not support combining the ten items to create a total score. The scale appeared to be well targeted to this clinical sample. No significant differential item functioning was observed for gender, age, work status and education. The Rasch and Mokken analyses revealed two dimensions, but the Major Depression Inventory showed fit to one scale if items 9 and 10 were excluded. Our study indicated scalability problems in the current version of the Major Depression Inventory. The conducted analysis revealed better statistical fit when items 9 and 10 were excluded. Copyright © 2017 Elsevier Inc. All rights reserved.
Rasch family models in e-learning: analyzing architectural sketching with a digital pen.

PubMed

Scalise, Kathleen; Cheng, Nancy Yen-Wen; Oskui, Nargas

2009-01-01

Since architecture students studying design drawing are usually assessed qualitatively on the basis of their final products, the challenges and stages of their learning have remained masked. To clarify the challenges in design drawing, we have been using the BEAR Assessment System and Rasch family models to measure levels of understanding for individuals and groups, in order to correct pedagogical assumptions and tune teaching materials. This chapter discusses the analysis of 81 drawings created by architectural students to solve a space layout problem, collected and analyzed with digital pen-and-paper technology. The approach allows us to map developmental performance criteria and perceive achievement overlaps in learning domains assumed separate, and then re-conceptualize a three-part framework to represent learning in architectural drawing. Results and measurement evidence from the assessment and Rasch modeling are discussed.
ISYQOL: a Rasch-consistent questionnaire for measuring health-related quality of life in adolescents with spinal deformities.

PubMed

Caronni, Antonio; Sciumè, Luciana; Donzelli, Sabrina; Zaina, Fabio; Negrini, Stefano

2017-09-01

Spinal deformities are commonly associated with poor health-related quality of life (HRQOL). Several questionnaires (eg, Scoliosis Research Society-24 [SRS-24] and Scoliosis Research Society-22 [SRS-22]) have been developed to evaluate HRQOL in these conditions. In adults as well as during growth, the HRQOL is considered one of the most relevant outcomes of both conservative and surgical treatments. Rasch analysis is a powerful statistical technique for developing high-quality and valid questionnaires. The SRS-24 and SRS-22 have been evaluated using the Rasch analysis but showed poor measurement properties. Thus, a proper measure of HRQOL in people with a spine condition is still missing. This study aimed to develop a new questionnaire that is totally Rasch consistent for measuring the HRQOL in young people with a spine condition. This is a cross-sectional study for developing a new HRQOL measure. A total of 402 participants with adolescent idiopathic scoliosis or Scheuermann juvenile kyphosis were included in the study. The outcome measure used was the Italian Spine Youth Quality of Life (ISYQOL) questionnaire. The study consisted of different stages: a conventional approach content analysis, an opinion poll among clinicians trained in spine deformities, and the Rasch analysis (partial credit model). The Rasch analysis showed that all items of the ISYQOL questionnaire had ordered thresholds and a good fit to the model. Differential item functioning was present for Item 1, with bracing only, and was solved with a conventional items splitting procedure. The ISYQOL item map spans an adequate range of HRQOL. The principal component analysis for Rasch residuals showed, in practical terms, the ISYQOL unidimensionality. The reliability of ISYQOL was high enough so that approximately three significantly different levels of HRQOL could be discerned. Two questionnaire versions were provided for patients with and without the brace, respectively. ISYQOL is the first HRQOL questionnaire developed according to the Rasch analysis. It was developed in a conservative treatment setting for all types of spinal deformities, including also patients with surgical curves. Validation in many languages is already under way. Copyright © 2017 Elsevier Inc. All rights reserved.
Using the Mixed Rasch Model to analyze data from the beliefs and attitudes about memory survey.

PubMed

Smith, Everett V; Ying, Yuping; Brown, Scott W

2012-01-01

In this study, we used the Mixed Rasch Model (MRM) to analyze data from the Beliefs and Attitudes About Memory Survey (BAMS; Brown, Garry, Silver, and Loftus, 1997). We used the original 5-point BAMS data to investigate the functioning of the "Neutral" category via threshold analysis under a 2-class MRM solution. The "Neutral" category was identified as not eliciting the model expected responses and observations in the "Neutral" category were subsequently treated as missing data. For the BAMS data without the "Neutral" category, exploratory MRM analyses specifying up to 5 latent classes were conducted to evaluate data-model fit using the consistent Akaike information criterion (CAIC). For each of three BAMS subscales, a two latent class solution was identified as fitting the mixed Rasch rating scale model the best. Results regarding threshold analysis, person parameters, and item fit based on the final models are presented and discussed as well as the implications of this study.
A 7-item version of the fatigue severity scale has better psychometric properties among HIV-infected adults: an application of a Rasch model.

PubMed

Lerdal, Anners; Kottorp, Anders; Gay, Caryl; Aouizerat, Bradley E; Portillo, Carmen J; Lee, Kathryn A

2011-11-01

To examine the psychometric properties of the 9-item Fatigue Severity Scale (FSS) using a Rasch model application. A convenience sample of HIV-infected adults was recruited, and a subset of the sample was assessed at 6-month intervals for 2 years. Socio-demographic, clinical, and symptom data were collected by self-report questionnaires. CD4 T-cell count and viral load measures were obtained from medical records. The Rasch analysis included 316 participants with 698 valid questionnaires. FSS item 2 did not advanced monotonically, and items 1 and 2 did not show acceptable goodness-of-fit to the Rasch model. A reduced FSS 7-item version demonstrated acceptable goodness-of-fit and explained 61.2% of the total variance in the scale. In the FSS-7 item version, no uniform Differential Item Functioning was found in relation to time of evaluation or to any of the socio-demographic or clinical variables. This study demonstrated that the FSS-7 has better psychometric properties than the FSS-9 in this HIV sample and that responses to the different items are comparable over time and unrelated to socio-demographic and clinical variables.
Analysis of the Rater Effects on the Scoring of Diagnostic Trees Prepared by Teacher Candidates with the Many-Facet Rasch Model

ERIC Educational Resources Information Center

Nalbantoglu Yilmaz, Funda

2017-01-01

In the study, it was aimed to investigate the leniency/severity, bias and halo effect of the raters which were used in the scoring of the diagnostic tree prepared by the teacher candidates with the many-facet Rasch model. The research study group constitutes 24 teacher candidates who are taking measurement and evaluation lesson from the students…
Measurement of Online Student Engagement: Utilization of Continuous Online Student Behavior Indicators as Items in a Partial Credit Rasch Model

ERIC Educational Resources Information Center

Anderson, Elizabeth

2017-01-01

Student engagement has been shown to be essential to the development of research-based best practices for K-12 education. It has been defined and measured in numerous ways. The purpose of this research study was to develop a measure of online student engagement for grades 3 through 8 using a partial credit Rasch model and validate the measure…
Is the pain visual analogue scale linear and responsive to change? An exploration using Rasch analysis.

PubMed

Kersten, Paula; White, Peter J; Tennant, Alan

2014-01-01

Pain visual analogue scales (VAS) are commonly used in clinical trials and are often treated as an interval level scale without evidence that this is appropriate. This paper examines the internal construct validity and responsiveness of the pain VAS using Rasch analysis. Patients (n = 221, mean age 67, 58% female) with chronic stable joint pain (hip 40% or knee 60%) of mechanical origin waiting for joint replacement were included. Pain was scored on seven daily VASs. Rasch analysis was used to examine fit to the Rasch model. Responsiveness (Standardized Response Means, SRM) was examined on the raw ordinal data and the interval data generated from the Rasch analysis. Baseline pain VAS scores fitted the Rasch model, although 15 aberrant cases impacted on unidimensionality. There was some local dependency between items but this did not significantly affect the person estimates of pain. Daily pain (item difficulty) was stable, suggesting that single measures can be used. Overall, the SRMs derived from ordinal data overestimated the true responsiveness by 59%. Changes over time at the lower and higher end of the scale were represented by large jumps in interval equivalent data points; in the middle of the scale the reverse was seen. The pain VAS is a valid tool for measuring pain at one point in time. However, the pain VAS does not behave linearly and SRMs vary along the trait of pain. Consequently, Minimum Clinically Important Differences using raw data, or change scores in general, are invalid as these will either under- or overestimate true change; raw pain VAS data should not be used as a primary outcome measure or to inform parametric-based Randomised Controlled Trial power calculations in research studies; and Rasch analysis should be used to convert ordinal data to interval data prior to data interpretation.
Measuring nursing competencies in the operating theatre: instrument development and psychometric analysis using Item Response Theory.

PubMed

Nicholson, Patricia; Griffin, Patrick; Gillis, Shelley; Wu, Margaret; Dunning, Trisha

2013-09-01

Concern about the process of identifying underlying competencies that contribute to effective nursing performance has been debated with a lack of consensus surrounding an approved measurement instrument for assessing clinical performance. Although a number of methodologies are noted in the development of competency-based assessment measures, these studies are not without criticism. The primary aim of the study was to develop and validate a Performance Based Scoring Rubric, which included both analytical and holistic scales. The aim included examining the validity and reliability of the rubric, which was designed to measure clinical competencies in the operating theatre. The fieldwork observations of 32 nurse educators and preceptors assessing the performance of 95 instrument nurses in the operating theatre were used in the calibration of the rubric. The Rasch model, a particular model among Item Response Models, was used in the calibration of each item in the rubric in an attempt at improving the measurement properties of the scale. This is done by establishing the 'fit' of the data to the conditions demanded by the Rasch model. Acceptable reliability estimates, specifically a high Cronbach's alpha reliability coefficient (0.940), as well as empirical support for construct and criterion validity for the rubric were achieved. Calibration of the Performance Based Scoring Rubric using Rasch model revealed that the fit statistics for most items were acceptable. The use of the Rasch model offers a number of features in developing and refining healthcare competency-based assessments, improving confidence in measuring clinical performance. The Rasch model was shown to be useful in developing and validating a competency-based assessment for measuring the competence of the instrument nurse in the operating theatre with implications for use in other areas of nursing practice. Crown Copyright © 2012. Published by Elsevier Ltd. All rights reserved.

An Application of the Rasch Model.

ERIC Educational Resources Information Center

Veitch, William R.

The one parameter latent trait theory of Georg Rasch has two assumptions: that student abilities can be measured on an equal interval scale, and that the success of a student with a given item is a function of student achievement and item difficulty. The grade four Michigan Educational Assessment Program reading test was designed to measure…
The Standardization of the Clock Drawing Test (CDT) for People with Stroke Using Rasch Analysis

PubMed Central

Yoo, Doo Han; Hong, Deok Gi; Lee, Jae Shin

2014-01-01

[Purpose] The aim of this study was to standardize the clock drawing test (CDT) for people with stroke using Rasch analysis. [Subjects and Methods] Seventeen items of the CDT identified through a literature review were performed by 159 stroke patients. The data was analyzed with Winstep version 3.57 using the Rasch model to examine the unidimensionality of the items’ fit, the distribution of the items’ difficulty, and the reliability and appropriateness of the rating scale. [Result] Ten out of the 159 participations (6.2%) were considered misfit subjects, and one item of the CDT was determined to be a misfit item based on Rasch analysis. The rating scales were judged as suitable because the observed average showed an array of vertical orders and MNSQ values < 2. The separate index and reliability of the subject (1.98, 0.80) and item (6.45, 0.97) showed relatively high values. [Conclusion] This study is the first to examine the CDT scale in stroke patients by Rasch analysis. The CDT is expected to be useful for screening stroke patients with cognitive problems. PMID:24409026
Rasch analysis of the carers quality of life questionnaire for parkinsonism.

PubMed

Pillas, Marios; Selai, Caroline; Schrag, Anette

2017-03-01

To assess the psychometric properties of the Carers Quality of Life Questionnaire for Parkinsonism using a Rasch modeling approach and determine the optimal cut-off score. We performed a Rasch analysis of the survey answers of 430 carers of patients with atypical parkinsonism. All of the scale items demonstrated acceptable goodness of fit to the Rasch model. The scale was unidimensional and no notable differential item functioning was detected in the items regarding age and disease type. Rating categories were functioning adequately in all scale items. The scale had high reliability (.95) and construct validity and a high degree of precision, distinguishing between 5 distinct groups of carers with different levels of quality of life. A cut-off score of 62 was found to have the optimal screening accuracy based on Hospital Anxiety and Depression Scale subscores. The results suggest that the Carers Quality of Life Questionnaire for Parkinsonism is a useful scale to assess carers' quality of life and allows analyses requiring interval scaling of variables. © 2016 International Parkinson and Movement Disorder Society. © 2016 International Parkinson and Movement Disorder Society.
Calibrating perceived understanding and competency in probability concepts: A diagnosis of learning difficulties based on Rasch probabilistic model

NASA Astrophysics Data System (ADS)

Mahmud, Zamalia; Porter, Anne; Salikin, Masniyati; Ghani, Nor Azura Md

2015-12-01

Students' understanding of probability concepts have been investigated from various different perspectives. Competency on the other hand is often measured separately in the form of test structure. This study was set out to show that perceived understanding and competency can be calibrated and assessed together using Rasch measurement tools. Forty-four students from the STAT131 Understanding Uncertainty and Variation course at the University of Wollongong, NSW have volunteered to participate in the study. Rasch measurement which is based on a probabilistic model is used to calibrate the responses from two survey instruments and investigate the interactions between them. Data were captured from the e-learning platform Moodle where students provided their responses through an online quiz. The study shows that majority of the students perceived little understanding about conditional and independent events prior to learning about it but tend to demonstrate a slightly higher competency level afterward. Based on the Rasch map, there is indication of some increase in learning and knowledge about some probability concepts at the end of the two weeks lessons on probability concepts.
A longitudinal evaluation of the Center for Epidemiologic Studies-Depression scale (CES-D) in a Rheumatoid Arthritis Population using Rasch Analysis

PubMed Central

Covic, Tanya; Pallant, Julie F; Conaghan, Philip G; Tennant, Alan

2007-01-01

Background The aim of this study was to test the internal validity of the total Center for Epidemiologic Studies-Depression (CES-D) scale using Rasch analysis in a rheumatoid arthritis (RA) population. Methods CES-D was administered to 157 patients with RA over three time points within a 12 month period. Rasch analysis was applied using RUMM2020 software to assess the overall fit of the model, the response scale used, individual item fit, differential item functioning (DIF) and person separation. Results Pooled data across three time points was shown to fit the Rasch model with removal of seven items from the original 20-item CES-D scale. It was necessary to rescore the response format from four to three categories in order to improve the scale's fit. Two items demonstrated some DIF for age and gender but were retained within the 13-item CES-D scale. A new cut point for depression score of 9 was found to correspond to the original cut point score of 16 in the full CES-D scale. Conclusion This Rasch analysis of the CES-D in a longstanding RA cohort resulted in the construction of a modified 13-item scale with good internal validity. Further validation of the modified scale is recommended particularly in relation to the new cut point for depression. PMID:17629902
Rasch-built Overall Disability Scale for Multifocal motor neuropathy (MMN-RODS(©) ).

PubMed

Vanhoutte, Els K; Faber, Catharina G; van Nes, Sonja I; Cats, Elisabeth A; Van der Pol, W-Ludo; Gorson, Kenneth C; van Doorn, Pieter A; Cornblath, David R; van den Berg, Leonard H; Merkies, Ingemar S J

2015-09-01

Clinical trials in multifocal motor neuropathy (MMN) have often used ordinal-based measures that may not accurately capture changes. We aimed to construct a disability interval outcome measure specifically for MMN using the Rasch model and to examine its clinimetric properties. A total of 146 preliminary activity and participation items were assessed twice (reliability studies) in 96 clinically stable MMN patients. These patients also assessed the ordinal-based overall disability sum score (construct, sample-dependent validity). The final Rasch-built overall disability scale for MMN (MMN-RODS(©) ) was serially applied in 26 patients with newly diagnosed or relapsing MMN, treated with intravenous immunoglobulin (IVIg) (1-year follow-up; responsiveness study). The magnitude of change for each patient was calculated using the minimum clinically important difference technique related to the individually obtained standard errors. A total of 121 items not fulfilling Rasch requirements were removed. The final 25-item MMN-RODS(©) fulfilled all Rasch model's expectations and showed acceptable reliability and validity including good discriminatory capacity. Most serially examined patients improved, but its magnitude was low, reflecting poor responsiveness. The constructed MMN-RODS(©) is a disease-specific, interval measure to detect activity limitations in patients with MMN and overcomes the shortcomings of ordinal scales. However, future clinimetric studies are needed to improve the MMN-RODS(©) 's responsiveness by longer observations and/or more rigorous treatment regimens. © 2015 Peripheral Nerve Society.
Psychometric evaluation of the WHOQOL-BREF, Taiwan version, across five kinds of Taiwanese cancer survivors: Rasch analysis and confirmatory factor analysis.

PubMed

Lin, Chung-Ying; Hwang, Jing-Shiang; Wang, Wen-Chung; Lai, Wu-Wei; Su, Wu-Chou; Wu, Tzu-Yi; Yao, Grace; Wang, Jung-Der

2018-04-13

Quality of life (QoL) is important for clinicians to evaluate how cancer survivors judge their sense of well-being, and WHOQOL-BREF may be a good tool for clinical use. However, at least three issues remain unresolved: (1) the psychometric properties of the WHOQOL-BREF for cancer patients are insufficient; (2) the scoring method used for WHOQOL-BREF needs to be clarify; (3) whether different types of cancer patients interpret the WHOQOL-BREF similarly. We recruited 1000 outpatients with head/neck cancer, 1000 with colorectal cancer, 965 with liver cancer, 1438 with lung cancer and 1299 with gynecologic cancers in a medical center. Data analyses included Rasch models, confirmatory factor analysis (CFA), and Pearson correlations. The mean WHOQOL-BREF domain scores were between 13.34 and 14.77 among all participants. CFA supported construct validity; Rasch models revealed that almost all items were embedded in their expected domains and were interpreted similarly across five types of cancer patients; all correlation coefficients between Rasch scores and original domain scores were above 0.9. The linear relationship between Rasch scores and domain scores suggested that the current calculations for domain scores were applicable and without serious bias. Clinical practitioners may regularly collect and record the WHOQOL-BREF domain scores into electronic health records. Copyright © 2018. Published by Elsevier B.V.
Validation of the brief version of the Recovery Self-Assessment (RSA-B) using Rasch measurement theory.

PubMed

Barbic, Skye P; Kidd, Sean A; Davidson, Larry; McKenzie, Kwame; O'Connell, Maria J

2015-12-01

In psychiatry, the recovery paradigm is increasingly identified as the overarching framework for service provision. Currently, the Recovery Self-Assessment (RSA), a 36-item rating scale, is commonly used to assess the uptake of a recovery orientation in clinical services. However, the consumer version of the RSA has been found challenging to complete because of length and the reading level required. In response to this feedback, a brief 12-item version of the RSA was developed (RSA-B). This article describes the development of the modified instrument and the application of traditional psychometric analysis and Rasch Measurement Theory to test the psychometrics properties of the RSA-B. Data from a multisite study of adults with serious mental illnesses (n = 1256) who were followed by assertive community treatment teams were examined for reliability, clinical meaning, targeting, response categories, model fit, reliability, dependency, and raw interval-level measurement. Analyses were performed using the Rasch Unidimensional Measurement Model (RUMM 2030). Adequate fit to the Rasch model was observed (χ2 = 112.46, df = 90, p = .06) and internal consistency was good (r = .86). However, Rasch analysis revealed limitations of the 12-item version, with items covering only 39% of the targeted theoretical continuum, 2 misfitting items, and strong evidence for the 5 option response categories not working as intended. This study revealed areas for improvement in the shortened version of the 12-item RSA-B. A revisit of the conceptual model and original 36-item rating scale is encouraged to select items that will help practitioners and researchers measure the full range of recovery orientation. (c) 2015 APA, all rights reserved).
Rasch Modeling of the Test of Early Mathematics Ability-Third Edition with a Sample of K1 Children in Singapore

ERIC Educational Resources Information Center

Yao, Shih-Ying; Muñez, David; Bull, Rebecca; Lee, Kerry; Khng, Kiat Hui; Poon, Kenneth

2017-01-01

The Test of Early Mathematics Ability-Third Edition (TEMA-3) is a commonly used measure of early mathematics knowledge for children aged 3 years to 8 years 11 months. In spite of its wide use, research on the psychometric properties of TEMA-3 remains limited. This study applied the Rasch model to investigate the psychometric properties of TEMA-3…
Study of Bias in 2012-Placement Test through Rasch Model in Terms of Gender Variable

ERIC Educational Resources Information Center

Turkan, Azmi; Cetin, Bayram

2017-01-01

Validity and reliability are among the most crucial characteristics of a test. One of the steps to make sure that a test is valid and reliable is to examine the bias in test items. The purpose of this study was to examine the bias in 2012 Placement Test items in terms of gender variable using Rasch Model in Turkey. The sample of this study was…
Optimizing the compatibility between rating scales and measures of productive second language competence.

PubMed

Weaver, Christopher

2011-01-01

This study presents a systematic investigation concerning the performance of different rating scales used in the English section of a university entrance examination to assess 1,287 Japanese test takers' ability to write a third-person introduction speech. Although the rating scales did not conform to all of the expectations of the Rasch model, they successfully defined a meaningful continuum of English communicative competence. In some cases, the expectations of the Rasch model needed to be weighed against the specific assessment needs of the university entrance examination. This investigation also found that the degree of compatibility between the number of points allotted to the different rating scales and the various requirements of an introduction speech played a considerable role in determining the extent to which the different rating scales conformed to the expectations of the Rasch model. Compatibility thus becomes an important factor to consider for optimal rating scale performance.
Rasch modeling to assess Albanian and South African learners' preferences for real-life situations to be used in mathematics: a pilot study.

PubMed

Kacerja, Suela; Julie, Cyril; Hadjerrouit, Said

2013-01-01

This paper reports on an investigation on the real-life situations students in grades 8 and 9 in South Africa and Albania prefer to use in Mathematics. The functioning of the instrument used to assess the order of preference learners from both countries have for contextual situations is assessed using Rasch modeling techniques. For both the cohorts, the data fit the Rasch model. The differential item functioning (DIF) analysis rendered 3 items operating differentially for the two cohorts. Explanations for these differences are provided in terms of differences in experiences learners in the two countries have related to some of the contextual situations. Implications for interpretation of international comparative tests are offered, as are the possibilities for the cross-country development of curriculum materials related to contexts that learners prefer to use in Mathematics.
Psychometric validation of the Persian Bergen Social Media Addiction Scale using classic test theory and Rasch models.

PubMed

Lin, Chung-Ying; Broström, Anders; Nilsen, Per; Griffiths, Mark D; Pakpour, Amir H

2017-12-01

Background and aims The Bergen Social Media Addiction Scale (BSMAS), a six-item self-report scale that is a brief and effective psychometric instrument for assessing at-risk social media addiction on the Internet. However, its psychometric properties in Persian have never been examined and no studies have applied Rasch analysis for the psychometric testing. This study aimed to verify the construct validity of the Persian BSMAS using confirmatory factor analysis (CFA) and Rasch models among 2,676 Iranian adolescents. Methods In addition to construct validity, measurement invariance in CFA and differential item functioning (DIF) in Rasch analysis across gender were tested for in the Persian BSMAS. Results Both CFA [comparative fit index (CFI) = 0.993; Tucker-Lewis index (TLI) = 0.989; root mean square error of approximation (RMSEA) = 0.057; standardized root mean square residual (SRMR) = 0.039] and Rasch (infit MnSq = 0.88-1.28; outfit MnSq = 0.86-1.22) confirmed the unidimensionality of the BSMAS. Moreover, measurement invariance was supported in multigroup CFA including metric invariance (ΔCFI = -0.001; ΔSRMR = 0.003; ΔRMSEA = -0.005) and scalar invariance (ΔCFI = -0.002; ΔSRMR = 0.005; ΔRMSEA = 0.001) across gender. No item displayed DIF (DIF contrast = -0.48 to 0.24) in Rasch across gender. Conclusions Given the Persian BSMAS was unidimensional, it is concluded that the instrument can be used to assess how an adolescent is addicted to social media on the Internet. Moreover, users of the instrument may comfortably compare the sum scores of the BSMAS across gender.
Update on the Child's Challenging Behaviour Scale following evaluation using Rasch analysis.

PubMed

Bourke-Taylor, H M; Pallant, J F; Law, M

2014-03-01

The Child's Challenging Behaviour Scale (CCBS) was designed to measure a mother's rating of her child's challenging behaviours. The CCBS was initially developed for mothers of school-aged children with developmental disability and has previously been shown to have good psychometric properties using classical test theory techniques. The aim of this study was to use Rasch analysis to fully evaluate all aspects of the scale, including response format, item fit, dimensionality and targeting. The sample consisted of 152 mothers of a school-aged child (aged 5-18 years) with a disability. Mothers were recruited via websites and mail-out newsletters through not-for-profit organizations that supported families with disabilities. Respondents completed a survey which included the 11 items of the CCBS. Rasch analysis was conducted on these responses using the RUMM2030 package. Rasch analysis of the CCBS revealed serious threshold disordering for nine of the 11 items, suggesting problems with the 5-point response format used for the scale. The neutral midpoint of the response format was subsequently removed to create a 4-point scale. High levels of local dependency were detected among two pairs of items, resulting in the removal of two items (item 7 and item 1). The final nine-item version of the scale (CCBS Version 2) was unidimensional, well targeted, showed good fit to the Rasch model, and strong internal consistency. To achieve fit to the Rasch model it was necessary to make two modifications to the CCBS scale. The resulting nine-item scale with a 4-point response format showed excellent psychometric properties, supporting its internal validity. © 2013 John Wiley & Sons Ltd.
Classical, Generalizability, and Multifaceted Rasch Detection of Interrater Variability in Large, Sparse Data Sets.

ERIC Educational Resources Information Center

MacMillan, Peter D.

2000-01-01

Compared classical test theory (CTT), generalizability theory (GT), and multifaceted Rasch model (MFRM) approaches to detecting and correcting for rater variability using responses of 4,930 high school students graded by 3 raters on 9 scales. The MFRM approach identified far more raters as different than did the CTT analysis. GT and Rasch…
Investigating Young Children's Human Figure Drawings Using Rasch Analysis

ERIC Educational Resources Information Center

Campbell, Claire; Bond, Trevor

2017-01-01

The Goodenough-Harris Drawing Test (GHDT) is a non-verbal assessment designed to infer young children's levels of intellectual development and understanding via the collection of three human figure drawings (HFDs)--one each of a man, a woman and a self-portrait. This paper presents findings from a research project that applied the Rasch model for…
Stability of Rasch Scales over Time

ERIC Educational Resources Information Center

Taylor, Catherine S.; Lee, Yoonsun

2010-01-01

Item response theory (IRT) methods are generally used to create score scales for large-scale tests. Research has shown that IRT scales are stable across groups and over time. Most studies have focused on items that are dichotomously scored. Now Rasch and other IRT models are used to create scales for tests that include polytomously scored items.…
Psychometric Properties of the Chinese Version of the Beck Depression Inventory-II Using the Rasch Model

ERIC Educational Resources Information Center

Wu, Pei-Chen; Chang, Lily

2008-01-01

The authors investigated the Chinese version of the Beck Depression Inventory-II (BDI-II-C; Chinese Behavioral Science Corporation, 2000) within the Rasch framework in terms of dimensionality, item difficulty, and category functioning. Two underlying scale dimensions, relatively high item difficulties, and a need for collapsing 2 response…
A Response to Holster and Lake Regarding Guessing and the Rasch Model

ERIC Educational Resources Information Center

Stewart, Jeffrey; McLean, Stuart; Kramer, Brandon

2017-01-01

Stewart questioned vocabulary size estimation methods proposed by Beglar and Nation for the Vocabulary Size Test, further arguing Rasch mean square (MSQ) fit statistics cannot determine the proportion of random guesses contained in the average learner's raw score, because the average value will be near 1 by design. He illustrated this by…
Oral Performace Scoring Using Generalizability Theory and Many-Facet Rasch Measurement: A Comparison Study

ERIC Educational Resources Information Center

Alkahtani, Saif F.

2012-01-01

The principal aim of the present study was to better guide the Quranic recitation appraisal practice by presenting an application of Generalizability theory and Many-facet Rasch Measurement Model for assessing the dependability and fit of two suggested rubrics. Recitations of 93 students were rated holistically and analytically by 3 independent…

Examination of an eHealth literacy scale and a health literacy scale in a population with moderate to high cardiovascular risk: Rasch analyses.

PubMed

Richtering, Sarah S; Morris, Rebecca; Soh, Sze-Ee; Barker, Anna; Bampi, Fiona; Neubeck, Lis; Coorey, Genevieve; Mulley, John; Chalmers, John; Usherwood, Tim; Peiris, David; Chow, Clara K; Redfern, Julie

2017-01-01

Electronic health (eHealth) strategies are evolving making it important to have valid scales to assess eHealth and health literacy. Item response theory methods, such as the Rasch measurement model, are increasingly used for the psychometric evaluation of scales. This paper aims to examine the internal construct validity of an eHealth and health literacy scale using Rasch analysis in a population with moderate to high cardiovascular disease risk. The first 397 participants of the CONNECT study completed the electronic health Literacy Scale (eHEALS) and the Health Literacy Questionnaire (HLQ). Overall Rasch model fit as well as five key psychometric properties were analysed: unidimensionality, response thresholds, targeting, differential item functioning and internal consistency. The eHEALS had good overall model fit (χ2 = 54.8, p = 0.06), ordered response thresholds, reasonable targeting and good internal consistency (person separation index (PSI) 0.90). It did, however, appear to measure two constructs of eHealth literacy. The HLQ subscales (except subscale 5) did not fit the Rasch model (χ2: 18.18-60.60, p: 0.00-0.58) and had suboptimal targeting for most subscales. Subscales 6 to 9 displayed disordered thresholds indicating participants had difficulty distinguishing between response options. All subscales did, nonetheless, demonstrate moderate to good internal consistency (PSI: 0.62-0.82). Rasch analyses demonstrated that the eHEALS has good measures of internal construct validity although it appears to capture different aspects of eHealth literacy (e.g. using eHealth and understanding eHealth). Whilst further studies are required to confirm this finding, it may be necessary for these constructs of the eHEALS to be scored separately. The nine HLQ subscales were shown to measure a single construct of health literacy. However, participants' scores may not represent their actual level of ability, as distinction between response categories was unclear for the last four subscales. Reducing the response categories of these subscales may improve the ability of the HLQ to distinguish between different levels of health literacy.
Accounting for Local Dependence with the Rasch Model: The Paradox of Information Increase.

PubMed

Andrich, David

Test theories imply statistical, local independence. Where local independence is violated, models of modern test theory that account for it have been proposed. One violation of local independence occurs when the response to one item governs the response to a subsequent item. Expanding on a formulation of this kind of violation between two items in the dichotomous Rasch model, this paper derives three related implications. First, it formalises how the polytomous Rasch model for an item constituted by summing the scores of the dependent items absorbs the dependence in its threshold structure. Second, it shows that as a consequence the unit when the dependence is accounted for is not the same as if the items had no response dependence. Third, it explains the paradox, known, but not explained in the literature, that the greater the dependence of the constituent items the greater the apparent information in the constituted polytomous item when it should provide less information.
Rasch model of a dynamic assessment: an investigation of the children's inferential thinking modifiability test.

PubMed

Rittner, Linda L; Pulos, Steven M

2014-01-01

The purpose of this study was to develop a general procedure for evaluation of a dynamic assessment and to demonstrate an analysis of a dynamic assessment, the CITM (Tzuriel, 1995b), as an objective measure for use as a group assessment. The techniques used to determine the fit of the CITM to a Rasch partial credit model are explicitly outlined. A modified format of the CITM was administered to 266 diverse second grade students in the USA; 58% of participants were identified as low SES. The participants (males n = 144) were White Anglo and Latino American students (55%), many of whom were first generation Mexican immigrants. The CITM was found to adequately fit a Rasch partial credit model (PCM) indicating that the CITM is a likely candidate for a group administered dynamic assessment that can be measured objectively. Data also supported that a model for objectively measuring change in learning ability for inferential thinking in the CITM was feasible.
An in-depth psychometric analysis of the Connor-Davidson Resilience Scale: calibration with Rasch-Andrich model.

PubMed

Arias González, Víctor B; Crespo Sierra, María Teresa; Arias Martínez, Benito; Martínez-Molina, Agustín; Ponce, Fernando P

2015-09-23

The Connor-Davidson Resilience Scale (CD-RISC) is inarguably one of the best-known instruments in the field of resilience assessment. However, the criteria for the psychometric quality of the instrument were based only on classical test theory. The aim of this paper has focused on the calibration of the CD-RISC with a nonclinical sample of 444 adults using the Rasch-Andrich Rating Scale Model, in order to clarify its structure and analyze its psychometric properties at the level of item. Two items showed misfit to the model and were eliminated. The remaining 22 items form basically a unidimensional scale. The CD-RISC has good psychometric properties. The fit of both the items and the persons to the Rasch model was good, and the response categories were functioning properly. Two of the items showed differential item functioning. The CD-RISC has an obvious ceiling effect, which suggests to include more difficult items in future versions of the scale.
Validation of the Dutch version of the Swallowing Quality-of-Life Questionnaire (DSWAL-QoL) and the adjusted DSWAL-QoL (aDSWAL-QoL) using item analysis with the Rasch model: a pilot study.

PubMed

Simpelaere, Ingeborg S; Van Nuffelen, Gwen; De Bodt, Marc; Vanderwegen, Jan; Hansen, Tina

2017-04-07

The Swallowing Quality-of-Life Questionnaire (SWAL-QoL) is considered the gold standard for assessing health-related QoL in oropharyngeal dysphagia. The Dutch translation (DSWAL-QoL) and its adjusted version (aDSWAL-QoL) have been validated using classical test theory (CTT). However, these scales have not been tested against the Rasch measurement model, which is required to establish the structural validity and objectivity of the total scale and subscale scores. Thus, the purpose of this study was to examine the psychometric properties of these scales using item analysis according to the Rasch model. Item analysis with the Rasch model was performed using RUMM2030 software with previously collected data from a validation study of 108 patients. The assessment included evaluations of overall model fit, reliability, unidimensionality, threshold ordering, individual item and person fits, differential item functioning (DIF), local item dependency (LID) and targeting. The analysis could not establish the psychometric properties of either of the scales or their subscales because they did not fit the Rasch model, and multidimensionality, disordered thresholds, DIF, and/or LID were found. The reliability and power of fit were high for the total scales (PSI = 0.93) but low for most of the subscales (PSI < 0.70). The targeting of persons and items was suboptimal. The main source of misfit was disordered thresholds for both the total scales and subscales. Based on the results of the analysis, adjustments to improve the scales were implemented as follows: disordered thresholds were rescaled, misfit items were removed and items were split for DIF. However, the multidimensionality and LID could not be resolved. The reliability and power of fit remained low for most of the subscales. This study represents the first analyses of the DSWAL-QoL and aDSWAL-QoL with the Rasch model. Relying on the DSWAL-QoL and aDSWAL-QoL total and subscale scores to make conclusions regarding dysphagia-related HRQoL should be treated with caution before the structural validity and objectivity of both scales have been established. A larger and well-targeted sample is recommended to derive definitive conclusions about the items and scales. Solutions for the psychometric weaknesses suggested by the model and practical implications are discussed.
Measuring situational avoidance in older drivers: An application of Rasch analysis.

PubMed

Davis, Jessica; Conlon, Elizabeth; Ownsworth, Tamara; Morrissey, Shirley

2016-02-01

Situational avoidance is a form of driving self-regulation at the strategic level of driving behaviour. It has typically been defined as the purposeful avoidance of driving situations perceived as challenging or potentially hazardous. To date, assessment of the psychometric properties of existing scales that measure situational avoidance has been sparse. This study examined the contribution of Rasch analysis to the situational avoidance construct. Three hundred and ninety-nine Australian drivers (M=66.75, SD=10.14, range: 48-91 years) completed the Situational Avoidance Questionnaire (SAQ). Following removal of the item Parallel Parking, the scale conformed to a Rasch model, showing good person separation, sufficient reliability, little disordering of thresholds, and no evidence of differential item functioning by age or gender. The residuals were independent supporting the assumption of unidimensionality and in conforming to a Rasch model, SAQ items were found to be hierarchical or cumulative. Increased avoidance was associated with factors known to be related to driving self-regulation more broadly, including older age, female gender, reduced driving space and frequency, reporting a change in driving in the past five years and poorer indices of health (i.e., self-rated mood, vision and cognitive function). Overall, these results support the use of the SAQ as a psychometrically sound measure of situational avoidance. Application of Rasch analysis to this area of research advances understanding of the driving self-regulation construct and its practice by drivers in baby boomer and older adult generations. Copyright © 2015 Elsevier Ltd. All rights reserved.
The Revised Body Awareness Rating Questionnaire: Development Into a Unidimensional Scale Using Rasch Analysis.

PubMed

Dragesund, Tove; Strand, Liv Inger; Grotle, Margreth

2018-02-01

The Body Awareness Rating Questionnaire (BARQ) is a self-report questionnaire aimed at capturing how people with long-lasting musculoskeletal pain reflect on their own body awareness. Methods based on classical test theory were applied to the development of the instrument and resulted in 4 subscales. However, the scales were not correlated, and construct validity might be questioned. The primary purpose of this study was to explore the possibility of developing a unidimensional scale from items initially collected for the BARQ using Rasch analysis. A secondary purpose was to investigate the test-retest reliability of a revised version of the BARQ. This was a methodological study. Rasch and reliability analyses were performed for 3 samples of participants with long-lasting musculoskeletal pain. The first Rasch analysis was carried out on 66 items generated for the original BARQ and scored by 300 participants. The items supported by the first analysis were scored by a new group of 127 participants and analyzed in a second Rasch analysis. For the test-retest reliability analysis, 48 participants scored the revised BARQ items twice within 1 week. The 2-step Rasch analysis resulted in a unidimensional 12-item revised version of the BARQ with a 4-point response scale (scores from 0 to 36). It showed a good fit to the Rasch model, with acceptable internal consistency, satisfactory fit residuals, and no disordered thresholds. Test-retest reliability was high, with an intraclass correlation coefficient of .83 (95% CI = .71-.89) and a smallest detectable change of 6.3 points. The small sample size in the second Rasch analysis was a study limitation. The revised BARQ is a unidimensional and feasible measurement of body awareness, recommended for use in the context of body-mind physical therapy approaches for musculoskeletal conditions. © 2017 American Physical Therapy Association
Understanding Rasch Measurement: Distractors with Information in Multiple Choice Items: A Rationale Based on the Rasch Model

ERIC Educational Resources Information Center

Andrich, David; Styles, Irene

2011-01-01

There is a substantial literature on attempts to obtain information on the proficiency of respondents from distractors in multiple choice items. Information in a distractor implies that a person who chooses that distractor has greater proficiency than if the person chose another distractor with no information. A further implication is that the…
Rasch Analysis of the Bruininks-Oseretsky Test of Motor Proficiency--Second Edition in Intellectual Disabilities

ERIC Educational Resources Information Center

Wuang, Yee-Pay; Lin, Yueh-Hsien; Su, Chwen-Yng

2009-01-01

The Bruininks-Oseretsky Test of Motor Proficiency-Second Edition (BOT-2) is widely used to assess motor skills for both clinical and research purposes; however, its validity has not been adequately assessed in intellectual disabilities (ID). This study used partial credit Rasch model to examine the measurement properties of the BOT-2 among 446…
Learning to Teach for Social Justice-Beliefs Scale: An Application of Rasch Measurement Principles

ERIC Educational Resources Information Center

Ludlow, Larry H.; Enterline, Sarah E.; Cochran-Smith, Marilyn

2008-01-01

The authors illustrate how a Rasch model can guide the development of a new affective measurement instrument--the Learning to Teach for Social Justice--Beliefs scale. The results provide strong evidence of a meaningful continuum of attitudes about teaching for social justice ranging from those easier to endorse to those more difficult to endorse.…
Validating independent ratings of executive functioning following acquired brain injury using Rasch analysis.

PubMed

Simblett, Sara K; Badham, Rachel; Greening, Kate; Adlam, Anna; Ring, Howard; Bateman, Andrew

2012-01-01

Assessment of everyday problems with executive functioning following acquired brain injury (ABI) is greatly valued by neurorehabilitation services. Reliance on self-report measures alone is problematic within this client group who may experience difficulties with awareness and memory. The construct validity and reliability of independent ratings (i.e., ratings provided by a carer/relative) on the Dysexecutive Questionnaire (DEX-I) was explored in this study. Consistent with the results recently reported on the self-rated version of the DEX (DEX-S; Simblett & Bateman, 2011 ), Rasch analysis completed on 271 responses to the DEX-I revealed that the scale did not fit the Rasch model and did not meet the assumption of unidimensionality, that is, a single underlying construct could not be found for the DEX-I that would allow development of an interval-level measure as a whole. Subscales, based on theoretical conceptualisations of executive functioning (Stuss, 2007 ) previously suggested for the DEX-S, were able to demonstrate fit to the Rasch model and unidimensionality. Reliability of independent responses to these subscales in comparison to self-reported ratings is discussed. These results contribute to a greater understanding of how assessment of executive functioning can be improved.
Rasch validation of the PHQ-9 in people with visual impairment in South India.

PubMed

Gothwal, Vijaya K; Bagga, Deepak K; Sumalini, Rebecca

2014-01-01

The Patient-Health Questionnaire (PHQ-9) is a widely used screening instrument for depression. Recently, its properties as a measure were investigated using Rasch analysis in an Australian population with visual impairment (VI) and it was demonstrated to possess excellent measurement properties, but the response scale required shortening (modified PHQ-9). However, further validation was recommended to substantiate its use with the growing population of VI. Therefore, we aimed to use Rasch analysis to evaluate the measurement properties of the modified PHQ-9 in an Indian population with VI. 303 patients with VI (mean age 40.2 years; 71% male) referred to Vision Rehabilitation Centres were administered the PHQ-9 by trained interviewer. Rasch analysis was used to investigate the psychometric properties of the modified PHQ-9. Rasch analysis showed good fit to the model, no misfitting items and an acceptable person separation reliability (0.82). Dimensionality testing supported combining 9 items to create a total score. Targeting was sub-optimal (-1.30 logits); more difficult items are needed. One item ('trouble falling asleep') showed notable differential item functioning, DIF (1.18 logits) by duration of VI. The generalisability of these results might be restricted to patients with VI presenting to a tertiary eye care centre. Except for DIF, the performance of the modified PHQ-9 is consistent with that of the original, albeit in a different cultural context (Indian population with VI). Clinicians/researchers can readily use the modified PHQ-9 without formal training in Rasch procedures given the provision of ready-to-use spreadsheets that convert raw to Rasch-scaled scores. However the conversions will apply only if the sample being tested is similar to that of the present study. Copyright © 2014 Elsevier B.V. All rights reserved.
A Method of Q-Matrix Validation for the Linear Logistic Test Model

PubMed Central

Baghaei, Purya; Hohensinn, Christine

2017-01-01

The linear logistic test model (LLTM) is a well-recognized psychometric model for examining the components of difficulty in cognitive tests and validating construct theories. The plausibility of the construct model, summarized in a matrix of weights, known as the Q-matrix or weight matrix, is tested by (1) comparing the fit of LLTM with the fit of the Rasch model (RM) using the likelihood ratio (LR) test and (2) by examining the correlation between the Rasch model item parameters and LLTM reconstructed item parameters. The problem with the LR test is that it is almost always significant and, consequently, LLTM is rejected. The drawback of examining the correlation coefficient is that there is no cut-off value or lower bound for the magnitude of the correlation coefficient. In this article we suggest a simulation method to set a minimum benchmark for the correlation between item parameters from the Rasch model and those reconstructed by the LLTM. If the cognitive model is valid then the correlation coefficient between the RM-based item parameters and the LLTM-reconstructed item parameters derived from the theoretical weight matrix should be greater than those derived from the simulated matrices. PMID:28611721
Validation of VARK learning modalities questionnaire using Rasch analysis

NASA Astrophysics Data System (ADS)

Fitkov-Norris, E. D.; Yeghiazarian, A.

2015-02-01

This article discusses the application of Rasch analysis to assess the internal validity of a four sub-scale VARK (Visual, Auditory, Read/Write and Kinaesthetic) learning styles instrument. The results from the analysis show that the Rasch model fits the majority of the VARK questionnaire data and the sample data support the internal validity of the four sub-constructs at 1% level of significance for all but one item. While this suggests that the instrument could potentially be used as a predictor for a person's learning preference orientation, further analysis is necessary to confirm the invariability of the instrument across different user groups across factors such as gender, age, educational and cultural background.
Emotional Intelligence and Nurse Recruitment: Rasch and confirmatory factor analysis of the trait emotional intelligence questionnaire short form.

PubMed

Snowden, Austyn; Watson, Roger; Stenhouse, Rosie; Hale, Claire

2015-12-01

To examine the construct validity of the Trait Emotional Intelligence Questionnaire Short form. Emotional intelligence involves the identification and regulation of our own emotions and the emotions of others. It is therefore a potentially useful construct in the investigation of recruitment and retention in nursing and many questionnaires have been constructed to measure it. Secondary analysis of existing dataset of responses to Trait Emotional Intelligence Questionnaire Short form using concurrent application of Rasch analysis and confirmatory factor analysis. First year undergraduate nursing and computing students completed Trait Emotional Intelligence Questionnaire-Short Form in September 2013. Responses were analysed by synthesising results of Rasch analysis and confirmatory factor analysis. Participants (N = 938) completed Trait Emotional Intelligence Questionnaire Short form. Rasch analysis showed the majority of the Trait Emotional Intelligence Questionnaire-Short Form items made a unique contribution to the latent trait of emotional intelligence. Five items did not fit the model and differential item functioning (gender) accounted for this misfit. Confirmatory factor analysis revealed a four-factor structure consisting of: self-confidence, empathy, uncertainty and social connection. All five misfitting items from the Rasch analysis belonged to the 'social connection' factor. The concurrent use of Rasch and factor analysis allowed for novel interpretation of Trait Emotional Intelligence Questionnaire Short form. Much of the response variation in Trait Emotional Intelligence Questionnaire Short form can be accounted for by the social connection factor. Implications for practice are discussed. © 2015 John Wiley & Sons Ltd.
An Application of the Rasch Measurement Theory to an Assessment of Geometric Thinking Levels

ERIC Educational Resources Information Center

Stols, Gerrit; Long, Caroline; Dunne, Tim

2015-01-01

The purpose of this study is to apply the Rasch model to investigate both the Van Hiele theory for geometric development and an associated test. In terms of the test, the objective is to investigate the functioning of a classic 25-item instrument designed to identify levels of geometric proficiency. The dataset of responses by 244 students (106…
[Rasch Model in the Validation of the Paediatric Quality of Life Inventory™ 4.0 (PedsQL 4.0™) in Colombian Children and Adolescents].

PubMed

Vélez, Claudia Marcela; Villada Ramírez, Adriana C; Arias, Ana Carolina Amaya; Eslava-Schmalbach, Javier H

2016-01-01

The aim of this study was to validate the PedsQL 4.0™ in Colombian children and adolescents using the Rasch model. The Paediatric Quality of Life Inventory (PedsQL 4.0™) has demonstrated to be a reliable and sensitive measurement to changes in health status, as well as being quick and easy to use. Validation study of measurement tools. The PedsQL 4.0™ was applied to a convenience sample of 375 children and adolescents between 5 and 17 years old and 500 caregivers of children between 2 and 18 years old in five Colombian cities. The psychometric properties were analysed according to the Rasch model, including adjustment, separation, and differential item functioning (DIF). The Rasch model provided adequate fits to data. The social dimension, for both versions, had greater difficulty than the physical health dimension. Internal consistency for the items was observed, while for individuals, the values of reliability and separation were lower than that established. The DIF occurred in very few variables, especially when comparing cities. The characteristic curves for the items presented disordered thresholds. The items had adequate internal consistency. Analysis showed adequate individual separation, but disordered thresholds were found in the response categories. No DIF was observed by sex or disease, but it is noteworthy that the DIF occurred between cities. Copyright © 2016 Asociación Colombiana de Psiquiatría. Publicado por Elsevier España. All rights reserved.
Psychometric properties of the International Classification of Functioning, Disability and Health set for spinal cord injury nursing based on Rasch analysis.

PubMed

Li, Kun; Yan, Tiebin; You, Liming; Xie, Sumei; Li, Yun; Tang, Jie; Wang, Yingmin; Gao, Yan

2018-02-01

To examine the psychometric properties of the International Classification of Functioning, Disability and Health (ICF) set for spinal cord injury nursing (ICF-SCIN) using Rasch analysis. A total of 140 spinal cord injury patients were recruited between December 2013 and March 2014 through convenience sampling. Nurses used the components body functions (BF), body structures (BS), and activities and participation (AP) of the ICF-SCIN to rate the patients' functioning. Rasch analysis was performed using RUMM 2030 software. In each component, categories were rescored from 01234 to 01112 because of reversed thresholds. Nine testlets were created to overcome local dependency. Four categories which fit to the Rasch model poorly were deleted. After modification, the components BF, BS, and AP showed good fit to the Rasch model with a Bonferroni-adjusted significant level (χ 2 = 86.29, p = 0.006; χ 2 = 22.44, p = 0.130; χ 2 = 39.92, p = 0.159). The person separation indices (PSIs) for the three components were 0.80, 0.54, and 0.97, respectively. No differential item functioning (DIF) was detected across age, gender, or educational level. The fit properties of the ICF set were satisfactory after modifications. The ICF-SCIN has the potential as a nursing assessment instrument for measuring the functioning of patients with spinal cord injury. Implications for rehabilitation The International Classification of Functioning, Disability and Health (ICF) set for spinal cord injury nursing contains a group of categories which can reflect the functioning of spinal cord injury patients from the perspective of nurses. The components body functions (BF), body structures (BS), and activities and participation (AP) of the ICF set for spinal cord injury achieved the fit to the Rasch model through rescoring, generating testlets, and deleting categories with poor fit. The ICF set for spinal cord injury nursing (ICF-SCIN) has the potential to be used as a clinical nursing assessment tool in measuring the functioning of patients with spinal cord injury.
Analysis of the psychometric properties of the American Orthopaedic Foot and Ankle Society Score (AOFAS) in rheumatoid arthritis patients: application of the Rasch model.

PubMed

Conceição, Cristiano Sena da; Neto, Mansueto Gomes; Neto, Anolino Costa; Mendes, Selena M D; Baptista, Abrahão Fontes; Sá, Kátia Nunes

2016-01-01

To tested the reliability and validity of Aofas in a sample of rheumatoid arthritis patients. The scale was applicable to rheumatoid arthritis patients, twice by the interviewer 1 and once by the interviewer 2. The Aofas was subjected to test-retest reliability analysis (with 20 Rheumatoid arthritis subjects). The psychometric properties were investigated using Rasch analysis on 33 Rheumatoid arthritis patients. Intra-Class Correlation Coefficient (ICC) were (0.90
Validating the European Health Literacy Survey Questionnaire in people with type 2 diabetes: Latent trait analyses applying multidimensional Rasch modelling and confirmatory factor analysis.

PubMed

Finbråten, Hanne Søberg; Pettersen, Kjell Sverre; Wilde-Larsson, Bodil; Nordström, Gun; Trollvik, Anne; Guttersrud, Øystein

2017-11-01

To validate the European Health Literacy Survey Questionnaire (HLS-EU-Q47) in people with type 2 diabetes mellitus. The HLS-EU-Q47 latent variable is outlined in a framework with four cognitive domains integrated in three health domains, implying 12 theoretically defined subscales. Valid and reliable health literacy measurers are crucial to effectively adapt health communication and education to individuals and groups of patients. Cross-sectional study applying confirmatory latent trait analyses. Using a paper-and-pencil self-administered approach, 388 adults responded in March 2015. The data were analysed using the Rasch methodology and confirmatory factor analysis. Response violation (response dependency) and trait violation (multidimensionality) of local independence were identified. Fitting the "multidimensional random coefficients multinomial logit" model, 1-, 3- and 12-dimensional Rasch models were applied and compared. Poor model fit and differential item functioning were present in some items, and several subscales suffered from poor targeting and low reliability. Despite multidimensional data, we did not observe any unordered response categories. Interpreting the domains as distinct but related latent dimensions, the data fit a 12-dimensional Rasch model and a 12-factor confirmatory factor model best. Therefore, the analyses did not support the estimation of one overall "health literacy score." To support the plausibility of claims based on the HLS-EU score(s), we suggest: removing the health care aspect to reduce the magnitude of multidimensionality; rejecting redundant items to avoid response dependency; adding "harder" items and applying a six-point rating scale to improve subscale targeting and reliability; and revising items to improve model fit and avoid bias owing to person factors. © 2017 John Wiley & Sons Ltd.

The integration of bioclimatic indices in an objective probabilistic model for establishing and mapping viticulture suitability in a region

NASA Astrophysics Data System (ADS)

Moral García, Francisco J.; Rebollo, Francisco J.; Paniagua, Luis L.; García, Abelardo

2014-05-01

Different bioclimatic indices have been proposed to determine the wine suitability in a region. Some of them are related to the air temperature, but the hydric component of climate should also be considered which, in turn, is influenced by the precipitation during the different stages of the grapevine growing and ripening periods. In this work we propose using the information obtained from 10 bioclimatic indices and variables (heliothermal index, HI, cool night index, CI, dryness index, DI, growing season temperature, GST, the Winkler index, WI, September mean thermal amplitude, MTA, annual precipitation, AP, precipitation during flowering, PDF, precipitation before flowering, PBF, and summer precipitation, SP) as inputs in an objective and probabilistic model, the Rasch model, with the aim of integrating the individual effects of them, obtaining the climate data that summarize all main bioclimatic indices which could influence on wine suitability, and utilize the Rasch measures to generate homogeneous climatic zones. The use of the Rasch model to estimate viticultural suitability constitutes a new application of great practical importance, enabling to rationally determine locations in a region where high viticultural potential exists and establishing a ranking of the bioclimatic indices or variables which exerts an important influence on wine suitability in a region. Furthermore, from the measures of viticultural suitability at some locations, estimates can be computed using a geostatistical algorithm, and these estimates can be utilized to map viticultural suitability potential in a region. To illustrate the process, an application to Extremadura, southewestern Spain, is shown. Keywords: Rasch model, bioclimatic indices, GIS.
Development of a short scale for assessing economic environmental aspects in patients with spinal diseases using Rasch analysis.

PubMed

Gecht, Judith; Mainz, Verena; Boecker, Maren; Clusmann, Hans; Geiger, Matthias Florian; Tingart, Markus; Quack, Valentin; Gauggel, Siegfried; Heinemann, Allen W; Müller, Christian-Andreas

2017-10-10

Economic environmental factors represent important barriers to participation and have deleterious effects on quality of life (QOL) in persons with spinal diseases (SpD). While economic factors are anchored in the International Classification of Functioning, Disability and Health, their influence on QOL and participation from patients' perspectives is an infrequent focus of research. The aim of the present research is to calibrate a culturally adapted Rasch-based questionnaire assessing economic QOL in patients with SpD. The 11-items of the German economic-QOL-scale were answered by 325 patients with SpD on a four-point Likert-scale. Fit to the Rasch measurement model was investigated by testing for stochastic ordering of the items, unidimensionality, local independence, and differential item functioning (DIF). After adjusting for local dependency, fit to the Rasch model was achieved with a non-significant item-trait interaction (chi-square df = 20 = 34.8, p = 0.021). The person separation reliability equaled 0.88, the scale was free from age- or gender-related DIF, and unidimensionality could be verified. The Rasch-based German version of the economic-QOL-scale represents a suitable instrument to investigate the influences of economic factors on patients' QOL at a group and individual level. It can be easily applied in research and practice and may be administered quickly in combination with other instruments. The short test duration implies a low test burden for patients and a minimum of time expenditure by clinicians when evaluating the results.
Measuring Math Anxiety (in Spanish) with the Rasch Rating Scale Model.

PubMed

Prieto, Gerardo; Delgado, Ana R

2007-01-01

Two successive studies probed the psychometric properties of a Math Anxiety questionnaire (in Spanish) by means of the Rasch Rating Scale Model. Participants were 411 and 216 Spanish adolescents. Convergent validity was examined by correlating the scale with both the Fennema and Sherman Attitude Scale and a math achievement test. The results show that the scores are psychometrically appropriate, and replicate those reported in meta-analyses: medium-sized negative correlations with achievement and with attitudes toward mathematics, as well as moderate sex-related differences (with girls presenting higher anxiety levels than boys).
Using Rasch Analysis to Examine the Dimensionality Structure and Differential Item Functioning of the Arabic Version of the Perceived Physical Ability Scale for Children

ERIC Educational Resources Information Center

Abd-El-Fattah, Sabry M.; AL-Sinani, Yousra; El Shourbagi, Sahar; Fakhroo, Hessa A.

2014-01-01

This study uses the Rasch model technique to examine the dimensionality structure and differential item functioning of the Arabic version of the Perceived Physical Ability Scale for Children (PPASC). A sample of 220 Omani fourth graders (120 males and 100 females) responded to an Arabic translated version of the PPASC. Data on students'…
A Rasch Differential Item Functioning Analysis of the Massachusetts Youth Screening Instrument: Identifying Race and Gender Differential Item Functioning among Juvenile Offenders

ERIC Educational Resources Information Center

Cauffman, Elizabeth; MacIntosh, Randall

2006-01-01

The juvenile justice system needs a tool that can identify and assess mental health problems among youths quickly with validity and reliability. The goal of this article is to evaluate the racial/ethnic and gender differential item functioning (DIF) of the Massachusetts Youth Screening Instrument-Second Version (MAYSI-2) using the Rasch Model.…
A measure of early physical functioning (EPF) post-stroke.

PubMed

Finch, Lois E; Higgins, Johanne; Wood-Dauphinee, Sharon; Mayo, Nancy E

2008-07-01

To develop a comprehensive measure of Early Physical Functioning (EPF) post-stroke quantified through Rasch analysis and conceptualized using the International Classification of Functioning Disability and Health (ICF). An observational cohort study. A cohort of 262 subjects (mean age 71.6 (standard deviation 12.5) years) hospitalized post-acute stroke. Functional assessments were made within 3 days of stroke with items from valid and reliable indices commonly utilized to evaluate stroke survivors. Information on important variables was also collected. Principal component and Rasch analysis confirmed the factor structure, and dimensionality of the measure. Rasch analysis combined items across ICF components to develop the measure. Items were deleted iteratively, those retained fit the model and were related to the construct; reliability and validity were assessed. A 38-item unidimensional measure of the EPF met all Rasch model requirements. The item difficulty matched the person ability (mean person measure: -0.31; standard error 0.37 logits), reliability of the person-item-hierarchy was excellent at 0.97. Initial validity was adequate. The 38-item EPF measure was developed. It expands the range of assessment post acute stroke; it covers a broad spectrum of difficulty with good initial psychometric properties that, once revalidated, can assist in planning and evaluating early interventions.
Application of Rasch analysis to the parent adherence report questionnaire in juvenile idiopathic arthritis.

PubMed

Toupin April, Karine; Higgins, Johanne; Ehrmann Feldman, Debbie

2016-07-28

Adherence to treatment in children with juvenile idiopathic arthritis (JIA) is associated with better outcomes. Assessing patient adherence in JIA, as well as attitudes and beliefs about prescribed treatments, is important for the clinician in order to optimize patient management. The objective of the current study was to evaluate the psychometric properties of the Parent (proxy-report) Adherence Report Questionnaires (PARQ), which assesses beliefs and behaviors related to adherence to treatments prescribed for JIA. A Rasch analysis was conducted on data collected with parents of children with JIA from two studies in which the PARQ was used as a measure of adherence. The PARQ showed preliminary evidence of multidimensionality with two factors, accounting for 38 % and 27 % of the variance respectively. The PARQ in its original version does not adhere to expectations of the Rasch model. A transformed version of the PARQ obtained by deletion of the general adherence scale and modification of visual analog scales into 5-point likert scales improved fit to the model and showed preliminary evidence of unidimensionality. The PARQ was transformed based on the results of the Rasch analysis. The transformed version of the PARQ shows preliminary evidence of unidimensionality and may allow computation of a total score, although further testing is needed to verify these findings.
Developing the Communicative Participation Item Bank: Rasch Analysis Results From a Spasmodic Dysphonia Sample

PubMed Central

Baylor, Carolyn R.; Yorkston, Kathryn M.; Eadie, Tanya L.; Miller, Robert M.; Amtmann, Dagmar

2011-01-01

Purpose The purpose of this study was to conduct the initial psychometric analyses of the Communicative Participation Item Bank—a new self-report instrument designed to measure the extent to which communication disorders interfere with communicative participation. This item bank is intended for community-dwelling adults across a range of communication disorders. Method A set of 141 candidate items was administered to 208 adults with spasmodic dysphonia. Participants rated the extent to which their condition interfered with participation in various speaking communication situations. Questionnaires were administered online or in a paper version per participant preference. Participants also completed the Voice Handicap Index (B. H. Jacobson et al., 1997) and a demographic questionnaire. Rasch analyses were conducted using Winsteps software (J. M. Linacre, 1991). Results The results show that items functioned better when the 5-category response format was recoded to a 4-category format. After removing 8 items that did not fit the Rasch model, the remaining 133 items demonstrated strong evidence of sufficient unidimensionality, with the model accounting for 89.3% of variance. Item location values ranged from −2.73 to 2.20 logits. Conclusions Preliminary Rasch analyses of the Communicative Participation Item Bank show strong psychometric properties. Further testing in populations with other communication disorders is needed. PMID:19717652
Rasch analysis of measurement instruments capturing psychological personal factors in persons with spinal cord injury.

PubMed

Peter, Claudio; Schulenberg, Stefan E; Buchanan, Erin M; Prodinger, Birgit; Geyh, Szilvia

2016-02-01

To evaluate the metric properties of distinct measures of psychological personal factors comprising feelings, beliefs, motives, and patterns of experience and behaviour assessed in the Swiss Spinal Cord Injury Cohort Study (SwiSCI), using Rasch methodology. SwiSCI Pathway 2 is a community-based, nationwide, cross-sectional survey for persons with spinal cord injury (SCI) (n = 511). The Rasch partial credit model was used for each subscale of the Positive Affect Negative Affect Scale (PANAS), Appraisal of Life Events Scale (ALE), Purpose in Life test - Short Form (PIL-SF), and the Big Five Inventory-K (BFI-K). The measures were unidimensional, with the exception of the positive affect items of the PANAS, where pairwise t-tests resulted in 10% significant cases, indicating multidimensionality. The BFI-K subscale agreeableness revealed low reliability (0.53). Other reliability estimates ranged between 0.61 and 0.89. Ceiling and floor effects were found for most measures. SCI-related differential item functioning (DIF) was rarely found. Language DIF was identified for several items of the BFI-K, PANAS and the ALE, but not for the PIL-SF. A majority of the measures satisfy the assumptions of the Rasch model, including unidimensionality. Invariance across language versions still represents a major challenge.
The pregnancy-related anxiety scale: A validity examination using Rasch analysis.

PubMed

Brunton, Robyn J; Dryer, Rachel; Krägeloh, Chris; Saliba, Anthony; Kohlhoff, Jane; Medvedev, Oleg

2018-04-27

Pregnancy-related anxiety is increasingly recognised as a common condition that is associated with many deleterious outcomes for both the mother and infant (e.g., preterm birth, postnatal depression). Limitations in the psychometric properties and/or breadth of existing scales for pregnancy-related anxiety highlight the need for a psychometrically sound measure to facilitate effective screening and possible early interventions. The recently developed Pregnancy-related Anxiety Scale (PrAS) was evaluated using Rasch analysis to explore how the scale's psychometric properties could be fine-tuned. A sample of 497 pregnant women completed the PrAS. Data were subjected to Rasch analysis, and the resulting scale structure examined using Confirmatory Factor Analysis. After minor modifications, the Rasch model with 33-items and 8-factors demonstrated good fit, unidimensionality and excellent targeting and internal consistency. Confirmatory Factor Analysis confirmed the final structure, and Cronbach's alpha demonstrated excellent reliability. The use of the same sample for all analyses was a potential limitation due to the possibility of sample-specific influences. The Rasch analysis further supports the internal construct validity of the PrAS. Ordinal to interval score conversions provide added precision to the analysis of the PrAS scores. The Rasch results, together with previous validation evidence, point to the PrAS as a comprehensive and psychometrically sound screening scale for pregnancy-related anxiety. The PrAS offers clinicians the ability to screen for pregnancy-related anxiety. The subscales provide additional insights into a woman's pregnancy-related anxiety and her specific areas of concern, enabling more targeted interventions. Copyright © 2018 Elsevier B.V. All rights reserved.
Using the Rasch model as an objective and probabilistic technique to integrate different soil properties

NASA Astrophysics Data System (ADS)

Rebollo, Francisco J.; Jesús Moral García, Francisco

2016-04-01

Soil apparent electrical conductivity (ECa) is one of the simplest, least expensive soil measurements that integrates many soil properties affecting crop productivity, including, for instance, soil texture, water content, and cation exchange capacity. The ECa measurements obtained with a 3100 Veris sensor, operating in both shallow (0-30 cm), ECs, and deep (0-90 cm), ECd, mode, can be used as an additional and essential information to be included in a probabilistic model, the Rasch model, with the aim of quantifying the overall soil fertililty potential in an agricultural field. This quantification should integrate the main soil physical and chemical properties, with different units. In this work, the formulation of the Rasch model integrates 11 soil properties (clay, silt and sand content, organic matter -OM-, pH, total nitrogen -TN-, available phosphorus -AP- and potassium -AK-, cation exchange capacity -CEC-, ECd, and ECs) measured at 70 locations in a field. The main outputs of the model include a ranking of all soil samples according to their relative fertility potential and the unexpected behaviours of some soil samples and properties. In the case study, the considered soil variables fit the model reasonably, having an important influence on soil fertility, except pH, probably due to its homogeneity in the field. Moreover, ECd, ECs are the most influential properties on soil fertility and, on the other hand, AP and AK the less influential properties. The use of the Rasch model to estimate soil fertility potential (always in a relative way, taking into account the characteristics of the studied soil) constitutes a new application of great practical importance, enabling to rationally determine locations in a field where high soil fertility potential exists and establishing those soil samples or properties which have any anomaly; this information can be necessary to conduct site-specific treatments, leading to a more cost-effective and sustainable field management. Furthermore, from the measures of soil fertility potential at sampled locations, estimates can be computed using, for instance, a geostatistical algorithm, and these estimates can be utilized to map soil fertility potential and delineate with a rational basis the management zones in the field. Keywords: Rasch model; soil management; soil electrical conductivity; probabilistic algorithm.
Rasch model based analysis of the Force Concept Inventory

NASA Astrophysics Data System (ADS)

Planinic, Maja; Ivanjek, Lana; Susac, Ana

2010-06-01

The Force Concept Inventory (FCI) is an important diagnostic instrument which is widely used in the field of physics education research. It is therefore very important to evaluate and monitor its functioning using different tools for statistical analysis. One of such tools is the stochastic Rasch model, which enables construction of linear measures for persons and items from raw test scores and which can provide important insight in the structure and functioning of the test (how item difficulties are distributed within the test, how well the items fit the model, and how well the items work together to define the underlying construct). The data for the Rasch analysis come from the large-scale research conducted in 2006-07, which investigated Croatian high school students’ conceptual understanding of mechanics on a representative sample of 1676 students (age 17-18 years). The instrument used in research was the FCI. The average FCI score for the whole sample was found to be (27.7±0.4)% , indicating that most of the students were still non-Newtonians at the end of high school, despite the fact that physics is a compulsory subject in Croatian schools. The large set of obtained data was analyzed with the Rasch measurement computer software WINSTEPS 3.66. Since the FCI is routinely used as pretest and post-test on two very different types of population (non-Newtonian and predominantly Newtonian), an additional predominantly Newtonian sample ( N=141 , average FCI score of 64.5%) of first year students enrolled in introductory physics course at University of Zagreb was also analyzed. The Rasch model based analysis suggests that the FCI has succeeded in defining a sufficiently unidimensional construct for each population. The analysis of fit of data to the model found no grossly misfitting items which would degrade measurement. Some items with larger misfit and items with significantly different difficulties in the two samples of students do require further examination. The analysis revealed some problems with item distribution in the FCI and suggested that the FCI may function differently in non-Newtonian and predominantly Newtonian population. Some possible improvements of the test are suggested.
A Comparison between Discrimination Indices and Item-Response Theory Using the Rasch Model in a Clinical Course Written Examination of a Medical School.

PubMed

Park, Jong Cook; Kim, Kwang Sig

2012-03-01

The reliability of test is determined by each items' characteristics. Item analysis is achieved by classical test theory and item response theory. The purpose of the study was to compare the discrimination indices with item response theory using the Rasch model. Thirty-one 4th-year medical school students participated in the clinical course written examination, which included 22 A-type items and 3 R-type items. Point biserial correlation coefficient (C(pbs)) was compared to method of extreme group (D), biserial correlation coefficient (C(bs)), item-total correlation coefficient (C(it)), and corrected item-total correlation coeffcient (C(cit)). Rasch model was applied to estimate item difficulty and examinee's ability and to calculate item fit statistics using joint maximum likelihood. Explanatory power (r2) of Cpbs is decreased in the following order: C(cit) (1.00), C(it) (0.99), C(bs) (0.94), and D (0.45). The ranges of difficulty logit and standard error and ability logit and standard error were -0.82 to 0.80 and 0.37 to 0.76, -3.69 to 3.19 and 0.45 to 1.03, respectively. Item 9 and 23 have outfit > or =1.3. Student 1, 5, 7, 18, 26, 30, and 32 have fit > or =1.3. C(pbs), C(cit), and C(it) are good discrimination parameters. Rasch model can estimate item difficulty parameter and examinee's ability parameter with standard error. The fit statistics can identify bad items and unpredictable examinee's responses.
Linguistic validation of stigmatisation degree, self-esteem and knowledge questionnaire among asthma patients using Rasch analysis.

PubMed

Ahmad, Sohail; Ismail, Ahmad Izuanuddin; Khan, Tahir Mehmood; Akram, Waqas; Mohd Zim, Mohd Arif; Ismail, Nahlah Elkudssiah

2017-04-01

The stigmatisation degree, self-esteem and knowledge either directly or indirectly influence the control and self-management of asthma. To date, there is no valid and reliable instrument that can assess these key issues collectively. The main aim of this study was to test the reliability and validity of the newly devised and translated "Stigmatisation Degree, Self-Esteem and Knowledge Questionnaire" among adult asthma patients using the Rasch measurement model. This cross-sectional study recruited thirty adult asthma patients from two respiratory specialist clinics in Selangor, Malaysia. The newly devised self-administered questionnaire was adapted from relevant publications and translated into the Malay language using international standard translation guidelines. Content and face validation was done. The data were extracted and analysed for real item reliability and construct validation using the Rasch model. The translated "Stigmatisation Degree, Self-Esteem and Knowledge Questionnaire" showed high real item reliability values of 0.90, 0.86 and 0.89 for stigmatisation degree, self-esteem, and knowledge of asthma, respectively. Furthermore, all values of point measure correlation (PTMEA Corr) analysis were within the acceptable specified range of the Rasch model. Infit/outfit mean square values and Z standard (ZSTD) values of each item verified the construct validity and suggested retaining all the items in the questionnaire. The reliability analyses and output tables of item measures for construct validation proved the translated Malaysian version of "Stigmatisation Degree, Self-Esteem and Knowledge Questionnaire" as a valid and highly reliable questionnaire.
Identifying potential misfit items in cognitive process of learning engineering mathematics based on Rasch model

NASA Astrophysics Data System (ADS)

Ataei, Sh; Mahmud, Z.; Khalid, M. N.

2014-04-01

The students learning outcomes clarify what students should know and be able to demonstrate after completing their course. So, one of the issues on the process of teaching and learning is how to assess students' learning. This paper describes an application of the dichotomous Rasch measurement model in measuring the cognitive process of engineering students' learning of mathematics. This study provides insights into the perspective of 54 engineering students' cognitive ability in learning Calculus III based on Bloom's Taxonomy on 31 items. The results denote that some of the examination questions are either too difficult or too easy for the majority of the students. This analysis yields FIT statistics which are able to identify if there is data departure from the Rasch theoretical model. The study has identified some potential misfit items based on the measurement of ZSTD where the removal misfit item was accomplished based on the MNSQ outfit of above 1.3 or less than 0.7 logit. Therefore, it is recommended that these items be reviewed or revised to better match the range of students' ability in the respective course.
Measuring leader perceptions of school readiness for reforms: use of an iterative model combining classical and Rasch methods.

PubMed

Chatterji, Madhabi

2002-01-01

This study examines validity of data generated by the School Readiness for Reforms: Leader Questionnaire (SRR-LQ) using an iterative procedure that combines classical and Rasch rating scale analysis. Following content-validation and pilot-testing, principal axis factor extraction and promax rotation of factors yielded a five factor structure consistent with the content-validated subscales of the original instrument. Factors were identified based on inspection of pattern and structure coefficients. The rotated factor pattern, inter-factor correlations, convergent validity coefficients, and Cronbach's alpha reliability estimates supported the hypothesized construct properties. To further examine unidimensionality and efficacy of the rating scale structures, item-level data from each factor-defined subscale were subjected to analysis with the Rasch rating scale model. Data-to-model fit statistics and separation reliability for items and persons met acceptable criteria. Rating scale results suggested consistency of expected and observed step difficulties in rating categories, and correspondence of step calibrations with increases in the underlying variables. The combined approach yielded more comprehensive diagnostic information on the quality of the five SRR-LQ subscales; further research is continuing.
Scale invariance and longitudinal stability of the Physical Functioning Western Ontario and MacMaster Universities Osteoarthritis Index using the Rasch model.

PubMed

Ayala, Alba; Bilbao, Amaia; Garcia-Perez, Sonia; Escobar, Antonio; Forjaz, Maria João

2018-03-01

The Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC) measures the quality of life of patients with osteoarthritis (OA), and there is a specific scale for the physical functioning dimension, the short version with seven items WOMAC-pf. This study describes the application of the Rasch model to explore scale invariance and response stability of the WOMAC-pf short version across affected joint and over time. A sample of 884 patients with OA, from 15 hospitals in Spain, completed the WOMAC-pf before surgery (baseline) and at 3, 6 and 12 months post-surgery of hip or knee. The invariance by joint was explored through the differential item functioning (DIF) analysis of the Rasch model using baseline data, and time stability (DIF by time) were evaluated in stack data (each participant is represented four times, one by time point). Mean age of the patients was of 69.13 years (SD 10.01), 59.3% of them were women (n = 524), 59.2% had knee OA (n = 523) and 40.8% hip OA (n = 361). Item "putting on socks" showed DIF by joint and time. Fit to the Rasch model using stack data improved when this item was removed. Good reliability for individual use, local independency and unidimensionality of the models were confirmed. WOMAC-pf 7-item short version was invariant over time and joint when item "putting on socks" was removed. Researchers should carefully evaluate this item as it presents problems in scale invariance and stability, which could affect results when comparing data by joint or when computing change scores.
Construct validity of the Heart Failure Screening Tool (Heart-FaST) to identify heart failure patients at risk of poor self-care: Rasch analysis.

PubMed

Reynolds, Nicholas A; Ski, Chantal F; McEvedy, Samantha M; Thompson, David R; Cameron, Jan

2018-02-14

The aim of this study was to psychometrically evaluate the Heart Failure Screening Tool (Heart-FaST) via: (1) examination of internal construct validity; (2) testing of scale function in accordance with design; and (3) recommendation for change/s, if items are not well adjusted, to improve psychometric credential. Self-care is vital to the management of heart failure. The Heart-FaST may provide a prospective assessment of risk, regarding the likelihood that patients with heart failure will engage in self-care. Psychometric validation of the Heart-FaST using Rasch analysis. The Heart-FaST was administered to 135 patients (median age = 68, IQR = 59-78 years; 105 males) enrolled in a multidisciplinary heart failure management program. The Heart-FaST is a nurse-administered tool for screening patients with HF at risk of poor self-care. A Rasch analysis of responses was conducted which tested data against Rasch model expectations, including whether items serve as unbiased, non-redundant indicators of risk and measure a single construct and that rating scales operate as intended. The results showed that data met Rasch model expectations after rescoring or deleting items due to poor discrimination, disordered thresholds, differential item functioning, or response dependence. There was no evidence of multidimensionality which supports the use of total scores from Heart-FaST as indicators of risk. Aggregate scores from this modified screening tool rank heart failure patients according to their "risk of poor self-care" demonstrating that the Heart-FaST items constitute a meaningful scale to identify heart failure patients at risk of poor engagement in heart failure self-care. © 2018 John Wiley & Sons Ltd.
Validation of Catquest-9SF-A Visual Disability Instrument to Evaluate Patient Function After Corneal Transplantation.

PubMed

Claesson, Margareta; Armitage, W John; Byström, Berit; Montan, Per; Samolov, Branka; Stenvi, Ulf; Lundström, Mats

2017-09-01

Catquest-9SF is a 9-item visual disability questionnaire developed for evaluating patient-reported outcome measures after cataract surgery. The aim of this study was to use Rasch analysis to determine the responsiveness of Catquest-9SF for corneal transplant patients. Patients who underwent corneal transplantation primarily to improve vision were included. One group (n = 199) completed the Catquest-9SF questionnaire before corneal transplantation and a second independent group (n = 199) completed the questionnaire 2 years after surgery. All patients were recorded in the Swedish Cornea Registry, which provided clinical and demographic data for the study. Winsteps software v.3.91.0 (Winsteps.com, Beaverton, OR) was used to assess the fit of the Catquest-9SF data to the Rasch model. Rasch analysis showed that Catquest-9SF applied to corneal transplant patients was unidimensional (infit range, 0.73-1.32; outfit range, 0.81-1.35), and therefore, measured a single underlying construct (visual disability). The Rasch model explained 68.5% of raw variance. The response categories of the 9-item questionnaire were ordered, and the category thresholds were well defined. Item difficulty matched the level of patients' ability (0.36 logit difference between the means). Precision in terms of person separation (3.09) and person reliability (0.91) was good. Differential item functioning was notable for only 1 item (satisfaction with vision), which had a differential item functioning contrast of 1.08 logit. Rasch analysis showed that Catquest-9SF is a valid instrument for measuring visual disability in patients who have undergone corneal transplantation primarily to improve vision.
Investigating the application of Rasch theory in measuring change in middle school student performance in physical science

NASA Astrophysics Data System (ADS)

Cunningham, Jessica D.

Newton's Universe (NU), an innovative teacher training program, strives to obtain measures from rural, middle school science teachers and their students to determine the impact of its distance learning course on understanding of temperature. No consensus exists on the most appropriate and useful method of analysis to measure change in psychological constructs over time. Several item response theory (IRT) models have been deemed useful in measuring change, which makes the choice of an IRT model not obvious. The appropriateness and utility of each model, including a comparison to a traditional analysis of variance approach, was investigated using middle school science student performance on an assessment over an instructional period. Predetermined criteria were outlined to guide model selection based on several factors including research questions, data properties, and meaningful interpretations to determine the most appropriate model for this study. All methods employed in this study reiterated one common interpretation of the data -- specifically, that the students of teachers with any NU course experience had significantly greater gains in performance over the instructional period. However, clear distinctions were made between an analysis of variance and the racked and stacked analysis using the Rasch model. Although limited research exists examining the usefulness of the Rasch model in measuring change in understanding over time, this study applied these methods and detailed plausible implications for data-driven decisions based upon results for NU and others. Being mindful of the advantages and usefulness of each method of analysis may help others make informed decisions about choosing an appropriate model to depict changes to evaluate other programs. Results may encourage other researchers to consider the meaningfulness of using IRT for this purpose. Results have implications for data-driven decisions for future professional development courses, in science education and other disciplines. KEYWORDS: Item Response Theory, Rasch Model, Racking and Stacking, Measuring Change in Student Performance, Newton's Universe teacher training

Rasch Analysis for Instrument Development: Why, When, and How?

ERIC Educational Resources Information Center

Boone, William J.

2016-01-01

This essay describes Rasch analysis psychometric techniques and how such techniques can be used by life sciences education researchers to guide the development and use of surveys and tests. Specifically, Rasch techniques can be used to document and evaluate the measurement functioning of such instruments. Rasch techniques also allow researchers to…
Assessing the efficacy of the Measure of Understanding of Macroevolution as a valid tool for undergraduate non-science majors

NASA Astrophysics Data System (ADS)

Romine, William Lee; Walter, Emily Marie

2014-11-01

Efficacy of the Measure of Understanding of Macroevolution (MUM) as a measurement tool has been a point of contention among scholars needing a valid measure for knowledge of macroevolution. We explored the structure and construct validity of the MUM using Rasch methodologies in the context of a general education biology course designed with an emphasis on macroevolution content. The Rasch model was utilized to quantify item- and test-level characteristics, including dimensionality, reliability, and fit with the Rasch model. Contrary to previous work, we found that the MUM provides a valid, reliable, and unidimensional scale for measuring knowledge of macroevolution in introductory non-science majors, and that its psychometric behavior does not exhibit large changes across time. While we found that all items provide productive measurement information, several depart substantially from ideal behavior, warranting a collective effort to improve these items. Suggestions for improving the measurement characteristics of the MUM at the item and test levels are put forward and discussed.
Validation of the knowledge, attitude and perceived practice of asthma instrument among community pharmacists using Rasch analysis.

PubMed

Akram, Waqas; Hussein, Maryam S E; Ahmad, Sohail; Mamat, Mohd N; Ismail, Nahlah E

2015-10-01

There is no instrument which collectively assesses the knowledge, attitude and perceived practice of asthma among community pharmacists. Therefore, this study aimed to validate the instrument which measured the knowledge, attitude and perceived practice of asthma among community pharmacists by producing empirical evidence of validity and reliability of the items using Rasch model (Bond & Fox software®) for dichotomous and polytomous data. This baseline study recruited 33 community pharmacists from Penang, Malaysia. The results showed that all PTMEA Corr were in positive values, where an item was able to distinguish between the ability of respondents. Based on the MNSQ infit and outfit range (0.60-1.40), out of 55 items, 2 items from the instrument were suggested to be removed. The findings indicated that the instrument fitted with Rasch measurement model and showed the acceptable reliability values of 0.88 and 0.83 and 0.79 for knowledge, attitude and perceived practice respectively.
Rasch Analysis of the Adult Strabismus Quality of Life Questionnaire (AS-20) among Chinese Adult Patients with Strabismus.

PubMed

Wang, Zonghua; Zhou, Juan; Luo, Xingli; Xu, Yan; She, Xi; Chen, Ling; Yin, Honghua; Wang, Xianyuan

2015-01-01

The impact of strabismus on visual function, self-image, self-esteem, and social interactions decrease health-related quality of life (HRQoL).The purpose of this study was to evaluate and refine the adult strabismus quality of life questionnaire (AS-20) by using Rasch analysis among Chinese adult patients with strabismus. We evaluated the fitness of the AS-20 with Rasch model in Chinese population by assessing unidimensionality, infit and outfit, person and item separation index and reliability, response ordering, targeting and differential item functioning (DIF). The overall AS-20 did not demonstrate unidimensional; however, it was achieved separately in the two Rasch-revised subscales: the psychosocial subscale (11 items) and the function subscale (9 items). The features of good targeting, optimal item infit and outfit, and no notable local dependence were found for each of the subscales. The rating scale was appropriate for the psychosocial subscale but a reduction to four response categories was required for the function subscale. No significant DIF were revealed for any demographic and clinical factors (e.g., age, gender, and strabismus types). The AS-20 was demonstrated by Rasch analysis to be a rigorous instrument for measuring health-related quality of life in Chinese strabismus patents if some revisions were made regarding the subscale construct and response options.
Validation of instruments to measure students' mathematical knowledge

NASA Astrophysics Data System (ADS)

Khatimin, Nuraini; Zaharim, Azami; Aziz, Azrilah Abd

2015-02-01

This paper describes instruments' validation process to identify the suitability and accuracy of the final examination questions for engineering mathematics. As a compulsory subject for second year students from 4 departments in Faculty of Engineering and Built Environment Universiti Kebangsaan Malaysia, the Differential Equations 1 course (KKKQ2124) was considered in this study. The data used in this study consists of the raw marks for final examination of semester 2, 2012/2013 session. The data then will be run and analyzed using the Rasch measurement model. Rasch model can also examine the ability of students and redundancy of instrument constructs.
Practice and Problems in Language Testing 5. Non-Classical Test Theory; Final Examinations in Secondary Schools. Papers Presented at the International Language Testing Symposium (5th, Arnhem, Netherlands, March 25-26, 1982).

ERIC Educational Resources Information Center

van Weeren, J., Ed.

Presented in this symposium reader are nine papers, four of which deal with the theory and impact of the Rasch model on language testing and five of which discuss final examinations in secondary schools in both general and specific terms. The papers are: "Introduction to Rasch Measurement: Some Implications for Language Testing" (J. J.…
Lookup Tables Versus Stacked Rasch Analysis in Comparing Pre- and Postintervention Adult Strabismus-20 Data.

PubMed

Leske, David A; Hatt, Sarah R; Liebermann, Laura; Holmes, Jonathan M

2016-02-01

We compare two methods of analysis for Rasch scoring pre- to postintervention data: Rasch lookup table versus de novo stacked Rasch analysis using the Adult Strabismus-20 (AS-20). One hundred forty-seven subjects completed the AS-20 questionnaire prior to surgery and 6 weeks postoperatively. Subjects were classified 6 weeks postoperatively as "success," "partial success," or "failure" based on angle and diplopia status. Postoperative change in AS-20 scores was compared for all four AS-20 domains (self-perception, interactions, reading function, and general function) overall and by success status using two methods: (1) applying historical Rasch threshold measures from lookup tables and (2) performing a stacked de novo Rasch analysis. Change was assessed by analyzing effect size, improvement exceeding 95% limits of agreement (LOA), and score distributions. Effect sizes were similar for all AS-20 domains whether obtained from lookup tables or stacked analysis. Similar proportions exceeded 95% LOAs using lookup tables versus stacked analysis. Improvement in median score was observed for all AS-20 domains using lookup tables and stacked analysis ( P < 0.0001 for all comparisons). The Rasch-scored AS-20 is a responsive and valid instrument designed to measure strabismus-specific health-related quality of life. When analyzing pre- to postoperative change in AS-20 scores, Rasch lookup tables and de novo stacked Rasch analysis yield essentially the same results. We describe a practical application of lookup tables, allowing the clinician or researcher to score the Rasch-calibrated AS-20 questionnaire without specialized software.
Lookup Tables Versus Stacked Rasch Analysis in Comparing Pre- and Postintervention Adult Strabismus-20 Data

PubMed Central

Leske, David A.; Hatt, Sarah R.; Liebermann, Laura; Holmes, Jonathan M.

2016-01-01

Purpose We compare two methods of analysis for Rasch scoring pre- to postintervention data: Rasch lookup table versus de novo stacked Rasch analysis using the Adult Strabismus-20 (AS-20). Methods One hundred forty-seven subjects completed the AS-20 questionnaire prior to surgery and 6 weeks postoperatively. Subjects were classified 6 weeks postoperatively as “success,” “partial success,” or “failure” based on angle and diplopia status. Postoperative change in AS-20 scores was compared for all four AS-20 domains (self-perception, interactions, reading function, and general function) overall and by success status using two methods: (1) applying historical Rasch threshold measures from lookup tables and (2) performing a stacked de novo Rasch analysis. Change was assessed by analyzing effect size, improvement exceeding 95% limits of agreement (LOA), and score distributions. Results Effect sizes were similar for all AS-20 domains whether obtained from lookup tables or stacked analysis. Similar proportions exceeded 95% LOAs using lookup tables versus stacked analysis. Improvement in median score was observed for all AS-20 domains using lookup tables and stacked analysis (P < 0.0001 for all comparisons). Conclusions The Rasch-scored AS-20 is a responsive and valid instrument designed to measure strabismus-specific health-related quality of life. When analyzing pre- to postoperative change in AS-20 scores, Rasch lookup tables and de novo stacked Rasch analysis yield essentially the same results. Translational Relevance We describe a practical application of lookup tables, allowing the clinician or researcher to score the Rasch-calibrated AS-20 questionnaire without specialized software. PMID:26933524
Patient self-report section of the ASES questionnaire: a Spanish validation study using classical test theory and the Rasch model.

PubMed

Vrotsou, Kalliopi; Cuéllar, Ricardo; Silió, Félix; Rodriguez, Miguel Ángel; Garay, Daniel; Busto, Gorka; Trancho, Ziortza; Escobar, Antonio

2016-10-18

The aim of the current study was to validate the self-report section of the American Shoulder and Elbow Surgeons questionnaire (ASES-p) into Spanish. Shoulder pathology patients were recruited and followed up to 6 months post treatment. The ASES-p, Constant, SF-36 and Barthel scales were filled-in pre and post treatment. Reliability was tested with Cronbach's alpha, convergent validity with Spearman's correlations coefficients. Confirmatory factor analysis (CFA) and the Rasch model were implemented for assessing structural validity and unidimensionality of the scale. Models with and without the pain item were considered. Responsiveness to change was explored via standardised effect sizes. Results were acceptable for both tested models. Cronbach's alpha was 0.91, total scale correlations with Constant and physical SF-36 dimensions were >0.50. Factor loadings for CFA were >0.40. The Rasch model confirmed unidimensionality of the scale, even though item 10 "do usual sport" was suggested as non-informative. Finally, patients with improved post treatment shoulder function and those receiving surgery had higher standardised effect sizes. The adapted Spanish ASES-p version is a valid and reliable tool for shoulder evaluation and its unidimensionality is supported by the data.
Psychometric Properties of the Quantitative Myasthenia Gravis Score and the Myasthenia Gravis Composite Scale.

PubMed

Barnett, Carolina; Merkies, Ingemar S J; Katzberg, Hans; Bril, Vera

2015-09-02

The Quantitative Myasthenia Gravis Score and the Myasthenia Gravis Composite are two commonly used outcome measures in Myasthenia Gravis. So far, their measurement properties have not been compared, so we aimed to study their psychometric properties using the Rasch model. 251 patients with stable myasthenia gravis were assessed with both scales, and 211 patients returned for a second assessment. We studied fit to the Rasch model at the first visit, and compared item fit, thresholds, differential item functioning, local dependence, person separation index, and tests for unidimensionality. We also assessed test-retest reliability and estimated the Minimal Detectable Change. Neither scale fit the Rasch model (X2p < 0.05). The Myasthenia Gravis Composite had lower discrimination properties than the Quantitative Myasthenia Gravis Scale (Person Separation Index: 0.14 and 0.7). There was local dependence in both scales, as well as differential item functioning for ocular and generalized disease. Disordered thresholds were found in 6(60%) items of the Myasthenia Gravis Composite and in 4(31%) of the Quantitative Myasthenia Gravis Score. Both tools had adequate test-retest reliability (ICCs >0.8). The minimally detectable change was 4.9 points for the Myasthenia Gravis Composite and 4.3 points for the Quantitative Myasthenia Gravis Score. Neither scale fulfilled Rasch model expectations. The Quantitative Myasthenia Gravis Score has higher discrimination than the Myasthenia Gravis Composite. Both tools have items with disordered thresholds, differential item functioning and local dependency. There was evidence of multidimensionality in the QMGS. The minimal detectable change values are higher than previous studies on the minimal significant change. These findings might inform future modifications of these tools.
Comparison of formula and number-right scoring in undergraduate medical training: a Rasch model analysis.

PubMed

Cecilio-Fernandes, Dario; Medema, Harro; Collares, Carlos Fernando; Schuwirth, Lambert; Cohen-Schotanus, Janke; Tio, René A

2017-11-09

Progress testing is an assessment tool used to periodically assess all students at the end-of-curriculum level. Because students cannot know everything, it is important that they recognize their lack of knowledge. For that reason, the formula-scoring method has usually been used. However, where partial knowledge needs to be taken into account, the number-right scoring method is used. Research comparing both methods has yielded conflicting results. As far as we know, in all these studies, Classical Test Theory or Generalizability Theory was used to analyze the data. In contrast to these studies, we will explore the use of the Rasch model to compare both methods. A 2 × 2 crossover design was used in a study where 298 students from four medical schools participated. A sample of 200 previously used questions from the progress tests was selected. The data were analyzed using the Rasch model, which provides fit parameters, reliability coefficients, and response option analysis. The fit parameters were in the optimal interval ranging from 0.50 to 1.50, and the means were around 1.00. The person and item reliability coefficients were higher in the number-right condition than in the formula-scoring condition. The response option analysis showed that the majority of dysfunctional items emerged in the formula-scoring condition. The findings of this study support the use of number-right scoring over formula scoring. Rasch model analyses showed that tests with number-right scoring have better psychometric properties than formula scoring. However, choosing the appropriate scoring method should depend not only on psychometric properties but also on self-directed test-taking strategies and metacognitive skills.
Reliability and responsiveness of measures of pain in people with osteoarthritis of the knee: a psychometric evaluation

PubMed Central

Turner, Katie V.; Moreton, Bryan M.; Walsh, David A.; Lincoln, Nadina B.

2017-01-01

Abstract Purpose: To examine the fit between data from the Short Form McGill Pain Questionnaire (SF-MPQ-2) and the Rasch model, and to explore the reliability and internal responsiveness of measures of pain in people with knee osteoarthritis. Methods: Participants with knee osteoarthritis completed the SF-MPQ-2, Intermittent and Constant Osteoarthritis Pain questionnaire (ICOAP) and painDETECT. Participants were sent the same questionnaires 3 and 6 months later. Results: Fit to the Rasch model was not achieved for the SF-MPQ-2 Total scale. The Continuous subscale yielded adequate fit statistics after splitting item 10 on uniform DIF for gender, and removing item 9. The Intermittent subscale fit the Rasch model after rescoring items. The Neuropathic subscale had relatively good fit to the model. Test–retest reliability was satisfactory for most scales using both original and Rasch scoring ranging from fair to substantial. Effect sizes ranged from 0.13 to 1.79 indicating good internal responsiveness for most scales. Conclusions: These findings support the use of ICOAP subscales as reliable and responsive measure of pain in people with knee osteoarthritis. The MPQ-SF-2 subscales found to be acceptable alternatives. Implications for RehabilitationThe McGill Pain Questionnaire short version 2 is not a unidimensional scale in people with knee osteoarthritis, whereas three of the subscales are unidimensional.The McGill Pain Questionnaire short version 2 Affective subscale does not have good measurement properties for people with knee osteoarthritis.The McGill Pain Questionnaire short version 2 and the Intermittent and Constant Osteoarthritis Pain scales can be used to assess change over time.The painDETECT performs better as a screening measure than as an outcome measure. PMID:27027698
Measuring practical knowledge about balanced meals: development and validation of the brief PKB-7 scale.

PubMed

Mötteli, S; Barbey, J; Keller, C; Bucher, T; Siegrist, M

2016-04-01

As a high-quality diet is associated with a lower risk for several diseases and all-cause mortality, current nutrition education tools provide people with information regarding how to build a healthy and a balanced meal. To assess this basic nutrition knowledge, the research aim was to develop and validate a brief scale to measure the Practical Knowledge about Balanced meals (PKB-7). A pool of 25 items was pretested with experts and laypeople before being tested on a random sample in Switzerland (n=517). For item selection, a Rasch model analysis was applied. The validity and reliability of the new scale were assessed by three additional studies including laypeople (n=597; n=145) and nutrition experts (n=59). The final scale consists of seven multiple-choice items, which met the assumptions of the Rasch model. The validity of the new scale was shown by several aspects: the Rasch model was replicated in a second study, and nutrition experts achieved significantly higher scores than laypeople (t(148)=20.27, P<0.001, d=1.78). In addition, the PKB-7 scale was correlated with other nutrition-related constructs and associated with reported vegetable consumption. Test-retest reliability (r=0.68, P<0.001) was acceptable. The PKB-7 scale is a reliable and a valid Rasch-based instrument in Swiss citizens aged between 18 and 80 years for measuring the practical knowledge about balanced meals based on current dietary guidelines. This brief and easy-to-use scale is intended for application in both research and practice.
Identifying Core Competencies of Infection Control Nurse Specialists in Hong Kong.

PubMed

Chan, Wai Fong; Bond, Trevor G; Adamson, Bob; Chow, Meyrick

2016-01-01

To confirm a core competency scale for Hong Kong infection control nurses at the advanced nursing practice level from the core competency items proposed in a previous phase of this study. This would serve as the foundation of competency assurance in Hong Kong hospitals. A cross-sectional survey design was used. All public and private hospitals in Hong Kong. All infection control nurses in hospitals of Hong Kong. The 83-item proposed core competency list established in an earlier study was transformed into a questionnaire and sent to 112 infection control nurses in 48 hospitals in Hong Kong. They were asked to rate the importance of each infection prevention and control item using Likert-style response categories. Data were analyzed using the Rasch model. The response rate of 81.25% was achieved. Seven items were removed from the proposed core competency list, leaving a scale of 76 items that fit the measurement requirements of the unidimensional Rasch model. Essential core competency items of advanced practice for infection control nurses in Hong Kong were identified based on the measurement criteria of the Rasch model. Several items of the scale that reflect local Hong Kong contextual characteristics are distinguished from the overseas standards. This local-specific competency list could serve as the foundation for education and for certification of infection control nurse specialists in Hong Kong. Rasch measurement is an appropriate analytical tool for identifying core competencies of advanced practice nurses in other specialties and in other locations in a manner that incorporates practitioner judgment and expertise.
Development of a Microsoft Excel tool for one-parameter Rasch model of continuous items: an application to a safety attitude survey.

PubMed

Chien, Tsair-Wei; Shao, Yang; Kuo, Shu-Chun

2017-01-10

Many continuous item responses (CIRs) are encountered in healthcare settings, but no one uses item response theory's (IRT) probabilistic modeling to present graphical presentations for interpreting CIR results. A computer module that is programmed to deal with CIRs is required. To present a computer module, validate it, and verify its usefulness in dealing with CIR data, and then to apply the model to real healthcare data in order to show how the CIR that can be applied to healthcare settings with an example regarding a safety attitude survey. Using Microsoft Excel VBA (Visual Basic for Applications), we designed a computer module that minimizes the residuals and calculates model's expected scores according to person responses across items. Rasch models based on a Wright map and on KIDMAP were demonstrated to interpret results of the safety attitude survey. The author-made CIR module yielded OUTFIT mean square (MNSQ) and person measures equivalent to those yielded by professional Rasch Winsteps software. The probabilistic modeling of the CIR module provides messages that are much more valuable to users and show the CIR advantage over classic test theory. Because of advances in computer technology, healthcare users who are familiar to MS Excel can easily apply the study CIR module to deal with continuous variables to benefit comparisons of data with a logistic distribution and model fit statistics.
Dimensionality and predictive validity of the HAM-Nat, a test of natural sciences for medical school admission

PubMed Central

2011-01-01

Background Knowledge in natural sciences generally predicts study performance in the first two years of the medical curriculum. In order to reduce delay and dropout in the preclinical years, Hamburg Medical School decided to develop a natural science test (HAM-Nat) for student selection. In the present study, two different approaches to scale construction are presented: a unidimensional scale and a scale composed of three subject specific dimensions. Their psychometric properties and relations to academic success are compared. Methods 334 first year medical students of the 2006 cohort responded to 52 multiple choice items from biology, physics, and chemistry. For the construction of scales we generated two random subsamples, one for development and one for validation. In the development sample, unidimensional item sets were extracted from the item pool by means of weighted least squares (WLS) factor analysis, and subsequently fitted to the Rasch model. In the validation sample, the scales were subjected to confirmatory factor analysis and, again, Rasch modelling. The outcome measure was academic success after two years. Results Although the correlational structure within the item set is weak, a unidimensional scale could be fitted to the Rasch model. However, psychometric properties of this scale deteriorated in the validation sample. A model with three highly correlated subject specific factors performed better. All summary scales predicted academic success with an odds ratio of about 2.0. Prediction was independent of high school grades and there was a slight tendency for prediction to be better in females than in males. Conclusions A model separating biology, physics, and chemistry into different Rasch scales seems to be more suitable for item bank development than a unidimensional model, even when these scales are highly correlated and enter into a global score. When such a combination scale is used to select the upper quartile of applicants, the proportion of successful completion of the curriculum after two years is expected to rise substantially. PMID:21999767
Evaluation properties of the French version of the OUT-PATSAT35 satisfaction with care questionnaire according to classical and item response theory analyses.

PubMed

Panouillères, M; Anota, A; Nguyen, T V; Brédart, A; Bosset, J F; Monnier, A; Mercier, M; Hardouin, J B

2014-09-01

The present study investigates the properties of the French version of the OUT-PATSAT35 questionnaire, which evaluates the outpatients' satisfaction with care in oncology using classical analysis (CTT) and item response theory (IRT). This cross-sectional multicenter study includes 692 patients who completed the questionnaire at the end of their ambulatory treatment. CTT analyses tested the main psychometric properties (convergent and divergent validity, and internal consistency). IRT analyses were conducted separately for each OUT-PATSAT35 domain (the doctors, the nurses or the radiation therapists and the services/organization) by models from the Rasch family. We examined the fit of the data to the model expectations and tested whether the model assumptions of unidimensionality, monotonicity and local independence were respected. A total of 605 (87.4%) respondents were analyzed with a mean age of 64 years (range 29-88). Internal consistency for all scales separately and for the three main domains was good (Cronbach's α 0.74-0.98). IRT analyses were performed with the partial credit model. No disordered thresholds of polytomous items were found. Each domain showed high reliability but fitted poorly to the Rasch models. Three items in particular, the item about "promptness" in the doctors' domain and the items about "accessibility" and "environment" in the services/organization domain, presented the highest default of fit. A correct fit of the Rasch model can be obtained by dropping these items. Most of the local dependence concerned items about "information provided" in each domain. A major deviation of unidimensionality was found in the nurses' domain. CTT showed good psychometric properties of the OUT-PATSAT35. However, the Rasch analysis revealed some misfitting and redundant items. Taking the above problems into consideration, it could be interesting to refine the questionnaire in a future study.
Dimensionality and predictive validity of the HAM-Nat, a test of natural sciences for medical school admission.

PubMed

Hissbach, Johanna C; Klusmann, Dietrich; Hampe, Wolfgang

2011-10-14

Knowledge in natural sciences generally predicts study performance in the first two years of the medical curriculum. In order to reduce delay and dropout in the preclinical years, Hamburg Medical School decided to develop a natural science test (HAM-Nat) for student selection. In the present study, two different approaches to scale construction are presented: a unidimensional scale and a scale composed of three subject specific dimensions. Their psychometric properties and relations to academic success are compared. 334 first year medical students of the 2006 cohort responded to 52 multiple choice items from biology, physics, and chemistry. For the construction of scales we generated two random subsamples, one for development and one for validation. In the development sample, unidimensional item sets were extracted from the item pool by means of weighted least squares (WLS) factor analysis, and subsequently fitted to the Rasch model. In the validation sample, the scales were subjected to confirmatory factor analysis and, again, Rasch modelling. The outcome measure was academic success after two years. Although the correlational structure within the item set is weak, a unidimensional scale could be fitted to the Rasch model. However, psychometric properties of this scale deteriorated in the validation sample. A model with three highly correlated subject specific factors performed better. All summary scales predicted academic success with an odds ratio of about 2.0. Prediction was independent of high school grades and there was a slight tendency for prediction to be better in females than in males. A model separating biology, physics, and chemistry into different Rasch scales seems to be more suitable for item bank development than a unidimensional model, even when these scales are highly correlated and enter into a global score. When such a combination scale is used to select the upper quartile of applicants, the proportion of successful completion of the curriculum after two years is expected to rise substantially.
Controlling Guessing Bias in the Dichotomous Rasch Model Applied to a Large-Scale, Vertically Scaled Testing Program

PubMed Central

Andrich, David; Marais, Ida; Humphry, Stephen Mark

2015-01-01

Recent research has shown how the statistical bias in Rasch model difficulty estimates induced by guessing in multiple-choice items can be eliminated. Using vertical scaling of a high-profile national reading test, it is shown that the dominant effect of removing such bias is a nonlinear change in the unit of scale across the continuum. The consequence is that the proficiencies of the more proficient students are increased relative to those of the less proficient. Not controlling the guessing bias underestimates the progress of students across 7 years of schooling with important educational implications. PMID:29795871
Rasch fit statistics and sample size considerations for polytomous data.

PubMed

Smith, Adam B; Rush, Robert; Fallowfield, Lesley J; Velikova, Galina; Sharpe, Michael

2008-05-29

Previous research on educational data has demonstrated that Rasch fit statistics (mean squares and t-statistics) are highly susceptible to sample size variation for dichotomously scored rating data, although little is known about this relationship for polytomous data. These statistics help inform researchers about how well items fit to a unidimensional latent trait, and are an important adjunct to modern psychometrics. Given the increasing use of Rasch models in health research the purpose of this study was therefore to explore the relationship between fit statistics and sample size for polytomous data. Data were collated from a heterogeneous sample of cancer patients (n = 4072) who had completed both the Patient Health Questionnaire - 9 and the Hospital Anxiety and Depression Scale. Ten samples were drawn with replacement for each of eight sample sizes (n = 25 to n = 3200). The Rating and Partial Credit Models were applied and the mean square and t-fit statistics (infit/outfit) derived for each model. The results demonstrated that t-statistics were highly sensitive to sample size, whereas mean square statistics remained relatively stable for polytomous data. It was concluded that mean square statistics were relatively independent of sample size for polytomous data and that misfit to the model could be identified using published recommended ranges.

Rasch fit statistics and sample size considerations for polytomous data

PubMed Central

Smith, Adam B; Rush, Robert; Fallowfield, Lesley J; Velikova, Galina; Sharpe, Michael

2008-01-01

Background Previous research on educational data has demonstrated that Rasch fit statistics (mean squares and t-statistics) are highly susceptible to sample size variation for dichotomously scored rating data, although little is known about this relationship for polytomous data. These statistics help inform researchers about how well items fit to a unidimensional latent trait, and are an important adjunct to modern psychometrics. Given the increasing use of Rasch models in health research the purpose of this study was therefore to explore the relationship between fit statistics and sample size for polytomous data. Methods Data were collated from a heterogeneous sample of cancer patients (n = 4072) who had completed both the Patient Health Questionnaire – 9 and the Hospital Anxiety and Depression Scale. Ten samples were drawn with replacement for each of eight sample sizes (n = 25 to n = 3200). The Rating and Partial Credit Models were applied and the mean square and t-fit statistics (infit/outfit) derived for each model. Results The results demonstrated that t-statistics were highly sensitive to sample size, whereas mean square statistics remained relatively stable for polytomous data. Conclusion It was concluded that mean square statistics were relatively independent of sample size for polytomous data and that misfit to the model could be identified using published recommended ranges. PMID:18510722
An introduction to the partial credit model for developing nursing assessments.

PubMed

Fox, C

1999-11-01

The partial credit model, which is a special case of the Rasch measurement model, was presented as a useful way to develop and refine complex nursing assessments. The advantages of the Rasch model over the classical psychometric model were presented including the lack of bias in the measurement process, the ability to highlight those items in need of refinement, the provision of information on congruence between the data and the model, and feedback on the usefulness of the response categories. The partial credit model was introduced as a way to develop complex nursing assessments such as performance-based assessments, because of the model's ability to accommodate a variety of scoring procedures. Finally, an application of the partial credit model was illustrated using the Practical Knowledge Inventory for Nurses, a paper-and-pencil instrument that measures on-the-job decision-making for nurses.
Measurement properties of the CLOX Executive Clock Drawing Task in an inpatient stroke rehabilitation setting.

PubMed

Zuverza-Chavarria, Virginia; Tsanadis, John

2011-05-01

The goal of this study was to explore the psychometric properties of the CLOX Executive Clock Drawing Task (Royall, Cordes, & Polk, 1998) in persons who had sustained a stroke and were receiving inpatient rehabilitation. Rasch modeling was utilized to examine the psychometric properties of the CLOX. Separate analyses were conducted for the free draw (CLOX 1) and copy (CLOX 2) portions of the measure to investigate each presentation mode independently. The sample consisted of 66 inpatient adults who had sustained a stroke. CLOX 1 met most Rasch model expectations for item fit, unidimensionality, test reliability, and sample targeting. CLOX 2 was less psychometrically sound and contained two items with significant misfit. CLOX 2 demonstrated a significant ceiling effect that resulted in poor sample targeting. CLOX 1 is a psychometrically sound screening instrument for assessing persons with stroke receiving inpatient rehabilitation. In addition to the psychometric weaknesses of CLOX 2, its interpretive yield is minimal and clinicians may consider omitting it. Recommendations are made for using the Rasch item-person maps in clinical practice.
Improving Measurement of Trait Competitiveness: A Rasch Analysis of the Revised Competitiveness Index With Samples From New Zealand and US University Students.

PubMed

Krägeloh, Christian U; Medvedev, Oleg N; Hill, Erin M; Webster, Craig S; Booth, Roger J; Henning, Marcus A

2018-01-01

Measuring competitiveness is necessary to fully understand variables affecting student learning. The 14-item Revised Competitiveness Index has become a widely used measure to assess trait competitiveness. The current study reports on a Rasch analysis to investigate the psychometric properties of the Revised Competitiveness Index and to improve its precision for international comparisons. Students were recruited from medical studies at a university in New Zealand, undergraduate health sciences courses at another New Zealand university, and a psychology undergraduate class at a university in the United States. Rasch model estimate parameters were affected by local dependency and item misfit. Best fit to the Rasch model (χ 2 (20) = 15.86, p = .73, person separation index = .95) was obtained for the Enjoyment of Competition subscale after combining locally dependent items into a subtest and discarding the highly misfitting Item 9. The only modifications required to obtain a suitable fit (χ 2 (25) = 25.81, p = .42, person separation index = .77) for the Contentiousness subscale were a subtest to combine two locally dependent items and splitting this subtest by country to deal with differential item functioning. The results support reliability and internal construct validity of the modified Revised Competitiveness Index. Precision of the measure may be enhanced using the ordinal-to-interval conversion algorithms presented here, allowing the use of parametric statistics without breaking fundamental statistical assumptions.
Development of Rasch-based item banks for the assessment of work performance in patients with musculoskeletal diseases.

PubMed

Mueller, Evelyn A; Bengel, Juergen; Wirtz, Markus A

2013-12-01

This study aimed to develop a self-description assessment instrument to measure work performance in patients with musculoskeletal diseases. In terms of the International Classification of Functioning, Disability and Health (ICF), work performance is defined as the degree of meeting the work demands (activities) at the actual workplace (environment). To account for the fact that work performance depends on the work demands of the job, we strived to develop item banks that allow a flexible use of item subgroups depending on the specific work demands of the patients' jobs. Item development included the collection of work tasks from literature and content validation through expert surveys and patient interviews. The resulting 122 items were answered by 621 patients with musculoskeletal diseases. Exploratory factor analysis to ascertain dimensionality and Rasch analysis (partial credit model) for each of the resulting dimensions were performed. Exploratory factor analysis resulted in four dimensions, and subsequent Rasch analysis led to the following item banks: 'impaired productivity' (15 items), 'impaired cognitive performance' (18), 'impaired coping with stress' (13) and 'impaired physical performance' (low physical workload 20 items, high physical workload 10 items). The item banks exhibited person separation indices (reliability) between 0.89 and 0.96. The assessment of work performance adds the activities component to the more commonly employed participation component of the ICF-model. The four item banks can be adapted to specific jobs where necessary without losing comparability of person measures, as the item banks are based on Rasch analysis.
Quality of life for post-polio syndrome: a patient derived, Rasch standard scale.

PubMed

Young, Carolyn A; Quincey, Anne-Marie C; Wong, Samantha M; Tennant, Alan

2018-03-01

To design a disease-specific quality of life (QoL) questionnaire for people with post-polio syndrome (PPS). Qualitative interviews were conducted with 45 people with PPS to identify themes and derive potential items reflecting impact upon QoL. After cognitive debriefing, these were made into a questionnaire pack along with comparative questionnaires and posted to 319 patients. The 271 (85%) returned questionnaires were subjected to exploratory factor analysis (EFA) and Rasch analysis. A 25 item scale, the post-polio quality of life scale (PP-QoL), showed good fit to the Rasch model (conditional chi-square p = 0.156), unidimensionality (% t-tests 2.0: CI 0.7-3.8), and Cronbach's alpha of 0.87. With the latent estimate transformed to a 0-100 scale, the mean score was 56.9 (SD 18.5) with only 3.3% of respondents at the floor or ceiling of the scale. Test-retest reliability showed an intraclass correlation coefficient (ICC) (2.1) of 0.916, and correlation of 0.85. The disease-specific PP-QoL demonstrated excellent reliability, appropriate concurrent validity, and satisfied the standards of the Rasch model. It enables examination of the impact of health status upon perceived QoL, and the impact of rehabilitation interventions. The scale is freely available for academic or not-for-profit users to improve research in this neglected, disabling condition. Implications for Rehabilitation In post-polio syndrome (PPS), existing work examines aspects of health-related quality of life (HRQoL), such as activity limitations. A disease-specific QoL measure would enable researchers to model the impact of health status, such as fatigue or mobility restrictions, upon QoL in PPS. The post-polio quality of life scale (PP-QoL) is based on the patients' lived experience, meets Rasch standards and is free for use for academic and not-for-profit researchers. The raw score is reliable for individual use in clinical settings, and interval scale transformation is available for parametric applications and the calculation of change scores.
Effects of demographic and health variables on Rasch scaled cognitive scores.

PubMed

Zelinski, Elizabeth M; Gilewski, Michael J

2003-08-01

To determine whether demographic and health variables interact to predict cognitive scores in Asset and Health Dynamics of the Oldest-Old (AHEAD), a representative survey of older Americans, as a test of the developmental discontinuity hypothesis. Rasch modeling procedures were used to rescale cognitive measures into interval scores, equating scales across measures, making it possible to compare predictor effects directly. Rasch scaling also reduces the likelihood of obtaining spurious interactions. Tasks included combined immediate and delayed recall, the Telephone Interview for Cognitive Status (TICS), Series 7, and an overall cognitive score. Demographic variables most strongly predicted performance on all scores, with health variables having smaller effects. Age interacted with both demographic and health variables, but patterns of effects varied. Demographic variables have strong effects on cognition. The developmental discontinuity hypothesis that health variables have stronger effects than demographic ones on cognition in older adults was not supported.
Factor and Rasch analysis of the Fonseca anamnestic index for the diagnosis of myogenous temporomandibular disorder.

PubMed

Rodrigues-Bigaton, Delaine; de Castro, Ester M; Pires, Paulo F

Rasch analysis has been used in recent studies to test the psychometric properties of a questionnaire. The conditions for use of the Rasch model are one-dimensionality (assessed via prior factor analysis) and local independence (the probability of getting a particular item right or wrong should not be conditioned upon success or failure in another). To evaluate the dimensionality and the psychometric properties of the Fonseca anamnestic index (FAI), such as the fit of the data to the model, the degree of difficulty of the items, and the ability to respond in patients with myogenous temporomandibular disorder (TMD). The sample consisted of 94 women with myogenous TMD, diagnosed by the Research Diagnostic Criteria for Temporomandibular Disorders (RDC/TMD), who answered the FAI. For the factor analysis, we applied the Kaiser-Meyer-Olkin test, Bartlett's sphericity, Spearman's correlation, and the determinant of the correlation matrix. For extraction of the factors/dimensions, an eigenvalue >1.0 was used, followed by oblique oblimin rotation. The Rasch analysis was conducted on the dimension that showed the highest proportion of variance explained. Adequate sample "n" and FAI multidimensionality were observed. Dimension 1 (primary) consisted of items 1, 2, 3, 6, and 7. All items of dimension 1 showed adequate fit to the model, being observed according to the degree of difficulty (from most difficult to easiest), respectively, items 2, 1, 3, 6, and 7. The FAI presented multidimensionality with its main dimension consisting of five reliable items with adequate fit to the composition of its structure. Copyright © 2017 Associação Brasileira de Pesquisa e Pós-Graduação em Fisioterapia. Publicado por Elsevier Editora Ltda. All rights reserved.
Item analysis using Rasch models confirms that the Danish versions of the DISABKIDS® chronic-generic and diabetes-specific modules are valid and reliable.

PubMed

Nielsen, Julie Bøjstrup; Kyvsgaard, Julie Nyholm; Sildorf, Stine Møller; Kreiner, Svend; Svensson, Jannet

2017-03-01

Type 1 Diabetes (T1D) has a negative impact on psychological and overall well-being. Screening for Health-related Quality of Life (HrQoL) and addressing HrQoL issues in the clinic leads to improved well-being and metabolic outcomes. The aim of this study was to translate the generic and diabetes-specific validated multinational DISABKIDS® questionnaires into Danish, and then determine their validity and reliability. The questionnaires were translated using a validated translation procedure and completed by 99 children and adolescents from our diabetes-department; all diagnosed with T1D and were aged between 8 and 18 years old. The Rasch and the graphical log linear Rasch model (GLLRM) were used to determine validity. Monte Carlo methods and Cronbach's α were used to confirm reliability. The data did not fit a pure Rasch model but did fit a GLLRM when item six in the independence scale is excluded. The six subscales measure different aspects of HrQoL indicating that all the subscales are necessary. The questionnaire shows local dependency between items and differential item functioning (DIF). Therefore age, gender, and glycated hemoglobin (HbA1c) levels must be taken into account when comparing HrQoL between groups. The Danish versions of the DISABKIDS® chronic-generic and diabetes-specific modules provide valid and objective measurements with adequate reliability. These Danish versions are useful tools for evaluating HrQoL in Danish patients with T1D. However, guidelines on how to manage DIF and local independence will be required, and item six should be rephrased.
A Rasch analysis of the Brief Pain Inventory Interference subscale reveals three dimensions and an age bias.

PubMed

Walton, David M; Beattie, Tyler; Putos, Joseph; MacDermid, Joy C

2016-06-01

The Brief Pain Inventory is composed of two quantifiable scales: pain severity and pain interference. The reported factor structure of the interference subscale is not consistent in the extant literature, with no clear choice between a single- or two-factor structure. Here, we report on the results of Rasch-based analysis of the interference subscale using a large population-based ambulatory patient database (the Quebec Pain Registry). Observational cohort. A total of 1,000 responses were randomly drawn from a total database of 5,654 for this analysis. Both the original 7-item and an expanded 10-item version (Tyler 2002) of the interference subscale were evaluated. Rasch analysis revealed significant misfit of both versions of the scale, with the original 7-item version outperforming the expanded 10-item version. Analysis of dimensionality revealed that both versions showed improved model fit when considered two subscales (affective and physical interference) with the item on sleep interference removed or considered separately. Additionally, significant uniform differential item functioning was identified for 6 of the 7 original items when the sample was stratified by age above or below 55 years. The interference subscale achieved adequate model fit when considered as two separate subscales with age as a mediator of response, while interpreting the sleep interference item separately. A transformation matrix revealed that in all cases, ordinal-level change at the extreme ends of the scale appears to be more meaningful than does a similar change at the midpoints. The Interference subscale of the BPI should be interpreted as two separate subscales (Affective Interference, Physical Interference) with the sleep item removed or interpreted separately for optimal fit to the Rasch model. Implications for research and clinical use are discussed. Copyright © 2016 Elsevier Inc. All rights reserved.
Improving the measurement of health-related quality of life in adolescent with idiopathic scoliosis: the SRS-7, a Rasch-developed short form of the SRS-22 questionnaire.

PubMed

Caronni, Antonio; Zaina, Fabio; Negrini, Stefano

2014-04-01

Scoliosis Research Society-22 (SRS-22) questionnaire was developed to evaluate health-related quality of life (HRQL) in adolescent idiopathic scoliosis (AIS) patients. Rasch analysis (RA) is a statistical procedure which turns questionnaire ordinal scores into interval measures. Measures from Rasch-compatible questionnaires can be used, similar to body temperature or blood pressure, to quantify disease severity progression and treatment efficacy. Purpose of the current work is to present Rasch analysis (RA) of the SRS-22 questionnaire and to develop an SRS-22 Rasch-approved short form. 300 SRS-22 were randomly collected from 2447 consecutive IS adolescents at their first evaluation (229 females; 13.9 ± 1.9 years; 26.9 ± 14.7 Cobb°) in a scoliosis outpatient clinic. RA showed both disordered thresholds and overall misfit of the SRS-22. Sixteen items were re-scored and two misfitting items (6 and 14) removed to obtain a Rasch-compatible questionnaire. Participants HRQL measured too high with the rearranged questionnaire, indicating a severe SRS-22 ceiling effect. RA also highlighted SRS-22 multidimensionality, with pain/function not merging with self-image/mental health items. Item 3 showed differential item functioning (DIF) for both curve and hump amplitude. A 7-item questionnaire (SRS-7) was prepared by selecting single items from the original SRS-22. SRS-7 showed fit to the model, unidimensionality and no DIF. Compared with the SRS-22, the short form scale shows better targeting of the participants' population. RA shows that SRS-22 has poor clinimetric properties; moreover, when used with AIS at first evaluation, SRS-22 is affected by a severe ceiling effect. SRS-7, an SRS-22 7-item short form questionnaire, provides an HRQL interval measure better tailored to these participants. Copyright © 2014 Elsevier Ltd. All rights reserved.
An Introduction to the Partial Credit Model for Developing Nursing Assessments.

ERIC Educational Resources Information Center

Fox, Christine

1999-01-01

Demonstrates how the partial credit model, a variation of the Rasch Measurement Model, can be used to develop performance-based assessments for nursing education. Applies the model using the Practical Knowledge Inventory for Nurses. (SK)
Assessing Pre-Service Physics Teachers’ Energy Literacy: An Application of Rasch measurement

NASA Astrophysics Data System (ADS)

Yusup, M.; Setiawan, A.; Rustaman, N. Y.; Kaniawati, I.

2017-09-01

This paper aims to present a summary of pre-service physics teachers’ responses on energy literacy assessment. A total of 123 pre-service physics teacher in first through third year of education participated. Data were analyzed using Rasch modeling. Research findings indicate that pre-service physics teachers show their low self-system toward energy conservation. They were also still lack of metacognitive and cognitive competencies. These finding provide information for the future development of curriculum, teaching and learning that can improve pre-service physics teachers’ energy literacy.
Examining the validity and reliability of the Taita symptom checklist using Rasch analysis.

PubMed

Chen, Yun-Ling; Pan, Ay-Woan; Chung, LyInn; Chen, Tsyr-Jang

2015-03-01

The Taita symptom checklist (TSCL) is a standardized self-rating psychiatric symptom scale for outpatients with mental illness in Taiwan. This study aimed to examine the validity and reliability of the TSCL using Rasch analysis. The TSCL was given to 583 healthy people and 479 people with mental illness. Rasch analysis was used to examine the appropriateness of the rating scale, the unidimensionality of the scale, the differential item functioning across sex and diagnosis, and the Rasch cut-off score of the scale. Rasch analysis confirmed that the revised 37 items with a three-point rating scale of the TSCL demonstrated good internal consistency and met criteria for unidimensionality. The person and item reliability indices were high. The TSCL could reliably measure healthy participants and patients with mental illness. Differential item functioning due to sex or psychiatric diagnosis was evident for three items. A Rasch cut-off score for TSCL was produced for detecting participants' psychiatric symptoms based on an eight-level classification. The TSCL is a reliable and valid assessment to evaluate the participants' perceived disturbance of psychiatric symptoms based on Rasch analysis. Copyright © 2013. Published by Elsevier B.V.
Rasch analysis of the Italian Lower Extremity Functional Scale: insights on dimensionality and suggestions for an improved 15-item version.

PubMed

Bravini, Elisabetta; Giordano, Andrea; Sartorio, Francesco; Ferriero, Giorgio; Vercelli, Stefano

2017-04-01

To investigate dimensionality and the measurement properties of the Italian Lower Extremity Functional Scale using both classical test theory and Rasch analysis methods, and to provide insights for an improved version of the questionnaire. Rasch analysis of individual patient data. Rehabilitation centre. A total of 135 patients with musculoskeletal diseases of the lower limb. Patients were assessed with the Lower Extremity Functional Scale before and after the rehabilitation. Rasch analysis showed some problems related to rating scale category functioning, items fit, and items redundancy. After an iterative process, which resulted in the reduction of rating scale categories from 5 to 4, and in the deletion of 5 items, the psychometric properties of the Italian Lower Extremity Functional Scale improved. The retained 15 items with a 4-level response format fitted the Rasch model (internal construct validity), and demonstrated unidimensionality and good reliability indices (person-separation reliability 0.92; Cronbach's alpha 0.94). Then, the analysis showed differential item functioning for six of the retained items. The sensitivity to change of the Italian 15-item Lower Extremity Functional Scale was nearly equal to the one of the original version (effect size: 0.93 and 0.98; standardized response mean: 1.20 and 1.28, respectively for the 15-item and 20-item versions). The Italian Lower Extremity Functional Scale had unsatisfactory measurement properties. However, removing five items and simplifying the scoring from 5 to 4 levels resulted in a more valid measure with good reliability and sensitivity to change.
The Val30Met familial amyloid polyneuropathy specific Rasch-built overall disability scale (FAP-RODS(©) ).

PubMed

Pruppers, Mariëlle H J; Merkies, Ingemar S J; Faber, Catharina G; Da Silva, Ana M; Costa, Vanessa; Coelho, Teresa

2015-09-01

Familial amyloid polyneuropathy (FAP) is a chronic debilitating multi-organic disorder, mainly assessed using ordinal-based impairment measures. To date, no outcome measure at the activity and participation level has been constructed in FAP. The current study aimed to design an interval activity/participation scale for FAP through Rasch methodology. A preliminary FAP Rasch-built overall disability scale (pre-FAP-RODS) containing 146 activity/participation items was assessed twice (interval: 2-4 week; test-retest reliability) in 248 patients with Val30Met FAP examined in Porto, Portugal, of which 65.7% have received liver transplantation. An ordinal-based 24-item FAP-symptoms inventory questionnaire (FAP-SIQ) was also assessed (validity purposes). The pre-FAP-RODS and FAP-SIQ data were subjected to Rasch analyses. The pre-FAP-RODS did not meet model's expectations. On the basis of requirements such as misfit statistics, differential item functioning, and local dependency, items were systematically removed until a final 34-item FAP-RODS(©) was constructed fulfilling all Rasch requirements. Acceptable reliability/validity scores were demonstrated. In conclusion, the 34-item FAP-RODS(©) is a disease-specific interval measure suitable for detecting activity and participation restrictions in patients with FAP. The use of the FAP-RODS(©) is recommended for future international clinical trials in patients with Val30Met FAP determining its responsiveness and its cross-cultural validation. Its expansion to other forms of FAP should also be focus of future clinical studies. © 2015 Peripheral Nerve Society.
Testing of the SEE and OEE post-hip fracture.

PubMed

Resnick, Barbara; Orwig, Denise; Zimmerman, Sheryl; Hawkes, William; Golden, Justine; Werner-Bronzert, Michelle; Magaziner, Jay

2006-08-01

The purpose of this study was to test the reliability and validity of the Self-Efficacy for Exercise (SEE) and the Outcome Expectations for Exercise (OEE) scales in a sample of 166 older women post-hip fracture. There was some evidence of validity of the SEE and OEE based on confirmatory factor analysis and Rasch model testing, criterion based and convergent validity, and evidence of internal consistency based on alpha coefficients and separation indices and reliability based on R2 estimates. Rasch model testing demonstrated that some items had high variability. Based on these findings suggestions are made for how items could be revised and the scales improved for future use.
Rasch Analysis for Instrument Development: Why, When, and How?

PubMed Central

Boone, William J.

2016-01-01

This essay describes Rasch analysis psychometric techniques and how such techniques can be used by life sciences education researchers to guide the development and use of surveys and tests. Specifically, Rasch techniques can be used to document and evaluate the measurement functioning of such instruments. Rasch techniques also allow researchers to construct “Wright maps” to explain the meaning of a test score or survey score and develop alternative forms of tests and surveys. Rasch techniques provide a mechanism by which the quality of life sciences–related tests and surveys can be optimized and the techniques can be used to provide a context (e.g., what topics a student has mastered) when explaining test and survey results. PMID:27856555
Validity of the impact on participation and autonomy questionnaire: a comparison between two countries.

PubMed

Kersten, Paula; Cardol, Mieke; George, Steve; Ward, Christopher; Sibley, Andrew; White, Barney

2007-10-15

To evaluate the cross-cultural validity of the five subscales of the Impact on Participation and Autonomy (IPA) measure and the full 31-item scale. Data from two validation studies (Dutch and English) were pooled (n = 106). Participants (aged 18-75), known to rehabilitation services or GP practices, had conditions ranging from minor ailments to significant disability. Validity of the five subscales and the total scale was examined using Rasch analysis (Partial Credit Model). P values smaller than 0.01 were employed to allow for multiple testing. A number of items in all the subscales except 'Outdoor Autonomy' needed rescoring. One 'Indoor Autonomy' item showed uniform DIF by country and was split by country. One 'Work and Education' item displayed uniform and non-uniform DIF by gender. All the subscales fitted the Rasch model and were invariant across country. A 30-item IPA also fitted the Rasch model. The IPA subscales and a 30-item scale are invariant across the two cultures and gender. The IPA can be used validly to assess participation and autonomy in these populations. Further analyses are required to examine whether the IPA is invariant across differing levels of disability and other disease groups not included in this study.
Biases and power for groups comparison on subjective health measurements.

PubMed

Hamel, Jean-François; Hardouin, Jean-Benoit; Le Neel, Tanguy; Kubis, Gildas; Roquelaure, Yves; Sébille, Véronique

2012-01-01

Subjective health measurements are increasingly used in clinical research, particularly for patient groups comparisons. Two main types of analytical strategies can be used for such data: so-called classical test theory (CTT), relying on observed scores and models coming from Item Response Theory (IRT) relying on a response model relating the items responses to a latent parameter, often called latent trait. Whether IRT or CTT would be the most appropriate method to compare two independent groups of patients on a patient reported outcomes measurement remains unknown and was investigated using simulations. For CTT-based analyses, groups comparison was performed using t-test on the scores. For IRT-based analyses, several methods were compared, according to whether the Rasch model was considered with random effects or with fixed effects, and the group effect was included as a covariate or not. Individual latent traits values were estimated using either a deterministic method or by stochastic approaches. Latent traits were then compared with a t-test. Finally, a two-steps method was performed to compare the latent trait distributions, and a Wald test was performed to test the group effect in the Rasch model including group covariates. The only unbiased IRT-based method was the group covariate Wald's test, performed on the random effects Rasch model. This model displayed the highest observed power, which was similar to the power using the score t-test. These results need to be extended to the case frequently encountered in practice where data are missing and possibly informative.

Exploring Secondary Students' Knowledge and Misconceptions about Influenza: Development, validation, and implementation of a multiple-choice influenza knowledge scale

NASA Astrophysics Data System (ADS)

Romine, William L.; Barrow, Lloyd H.; Folk, William R.

2013-07-01

Understanding infectious diseases such as influenza is an important element of health literacy. We present a fully validated knowledge instrument called the Assessment of Knowledge of Influenza (AKI) and use it to evaluate knowledge of influenza, with a focus on misconceptions, in Midwestern United States high-school students. A two-phase validation process was used. In phase 1, an initial factor structure was calculated based on 205 students of grades 9-12 at a rural school. In phase 2, one- and two-dimensional factor structures were analyzed from the perspectives of classical test theory and the Rasch model using structural equation modeling and principal components analysis (PCA) on Rasch residuals, respectively. Rasch knowledge measures were calculated for 410 students from 6 school districts in the Midwest, and misconceptions were verified through the χ 2 test. Eight items measured knowledge of flu transmission, and seven measured knowledge of flu management. While alpha reliability measures for the subscales were acceptable, Rasch person reliability measures and PCA on residuals advocated for a single-factor scale. Four misconceptions were found, which have not been previously documented in high-school students. The AKI is the first validated influenza knowledge assessment, and can be used by schools and health agencies to provide a quantitative measure of impact of interventions aimed at increasing understanding of influenza. This study also adds significantly to the literature on misconceptions about influenza in high-school students, a necessary step toward strategic development of educational interventions for these students.
Cross-national health comparisons using the Rasch model: findings from the 2012 US Health and Retirement Study and the 2012 Mexican Health and Aging Study.

PubMed

Hong, Ickpyo; Reistetter, Timothy A; Díaz-Venegas, Carlos; Michaels-Obregon, Alejandra; Wong, Rebeca

2018-05-10

Cross-national comparisons of patterns of population aging have emerged as comparable national micro-data have become available. This study creates a metric using Rasch analysis and determines the health of American and Mexican older adult populations. Secondary data analysis using representative samples aged 50 and older from 2012 U.S. Health and Retirement Study (n = 20,554); 2012 Mexican Health and Aging Study (n = 14,448). We developed a function measurement scale using Rasch analysis of 22 daily tasks and physical function questions. We tested psychometrics of the scale including factor analysis, fit statistics, internal consistency, and item difficulty. We investigated differences in function using multiple linear regression controlling for demographics. Lastly, we conducted subgroup analyses for chronic conditions. The created common metric demonstrated a unidimensional structure with good item fit, an acceptable precision (person reliability = 0.78), and an item difficulty hierarchy. The American adults appeared less functional than adults in Mexico (β = - 0.26, p < 0.0001) and across two chronic conditions (arthritis, β = - 0.36; lung problems, β = - 0.62; all p < 0.05). However, American adults with stroke were more functional than Mexican adults (β = 0.46, p = 0.047). The Rasch model indicates that Mexican adults were more functional than Americans at the population level and across two chronic conditions (arthritis and lung problems). Future studies would need to elucidate other factors affecting the function differences between the two countries.
Using the Rasch Measurement Model in Psychometric Analysis of the Family Effectiveness Measure

PubMed Central

McCreary, Linda L.; Conrad, Karen M.; Conrad, Kendon J.; Scott, Christy K; Funk, Rodney R.; Dennis, Michael L.

2013-01-01

Background Valid assessment of family functioning can play a vital role in optimizing client outcomes. Because family functioning is influenced by family structure, socioeconomic context, and culture, existing measures of family functioning--primarily developed with nuclear, middle class European American families--may not be valid assessments of families in diverse populations. The Family Effectiveness Measure was developed to address this limitation. Objectives To test the Family Effectiveness Measure with data from a primarily low-income African American convenience sample, using the Rasch measurement model. Method A sample of 607 adult women completed the measure. Rasch analysis was used to assess unidimensionality, response category functioning, item fit, person reliability, differential item functioning by race and parental status, and item hierarchy. Criterion-related validity was tested using correlations with five other variables related to family functioning. Results The Family Effectiveness Measure measures two separate constructs: The effective family functioning construct was a psychometrically sound measure of the target construct that was more efficient due to the deletion of 22 items. The ineffective family functioning construct consisted of 16 of those deleted items but was not as strong psychometrically. Items in both constructs evidenced no differential item functioning by race. Criterion-related validity was supported for both. Discussion In contrast to the prevailing conceptualization that family functioning is a single construct, assessed by positively and negatively worded items, use of the Rasch analysis suggested the existence of two constructs. While the effective family functioning is a strong and efficient measure of family functioning, the ineffective family functioning will require additional item development and psychometric testing. PMID:23636342
Planning a Study for Testing the Rasch Model given Missing Values due to the use of Test-booklets.

PubMed

Yanagida, Takuya; Kubinger, Klaus D; Rasch, Dieter

2015-01-01

Though calibration of an achievement test within a psychological and educational context is very often carried out by the Rasch model, data sampling is hardly designed according to statistical foundations. However, Kubinger, Rasch, and Yanagida (2009, 2011) suggested an approach for the determination of sample size according to a given Type-I and Type-II risk and a certain effect of model contradiction when testing the Rasch model. The approach uses a three-way analysis of variance design with mixed classification. For the while, their simulation studies deal with complete data, meaning every examinee is administered with all of the items of an item pool. The simulation study now presented in this paper deals with the practical relevant case, in particular for large-scale assessments, that item presentation happens to use several test-booklets. As a consequence, there are missing values by design. Therefore, the question to be considered is, whether this approach works in this case as well. Besides the fact, that data are not normally distributed but there is a dichotomous variable (an examinee either solves an item or fails to solve it), only a single entry for each cell exists in the given three-way analysis of variance design, if at all, due to missing values. Hence, the obligatory test-statistic's distribution may not be retained, in contrast to the case of having no missing values. The result of our simulation study, despite applying only to a very special scenario, is that this approach works, indeed: Whether test-booklets were used or every examinee is administered all of the items changes nothing in respect to the actual Type-I risk or to the power of the test, given almost the same amount of information of examinees per item. However, as the results are limited to a special scenario, we currently recommend any interested researcher to simulate the appropriate one in advance by him/herself.
Psychometric Evaluation of the HIV Disclosure Belief Scale: A Rasch Model Approach.

PubMed

Hu, Jinxiang; Serovich, Julianne M; Chen, Yi-Hsin; Brown, Monique J; Kimberly, Judy A

2017-01-01

This study provides psychometric assessment of an HIV disclosure belief scale (DBS) among men who have sex with men (MSM). This study used baseline data from a clinical trial evaluating the effectiveness of an HIV serostatus disclosure intervention of 338 HIV-positive MSM. The Rasch model was used after unidimensionality and local independence assumptions were tested for application of the model. Results suggest that there was only one item that did not fit the model well. After removing the item, the DBS showed good model-data fit and high item and person reliabilities. This instrument showed measurement invariance across two different age groups, but some items showed differential item functioning between Caucasian and other minority groups. The findings suggest that the DBS is suitable for measuring the HIV disclosure beliefs, but it should be cautioned when the DBS is used to compare the disclosure beliefs between different racial/ethnic groups.
Rasch Analyses of Very Low Food Security among Households and Children in the Three City Study*

PubMed Central

Moffitt, Robert A.; Ribar, David C.

2017-01-01

The longitudinal Three City Study of low-income families with children measures food hardships using fewer questions and some different questions from the standard U.S. instrument for measuring food security, the Household Food Security Survey Module (HFSSM) in the Current Population Survey (CPS). We utilize a Rasch measurement model to identify thresholds of very low food security among households and very low food security among children in the Three City Study that are comparable to thresholds from the HFSSM. We also use the Three City Study to empirically investigate the determinants of food insecurity and of these specific food insecurity outcomes, estimating a multivariate behavioral Rasch model that is adapted to address longitudinal data. The estimation results indicate that participation in the Supplemental Nutrition Assistance Program and the Temporary Assistance for Needy Families program reduce food insecurity, while poverty and disability among caregivers increase it. Besides its longitudinal structure, the Three City Study measures many more characteristics about households than the CPS. Our estimates reveal that financial assistance through social networks and a household's own financial assets reduce food insecurity, while its outstanding loans increase insecurity. PMID:29187764
Rasch Analyses of Very Low Food Security among Households and Children in the Three City Study.

PubMed

Moffitt, Robert A; Ribar, David C

2016-04-01

The longitudinal Three City Study of low-income families with children measures food hardships using fewer questions and some different questions from the standard U.S. instrument for measuring food security, the Household Food Security Survey Module (HFSSM) in the Current Population Survey (CPS). We utilize a Rasch measurement model to identify thresholds of very low food security among households and very low food security among children in the Three City Study that are comparable to thresholds from the HFSSM. We also use the Three City Study to empirically investigate the determinants of food insecurity and of these specific food insecurity outcomes, estimating a multivariate behavioral Rasch model that is adapted to address longitudinal data. The estimation results indicate that participation in the Supplemental Nutrition Assistance Program and the Temporary Assistance for Needy Families program reduce food insecurity, while poverty and disability among caregivers increase it. Besides its longitudinal structure, the Three City Study measures many more characteristics about households than the CPS. Our estimates reveal that financial assistance through social networks and a household's own financial assets reduce food insecurity, while its outstanding loans increase insecurity.
Validation of the Spanish Short Self-Regulation Questionnaire (SSSRQ) through Rasch Analysis.

PubMed

Garzón Umerenkova, Angélica; de la Fuente Arias, Jesús; Martínez-Vicente, José Manuel; Zapata Sevillano, Lucía; Pichardo, Mari Carmen; García-Berbén, Ana Belén

2017-01-01

Background: The aim of the study was to psychometrically characterize the Spanish Short Self-Regulation Questionnaire (SSSRQ) through Rasch analysis. Materials and Methods: 831 Spaniard university students (262 men), between 17 and 39 years of age and ranging from the first to the 5th year of studies, completed the SSSRQ questionnaire. Confirmatory factor analysis (CFA) was carried out in order to establish structural adequacy. Afterward, by means of the Rasch model, a study of each sub scale was conducted to test for dimensionality, fit of the sample questions, functionality of the response categories, reliability and estimation of Differential Item Functioning by gender and course. Results: The four sub-scales comply with the unidimensionality criteria, the questions are in line with the model, the response categories operate properly and the reliability of the sample is acceptable. Nonetheless, the test could benefit from the inclusion of additional items of both high and low difficulty in order to increase construct validity, discrimination and reliability for the respondents. Several items with differences in gender and course were also identified. Discussion: The results evidence the need and adequacy of this complementary psychometric analysis strategy, in relation to the CFA to enhance the instrument.
Validation of the Spanish Short Self-Regulation Questionnaire (SSSRQ) through Rasch Analysis

PubMed Central

Garzón Umerenkova, Angélica; de la Fuente Arias, Jesús; Martínez-Vicente, José Manuel; Zapata Sevillano, Lucía; Pichardo, Mari Carmen; García-Berbén, Ana Belén

2017-01-01

Background: The aim of the study was to psychometrically characterize the Spanish Short Self-Regulation Questionnaire (SSSRQ) through Rasch analysis. Materials and Methods: 831 Spaniard university students (262 men), between 17 and 39 years of age and ranging from the first to the 5th year of studies, completed the SSSRQ questionnaire. Confirmatory factor analysis (CFA) was carried out in order to establish structural adequacy. Afterward, by means of the Rasch model, a study of each sub scale was conducted to test for dimensionality, fit of the sample questions, functionality of the response categories, reliability and estimation of Differential Item Functioning by gender and course. Results: The four sub-scales comply with the unidimensionality criteria, the questions are in line with the model, the response categories operate properly and the reliability of the sample is acceptable. Nonetheless, the test could benefit from the inclusion of additional items of both high and low difficulty in order to increase construct validity, discrimination and reliability for the respondents. Several items with differences in gender and course were also identified. Discussion: The results evidence the need and adequacy of this complementary psychometric analysis strategy, in relation to the CFA to enhance the instrument. PMID:28298898
Improving the evaluation of therapeutic interventions in multiple sclerosis: the role of new psychometric methods.

PubMed

Hobart, J; Cano, S

2009-02-01

In this monograph we examine the added value of new psychometric methods (Rasch measurement and Item Response Theory) over traditional psychometric approaches by comparing and contrasting their psychometric evaluations of existing sets of rating scale data. We have concentrated on Rasch measurement rather than Item Response Theory because we believe that it is the more advantageous method for health measurement from a conceptual, theoretical and practical perspective. Our intention is to provide an authoritative document that describes the principles of Rasch measurement and the practice of Rasch analysis in a clear, detailed, non-technical form that is accurate and accessible to clinicians and researchers in health measurement. A comparison was undertaken of traditional and new psychometric methods in five large sets of rating scale data: (1) evaluation of the Rivermead Mobility Index (RMI) in data from 666 participants in the Cannabis in Multiple Sclerosis (CAMS) study; (2) evaluation of the Multiple Sclerosis Impact Scale (MSIS-29) in data from 1725 people with multiple sclerosis; (3) evaluation of test-retest reliability of MSIS-29 in data from 150 people with multiple sclerosis; (4) examination of the use of Rasch analysis to equate scales purporting to measure the same health construct in 585 people with multiple sclerosis; and (5) comparison of relative responsiveness of the Barthel Index and Functional Independence Measure in data from 1400 people undergoing neurorehabilitation. Both Rasch measurement and Item Response Theory are conceptually and theoretically superior to traditional psychometric methods. Findings from each of the five studies show that Rasch analysis is empirically superior to traditional psychometric methods for evaluating rating scales, developing rating scales, analysing rating scale data, understanding and measuring stability and change, and understanding the health constructs we seek to quantify. There is considerable added value in using Rasch analysis rather than traditional psychometric methods in health measurement. Future research directions include the need to reproduce our findings in a range of clinical populations, detailed head-to-head comparisons of Rasch analysis and Item Response Theory, and the application of Rasch analysis to clinical practice.
The Nature of Science Instrument-Elementary (NOSI-E): Using Rasch principles to develop a theoretically grounded scale to measure elementary student understanding of the nature of science

NASA Astrophysics Data System (ADS)

Peoples, Shelagh

The purpose of this study was to determine which of three competing models will provide, reliable, interpretable, and responsive measures of elementary students' understanding of the nature of science (NOS). The Nature of Science Instrument-Elementary (NOSI-E), a 28-item Rasch-based instrument, was used to assess students' NOS understanding. The NOS construct was conceptualized using five construct dimensions (Empirical, Inventive, Theory-laden, Certainty and Socially & Culturally Embedded). The competing models represent three internal models for the NOS construct. One postulate is that the NOS construct is unidimensional where one latent construct explains the relationship between the 28 items of the NOSI-E. Alternatively, the NOS construct is composed of five independent unidimensional constructs (the consecutive approach). Lastly, the NOS construct is multidimensional and composed of five inter-related but separate dimensions. A validity argument was developed that hypothesized that the internal structure of the NOS construct is best represented by the multidimensional Rasch model. Four sets of analyses were performed in which the three representations were compared. These analyses addressed five validity aspects (content, substantive, generalizability, structural and external) of construct validity. The vast body of evidence supported the claim that the NOS construct is composed of five separate but inter-related dimensions that is best represented by the multidimensional Rasch model. The results of the multidimensional analyses indicated that the items of the five subscales were of excellent technical quality, exhibited no differential item functioning (based on gender), had an item hierarchy that conformed to theoretical expectations; and together formed subscales of reasonable reliability (> 0.7 on each subscale) that were responsive to change in the construct. Theory-laden scores from the multidimensional model predicted students' science achievement with scores from all five NOS dimensions significantly predicting students' perceptions of the constructivist nature of their classroom learning environment. The NOSI-E instrument is a theoretically grounded scale that can measure elementary students' NOS understanding and appears suitable for use in science education research.
A Rasch Analysis of Assessments of Morning and Evening Fatigue in Oncology Patients Using the Lee Fatigue Scale.

PubMed

Lerdal, Anners; Kottorp, Anders; Gay, Caryl; Aouizerat, Bradley E; Lee, Kathryn A; Miaskowski, Christine

2016-06-01

To accurately investigate diurnal variations in fatigue, a measure needs to be psychometrically sound and demonstrate stable item function in relationship to time of day. Rasch analysis is a modern psychometric approach that can be used to evaluate these characteristics. To evaluate, using Rasch analysis, the psychometric properties of the Lee Fatigue Scale (LFS) in a sample of oncology patients. The sample comprised 587 patients (mean age 57.3 ± 11.9 years, 80% women) undergoing chemotherapy for breast, gastrointestinal, gynecological, or lung cancer. Patients completed the 13-item LFS within 30 minutes of awakening (i.e., morning fatigue) and before going to bed (i.e., evening fatigue). Rasch analysis was used to assess validity and reliability. In initial analyses of differential item function, eight of the 13 items functioned differently depending on whether the LFS was completed in the morning or in the evening. Subsequent analyses were conducted separately for the morning and evening fatigue assessments. Nine of the morning fatigue items and 10 of the evening fatigue items demonstrated acceptable goodness-of-fit to the Rasch model. Principal components analyses indicated that both morning and evening assessments demonstrated unidimensionality. Person-separation indices indicated that both morning and evening fatigue scales were able to distinguish four distinct strata of fatigue severity. Excluding four items from the morning fatigue scale and three items from the evening fatigue scale improved the psychometric properties of the LFS for assessing diurnal variations in fatigue severity in oncology patients. Copyright © 2016 American Academy of Hospice and Palliative Medicine. Published by Elsevier Inc. All rights reserved.
With hiccups and bumps: the development of a Rasch-based instrument to measure elementary students' understanding of the nature of science.

PubMed

Peoples, Shelagh M; O'Dwyer, Laura M; Shields, Katherine A; Wang, Yang

2013-01-01

This research describes the development process, psychometric analyses and part validation study of a theoretically-grounded Rasch-based instrument, the Nature of Science Instrument-Elementary (NOSI-E). The NOSI-E was designed to measure elementary students' understanding of the Nature of Science (NOS). Evidence is provided for three of the six validity aspects (content, substantive and generalizability) needed to support the construct validity of the NOSI-E. A future article will examine the structural and external validity aspects. Rasch modeling proved especially productive in scale improvement efforts. The instrument, designed for large-scale assessment use, is conceptualized using five construct domains. Data from 741 elementary students were used to pilot the Rasch scale, with continuous improvements made over three successive administrations. The psychometric properties of the NOSI-E instrument are consistent with the basic assumptions of Rasch measurement, namely that the items are well-fitting and invariant. Items from each of the five domains (Empirical, Theory-Laden, Certainty, Inventive, and Socially and Culturally Embedded) are spread along the scale's continuum and appear to overlap well. Most importantly, the scale seems appropriately calibrated and responsive for elementary school-aged children, the target age group. As a result, the NOSI-E should prove beneficial for science education research. As the United States' science education reform efforts move toward students' learning science through engaging in authentic scientific practices (NRC, 2011), it will be important to assess whether this new approach to teaching science is effective. The NOSI-E can be used as one measure of whether this reform effort has an impact.
Aligning physical elements with persons' attitude: an approach using Rasch measurement theory

NASA Astrophysics Data System (ADS)

Camargo, F. R.; Henson, B.

2013-09-01

Affective engineering uses mathematical models to convert the information obtained from persons' attitude to physical elements into an ergonomic design. However, applications in the domain have not in many cases met measurement assumptions. This paper proposes a novel approach based on Rasch measurement theory to overcome the problem. The research demonstrates that if data fit the model, further variables can be added to a scale. An empirical study was designed to determine the range of compliance where consumers could obtain an impression of a moisturizer cream when touching some product containers. Persons, variables and stimulus objects were parameterised independently on a linear continuum. The results showed that a calibrated scale preserves comparability although incorporating further variables.
Caffeine expectancy: instrument development in the Rasch measurement framework.

PubMed

Heinz, Adrienne J; Kassel, Jon D; Smith, Everett V

2009-09-01

Although caffeine is the most widely consumed psychoactive drug in the world, the mechanisms associated with consumption are not well understood. Nonetheless, outcome expectancies for caffeine use are thought to underlie caffeine's reinforcing properties. To date, however, there is no available, sufficient measure by which to assess caffeine expectancy. Therefore, the current study sought to develop such a measure employing Rasch measurement models. Unlike traditional measurement development techniques, Rasch analyses afford dynamic and interactive control of the analysis process and generate helpful information to guide instrument construction. A 5-stage developmental process is described, ultimately yielding a 37-item Caffeine Expectancy Questionnaire (CEQ) comprised of 4 factors representing "withdrawal symptoms," "positive effects," "acute negative effects," and "mood effects." Initial evaluation of the CEQ yielded sufficient evidence for various aspects of validity. Although additional research with more heterogeneous samples is required to further assess the measure's reliability and validity, the CEQ demonstrates potential with regard to its utility in experimental laboratory research and clinical application. 2009 APA, all rights reserved.
Comparative study of middle school students' attitudes towards science: Rasch analysis of entire TIMSS 2011 attitudinal data for England, Singapore and the U.S.A. as well as psychometric properties of attitudes scale

NASA Astrophysics Data System (ADS)

Pey Tee, Oon; Subramaniam, R.

2018-02-01

We report here on a comparative study of middle school students' attitudes towards science involving three countries: England, Singapore and the U.S.A. Complete attitudinal data sets from TIMSS (Trends in International Mathematics and Science Study) 2011 were used, thus giving a very large sample size (N = 20,246), compared to other studies in the journal literature. The Rasch model was used to analyse the data, and the findings have shed some useful light on not only how the Western and Asian students responded on a comparative basis in the various scales related to attitudes but also on the validity, reliability, and unidimensionality of the attitudes instrument used in TIMSS 2011. There may be a need for TIMSS test developers to consider doing away with negatively phrased items in the attitudes instrument and phrasing these positively as the Rasch framework shows that response bias is associated with these statements.
Measuring coping style following acquired brain injury: a modification of the Coping Inventory for Stressful Situations Using Rasch analysis.

PubMed

Simblett, Sara K; Gracey, Fergus; Ring, Howard; Bateman, Andrew

2015-09-01

The importance of coping style factors in the process of emotional adjustment following acquired brain injury (ABI) has been gaining increased attention. To assess ways of coping with distress accurately, clear conceptual definitions and measurement precision is vital. The purpose of this study was to investigate the psychometric properties of a well-known measure of coping, the Coping Inventory for Stressful Situations (CISS), for people who have experienced an ABI; and to modify the CISS, where necessary, to create a more reliable and valid measurement tool for this clinical group. Psychometric properties were investigated using Rasch analysis of responses from a sample of adults with ABI (n = 207). The internal consistency reliability and construct validity of the scale were examined. All originally proposed subscales were not valid or reliable and, as such, were incapable of interval-level measurement within this sample - Task: χ(2) (32, N = 207) = 105.1, p < .001; Emotion: χ(2) (32, N = 204) = 121.9, p < .001; Avoidance: χ(2) (32, N = 207) = 66.7, p < .001. Three valid and reliable subscales were derived measuring emotion-, task-, and avoidance-oriented coping styles by removing items that provided the most unreliable information and exploring fit to the Rasch model. The original version of the CISS may not be a valid and reliable measure of coping style following ABI. Modified subscales of the three distinct coping domains have been proposed that would help to improve measurement of coping style following ABI in future research and clinical practice. How people cope with difficulties following an ABI has been shown to impact upon emotional outcomes and functional recovery. The original version of the CISS was found to be an imprecise measure of coping following ABI. A modified version of the CISS was found to be a valid and reliable measure of three styles of coping (task-focused, emotion-focused, and avoidance-focused) that conforms to the properties of interval-level measurement as represented by the Rasch model. This structure is in keeping with previous theoretical models of coping. We advise caution about including items (1, 6, 7, 22, 24, 28, 29, 33, 34, and 46) that were found to diverge from the expectations of the Rasch measurement model in total subscale scores for measuring change in coping style. A conversion table for the three modified subscales is included in this paper to convert total raw scores into Rasch transformed logit values. Identifying strengths and weaknesses in coping style could be a means of guiding psychological intervention to promote good recovery following ABI. The sample included mainly people who had experienced non-traumatic brain injuries (e.g., a stroke). This research could be extended to include broader sample of people with differing brain injury aetiologies and neurological disorders. © 2014 The British Psychological Society.
Comparison is key.

PubMed

Stone, Mark H; Stenner, A Jackson

2014-01-01

Several concepts from Georg Rasch's last papers are discussed. The key one is comparison because Rasch considered the method of comparison fundamental to science. From the role of comparison stems scientific inference made operational by a properly developed frame of reference producing specific objectivity. The exact specifications Rasch outlined for making comparisons are explicated from quotes, and the role of causality derived from making comparisons is also examined. Understanding causality has implications for what can and cannot be produced via Rasch measurement. His simple examples were instructive, but the implications are far reaching upon first establishing the key role of comparison.
Using Rasch-models to compare the 30-, 20-, and 12-items version of the general health questionnaire taking four recoding schemes into account.

PubMed

Alexandrowicz, Rainer W; Friedrich, Fabian; Jahn, Rebecca; Soulier, Nathalie

2015-01-01

The present study compares the 30-, 20-, and 12-items versions of the General Health Questionnaire (GHQ) in the original coding and four different recoding schemes (Bimodal, Chronic, Modified Likert and a newly proposed Modified Chronic) with respect to their psychometric qualities. The dichotomized versions (i.e. Bimodal, Chronic and Modified Chronic) were evaluated with the Rasch-Model and the polytomous original version and the Modified Likert version were evaluated with the Partial Credit Model. In general, the versions under consideration showed agreement with the model assumption. However, the recoded versions exhibited some deficits with respect to the Outfit index. Because of the item deficits and for theoretical reasons we argue in favor of using the any of the three length versions with the original four-categorical coding scheme. Nevertheless, any of the versions appears apt for clinical use from a psychometric perspective.
Comparison of scoring approaches for the NEI VFQ-25 in low vision.

PubMed

Dougherty, Bradley E; Bullimore, Mark A

2010-08-01

The aim of this study was to evaluate different approaches to scoring the National Eye Institute Visual Functioning Questionnaire-25 (NEI VFQ-25) in patients with low vision including scoring by the standard method, by Rasch analysis, and by use of an algorithm created by Massof to approximate Rasch person measure. Subscale validity and use of a 7-item short form instrument proposed by Ryan et al. were also investigated. NEI VFQ-25 data from 50 patients with low vision were analyzed using the standard method of summing Likert-type scores and calculating an overall average, Rasch analysis using Winsteps software, and the Massof algorithm in Excel. Correlations between scores were calculated. Rasch person separation reliability and other indicators were calculated to determine the validity of the subscales and of the 7-item instrument. Scores calculated using all three methods were highly correlated, but evidence of floor and ceiling effects was found with the standard scoring method. None of the subscales investigated proved valid. The 7-item instrument showed acceptable person separation reliability and good targeting and item performance. Although standard scores and Rasch scores are highly correlated, Rasch analysis has the advantages of eliminating floor and ceiling effects and producing interval-scaled data. The Massof algorithm for approximation of the Rasch person measure performed well in this group of low-vision patients. The validity of the subscales VFQ-25 should be reconsidered.

Application of Rasch Measurement to a Measure of Musical Performance.

ERIC Educational Resources Information Center

Haley, Kathleen A.

1999-01-01

Describes the Rasch calibration of a portion of the Watkins Farnum Performance Scale (J. Watkins and S. Farnum, 1954), a test of instructional music performance, for 218 sixth graders. Results show how Rasch scaling allows item difficulties to be estimated, the test to be administered more efficiently, and diagnostic information to be obtained.…
Examination of a Social-Networking Site Activities Scale (SNSAS) Using Rasch Analysis

ERIC Educational Resources Information Center

Alhaythami, Hassan; Karpinski, Aryn; Kirschner, Paul; Bolden, Edward

2017-01-01

This study examined the psychometric properties of a social-networking site (SNS) activities scale (SNSAS) using Rasch Analysis. Items were also examined with Rasch Principal Components Analysis (PCA) and Differential Item Functioning (DIF) across groups of university students (i.e., males and females from the United States [US] and Europe; N =…
Using Rasch Measurement Theory to Examine Two Instructional Approaches for Teaching and Learning of French Grammar

ERIC Educational Resources Information Center

Vogel, Severine P.; Engelhard, George, Jr.

2011-01-01

The authors describe a quantitative approach based on Rasch measurement theory for evaluating classroom assessments within the context of foreign language classes. A secondary purpose was to examine the effects of two instructional approaches to teach grammar, a guided inductive and a deductive approach, through the lens of Rasch measurement…
The Assessment Revolution that Has Passed England By: Rasch Measurement

ERIC Educational Resources Information Center

Panayides, Panayiotis; Robinson, Colin; Tymms, Peter

2010-01-01

Assessment has been dominated by Classical Test Theory for the last half century although the radically different approach known as Rasch measurement briefly blossomed in England during the 1960s and 1970s. Its open development was stopped dead in the 1980s, whilst some work has continued almost surreptitiously. Elsewhere Rasch has assumed…
Identifying Measurement Disturbance Effects Using Rasch Item Fit Statistics and the Logit Residual Index.

ERIC Educational Resources Information Center

Mount, Robert E.; Schumacker, Randall E.

1998-01-01

A Monte Carlo study was conducted using simulated dichotomous data to determine the effects of guessing on Rasch item fit statistics and the Logit Residual Index. Results indicate that no significant differences were found between the mean Rasch item fit statistics for each distribution type as the probability of guessing the correct answer…
Using Rasch Measurement to Validate the Instrument of Students' Understanding of Models in Science (SUMS)

ERIC Educational Resources Information Center

Wei, Silin; Liu, Xiufeng; Jia, Yuane

2014-01-01

Scientific models and modeling play an important role in science, and students' understanding of scientific models is essential for their understanding of scientific concepts. The measurement instrument of "Students' Understanding of Models in Science" (SUMS), developed by Treagust, Chittleborough & Mamiala ("International…
Tree-Based Global Model Tests for Polytomous Rasch Models

ERIC Educational Resources Information Center

Komboz, Basil; Strobl, Carolin; Zeileis, Achim

2018-01-01

Psychometric measurement models are only valid if measurement invariance holds between test takers of different groups. Global model tests, such as the well-established likelihood ratio (LR) test, are sensitive to violations of measurement invariance, such as differential item functioning and differential step functioning. However, these…
The development and psychometric validation of the Ethical Awareness Scale.

PubMed

Milliken, Aimee; Ludlow, Larry; DeSanto-Madeya, Susan; Grace, Pamela

2018-04-19

To develop and psychometrically assess the Ethical Awareness Scale using Rasch measurement principles and a Rasch item response theory model. Critical care nurses must be equipped to provide good (ethical) patient care. This requires ethical awareness, which involves recognizing the ethical implications of all nursing actions. Ethical awareness is imperative in successfully addressing patient needs. Evidence suggests that the ethical import of everyday issues may often go unnoticed by nurses in practice. Assessing nurses' ethical awareness is a necessary first step in preparing nurses to identify and manage ethical issues in the highly dynamic critical care environment. A cross-sectional design was used in two phases of instrument development. Using Rasch principles, an item bank representing nursing actions was developed (33 items). Content validity testing was performed. Eighteen items were selected for face validity testing. Two rounds of operational testing were performed with critical care nurses in Boston between February-April 2017. A Rasch analysis suggests sufficient item invariance across samples and sufficient construct validity. The analysis further demonstrates a progression of items uniformly along a hierarchical continuum; items that match respondent ability levels; response categories that are sufficiently used; and adequate internal consistency. Mean ethical awareness scores were in the low/moderate range. The results suggest the Ethical Awareness Scale is a psychometrically sound, reliable and valid measure of ethical awareness in critical care nurses. © 2018 John Wiley & Sons Ltd.
Biases and Power for Groups Comparison on Subjective Health Measurements

PubMed Central

Hamel, Jean-François; Hardouin, Jean-Benoit; Le Neel, Tanguy; Kubis, Gildas; Roquelaure, Yves; Sébille, Véronique

2012-01-01

Subjective health measurements are increasingly used in clinical research, particularly for patient groups comparisons. Two main types of analytical strategies can be used for such data: so-called classical test theory (CTT), relying on observed scores and models coming from Item Response Theory (IRT) relying on a response model relating the items responses to a latent parameter, often called latent trait. Whether IRT or CTT would be the most appropriate method to compare two independent groups of patients on a patient reported outcomes measurement remains unknown and was investigated using simulations. For CTT-based analyses, groups comparison was performed using t-test on the scores. For IRT-based analyses, several methods were compared, according to whether the Rasch model was considered with random effects or with fixed effects, and the group effect was included as a covariate or not. Individual latent traits values were estimated using either a deterministic method or by stochastic approaches. Latent traits were then compared with a t-test. Finally, a two-steps method was performed to compare the latent trait distributions, and a Wald test was performed to test the group effect in the Rasch model including group covariates. The only unbiased IRT-based method was the group covariate Wald’s test, performed on the random effects Rasch model. This model displayed the highest observed power, which was similar to the power using the score t-test. These results need to be extended to the case frequently encountered in practice where data are missing and possibly informative. PMID:23115620
Validation of Malaysian Versions of Perceived Diabetes Self-Management Scale (PDSMS), Medication Understanding and Use Self-Efficacy Scale (MUSE) and 8-Morisky Medication Adherence Scale (MMAS-8) Using Partial Credit Rasch Model.

PubMed

Al Abboud, Safaa Ahmed; Ahmad, Sohail; Bidin, Mohamed Badrulnizam Long; Ismail, Nahlah Elkudssiah

2016-11-01

The Diabetes Mellitus (DM) is a common silent epidemic disease with frequent morbidity and mortality. The psychological and psychosocial health factors are negatively influencing the glycaemic control in diabetic patients. Therefore, various questionnaires were developed to address the psychological and psychosocial well-being of the diabetic patients. Most of these questionnaires were first developed in English and then translated into different languages to make them useful for the local communities. The main aim of this study was to translate and validate the Malaysian versions of Perceived Diabetes Self-Management Scale (PDSMS), Medication Understanding and Use Self-Efficacy Scale (MUSE), and to revalidate 8-Morisky Medication Adherence Scale (MMAS-8) by Partial Credit Rasch Model (Modern Test Theory). Permission was obtained from respective authors to translate the English versions of PDSMS, MUSE and MMAS-8 into Malay language according to established standard international translation guidelines. In this cross-sectional study, 62 adult DM patients were recruited from Hospital Kuala Lumpur by purposive sampling method. The data were extracted from the self-administered questionnaires and entered manually in the Ministeps (Winsteps) software for Partial Credit Rasch Model. The item and person reliability, infit/outfit Z-Standard (ZSTD), infit/outfit Mean Square (MNSQ) and point measure correlation (PTMEA Corr) values were analysed for the reliability analyses and construct validation. The Malay version of PDSMS, MUSE and MMAS-8 found to be valid and reliable instrument for the Malaysian diabetic adults. The instrument showed good overall reliability value of 0.76 and 0.93 for item and person reliability, respectively. The values of infit/outfit ZSTD, infit/outfit MNSQ, and PTMEA Corr were also within the stipulated range of the Rasch Model proving the valid item constructs of the questionnaire. The translated Malay version of PDSMS, MUSE and MMAS-8 was found to be a highly reliable and valid questionnaire by Partial Credit Model. The Malay version was conceptually equivalent to original version, easy to understand and can be used for the Malaysian adult diabetic patients for future studies.
Validation of the Korean version of the pediatric quality of life inventory 4.0 (PedsQL) generic core scales in school children and adolescents using the Rasch model.

PubMed

Kook, Seung Hee; Varni, James W

2008-06-02

The Pediatric Quality of Life Inventory (PedsQL) is a child self-report and parent proxy-report instrument designed to assess health-related quality of life (HRQOL) in healthy and ill children and adolescents. It has been translated into over 70 international languages and proposed as a valid and reliable pediatric HRQOL measure. This study aimed to assess the psychometric properties of the Korean translation of the PedsQL 4.0 Generic Core Scales. Following the guidelines for linguistic validation, the original US English scales were translated into Korean and cognitive interviews were administered. The field testing responses of 1425 school children and adolescents and 1431 parents to the Korean version of PedsQL 4.0 Generic Core Scales were analyzed utilizing confirmatory factor analysis and the Rasch model. Consistent with studies using the US English instrument and other translation studies, score distributions were skewed toward higher HRQOL in a predominantly healthy population. Confirmatory factor analysis supported a four-factor and a second order-factor model. The analysis using the Rasch model showed that person reliabilities are low, item reliabilities are high, and the majority of items fit the model's expectation. The Rasch rating scale diagnostics showed that PedsQL 4.0 Generic Core Scales in general have the optimal number of response categories, but category 4 (almost always a problem) is somewhat problematic for the healthy school sample. The agreements between child self-report and parent proxy-report were moderate. The results demonstrate the feasibility, validity, item reliability, item fit, and agreement between child self-report and parent proxy-report of the Korean version of PedsQL 4.0 Generic Core Scales for school population health research in Korea. However, the utilization of the Korean version of the PedsQL 4.0 Generic Core Scales for healthy school populations needs to consider low person reliability, ceiling effects and cultural differences, and further validation studies on Korean clinical samples are required.
Funding Medical Research Projects: Taking into Account Referees' Severity and Consistency through Many-Faceted Rasch Modeling of Projects' Scores.

PubMed

Tesio, Luigi; Simone, Anna; Grzeda, Mariuzs T; Ponzio, Michela; Dati, Gabriele; Zaratin, Paola; Perucca, Laura; Battaglia, Mario A

2015-01-01

The funding policy of research projects often relies on scores assigned by a panel of experts (referees). The non-linear nature of raw scores and the severity and inconsistency of individual raters may generate unfair numeric project rankings. Rasch measurement (many-facets version, MFRM) provides a valid alternative to scoring. MFRM was applied to the scores achieved by 75 research projects on multiple sclerosis sent in response to a previous annual call by FISM-Italian Foundation for Multiple Sclerosis. This allowed to simulate, a posteriori, the impact of MFRM on the funding scenario. The applications were each scored by 2 to 4 independent referees (total = 131) on a 10-item, 0-3 rating scale called FISM-ProQual-P. The rotation plan assured "connection" of all pairs of projects through at least 1 shared referee.The questionnaire fulfilled satisfactorily the stringent criteria of Rasch measurement for psychometric quality (unidimensionality, reliability and data-model fit). Arbitrarily, 2 acceptability thresholds were set at a raw score of 21/30 and at the equivalent Rasch measure of 61.5/100, respectively. When the cut-off was switched from score to measure 8 out of 18 acceptable projects had to be rejected, while 15 rejected projects became eligible for funding. Some referees, of various severity, were grossly inconsistent (z-std fit indexes less than -1.9 or greater than 1.9). The FISM-ProQual-P questionnaire seems a valid and reliable scale. MFRM may help the decision-making process for allocating funds to MS research projects but also in other fields. In repeated assessment exercises it can help the selection of reliable referees. Their severity can be steadily calibrated, thus obviating the need to connect them with other referees assessing the same projects.
Developing the Polish Educational Needs Assessment Tool (Pol-ENAT) in rheumatoid arthritis and systemic sclerosis: a cross-cultural validation study using Rasch analysis.

PubMed

Sierakowska, Matylda; Sierakowski, Stanisław; Sierakowska, Justyna; Horton, Mike; Ndosi, Mwidimi

2015-03-01

To undertake cross-cultural adaptation and validation of the educational needs assessment tool (ENAT) for use with people with rheumatoid arthritis (RA) and systemic sclerosis (SSc) in Poland. The study involved two main phases: (1) cross-cultural adaptation of the ENAT from English into Polish and (2) Cross-cultural validation of Polish Educational Needs Assessment Tool (Pol-ENAT). The first phase followed an established process of cross-cultural adaptation of self-report measures. The second phase involved completion of the Pol-ENAT by patients and subjecting the data to Rasch analysis to assess the construct validity, unidimensionality, internal consistency and cross-cultural invariance. An adequate conceptual equivalence was achieved following the adaptation process. The dataset for validation comprised a total of 278 patients, 237 (85.3 %) of which were female. In each disease group (145, RA and 133, SSc), the 7 domains of the Pol-ENAT were found to fit the Rasch model, X (2)(df) = 16.953(14), p = 0.259 and 8.132(14), p = 0.882 for RA and SSc, respectively. Internal consistency of the Pol-ENAT was high (patient separation index = 0.85 and 0.89 for SSc and RA, respectively), and unidimensionality was confirmed. Cross-cultural differential item functioning (DIF) was detected in some subscales, and DIF-adjusted conversion tables were calibrated to enable cross-cultural comparison of data between Poland and the UK. Using a standard process in cross-cultural adaptation, conceptual equivalence was achieved between the original (UK) ENAT and the adapted Pol-ENAT. Fit to the Rasch model, confirmed that the construct validity, unidimensionality and internal consistency of the ENAT have been preserved.
Using Hospital Anxiety and Depression Scale (HADS) on patients with epilepsy: Confirmatory factor analysis and Rasch models.

PubMed

Lin, Chung-Ying; Pakpour, Amir H

2017-02-01

The problems of mood disorders are critical in people with epilepsy. Therefore, there is a need to validate a useful tool for the population. The Hospital Anxiety and Depression Scale (HADS) has been used on the population, and showed that it is a satisfactory screening tool. However, more evidence on its construct validity is needed. A total of 1041 people with epilepsy were recruited in this study, and each completed the HADS. Confirmatory factor analysis (CFA) and Rasch analysis were used to understand the construct validity of the HADS. In addition, internal consistency was tested using Cronbachs' α, person separation reliability, and item separation reliability. Ordering of the response descriptors and the differential item functioning (DIF) were examined using the Rasch models. The HADS showed that 55.3% of our participants had anxiety; 56.0% had depression based on its cutoffs. CFA and Rasch analyses both showed the satisfactory construct validity of the HADS; the internal consistency was also acceptable (α=0.82 in anxiety and 0.79 in depression; person separation reliability=0.82 in anxiety and 0.73 in depression; item separation reliability=0.98 in anxiety and 0.91 in depression). The difficulties of the four-point Likert scale used in the HADS were monotonically increased, which indicates no disordering response categories. No DIF items across male and female patients and across types of epilepsy were displayed in the HADS. The HADS has promising psychometric properties on construct validity in people with epilepsy. Moreover, the additive item score is supported for calculating the cutoff. Copyright © 2016 British Epilepsy Association. Published by Elsevier Ltd. All rights reserved.
Construct Validity of the Spanish Versions of the Memorial Symptom Assessment Scale Short Form and Condensed Form: Rasch Analysis of Responses in Oncology Outpatients.

PubMed

Llamas-Ramos, Inés; Llamas-Ramos, Rocío; Buz, José; Cortés-Rodríguez, María; Martín-Nogueras, Ana María

2018-06-01

The Memorial Symptom Assessment Scale (MSAS) is a self-rating instrument for the assessment of symptom distress in cancer patients. The Spanish version of the MSAS has recently been validated. However, we lack evidence of the internal construct validity of the shorter versions (short form [MSAS-SF] and condensed form [CMSAS]). In addition, rigorous testing of these scales with modern psychometric methods is needed. The aim of this study was to evaluate the internal construct validity and reliability of the Spanish versions of the MSAS-SF and CMSAS in oncology outpatients using Rasch analysis. Data from a convenience sample of oncology outpatients receiving chemotherapy (n = 306; mean age 60 years; 63% women) at a university hospital were analyzed. The Rasch unidimensional measurement model was used to examine response category functioning, item hierarchy, targeting, unidimensionality, reliability, and differential item functioning by age, gender, and marital status. The response category structure of the symptom distress items was improved by collapsing two categories. The scales were adequately targeted to the study patients, showed overall Rasch model fit (mean Infit MnSq ranged from 0.98 to 1.05), met criteria for unidimensionality, and the reliability of scores was good (person reliability > 0.80), except for the CMSAS prevalence scale. Only four items showed differential item functioning. The present study demonstrated that the Spanish versions of the MSAS-SF and CMSAS have adequate psychometric properties to evaluate symptom distress in oncology outpatients. Additional studies of the CMSAS are recommended. Copyright © 2018 American Academy of Hospice and Palliative Medicine. Published by Elsevier Inc. All rights reserved.
Vision and Quality of Life Index: validation of the Indian version using Rasch analysis.

PubMed

Gothwal, Vijaya K; Bagga, Deepak K

2013-07-18

A multi-attribute utility instrument (MAUI) consists of a descriptive system in which the items and responses seek information about a concept of the universe of health-related quality of life (QoL), and responses to these items then are weighted and combined to produce the index. To our knowledge, the 6-item Vision and Quality of Life Index (VisQoL) is the only available vision-related MAUI, developed and validated in Australia, specifically for visually impaired (VI) populations. To our knowledge, the psychometric properties of the VisQoL have not yet been investigated in an Indian VI sample; this was the aim of our study. The Indian VisQoL was administered to 349 VI adults face-to-face by a trained interviewer at the Vision Rehabilitation Centres of a tertiary eye care facility, South India. Rasch analysis was used to assess the psychometric properties. Rescoring was necessary for all except one item before ordered thresholds were obtained. All items fit the Rasch model and unidimensionality was confirmed. Person separation was acceptable (2.01), indicating that the instrument can discriminate among three strata of participants" vision-related QoL (VRQoL). The VisQoL items were targeted substantially to the participants" VRQoL (-0.69 logits). One item ("ability to have friendships") demonstrated large differential item functioning by work status; working participants reported the item to be more difficult (-1.13 logits) relative to other items when compared to the nonworking participants. The 6-item Indian VisQoL satisfies unidimensional Rasch model expectations in VI patients. Disordering of response categories was evident; replication is required before a common rescoring option should be considered.
ABILOCO-Kids: a Rasch-built 10-item questionnaire for assessing locomotion ability in children with cerebral palsy.

PubMed

Caty, Gilles D; Gilles, Caty D; Arnould, Carlyne; Thonnard, Jean-Louis; Lejeune, Thierry M

2008-11-01

To develop a questionnaire (ABILOCO-Kids) based on the Rasch measurement model that assesses locomotion ability in children with cerebral palsy. Prospective study and questionnaire development. A total of 113 children with cerebral palsy (10 (standard deviation 2.5) years old). A 41-item questionnaire was developed based on existing scales and on the clinical experience of professionals in the field of rehabilitation. This questionnaire was tested separately on the 113 children with cerebral palsy and their parents. Their responses were analysed using the Rasch model (RUMM-2020) to select items that had an ordered rating scale and that fit a unidimensional model. The final ABILOCO-Kids scale consisted of 10 locomotion activities, of which difficulty was rated by the parents. The parents gave a more precise assessment of their children's ability than the children themselves, leading to a wider range of measurement that was well-targeted on the sample population and that had good reliability (r=0.97) and reproducibility (intraclass correlation coefficient=0.96). Item calibration did not vary with age, sex or clinical presentation (hemiplegia, diplegia, quadriplegia). The concurrent validity of the ABILOCO-Kids questionnaire was also shown by its correlation with the Gross Motor Function Classification System. The ABILOCO-Kids questionnaire has good psychometric qualities for measuring a wide range of locomotion abilities in children with cerebral palsy.
Blooms' separation of the final exam of Engineering Mathematics II: Item reliability using Rasch measurement model

NASA Astrophysics Data System (ADS)

Fuaad, Norain Farhana Ahmad; Nopiah, Zulkifli Mohd; Tawil, Norgainy Mohd; Othman, Haliza; Asshaari, Izamarlina; Osman, Mohd Hanif; Ismail, Nur Arzilah

2014-06-01

In engineering studies and researches, Mathematics is one of the main elements which express physical, chemical and engineering laws. Therefore, it is essential for engineering students to have a strong knowledge in the fundamental of mathematics in order to apply the knowledge to real life issues. However, based on the previous results of Mathematics Pre-Test, it shows that the engineering students lack the fundamental knowledge in certain topics in mathematics. Due to this, apart from making improvements in the methods of teaching and learning, studies on the construction of questions (items) should also be emphasized. The purpose of this study is to assist lecturers in the process of item development and to monitor the separation of items based on Blooms' Taxonomy and to measure the reliability of the items itself usingRasch Measurement Model as a tool. By using Rasch Measurement Model, the final exam questions of Engineering Mathematics II (Linear Algebra) for semester 2 sessions 2012/2013 were analysed and the results will provide the details onthe extent to which the content of the item providesuseful information about students' ability. This study reveals that the items used in Engineering Mathematics II (Linear Algebra) final exam are well constructed but the separation of the items raises concern as it is argued that it needs further attention, as there is abig gap between items at several levels of Blooms' cognitive skill.
Low back pain in 17 countries, a Rasch analysis of the ICF core set for low back pain.

PubMed

Røe, Cecilie; Bautz-Holter, Erik; Cieza, Alarcos

2013-03-01

Previous studies indicate that a worldwide measurement tool may be developed based on the International Classification of Functioning Disability and Health (ICF) Core Sets for chronic conditions. The aim of the present study was to explore the possibility of constructing a cross-cultural measurement of functioning for patients with low back pain (LBP) on the basis of the Comprehensive ICF Core Set for LBP and to evaluate the properties of the ICF Core Set. The Comprehensive ICF Core Set for LBP was scored by health professionals for 972 patients with LBP from 17 countries. Qualifier levels of the categories, invariance across age, sex and countries, construct validity and the ordering of the categories in the components of body function, body structure, activities and participation were explored by Rasch analysis. The item-trait χ2-statistics showed that the 53 categories in the ICF Core Set for LBP did not fit the Rasch model (P<0.001). The main challenge was the invariance in the responses according to country. Analysis of the four countries with the largest sample sizes indicated that the data from Germany fit the Rasch model, and the data from Norway, Serbia and Kuwait in terms of the components of body functions and activities and participation also fit the model. The component of body functions and activity and participation had a negative mean location, -2.19 (SD 1.19) and -2.98 (SD 1.07), respectively. The negative location indicates that the ICF Core Set reflects patients with a lower level of function than the present patient sample. The present results indicate that it may be possible to construct a clinical measure of function on the basis of the Comprehensive ICF Core Set for LBP by calculating country-specific scores before pooling the data.
Candidate Evaluation Using Targeted Construct Assessment in the Multiple Mini-Interview: A Multifaceted Rasch Model Analysis.

PubMed

McLaughlin, Jacqueline E; Singer, David; Cox, Wendy C

2017-01-01

Construct: A 7-station multiple mini-interview (MMI) circuit was implemented and assessed for 214 candidates rated by 37 interviewers (N = 1,498 ratings). The MMI stations were designed to assess 6 specific constructs (adaptability, empathy, integrity, critical thinking, teamwork [receiving instruction], teamwork [giving instruction]) and one open station about the candidate's interest in the school. Despite the apparent benefits of the MMI, construct-irrelevant variance continues to be a topic of study. Refining the MMI to more effectively measure candidate ability is critical to improving our ability to identify and select candidates that are equipped for success within health professions education and the workforce. Each station assessed a single construct and was rated by a single interviewer who was provided only the name of the candidate and no additional information about the candidate's background, application, or prior academic performance. All interviewers received online and in-person training in the fall prior to the MMI and the morning of the MMI. A 3-facet multifaceted Rasch measurement analysis was completed to determine interviewer severity, candidate ability, and MMI station difficulty and examine how the model performed overall (e.g., rating scale). Altogether, the Rasch measures explained 62.84% of the variance in the ratings. Differences in candidate ability explained 45.28% of the variance in the data, whereas differences in interviewer severity explained 16.09% of the variance in the data. None of the interviewers had Infit or Outfit mean-square scores greater than 1.7, and only 2 (5.4%) had mean-square scores less than 0.5. The data demonstrated acceptable fit to the multifaceted Rasch measurement model. This work is the first of its kind in pharmacy and provides insight into the development of an MMI that provides useful and meaningful candidate assessment ratings for institutional decision making.

Rasch Analysis of the Locus-of-Hope Scale. Brief Report

ERIC Educational Resources Information Center

Gadiana, Leny G.; David, Adonis P.

2015-01-01

The Locus-of-Hope Scale (LHS) was developed as a measure of the locus-of-hope dimensions (Bernardo, 2010). The present study adds to the emerging literature on locus-of-hope by assessing the psychometric properties of the LHS using Rasch analysis. The results from the Rasch analyses of the four subscales of LHS provided evidence on the…
On the Bayesian Nonparametric Generalization of IRT-Type Models

ERIC Educational Resources Information Center

San Martin, Ernesto; Jara, Alejandro; Rolin, Jean-Marie; Mouchart, Michel

2011-01-01

We study the identification and consistency of Bayesian semiparametric IRT-type models, where the uncertainty on the abilities' distribution is modeled using a prior distribution on the space of probability measures. We show that for the semiparametric Rasch Poisson counts model, simple restrictions ensure the identification of a general…
Estimation Methods for One-Parameter Testlet Models

ERIC Educational Resources Information Center

Jiao, Hong; Wang, Shudong; He, Wei

2013-01-01

This study demonstrated the equivalence between the Rasch testlet model and the three-level one-parameter testlet model and explored the Markov Chain Monte Carlo (MCMC) method for model parameter estimation in WINBUGS. The estimation accuracy from the MCMC method was compared with those from the marginalized maximum likelihood estimation (MMLE)…
An Evaluation of Three Approximate Item Response Theory Models for Equating Test Scores.

ERIC Educational Resources Information Center

Marco, Gary L.; And Others

Three item response models were evaluated for estimating item parameters and equating test scores. The models, which approximated the traditional three-parameter model, included: (1) the Rasch one-parameter model, operationalized in the BICAL computer program; (2) an approximate three-parameter logistic model based on coarse group data divided…
Measuring Model-Based High School Science Instruction: Development and Application of a Student Survey

ERIC Educational Resources Information Center

Fulmer, Gavin W.; Liang, Ling L.

2013-01-01

This study tested a student survey to detect differences in instruction between teachers in a modeling-based science program and comparison group teachers. The Instructional Activities Survey measured teachers' frequency of modeling, inquiry, and lecture instruction. Factor analysis and Rasch modeling identified three subscales, Modeling and…
Another Look at the PART-O Using the Traumatic Brain Injury Model Systems National Database: Scoring to Optimize Psychometrics.

PubMed

Malec, James F; Whiteneck, Gale G; Bogner, Jennifer A

2016-02-01

To integrate previous approaches to scoring the Participation Assessment with Recombined Tools-Objective (PART-O) in a unidimensional scale. Retrospective analysis of PART-O data from the Traumatic Brain Injury Model Systems. Community. Data from individuals (N=469) selected randomly from participants who completed 1-year follow-up in the Traumatic Brain Injury Model Systems were used in Rasch model development. The model was subsequently tested on data from additional random samples of similar size at 1-, 2-, 5-, 10-, and >15-year follow-ups. Not applicable. PART-O. After combining items for productivity and social interaction, the initial analysis at 1-year follow-up indicated relatively good fit to the Rasch model (person reliability=.80) but also suggested item misfit and that the 0-to-5 scale used for most items did not consistently show clear separation between rating levels. Reducing item rating scales to 3 levels (except combined and dichotomous items) resolved these issues and demonstrated good item level discrimination, fit, and person reliability (.81), with no evidence of multidimensionality. These results replicated in analyses at each additional follow-up period. Modifications to item scoring for the PART-O resulted in a unidimensional parametric equivalent measure that addresses previous concerns about competing item relations, and it fit the Rasch model consistently across follow-up periods. The person-item map shows a progression toward greater community participation from solitary and dyadic activities, such as leaving the house and having a friend through social and productivity activities, to group activities with others who share interests or beliefs. Copyright © 2016 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.
Evaluation of the Edinburgh Post Natal Depression Scale using Rasch analysis

PubMed Central

Pallant, Julie F; Miller, Renée L; Tennant, Alan

2006-01-01

Background The Edinburgh Postnatal Depression Scale (EPDS) is a 10 item self-rating post-natal depression scale which has seen widespread use in epidemiological and clinical studies. Concern has been raised over the validity of the EPDS as a single summed scale, with suggestions that it measures two separate aspects, one of depressive feelings, the other of anxiety. Methods As part of a larger cross-sectional study conducted in Melbourne, Australia, a community sample (324 women, ranging in age from 18 to 44 years: mean = 32 yrs, SD = 4.6), was obtained by inviting primiparous women to participate voluntarily in this study. Data from the EPDS were fitted to the Rasch measurement model and tested for appropriate category ordering, for item bias through Differential Item Functioning (DIF) analysis, and for unidimensionality through tests of the assumption of local independence. Results Rasch analysis of the data from the ten item scale initially demonstrated a lack of fit to the model with a significant Item-Trait Interaction total chi-square (chi Square = 82.8, df = 40; p < .001). Removal of two items (items 7 and 8) resulted in a non-significant Item-Trait Interaction total chi-square with a residual mean value for items of -0.467 with a standard deviation of 0.850, showing fit to the model. No DIF existed in the final 8-item scale (EPDS-8) and all items showed fit to model expectations. Principal Components Analysis of the residuals supported the local independence assumption, and unidimensionality of the revised EPDS-8 scale. Revised cut points were identified for EPDS-8 to maintain the case identification of the original scale. Conclusion The results of this study suggest that EPDS, in its original 10 item form, is not a viable scale for the unidimensional measurement of depression. Rasch analysis suggests that a revised eight item version (EPDS-8) would provide a more psychometrically robust scale. The revised cut points of 7/8 and 9/10 for the EPDS-8 show high levels of agreement with the original case identification for the EPDS-10. PMID:16768803
A Rasch Perspective

ERIC Educational Resources Information Center

Schumacker, Randall E.; Smith, Everett V., Jr.

2007-01-01

Measurement error is a common theme in classical measurement models used in testing and assessment. In classical measurement models, the definition of measurement error and the subsequent reliability coefficients differ on the basis of the test administration design. Internal consistency reliability specifies error due primarily to poor item…
Parametric analyses of summative scores may lead to conflicting inferences when comparing groups: A simulation study.

PubMed

Khan, Asaduzzaman; Chien, Chi-Wen; Bagraith, Karl S

2015-04-01

To investigate whether using a parametric statistic in comparing groups leads to different conclusions when using summative scores from rating scales compared with using their corresponding Rasch-based measures. A Monte Carlo simulation study was designed to examine between-group differences in the change scores derived from summative scores from rating scales, and those derived from their corresponding Rasch-based measures, using 1-way analysis of variance. The degree of inconsistency between the 2 scoring approaches (i.e. summative and Rasch-based) was examined, using varying sample sizes, scale difficulties and person ability conditions. This simulation study revealed scaling artefacts that could arise from using summative scores rather than Rasch-based measures for determining the changes between groups. The group differences in the change scores were statistically significant for summative scores under all test conditions and sample size scenarios. However, none of the group differences in the change scores were significant when using the corresponding Rasch-based measures. This study raises questions about the validity of the inference on group differences of summative score changes in parametric analyses. Moreover, it provides a rationale for the use of Rasch-based measures, which can allow valid parametric analyses of rating scale data.
A Comparison of Graded Response and Rasch Partial Credit Models with Subjective Well-Being.

ERIC Educational Resources Information Center

Baker, John G.; Rounds, James B.; Zevon, Michael A.

2000-01-01

Compared two multiple category item response theory models using a data set of 52 mood terms with 713 undergraduate psychology students. Comparative model fit for the Samejima (F. Samejima, 1966) logistic model for graded responses and the Masters (G. Masters, 1982) partial credit model favored the former model for this data set. (SLD)
Is the Parkinson Anxiety Scale comparable across raters?

PubMed

Forjaz, Maria João; Ayala, Alba; Martinez-Martin, Pablo; Dujardin, Kathy; Pontone, Gregory M; Starkstein, Sergio E; Weintraub, Daniel; Leentjens, Albert F G

2015-04-01

The Parkinson Anxiety Scale is a new scale developed to measure anxiety severity in Parkinson's disease specifically. It consists of three dimensions: persistent anxiety, episodic anxiety, and avoidance behavior. This study aimed to assess the measurement properties of the scale while controlling for the rater (self- vs. clinician-rated) effect. The Parkinson Anxiety Scale was administered to a cross-sectional multicenter international sample of 362 Parkinson's disease patients. Both patients and clinicians rated the patient's anxiety independently. A many-facet Rasch model design was applied to estimate and remove the rater effect. The following measurement properties were assessed: fit to the Rasch model, unidimensionality, reliability, differential item functioning, item local independency, interrater reliability (self or clinician), and scale targeting. In addition, test-retest stability, construct validity, precision, and diagnostic properties of the Parkinson Anxiety Scale were also analyzed. A good fit to the Rasch model was obtained for Parkinson Anxiety Scale dimensions A and B, after the removal of one item and rescoring of the response scale for certain items, whereas dimension C showed marginal fit. Self versus clinician rating differences were of small magnitude, with patients reporting higher anxiety levels than clinicians. The linear measure for Parkinson Anxiety Scale dimensions A and B showed good convergent construct with other anxiety measures and good diagnostic properties. Parkinson Anxiety Scale modified dimensions A and B provide valid and reliable measures of anxiety in Parkinson's disease that are comparable across raters. Further studies are needed with dimension C. © 2014 International Parkinson and Movement Disorder Society.
Sustaining Equipment and the Rapid Acquisition Process: The Forgotten Phase

DTIC Science & Technology

2012-02-24

Operation of the Defense Acquisition System,” December 8, 2008. 7 Rasch , Robert. A, Jr. Lessons Learned from Rapid Acquisition: Better, Faster, Cheaper...Life Cycle Management Responsibilities,” Defense AR Journal, 17.2 (April 2010): 183. 37 Robert A. Rasch , Lessons Learned from Rapid Acquisition: Better...Accountability Office (GAO) Report, Subject: Rapid Acquisition of Mine Resistant Protected Vehicles, July 15, 2008, 4. 39 Ibid. 40 Ibid. 41 Robert A. Rasch
Integration of climatic indices in an objective probabilistic model for establishing and mapping viticultural climatic zones in a region

NASA Astrophysics Data System (ADS)

Moral, Francisco J.; Rebollo, Francisco J.; Paniagua, Luis L.; García, Abelardo; Honorio, Fulgencio

2016-05-01

Different climatic indices have been proposed to determine the wine suitability in a region. Some of them are related to the air temperature, but the hydric component of climate should also be considered which, in turn, is influenced by the precipitation during the different stages of the grapevine growing and ripening periods. In this study, we propose using the information obtained from ten climatic indices [heliothermal index (HI), cool night index (CI), dryness index (DI), growing season temperature (GST), the Winkler index (WI), September mean thermal amplitude (MTA), annual precipitation (AP), precipitation during flowering (PDF), precipitation before flowering (PBF), and summer precipitation (SP)] as inputs in an objective and probabilistic model, the Rasch model, with the aim of integrating the individual effects of them, obtaining the climate data that summarize all main climatic indices, which could influence on wine suitability from a climate viewpoint, and utilizing the Rasch measures to generate homogeneous climatic zones. The use of the Rasch model to estimate viticultural climatic suitability constitutes a new application of great practical importance, enabling to rationally determine locations in a region where high viticultural potential exists and establishing a ranking of the climatic indices which exerts an important influence on wine suitability in a region. Furthermore, from the measures of viticultural climatic suitability at some locations, estimates can be computed using a geostatistical algorithm, and these estimates can be utilized to map viticultural climatic zones in a region. To illustrate the process, an application to Extremadura, southwestern Spain, is shown.
Using the Rasch Model to Measure the Extent to which Students Work Conceptually with Mathematics.

PubMed

Kaspersen, Eivind

2015-01-01

Differences between working conceptually and procedurally with mathematics are well documented. In short, working procedurally can be characterized as learning and applying rules without reason. Working conceptually, in contrast, means creating and applying a web of knowledge. To continue this line of research, an instrument that is able to measure the level of conceptual work, and that is based on the basic requirements of measurement, is desireable. As such, this paper presents a Rasch calibrated instrument that measures the extent to which students work conceptually with mathematics. From a sample of 133 student teachers and 185 Civil Engineering students, 20 items are concluded as being productive for measurement.
Parameter Recovery for the 1-P HGLLM with Non-Normally Distributed Level-3 Residuals

ERIC Educational Resources Information Center

Kara, Yusuf; Kamata, Akihito

2017-01-01

A multilevel Rasch model using a hierarchical generalized linear model is one approach to multilevel item response theory (IRT) modeling and is referred to as a one-parameter hierarchical generalized linear logistic model (1-P HGLLM). Although it has the flexibility to model nested structure of data with covariates, the model assumes the normality…
Using Rasch Measurement to Develop a Computer Modeling-Based Instrument to Assess Students' Conceptual Understanding of Matter

ERIC Educational Resources Information Center

Wei, Silin; Liu, Xiufeng; Wang, Zuhao; Wang, Xingqiao

2012-01-01

Research suggests that difficulty in making connections among three levels of chemical representations--macroscopic, submicroscopic, and symbolic--is a primary reason for student alternative conceptions of chemistry concepts, and computer modeling is promising to help students make the connections. However, no computer modeling-based assessment…
The Detection and Correction of Bias in Student Ratings of Instruction.

ERIC Educational Resources Information Center

Haladyna, Thomas; Hess, Robert K.

1994-01-01

A Rasch model was used to detect and correct bias in Likert rating scales used to assess student perceptions of college teaching, using a database of ratings. Statistical corrections were significant, supporting the model's potential utility. Recommendations are made for a theoretical rationale and further research on the model. (Author/MSE)
Comparing Latent Structures of the Grade of Membership, Rasch, and Latent Class Models

ERIC Educational Resources Information Center

Erosheva, Elena A.

2005-01-01

This paper focuses on model interpretation issues and employs a geometric approach to compare the potential value of using the Grade of Membership (GoM) model in representing population heterogeneity. We consider population heterogeneity manifolds generated by letting subject specific parameters vary over their natural range, while keeping other…
Clinical outcome measurement: Models, theory, psychometrics and practice.

PubMed

McClimans, Leah; Browne, John; Cano, Stefan

In the last decade much has been made of the role that models play in the epistemology of measurement. Specifically, philosophers have been interested in the role of models in producing measurement outcomes. This discussion has proceeded largely within the context of the physical sciences, with notable exceptions considering measurement in economics. However, models also play a central role in the methods used to develop instruments that purport to quantify psychological phenomena. These methods fall under the umbrella term 'psychometrics'. In this paper, we focus on Clinical Outcome Assessments (COAs) and discuss two measurement theories and their associated models: Classical Test Theory (CTT) and Rasch Measurement Theory. We argue that models have an important role to play in coordinating theoretical terms with empirical content, but to do so they must serve: 1) as a representation of the measurement interaction; and 2) in conjunction with a theory of the attribute in which we are interested. We conclude that Rasch Measurement Theory is a more promising approach than CTT in these regards despite the latter's popularity with health outcomes researchers. Copyright © 2017. Published by Elsevier Ltd.
An alternative to Rasch analysis using triadic comparisons and multi-dimensional scaling

NASA Astrophysics Data System (ADS)

Bradley, C.; Massof, R. W.

2016-11-01

Rasch analysis is a principled approach for estimating the magnitude of some shared property of a set of items when a group of people assign ordinal ratings to them. In the general case, Rasch analysis not only estimates person and item measures on the same invariant scale, but also estimates the average thresholds used by the population to define rating categories. However, Rasch analysis fails when there is insufficient variance in the observed responses because it assumes a probabilistic relationship between person measures, item measures and the rating assigned by a person to an item. When only a single person is rating all items, there may be cases where the person assigns the same rating to many items no matter how many times he rates them. We introduce an alternative to Rasch analysis for precisely these situations. Our approach leverages multi-dimensional scaling (MDS) and requires only rank orderings of items and rank orderings of pairs of distances between items to work. Simulations show one variant of this approach - triadic comparisons with non-metric MDS - provides highly accurate estimates of item measures in realistic situations.

The Swedish version of the Acceptance of Chronic Health Conditions Scale for people with multiple sclerosis: Translation, cultural adaptation and psychometric properties.

PubMed

Forslin, Mia; Kottorp, Anders; Kierkegaard, Marie; Johansson, Sverker

2016-11-11

To translate and culturally adapt the Acceptance of Chronic Health Conditions (ACHC) Scale for people with multiple sclerosis into Swedish, and to analyse the psychometric properties of the Swedish version. Ten people with multiple sclerosis participated in translation and cultural adaptation of the ACHC Scale; 148 people with multiple sclerosis were included in evaluation of the psychometric properties of the scale. Translation and cultural adaptation were carried out through translation and back-translation, by expert committee evaluation and pre-test with cognitive interviews in people with multiple sclerosis. The psychometric properties of the Swedish version were evaluated using Rasch analysis. The Swedish version of the ACHC Scale was an acceptable equivalent to the original version. Seven of the original 10 items fitted the Rasch model and demonstrated ability to separate between groups. A 5-item version, including 2 items and 3 super-items, demonstrated better psychometric properties, but lower ability to separate between groups. The Swedish version of the ACHC Scale with the original 10 items did not fit the Rasch model. Two solutions, either with 7 items (ACHC-7) or with 2 items and 3 super-items (ACHC-5), demonstrated acceptable psychometric properties. Use of the ACHC-5 Scale with super-items is recommended, since this solution adjusts for local dependency among items.
Validity of personality measurement in adults with anxiety disorders: psychometric properties of the Spanish NEO-FFI-R using Rasch analyses

PubMed Central

Inchausti, Felix; Mole, Joe; Fonseca-Pedrero, Eduardo; Ortuño-Sierra, Javier

2015-01-01

The aim of this study was to analyse the psychometric properties of the Spanish NEO Five Factor Inventory–Revised (NEO-FFI-R) using Rasch analyses, in order to test its rating scale functioning, the reliability of scores, internal structure, and differential item functioning (DIF) by gender in a psychiatric sample. The NEO-FFI-R responses of 433 Spanish adults (154 males) with an anxiety disorder as primary diagnosis were analysed using the Rasch model for rating scales. Two intermediate categories of response (‘neutral’ and ‘agree’) malfunctioned in the Neuroticism and Conscientiousness scales. In addition, model reliabilities were lower than expected in Agreeableness and Neuroticism, and the item fit values indicated each scale had items that did not achieve moderate to high discrimination on its dimension, particularly in the Agreeableness scale. Concerning unidimensionality, the five NEO-FFI-R scales showed large first components of unexplained variance. Finally, DIF by gender was detected in many items. The results suggest that the scores of the Spanish NEO-FFI-R are unreliable in psychiatric samples and cannot be generalized between males and females, especially in the Openness, Conscientiousness, and Agreeableness scales. Future directions for testing and refinement should be developed before the NEO-FFI-R can be used reliably in clinical samples. PMID:25954224
Limits on Log Cross-Product Ratios for Item Response Models. Research Report. ETS RR-06-10

ERIC Educational Resources Information Center

Haberman, Shelby J.; Holland, Paul W.; Sinharay, Sandip

2006-01-01

Bounds are established for log cross-product ratios (log odds ratios) involving pairs of items for item response models. First, expressions for bounds on log cross-product ratios are provided for unidimensional item response models in general. Then, explicit bounds are obtained for the Rasch model and the two-parameter logistic (2PL) model.…
Measuring the impact of health problems among adults with limited mobility in Thailand: further validation of the Perceived Impact of Problem Profile

PubMed Central

Misajon, RoseAnne; Pallant, Julie F; Manderson, Lenore; Chirawatkul, Siriporn

2008-01-01

Background The Perceived Impact of Problem Profile (PIPP) was developed to provide a tool for measuring the impact of a health condition from the individual's perspective, using the ICF model as a framework. One of the aims of the ICF is to enable the comparison of data across countries, however, relatively little is known about the subjective experience of disability in middle and low-income countries. The aim of this study was to assess the validity of the Perceived Impact of Problem Profile (PIPP) for use among adults with a disability in Thailand using Rasch analysis. Methods A total of 210 adults with mobility impairment from the urban, rural and remote areas of northeast Thailand completed the PIPP, which contains 23 items assessing both impact and distress across five key domains (Self-care, Mobility, Participation, Relationships, and Psychological Well-being). Rasch analysis, using RUMM2020, was conducted to assess the internal validity and psychometric properties of the PIPP Impact subscales. Validation of the PIPP Impact scales was conducted by comparing scores across the different response levels of the EQ5D items. Results Rasch analysis indicated that participants did not clearly differentiate between 'impact' and 'distress,' the two aspects assessed by the PIPP. Further analyses were therefore limited to the PIPP Impact subscales. These showed adequate psychometric properties, demonstrating fit to the Rasch model and good person separation reliability. Preliminary validity testing using the EQ5D items provided support for the PIPP Impact subscales. Conclusion The results provide further support for the psychometric properties of the PIPP Impact scales and indicate that it is a suitable tool for use among adults with a locomotor disability in Thailand. Further research is needed to validate the PIPP across different cultural contexts and health conditions and to assess the usefulness of separate Impact and Distress subscales. PMID:18208616
Recent advances in analysis of differential item functioning in health research using the Rasch model.

PubMed

Hagquist, Curt; Andrich, David

2017-09-19

Rasch analysis with a focus on Differential Item Functioning (DIF) is increasingly used for examination of psychometric properties of health outcome measures. To take account of DIF in order to retain precision of measurement, split of DIF-items into separate sample specific items has become a frequently used technique. The purpose of the paper is to present and summarise recent advances of analysis of DIF in a unified methodology. In particular, the paper focuses on the use of analysis of variance (ANOVA) as a method to simultaneously detect uniform and non-uniform DIF, the need to distinguish between real and artificial DIF and the trade-off between reliability and validity. An illustrative example from health research is used to demonstrate how DIF, in this case between genders, can be identified, quantified and under specific circumstances accounted for using the Rasch model. Rasch analyses of DIF were conducted of a composite measure of psychosomatic problems using Swedish data from the Health Behaviour in School-aged Children study for grade 9 students collected during the 1985-2014 time periods. The procedures demonstrate how DIF can be identified efficiently by ANOVA of residuals, and how the magnitude of DIF can be quantified and potentially accounted for by resolving items according to identifiable groups and using principles of test equating on the resolved items. The results of the analysis also show that the real DIF in some items does affect person measurement estimates. Firstly, in order to distinguish between real and artificial DIF, the items showing DIF initially should not be resolved simultaneously but sequentially. Secondly, while resolving instead of deleting a DIF item may retain reliability, both options may affect the content validity negatively. Resolving items with DIF is not justified if the source of the DIF is relevant for the content of the variable; then resolving DIF may deteriorate the validity of the instrument. Generally, decisions on resolving items to deal with DIF should also rely on external information.
Rasch Analysis for Binary Data with Nonignorable Nonresponses

ERIC Educational Resources Information Center

Bertoli-Barsotti, Lucio; Punzo, Antonio

2013-01-01

This paper introduces a two-dimensional Item Response Theory (IRT) model to deal with nonignorable nonresponses in tests with dichotomous items. One dimension provides information about the omitting behavior, while the other dimension is related to the person's "ability". The idea of embedding an IRT model for missingness into the measurement…
Consistency of Rasch Model Parameter Estimation: A Simulation Study.

ERIC Educational Resources Information Center

van den Wollenberg, Arnold L.; And Others

1988-01-01

The unconditional--simultaneous--maximum likelihood (UML) estimation procedure for the one-parameter logistic model produces biased estimators. The UML method is inconsistent and is not a good alternative to conditional maximum likelihood method, at least with small numbers of items. The minimum Chi-square estimation procedure produces unbiased…
Comparing the High School English Curriculum in Turkey through Multi-Analysis

ERIC Educational Resources Information Center

Batdi, Veli

2017-01-01

This study aimed to compare the High School English Curriculum (HSEC) in accordance with Stufflebeam's context, input, process and product (CIPP) model through multi-analysis. The research includes both quantitative and qualitative aspects. A descriptive analysis was operated through Rasch Measurement Model; SPSS program for the quantitative…
The Analysis of Measurement Equivalence in International Studies Using the Rasch Model

ERIC Educational Resources Information Center

Schulz, Wolfram; Fraillon, Julian

2011-01-01

When comparing data derived from tests or questionnaires in cross-national studies, researchers commonly assume measurement invariance in their underlying scaling models. However, different cultural contexts, languages, and curricula can have powerful effects on how students respond in different countries. This article illustrates how the…
Rasch analysis of the Mini-Mental Adjustment to Cancer Scale (mini-MAC) among a heterogeneous sample of long-term cancer survivors: A cross-sectional study

PubMed Central

2012-01-01

Background The mini-Mental Adjustment to Cancer Scale (mini-MAC) is a well-recognised, popular measure of coping in psycho-oncology and assesses five cancer-specific coping strategies. It has been suggested that these five subscales could be grouped to form the over-arching adaptive and maladptive coping subscales to facilitate the interpretation and clinical application of the scale. Despite the popularity of the mini-MAC, few studies have examined its psychometric properties among long-term cancer survivors, and further validation of the mini-MAC is needed to substantiate its use with the growing population of survivors. Therefore, this study examined the psychometric properties and dimensionality of the mini-MAC in a sample of long-term cancer survivors using Rasch analysis. Methods RUMM 2030 was used to analyse the mini-MAC data (n=851). Separate Rasch analyses were conducted for each of the original mini-MAC subscales as well as the over-arching adaptive and maladaptive coping subscales to examine summary and individual model fit statistics, person separation index (PSI), response format, local dependency, targeting, item bias (or differential item functioning -DIF), and dimensionality. Results For the fighting spirit, fatalism, and helplessness-hopelessness subscales, a revised three-point response format seemed more optimal than the original four-point response. To achieve model fit, items were deleted from four of the five subscales – Anxious Preoccupation items 7, 25, and 29; Cognitive Avoidance items 11 and 17; Fighting Spirit item 18; and Helplessness-Hopelessness items 16 and 20. For those subscales with sufficient items, analyses supported unidimensionality. Combining items to form the adaptive and maladaptive subscales was partially supported. Conclusions The original five subscales required item deletion and/or rescaling to improve goodness of fit to the Rasch model. While evidence was found for overarching subscales of adaptive and maladaptive coping, extensive modifications were necessary to achieve this result. Further exploration and validation of over-arching subscales assessing adaptive and maladaptive coping is necessary with cancer survivors. PMID:22607052
Validation of the Epworth Sleepiness Scale for Children and Adolescents using Rasch analysis.

PubMed

Janssen, Kitty C; Phillipson, Sivanes; O'Connor, Justen; Johns, Murray W

2017-05-01

A validated measure of daytime sleepiness for adolescents is needed to better explore emerging relationships between sleepiness and the mental and physical health of adolescents. The Epworth Sleepiness Scale (ESS) is a widely used scale for daytime sleepiness in adults but contains references to alcohol and driving. The Epworth Sleepiness Scale for Children and Adolescents (ESS-CHAD) has been proposed as the official modified version of the ESS for children and adolescents. This study describes the psychometric analysis of the ESS-CHAD as a measure of daytime sleepiness for adolescents. The ESS-CHAD was completed by 297 adolescents, 12-18 years old, from two independent schools in Victoria, Australia. Exploratory factor analysis and Rasch analysis was conducted to determine the validity of the scale. Exploratory factor analysis and Rasch analysis indicated that ESS-CHAD has internal validity and a unidimensional structure with good model fit. Rasch analysis of four subgroups based on gender and year-level were consistent with the overall results. The results were consistent with published ESS results, which strongly indicates that the changes to the scale do not affect the scale's capacity to measure daytime sleepiness. It is concluded that the ESS-CHAD is a reliable and internally valid measure of daytime sleepiness in adolescents 12-18 years old. Further studies are needed to establish the internal validity of the ESS-CHAD for children under 12 years, and to establish external validity and accurate cut-off points for children and adolescents. Copyright © 2017 Elsevier B.V. All rights reserved.
Assessment of the dimensionality of the Wijma delivery expectancy/experience questionnaire using factor analysis and Rasch analysis.

PubMed

Pallant, J F; Haines, H M; Green, P; Toohill, J; Gamble, J; Creedy, D K; Fenwick, J

2016-11-21

Fear of childbirth has negative consequences for a woman's physical and emotional wellbeing. The most commonly used measurement tool for childbirth fear is the Wijma Delivery Expectancy Questionnaire (WDEQ-A). Although originally conceptualized as unidimensional, subsequent investigations have suggested it is multidimensional. This study aimed to undertake a detailed psychometric assessment of the WDEQ-A; exploring the dimensionality and identifying possible subscales that may have clinical and research utility. WDEQ-A was administered to a sample of 1410 Australian women in mid-pregnancy. The dimensionality of WDEQ-A was explored using exploratory (EFA) and confirmatory factor analysis (CFA), and Rasch analysis. EFA identified a four factor solution. CFA failed to support the unidimensional structure of the original WDEQ-A, but confirmed the four factor solution identified by EFA. Rasch analysis was used to refine the four subscales (Negative emotions: five items; Lack of positive emotions: five items; Social isolation: four items; Moment of birth: three items). Each WDEQ-A Revised subscale showed good fit to the Rasch model and adequate internal consistency reliability. The correlation between Negative emotions and Lack of positive emotions was strong, however Moment of birth and Social isolation showed much lower intercorrelations, suggesting they should not be added to create a total score. This study supports the findings of other investigations that suggest the WDEQ-A is multidimensional and should not be used in its original form. The WDEQ-A Revised may provide researchers with a more refined, psychometrically sound tool to explore the differential impact of aspects of childbirth fear.
Using Rasch Analysis to Evaluate the Reliability and Validity of the Swallowing Quality of Life Questionnaire: An Item Response Theory Approach.

PubMed

Cordier, Reinie; Speyer, Renée; Schindler, Antonio; Michou, Emilia; Heijnen, Bas Joris; Baijens, Laura; Karaduman, Ayşe; Swan, Katina; Clavé, Pere; Joosten, Annette Veronica

2018-02-01

The Swallowing Quality of Life questionnaire (SWAL-QOL) is widely used clinically and in research to evaluate quality of life related to swallowing difficulties. It has been described as a valid and reliable tool, but was developed and tested using classic test theory. This study describes the reliability and validity of the SWAL-QOL using item response theory (IRT; Rasch analysis). SWAL-QOL data were gathered from 507 participants at risk of oropharyngeal dysphagia (OD) across four European countries. OD was confirmed in 75.7% of participants via videofluoroscopy and/or fiberoptic endoscopic evaluation, or a clinical diagnosis based on meeting selected criteria. Patients with esophageal dysphagia were excluded. Data were analysed using Rasch analysis. Item and person reliability was good for all the items combined. However, person reliability was poor for 8 subscales and item reliability was poor for one subscale. Eight subscales exhibited poor person separation and two exhibited poor item separation. Overall item and person fit statistics were acceptable. However, at an individual item fit level results indicated unpredictable item responses for 28 items, and item redundancy for 10 items. The item-person dimensionality map confirmed these findings. Results from the overall Rasch model fit and Principal Component Analysis were suggestive of a second dimension. For all the items combined, none of the item categories were 'category', 'threshold' or 'step' disordered; however, all subscales demonstrated category disordered functioning. Findings suggest an urgent need to further investigate the underlying structure of the SWAL-QOL and its psychometric characteristics using IRT.
Calibrating Communication Competencies

NASA Astrophysics Data System (ADS)

Surges Tatum, Donna

2016-11-01

The Many-faceted Rasch measurement model is used in the creation of a diagnostic instrument by which communication competencies can be calibrated, the severity of observers/raters can be determined, the ability of speakers measured, and comparisons made between various groups.
A mixed Rasch model of dual-process conditional reasoning.

PubMed

Bonnefon, Jean-François; Eid, Michael; Vautier, Stéphane; Jmel, Saïd

2008-05-01

A fine-grained dual-process approach to conditional reasoning is advocated: Responses to conditional syllogisms are reached through the operation of either one of two systems, each of which can rely on two different mechanisms. System1 relies either on pragmatic implicatures or on the retrieval of information from semantic memory; System2 operates first through inhibition of System1, then (but not always) through activation of analytical processes. It follows that reasoners will fall into one of four groups of increasing reasoning ability, each group being uniquely characterized by (a) the modal pattern of individual answers to blocks of affirming the consequent (AC), denying the antecedent (DA), and modus tollens (MT) syllogisms featuring the same conditional; and (b) the average rate of determinate answers to AC, DA, and MT. This account receives indirect support from the extant literature and direct support from a mixed Rasch model of responses given to 18 syllogisms by 486 adult reasoners.
Conceptualising computerized adaptive testing for measurement of latent variables associated with physical objects

NASA Astrophysics Data System (ADS)

Camargo, F. R.; Henson, B.

2015-02-01

The notion of that more or less of a physical feature affects in different degrees the users' impression with regard to an underlying attribute of a product has frequently been applied in affective engineering. However, those attributes exist only as a premise that cannot directly be measured and, therefore, inferences based on their assessment are error-prone. To establish and improve measurement of latent attributes it is presented in this paper the concept of a stochastic framework using the Rasch model for a wide range of independent variables referred to as an item bank. Based on an item bank, computerized adaptive testing (CAT) can be developed. A CAT system can converge into a sequence of items bracketing to convey information at a user's particular endorsement level. It is through item banking and CAT that the financial benefits of using the Rasch model in affective engineering can be realised.
Evaluating a technical university's placement test using the Rasch measurement model

NASA Astrophysics Data System (ADS)

Salleh, Tuan Salwani; Bakri, Norhayati; Zin, Zalhan Mohd

2016-10-01

This study discusses the process of validating a mathematics placement test at a technical university. The main objective is to produce a valid and reliable test to measure students' prerequisite knowledge to learn engineering technology mathematics. It is crucial to have a valid and reliable test as the results will be used in a critical decision making to assign students into different groups of Technical Mathematics 1. The placement test which consists of 50 mathematics questions were tested on 82 new diplomas in engineering technology students at a technical university. This study employed rasch measurement model to analyze the data through the Winsteps software. The results revealed that there are ten test questions lower than less able students' ability. Nevertheless, all the ten questions satisfied infit and outfit standard values. Thus, all the questions can be reused in the future placement test at the technical university.
Rasch measurement of self-regulated learning in an information and communication technology (ICT)-rich environment.

PubMed

Njiru, Joseph N; Waugh, Russell F

2007-01-01

This report describes how a linear scale of self-regulated learning in an ICT-rich environment was created by analysing student data using the Rasch measurement model. A person convenience sample of (N = 409) university students in Western Australia was used. The stem-item sample was initially 41, answered in two perspectives ("I aim for this" and "I actually do this"), and reduced to 16 that fitted the measurement model to form a unidimensional scale. Items for motivation (extrinsic rewards, intrinsic rewards, and social rewards), academic goals (fear of performing poorly) (but not standards), self-learning beliefs (ability and interest), task management (strategies and time management) (but not cooperative learning), Volition (action control (but not environmental control), and self-evaluation (cognitive self-evaluation and metacognition) fitted the measurement model. The proportion of observed variance considered true was 0.90. A new instrument is proposed to handle the conceptually valid but non-fitting items. Characteristics of high self-regulated learners are measured.
Rasch-modeling the Portuguese SOCRATES in a clinical sample.

PubMed

Lopes, Paulo; Prieto, Gerardo; Delgado, Ana R; Gamito, Pedro; Trigo, Hélder

2010-06-01

The Stages of Change Readiness and Treatment Eagerness Scale (SOCRATES) assesses motivation for treatment in the drug-dependent population. The development of adequate measures of motivation is needed in order to properly understand the role of this construct in rehabilitation. This study probed the psychometric properties of the SOCRATES in the Portuguese population by means of the Rasch Rating Scale Model, which allows the conjoint measurement of items and persons. The participants were 166 substance abusers under treatment for their addiction. Results show that the functioning of the five response categories is not optimal; our re-analysis indicates that a three-category system is the most appropriate one. By using this response category system, both model fit and estimation accuracy are improved. The discussion takes into account other factors such as item format and content in order to make suggestions for the development of better motivation-for-treatment scales. (PsycINFO Database Record (c) 2010 APA, all rights reserved).
Fitting the Rasch Model to Account for Variation in Item Discrimination

ERIC Educational Resources Information Center

Weitzman, R. A.

2009-01-01

Building on the Kelley and Gulliksen versions of classical test theory, this article shows that a logistic model having only a single item parameter can account for varying item discrimination, as well as difficulty, by using item-test correlations to adjust incorrect-correct (0-1) item responses prior to an initial model fit. The fit occurs…

Item Construction Using Reflective, Formative, or Rasch Measurement Models: Implications for Group Work

ERIC Educational Resources Information Center

Peterson, Christina Hamme; Gischlar, Karen L.; Peterson, N. Andrew

2017-01-01

Measures that accurately capture the phenomenon are critical to research and practice in group work. The vast majority of group-related measures were developed using the reflective measurement model rooted in classical test theory (CTT). Depending on the construct definition and the measure's purpose, the reflective model may not always be the…
Cross-cultural validation of the National Eye Institute Visual Function Questionnaire.

PubMed

Mollazadegan, Kaziwe; Huang, Jinhai; Khadka, Jyoti; Wang, Qinmei; Yang, Feng; Gao, Rongrong; Pesudovs, Konrad

2014-05-01

To assess the native and the previously Rasch-modified National Eye Institute Visual Function Questionnaire (NEI VFQ) scales in a Chinese population. Eye Hospital of Wenzhou Medical University, Wenzhou, China. Questionnaire development. Patients on the waiting list for cataract surgery completed the 39-item NEI VFQ (NEI VFQ-39). Rasch analysis was performed in 3 steps as follows: (1) Assess the psychometric properties of the original NEI VFQ. (2) Reassess the previously proposed Rasch-modified NEI VFQ scales by Pesudovs et al. (2010) in Chinese populations. (3) Compare the scores of previously recommended scales of the NEI VFQ with new Rasch-modified scales of the same questionnaire using Bland-Altman plots. Four hundred thirty-five patients (median age 70 years; range 35 to 90 years) completed the NEI VFQ-39. Response categories for 4 question types were dysfunctional and therefore repaired. The original NEI VFQ-39 and NEI VFQ-25 showed good measurement precision. However, both versions showed multidimensionality, misfitting items, suboptimum targeting, and nonfunctioning subscales. Using the previously proposed Rasch-modified scales of the NEI VFQ yielded valid measurement of each construct in the 39-item and 25-item questionnaire. Comparison between the earlier proposed NEI VFQ scales and the new versions developed in this population showed good agreement. The original NEI VFQ was once again found to be flawed. The previously proposed Rasch-analyzed versions of the NEI VFQ and the new Chinese versions showed good agreement. Copyright © 2014 ASCRS and ESCRS. Published by Elsevier Inc. All rights reserved.
Rasch Measurement of Collaborative Problem Solving in an Online Environment.

PubMed

Harding, Susan-Marie E; Griffin, Patrick E

2016-01-01

This paper describes an approach to the assessment of human to human collaborative problem solving using a set of online interactive tasks completed by student dyads. Within the dyad, roles were nominated as either A or B and students selected their own roles. The question as to whether role selection affected individual student performance measures is addressed. Process stream data was captured from 3402 students in six countries who explored the problem space by clicking, dragging the mouse, moving the cursor and collaborating with their partner through a chat box window. Process stream data were explored to identify behavioural indicators that represented elements of a conceptual framework. These indicative behaviours were coded into a series of dichotomous items. These items represented actions and chats performed by students. The frequency of occurrence was used as a proxy measure of item difficulty. Then given a measure of item difficulty, student ability could be estimated using the difficulty estimates of the range of items demonstrated by the student. The Rasch simple logistic model was used to review the indicators to identify those that were consistent with the assumptions of the model and were invariant across national samples, language, curriculum and age of the student. The data were analysed using a one and two dimension, one parameter model. Rasch separation reliability, fit to the model, distribution of students and items on the underpinning construct, estimates for each country and the effect of role differences are reported. This study provides evidence that collaborative problem solving can be assessed in an online environment involving human to human interaction using behavioural indicators shown to have a consistent relationship between the estimate of student ability, and the probability of demonstrating the behaviour.
Harmonizing routinely collected health information for strengthening quality management in health systems: requirements and practice.

PubMed

Prodinger, Birgit; Tennant, Alan; Stucki, Gerold; Cieza, Alarcos; Üstün, Tevfik Bedirhan

2016-10-01

Our aim was to specify the requirements of an architecture to serve as the foundation for standardized reporting of health information and to provide an exemplary application of this architecture. The World Health Organization's International Classification of Functioning, Disability and Health (ICF) served as the conceptual framework. Methods to establish content comparability were the ICF Linking Rules. The Rasch measurement model, as a special case of additive conjoint measurement, which satisfies the required criteria for fundamental measurement, allowed for the development of a common metric foundation for measurement unit conversion. Secondary analysis of data from the North Yorkshire Survey was used to illustrate these methods. Patients completed three instruments and the items were linked to the ICF. The Rasch measurement model was applied, first to each scale, and then to items across scales which were linked to a common domain. Based on the linking of items to the ICF, the majority of items were grouped into two domains, Mobility and Self-care. Analysis of the individual scales and of items linked to a common domain across scales satisfied the requirements of the Rasch measurement model. The measurement unit conversion between items from the three instruments linked to the Mobility and Self-care domains, respectively, was demonstrated. The realization of an ICF-based architecture for information on patients' functioning enables harmonization of health information while allowing clinicians and researchers to continue using their existing instruments. This architecture will facilitate access to comprehensive and consistently reported health information to serve as the foundation for informed decision-making. © The Author(s) 2016.
Using the Rasch measurement model to design a report writing assessment instrument.

PubMed

Carlson, Wayne R

2013-01-01

This paper describes how the Rasch measurement model was used to develop an assessment instrument designed to measure student ability to write law enforcement incident and investigative reports. The ability to write reports is a requirement of all law enforcement recruits in the state of Michigan and is a part of the state's mandatory basic training curriculum, which is promulgated by the Michigan Commission on Law Enforcement Standards (MCOLES). Recently, MCOLES conducted research to modernize its training and testing in the area of report writing. A structured validation process was used, which included: a) an examination of the job tasks of a patrol officer, b) input from content experts, c) a review of the professional research, and d) the creation of an instrument to measure student competency. The Rasch model addressed several measurement principles that were central to construct validity, which were particularly useful for assessing student performances. Based on the results of the report writing validation project, the state established a legitimate connectivity between the report writing standard and the essential job functions of a patrol officer in Michigan. The project also produced an authentic instrument for measuring minimum levels of report writing competency, which generated results that are valid for inferences of student ability. Ultimately, the state of Michigan must ensure the safety of its citizens by licensing only those patrol officers who possess a minimum level of core competency. Maintaining the validity and reliability of both the training and testing processes can ensure that the system for producing such candidates functions as intended.
A Practitioner's Instrument for Measuring Secondary Mathematics Teachers' Beliefs Surrounding Learner-Centered Classroom Practice.

PubMed

Lischka, Alyson E; Garner, Mary

In this paper we present the development and validation of a Mathematics Teaching Pedagogical and Discourse Beliefs Instrument (MTPDBI), a 20 item partial-credit survey designed and analyzed using Rasch measurement theory. Items on the MTPDBI address beliefs about the nature of mathematics, teaching and learning mathematics, and classroom discourse practices. A Rasch partial credit model (Masters, 1982) was estimated from the pilot study data. Results show that item separation reliability is .96 and person separation reliability is .71. Other analyses indicate the instrument is a viable measure of secondary teachers' beliefs about reform-oriented mathematics teaching and learning. This instrument is proposed as a useful measure of teacher beliefs for those working with pre-service and in-service teacher development.
Using the Rasch analysis for the psychometric validation of the Irregular Word Reading Test (TeLPI): A Portuguese test for the assessment of premorbid intelligence.

PubMed

Freitas, Sandra; Prieto, Gerardo; Simões, Mário R; Nogueira, Joana; Santana, Isabel; Martins, Cristina; Alves, Lara

2018-05-03

The present study aims to analyze the psychometric characteristics of the TeLPI (Irregular Words Reading Test), a Portuguese premorbid intelligence test, using the Rasch model for dichotomous items. The results reveal an overall adequacy and a good fit of values regarding both items and persons. A high variability of cognitive performance level and a good quality of the measurements were also found. The TeLPI has proved to be a unidimensional measure with reduced DIF effects. The present findings contribute to overcome an important gap in the psychometric validity of this instrument and provide good evidence of the overall psychometric validity of TeLPI results.
An introduction to multidimensional measurement using Rasch models.

PubMed

Briggs, Derek C; Wilson, Mark

2003-01-01

The act of constructing a measure requires a number of important assumptions. Principle among these assumptions is that the construct is unidimensional. In practice there are many instances when the assumption of unidimensionality does not hold, and where the application of a multidimensional measurement model is both technically appropriate and substantively advantageous. In this paper we illustrate the usefulness of a multidimensional approach to measurement with the Multidimensional Random Coefficient Multinomial Logit (MRCML) model, an extension of the unidimensional Rasch model. An empirical example is taken from a collection of embedded assessments administered to 541 students enrolled in middle school science classes with a hands-on science curriculum. Student achievement on these assessments are multidimensional in nature, but can also be treated as consecutive unidimensional estimates, or as is most common, as a composite unidimensional estimate. Structural parameters are estimated for each model using ConQuest, and model fit is compared. Student achievement in science is also compared across models. The multidimensional approach has the best fit to the data, and provides more reliable estimates of student achievement than under the consecutive unidimensional approach. Finally, at an interpretational level, the multidimensional approach may well provide richer information to the classroom teacher about the nature of student achievement.
Applying the Mixed Rasch Model to the Runco Ideational Behavior Scale

ERIC Educational Resources Information Center

Sen, Sedat

2016-01-01

Previous research using creativity assessments has used latent class models and identified multiple classes (a 3-class solution) associated with various domains. This study explored the latent class structure of the Runco Ideational Behavior Scale, which was designed to quantify ideational capacity. A robust state-of the-art technique called the…
Rasch analysis of the Multiple Sclerosis Impact Scale (MSIS-29)

PubMed Central

Ramp, Melina; Khan, Fary; Misajon, Rose Anne; Pallant, Julie F

2009-01-01

Background Multiple Sclerosis (MS) is a degenerative neurological disease that causes impairments, including spasticity, pain, fatigue, and bladder dysfunction, which negatively impact on quality of life. The Multiple Sclerosis Impact Scale (MSIS-29) is a disease-specific health-related quality of life (HRQoL) instrument, developed using the patient's perspective on disease impact. It consists of two subscales assessing the physical (MSIS-29-PHYS) and psychological (MSIS-29-PSYCH) impact of MS. Although previous studies have found support for the psychometric properties of the MSIS-29 using traditional methods of scale evaluation, the scale has not been subjected to a detailed Rasch analysis. Therefore, the objective of this study was to use Rasch analysis to assess the internal validity of the scale, and its response format, item fit, targeting, internal consistency and dimensionality. Methods Ninety-two persons with definite MS residing in the community were recruited from a tertiary hospital database. Patients completed the MSIS-29 as part of a larger study. Rasch analysis was undertaken to assess the psychometric properties of the MSIS-29. Results Rasch analysis showed overall support for the psychometric properties of the two MSIS-29 subscales, however it was necessary to reduce the response format of the MSIS-29-PHYS to a 3-point response scale. Both subscales were unidimensional, had good internal consistency, and were free from item bias for sex and age. Dimensionality testing indicated it was not appropriate to combine the two subscales to form a total MSIS score. Conclusion In this first study to use Rasch analysis to fully assess the psychometric properties of the MSIS-29 support was found for the two subscales but not for the use of the total scale. Further use of Rasch analysis on the MSIS-29 in larger and broader samples is recommended to confirm these findings. PMID:19545445
Expert Panels, Consumers, and Chemistry.

ERIC Educational Resources Information Center

Rehfeldt, Thomas K.

2000-01-01

Studied the attributes, properties, and consumer acceptance of antiperspirant products through responses of 400 consumers (consumer data), expert panel data, and analytical data about the products. Results show how the Rasch model can provide the tool necessary to combine data from several sources. (SLD)
Rasch validation of the Chinese parent-child interaction scale (CPCIS).

PubMed

Ip, Patrick; Tso, Winnie; Rao, Nirmala; Ho, Frederick Ka Wing; Chan, Ko Ling; Fu, King Wa; Li, Sophia Ling; Goh, Winnie; Wong, Wilfred Hing-Sang; Chow, Chun Bong

2018-03-15

Proper parent-child interaction is crucial for child development, but an assessment tool in Chinese is currently lacking. This study aimed to develop and validate a parent-reported parent-child interaction scale for Chinese preschool children. The Chinese parent-child interaction scale (CPCIS) was designed by an expert panel based on the literature and clinical observations in the Chinese context. The initial CPCIS had 14 parent-child interactive activity items. Psychometric properties of the CPCIS were examined using the Rasch model and confirmatory factor analysis (CFA). Convergent validity was investigated by the associations between CPCIS and family income, maternal education level, and children's school readiness. The study recruited 567 Chinese parent-child pairs from diverse socioeconomic backgrounds, who completed the CPCIS. Six out of the 14 items in the initial CPCIS were dropped due to suboptimal fit values. The refined 8-item CPCIS was shown to be valid and reliable by Rasch models and CFA. The person separation reliability and Cronbach's α of the CPCIS were 0.81 and 0.82, respectively. The CPCIS scores were positively associated with family's socioeconomic status (η 2 = 0.05, P < 0.001), maternal education level (η 2 = 0.08, P < 0.001), and children's school readiness (η 2 = 0.01, P < 0.01). CPCIS is an easily administered, valid, and reliable tool for the assessment of parent-child interactions in Chinese families.
Construct Validity of Science Motivation and Beliefs Instrument (SLA-MB): A Case study in Sumedang, Indonesia

NASA Astrophysics Data System (ADS)

Rachmatullah, A.; Octavianda, R. P.; Ha, M.; Rustaman, N. Y.; Diana, S.

2017-02-01

Along with numerous instruments developed and used in science education researches, some of those instruments have been translated to local language in the country where the instruments were used. Most of researchers that used those translated instruments did not report the quality of those translated instruments. One of the instruments is the Scientific Literacy Assessment (SLA) including the Science Motivation and Beliefs (SLA-MB) as part of the SLA. In this study, the SLA-MB has been translated into Indonesian Language (Bahasa). The purpose of this study is to investigate the SLA-MB instrument that has been translated to Indonesian language from the view of dimensionality, reliability, item quality and differential item functioning (DIF) based on IRT-Rasch analysis. We used Conquest and Winstep as the program for IRT-Rasch analysis. We employed quantitative research method with school-survey on this study. Research subjects are 223 Indonesian Middle school students (age 13-16), with 64 boys and 159 girls. IRT-Rasch analysis of the SLA-MB Indonesian version indicated that a three-dimensional model fit significantly better than one-dimension model, and the reliability of each dimensions are about 0.60 to 0.82. As well as those findings, fit values of all items are acceptable, moreover we found no DIF for all of the SLA-MB items. Overall, our study suggests that Indonesian version of SLA-MB is acceptable to be implemented as research instrument conducted in Indonesia.
A Comparison of the Fit of Empirical Data to Two Latent Trait Models. Report No. 92.

ERIC Educational Resources Information Center

Hutten, Leah R.

Goodness of fit of raw test score data were compared, using two latent trait models: the Rasch model and the Birnbaum three-parameter logistic model. Data were taken from various achievement tests and the Scholastic Aptitude Test (Verbal). A minimum sample size of 1,000 was required, and the minimum test length was 40 items. Results indicated that…
Modeling Local Item Dependence Due to Common Test Format with a Multidimensional Rasch Model

ERIC Educational Resources Information Center

Baghaei, Purya; Aryadoust, Vahid

2015-01-01

Research shows that test method can exert a significant impact on test takers' performance and thereby contaminate test scores. We argue that common test method can exert the same effect as common stimuli and violate the conditional independence assumption of item response theory models because, in general, subsets of items which have a shared…
Associations between the Classroom Learning Environment and Student Engagement in Learning 2: A Structural Equation Modelling Approach

ERIC Educational Resources Information Center

Harbaugh, Allen G.; Cavanagh, Robert F.

2012-01-01

This report is about the second of two phases in an investigation into associations between student engagement in classroom learning and the classroom-learning environment. Whereas the first phase utilized Rasch modelling (Cavanagh, 2012), this report uses latent variable modelling to explore the data. The investigations in both phases of this…
The Assessment of Physiotherapy Practice (APP) is a valid measure of professional competence of physiotherapy students: a cross-sectional study with Rasch analysis.

PubMed

Dalton, Megan; Davidson, Megan; Keating, Jenny

2011-01-01

Is the Assessment of Physiotherapy Practice (APP) a valid instrument for the assessment of entry-level competence in physiotherapy students? Cross-sectional study with Rasch analysis of initial (n=326) and validation samples (n=318). Students were assessed on completion of 4, 5, or 6-week clinical placements across one university semester. 298 clinical educators and 456 physiotherapy students at nine universities in Australia and New Zealand provided 644 completed APP instruments. APP data in both samples showed overall fit to a Rasch model of expected item functioning for interval scale measurement. Item 6 (Written communication) exhibited misfit in both samples, but was retained as an important element of competence. The hierarchy of item difficulty was the same in both samples with items related to professional behaviour and communication the easiest to achieve and items related to clinical reasoning the most difficult. Item difficulty was well targeted to person ability. No Differential Item Functioning was identified, indicating that the scale performed in a comparable way regardless of the student's age, gender or amount of prior clinical experience, and the educator's age, gender, or experience as an educator, or the type of facility, university, or clinical area. The instrument demonstrated unidimensionality confirming the appropriateness of summing the scale scores on each item to provide an overall score of clinical competence and was able to discriminate four levels of professional competence (Person Separation Index=0.96). Person ability and raw APP scores had a linear relationship (r(2)=0.99). Rasch analysis supports the interpretation that a student's APP score is an indication of their underlying level of professional competence in workplace practice. Copyright © 2011 Australian Physiotherapy Association. Published by .. All rights reserved.
Emotional vitality in caregivers: application of Rasch Measurement Theory with secondary data to development and test a new measure.

PubMed

Barbic, Skye P; Bartlett, Susan J; Mayo, Nancy E

2015-07-01

To describe the practical steps in identifying items and evaluating scoring strategies for a new measure of emotional vitality in informal caregivers of individuals who have experienced a significant health event. The psychometric properties of responses to selected items from validated health-related quality of life and other psychosocial questionnaires administered four times over a one-year period were evaluated using Rasch Measurement Theory. Community. A total of 409 individuals providing informal care at home to older adults who had experienced a recent stroke. Rasch Measurement Theory was used to test the ordering of response option thresholds, fit, spread of the item locations, residual correlations, person separation index, and stability across time. Based on a theoretical framework developed in earlier work, we identified 22 candidate items from a pool of relevant psychosocial measures available. Of these, additional evaluation resulted in 19 items that could be used to assess the five core domains. The overall model fit was reasonable (χ(2) = 202.26, DF = 117, p = 0.06), stable across time, with borderline evidence of multidimensionality (10%). Items and people covered a continuum ranging from -3.7 to +2.7 logits, reflecting coverage of the measurement continuum, with a person separation index of 0.85. Mean fit of caregivers was lower than expected (-1.31 ±1.10 logits). Established methods from the Rasch Measurement Theory were applied to develop a prototype measure of emotional vitality that is acceptable, reliable, and can be used to obtain an interval level score for use in future research and clinical settings. © The Author(s) 2014.
Affective stress responses during leisure time: Validity evaluation of a modified version of the Stress-Energy Questionnaire.

PubMed

Hadžibajramović, Emina; Ahlborg, Gunnar; Håkansson, Carita; Lundgren-Nilsson, Åsa; Grimby-Ekman, Anna

2015-12-01

Psychosocial stress at work is one of the most important factors behind increasing sick-leave rates. In addition to work stressors, it is important to account for non-work-related stressors when assessing stress responses. In this study, a modified version of the Stress-Energy Questionnaire (SEQ), the SEQ during leisure time (SEQ-LT) was introduced for assessing the affective stress response during leisure time. The aim of this study was to investigate the internal construct validity of the SEQ-LT. A second aim was to define the cut-off points for the scales, which could indicate high and low levels of leisure-time stress and energy, respectively. Internal construct validity of the SEQ-LT was evaluated using a Rasch analysis. We examined the unidimensionality and other psychometric properties of the scale by the fit to the Rasch model. A criterion-based approach was used for classification into high and low stress/energy levels. The psychometric properties of the stress and energy scales of the SEQ-LT were satisfactory, having accommodated for local dependency. The cut-off point for low stress was proposed to be in the interval between 2.45 and 3.02 on the Rasch metric score; while for high stress, it was between 3.65 and 3.90. The suggested cut-off points for the low and high energy levels were values between 1.73-1.97 and 2.66-3.08, respectively. The stress and energy scale of the SEQ-LT satisfied the measurement criteria defined by the Rasch analysis and it provided a useful tool for non-work-related assessment of stress responses. We provide guidelines on how to interpret the scale values. © 2015 the Nordic Societies of Public Health.
Self-esteem among nursing assistants: reliability and validity of the Rosenberg Self-Esteem Scale.

PubMed

McMullen, Tara; Resnick, Barbara

2013-01-01

To establish the reliability and validity of the Rosenberg Self-Esteem Scale (RSES) when used with nursing assistants (NAs). Testing the RSES used baseline data from a randomized controlled trial testing the Res-Care Intervention. Female NAs were recruited from nursing homes (n = 508). Validity testing for the positive and negative subscales of the RSES was based on confirmatory factor analysis (CFA) using structural equation modeling and Rasch analysis. Estimates of reliability were based on Rasch analysis and the person separation index. Evidence supports the reliability and validity of the RSES in NAs although we recommend minor revisions to the measure for subsequent use. Establishing reliable and valid measures of self-esteem in NAs will facilitate testing of interventions to strengthen workplace self-esteem, job satisfaction, and retention.

Psychometric analysis of the Multidimensional Fatigue Inventory in a sample of persons treated for myocardial infarction.

PubMed

Fredriksson-Larsson, Ulla; Brink, Eva; Alsén, Pia; Falk, Kristin; Lundgren-Nilsson, Åsa

2015-01-01

Fatigue after myocardial infarction is a frequent and distressing symptom in the early recovery phase. The purpose of this study is to psychometrically evaluate the Multidimensional Fatigue Inventory (MFI-20). The MFI-20 was evaluated using Rasch analysis. The result showed that the MFI-20 can be used to obtain a global score reflecting an underlying unidimensional trait of fatigue; a transformation of the summarized raw scale scores into interval scale scores could be made. Also, 4 of the 5 original dimensions separately fitted the Rasch model. Calculation of a global score increases the possibility of identifying persons experiencing fatigue after myocardial infarction, and using the MFI-20 dimension scores increases the possibility of determining each person's specific fatigue profile.
Controlling the judge variable in grading essay-type items: an application of Rasch analyses to the recruitment exam for Korean public school teachers.

PubMed

Chae, S

1998-01-01

The purpose of this paper is to show how the Rasch measurement model can be used to control the effects of judge variable on the grading of essay-type items in the recruitment test for Korean teachers. Special attention is given to two aspects of judges' involvement in the grading. One is to identify a way to minimize the variation of grading due to judge severity. The other concern is to figure out a way to reduce the number of judges without threatening objectivity of ability estimates. Results from the FACETS analyses tell us not only how much grading standards vary among judges and how to adjust them but also it produces comparably reliable ability estimates with fewer judges.
Checking Dimensionality in Item Response Models with Principal Component Analysis on Standardized Residuals

ERIC Educational Resources Information Center

Chou, Yeh-Tai; Wang, Wen-Chung

2010-01-01

Dimensionality is an important assumption in item response theory (IRT). Principal component analysis on standardized residuals has been used to check dimensionality, especially under the family of Rasch models. It has been suggested that an eigenvalue greater than 1.5 for the first eigenvalue signifies a violation of unidimensionality when there…
A Theory of the Measurement of Knowledge Content, Access, and Learning.

ERIC Educational Resources Information Center

Pirolli, Peter; Wilson, Mark

1998-01-01

An approach to the measurement of knowledge content, knowledge access, and knowledge learning is developed. First a theoretical view of cognition is described, and then a class of measurement models, based on Rasch modeling, is presented. Knowledge access and content are viewed as determining the observable actions selected by an agent to achieve…
School-Level Contextual Effects of Parent Involvement on Children's Achievement during Elementary Grades

ERIC Educational Resources Information Center

Oh, Yoonkyung

2012-01-01

This study used the ECLS-K to examine the contextual influences of parent involvement on children's achievement growth in reading and math during elementary grades. The study used Rasch models and HLM measurement models to develop reliable and valid constructs of parent involvement both at the student and at the school level. Piecewise linear…
Sample Size and Item Parameter Estimation Precision When Utilizing the One-Parameter "Rasch" Model

ERIC Educational Resources Information Center

Custer, Michael

2015-01-01

This study examines the relationship between sample size and item parameter estimation precision when utilizing the one-parameter model. Item parameter estimates are examined relative to "true" values by evaluating the decline in root mean squared deviation (RMSD) and the number of outliers as sample size increases. This occurs across…
Alternative Measurement Paradigms for Measuring Executive Functions: SEM (Formative and Reflective Models) and IRT (Rasch Models)

ERIC Educational Resources Information Center

Engelhard, George, Jr.; Wang, Jue

2014-01-01

The authors of the Focus article pose important questions regarding whether or not performance-based tasks related to executive functioning are best viewed as reflective or formative indicators. Miyake and Friedman (2012) define executive functioning (EF) as "a set of general-purpose control mechanisms, often linked to the prefrontal cortex…
Using Rasch Modeling to Investigate a Learning Progression for Energy Ideas

ERIC Educational Resources Information Center

Herrmann-Abell, Cari F.; DeBoer, George E.

2016-01-01

Energy is a core concept in the teaching of science. Therefore, it is important to know how students' thinking about energy develops so that elementary, middle, and high school students can be appropriately supported in their understanding of energy. This study tests the validity of a proposed theoretical model of students' growth of understanding…
Direct Estimation of Correlation as a Measure of Association Strength Using Multidimensional Item Response Models

ERIC Educational Resources Information Center

Wang, Wen-Chung

2004-01-01

The Pearson correlation is used to depict effect sizes in the context of item response theory. Amultidimensional Rasch model is used to directly estimate the correlation between latent traits. Monte Carlo simulations were conducted to investigate whether the population correlation could be accurately estimated and whether the bootstrap method…
Psychometric evaluation of Persian Nomophobia Questionnaire: Differential item functioning and measurement invariance across gender.

PubMed

Lin, Chung-Ying; Griffiths, Mark D; Pakpour, Amir H

2018-03-01

Background and aims Research examining problematic mobile phone use has increased markedly over the past 5 years and has been related to "no mobile phone phobia" (so-called nomophobia). The 20-item Nomophobia Questionnaire (NMP-Q) is the only instrument that assesses nomophobia with an underlying theoretical structure and robust psychometric testing. This study aimed to confirm the construct validity of the Persian NMP-Q using Rasch and confirmatory factor analysis (CFA) models. Methods After ensuring the linguistic validity, Rasch models were used to examine the unidimensionality of each Persian NMP-Q factor among 3,216 Iranian adolescents and CFAs were used to confirm its four-factor structure. Differential item functioning (DIF) and multigroup CFA were used to examine whether males and females interpreted the NMP-Q similarly, including item content and NMP-Q structure. Results Each factor was unidimensional according to the Rach findings, and the four-factor structure was supported by CFA. Two items did not quite fit the Rasch models (Item 14: "I would be nervous because I could not know if someone had tried to get a hold of me;" Item 9: "If I could not check my smartphone for a while, I would feel a desire to check it"). No DIF items were found across gender and measurement invariance was supported in multigroup CFA across gender. Conclusions Due to the satisfactory psychometric properties, it is concluded that the Persian NMP-Q can be used to assess nomophobia among adolescents. Moreover, NMP-Q users may compare its scores between genders in the knowledge that there are no score differences contributed by different understandings of NMP-Q items.
Diagnosis of students' ability in a statistical course based on Rasch probabilistic outcome

NASA Astrophysics Data System (ADS)

Mahmud, Zamalia; Ramli, Wan Syahira Wan; Sapri, Shamsiah; Ahmad, Sanizah

2017-06-01

Measuring students' ability and performance are important in assessing how well students have learned and mastered the statistical courses. Any improvement in learning will depend on the student's approaches to learning, which are relevant to some factors of learning, namely assessment methods carrying out tasks consisting of quizzes, tests, assignment and final examination. This study has attempted an alternative approach to measure students' ability in an undergraduate statistical course based on the Rasch probabilistic model. Firstly, this study aims to explore the learning outcome patterns of students in a statistics course (Applied Probability and Statistics) based on an Entrance-Exit survey. This is followed by investigating students' perceived learning ability based on four Course Learning Outcomes (CLOs) and students' actual learning ability based on their final examination scores. Rasch analysis revealed that students perceived themselves as lacking the ability to understand about 95% of the statistics concepts at the beginning of the class but eventually they had a good understanding at the end of the 14 weeks class. In terms of students' performance in their final examination, their ability in understanding the topics varies at different probability values given the ability of the students and difficulty of the questions. Majority found the probability and counting rules topic to be the most difficult to learn.
A Rasch scaling validation of a 'core' near-death experience.

PubMed

Lange, Rense; Greyson, Bruce; Houran, James

2004-05-01

For those with true near-death experiences (NDEs), Greyson's (1983, 1990) NDE Scale satisfactorily fits the Rasch rating scale model, thus yielding a unidimensional measure with interval-level scaling properties. With increasing intensity, NDEs reflect peace, joy and harmony, followed by insight and mystical or religious experiences, while the most intense NDEs involve an awareness of things occurring in a different place or time. The semantics of this variable are invariant across True-NDErs' gender, current age, age at time of NDE, and latency and intensity of the NDE, thus identifying NDEs as 'core' experiences whose meaning is unaffected by external variables, regardless of variations in NDEs' intensity. Significant qualitative and quantitative differences were observed between True-NDErs and other respondent groups, mostly revolving around the differential emphasis on paranormal/mystical/religious experiences vs. standard reactions to threat. The findings further suggest that False-Positive respondents reinterpret other profound psychological states as NDEs. Accordingly, the Rasch validation of the typology proposed by Greyson (1983) also provides new insights into previous research, including the possibility of embellishment over time (as indicated by the finding of positive, as well as negative, latency effects) and the potential roles of religious affiliation and religiosity (as indicated by the qualitative differences surrounding paranormal/mystical/religious issues).
Measuring trust in nurses - Psychometric properties of the Trust in Nurses Scale in four countries.

PubMed

Stolt, Minna; Charalambous, Andreas; Radwin, Laurel; Adam, Christina; Katajisto, Jouko; Lemonidou, Chryssoula; Patiraki, Elisabeth; Sjövall, Katarina; Suhonen, Riitta

2016-12-01

The purpose of this study was to examine psychometric properties of three translated versions of the Trust in Nurses Scale (TNS) and cancer patients' perceptions of trust in nurses in a sample of cancer patients from four European countries. A cross-sectional, cross-cultural, multi-site survey design was used. The data were collected with the Trust in Nurses Scale from patients with different types of malignancies in 17 units within five clinical sites (n = 599) between 09/2012 and 06/2014. Data were analyzed using descriptive and inferential statistics, multivariate methods and psychometrics using exploratory factor analysis, Cronbach's alpha coefficients, item analysis and Rasch analysis. The psychometric properties of the data were consistent in all countries. Within the exploratory factor analysis the principal component analysis supported the one component structure (unidimensionality) of the TNS. The internal consistency reliability was acceptable. The Rasch analysis supported the unidimensionality of the TNS cross-culturally. All items of the TNS demonstrated acceptable goodness-of-fit to the Rasch model. Cancer patients trusted nurses to a great extent although between-country differences were found. The Trust in Nurses Scale proved to be a valid and reliable tool for measuring patients' trust in nurses in oncological settings in international contexts. Copyright © 2016 Elsevier Ltd. All rights reserved.
Exploring the Effects of Rater Linking Designs and Rater Fit on Achievement Estimates within the Context of Music Performance Assessments

ERIC Educational Resources Information Center

Wind, Stefanie A.; Engelhard, George, Jr.; Wesolowski, Brian

2016-01-01

When good model-data fit is observed, the Many-Facet Rasch (MFR) model acts as a linking and equating model that can be used to estimate student achievement, item difficulties, and rater severity on the same linear continuum. Given sufficient connectivity among the facets, the MFR model provides estimates of student achievement that are equated to…
Use of Robust z in Detecting Unstable Items in Item Response Theory Models

ERIC Educational Resources Information Center

Huynh, Huynh; Meyer, Patrick

2010-01-01

The first part of this paper describes the use of the robust z[subscript R] statistic to link test forms using the Rasch (or one-parameter logistic) model. The procedure is then extended to the two-parameter and three-parameter logistic and two-parameter partial credit (2PPC) models. A real set of data was used to illustrate the extension. The…
Combining partially ranked data in plant breeding and biology: II. Analysis with Rasch model.

USDA-ARS?s Scientific Manuscript database

Many years of breeding experiments, germplasm screening, and molecular biologic experimentation have generated volumes of sequence, genotype, and phenotype information that have been stored in public data repositories. These resources afford genetic and genomic researchers the opportunity to handle ...
Objective Measurement of Subjective Well-Being.

ERIC Educational Resources Information Center

Hahn, Elizabeth A.

2000-01-01

Demonstrates the usefulness of the Rasch model in evaluating the cross cultural equivalence of health-related quality of life instruments (HRQOL). Results from 195 U.S. cancer patients and 118 Austrian cancer patients identity biased items, providing a better estimate of each cultural group's HRQOL. (SLD)
Examining the psychometric properties of a sport-related concussion survey: a Rasch measurement approach.

PubMed

Hecimovich, Mark; Marais, Ida

2017-06-26

Awareness of sport-related concussion (SRC) is an essential step in increasing the number of athletes or parents who report on SRC. This awareness is important, as there is no established data on medical care at youth-level sports and may be limited to individuals with only first aid training. In this circumstance, aside from the coach, it is the players and their parents who need to be aware of possible signs and symptoms. The aim of this study was to examine the psychometric properties of a parent and player concussion survey intended for use before and after an education campaign regarding SRC. 1441 questionnaires were received from parents and 284 questionnaires from players. The responses to the sixteen-item section of the questionnaire's 'recognition of signs and symptoms' were submitted to psychometric analysis using the dichotomous and polytomous Rasch model via the Rasch Unidimensional Measurement Model software RUMM2030. The Rasch model of Modern Test Theory can be considered a refinement of, or advance on, traditional analyses of an instrument's psychometric properties. The main finding is that these sixteen items measure two factors: items that are symptoms of concussion and items that are not symptoms of concussion. Parents and athletes were able to identify most or all of the symptoms, but were not as good at distinguishing symptoms that are not symptoms of concussion. Analyzing these responses revealed differential item functioning for parents and athletes on non-symptom items. When the DIF was resolved a significant difference was found between parents and athletes. The main finding is that the items measure two 'dimensions' in concussion symptom recognition. The first dimension consists of those items that are symptoms of concussion and the second dimension of those items that are not symptoms of concussion. Parents and players were able to identify most or all of the symptoms of concussion, so one would not expect to pick up any positive change on these items after an education campaign. Parents and players were not as good at distinguishing symptoms that are not symptoms of concussion. It is on these items that one may possibly expect improvement to manifest, so to evaluate the effectiveness of an education campaign it would pay to look for improvement in distinguishing symptoms that are not symptoms of concussion.
Item-Level Psychometrics of the Glasgow Outcome Scale: Extended Structured Interviews.

PubMed

Hong, Ickpyo; Li, Chih-Ying; Velozo, Craig A

2016-04-01

The Glasgow Outcome Scale-Extended (GOSE) structured interview captures critical components of activities and participation, including home, shopping, work, leisure, and family/friend relationships. Eighty-nine community dwelling adults with mild-moderate traumatic brain injury (TBI) were recruited (average = 2.7 year post injury). Nine items of the 19 items were used for the psychometrics analysis purpose. Factor analysis and item-level psychometrics were investigated using the Rasch partial-credit model. Although the principal components analysis of residuals suggests that a single measurement factor dominates the measure, the instrument did not meet the factor analysis criteria. Five items met the rating scale criteria. Eight items fit the Rasch model. The instrument demonstrated low person reliability (0.63), low person strata (2.07), and a slight ceiling effect. The GOSE demonstrated limitations in precisely measuring activities/participation for individuals after TBI. Future studies should examine the impact of the low precision of the GOSE on effect size. © The Author(s) 2016.
Cultural differences in functional status measurement: analyses of person fit according to the Rasch model.

PubMed

Custers, J W; Hoijtink, H; van der Net, J; Helders, P J

2000-01-01

For many reasons it is preferable to use established health related outcome instruments. The validity of an instrument, however, can be affected when used in another culture or language other than what it was originally developed. In this paper, the outcome on functional status measurement using a preliminary version of the Dutch translated 'Pediatric Evaluation of Disability Inventory' (PEDI) was studied involving a sample of 20 non-disabled Dutch children and American peers, to see if a cross-cultural validation procedure is needed before using the instrument in the Netherlands. The Rasch model was used to analyse the Dutch data. Score profiles were not found to be compatible with the score profiles of American children. In particular, ten items were scored differently with strong indications that these were based on inter-cultural differences. Based on our study, it is argued that cross-cultural validation of the PEDI is necessary before using the instrument in the Netherlands.

Investigation of the prominent barriers to lean manufacturing implementation in Malaysian food and beverages industry using Rasch Model

NASA Astrophysics Data System (ADS)

Khusaini, N. S.; Ismail, A.; Rashid, A. A.

2016-02-01

This paper presents a preliminary study on the prominent barriers to lean manufacturing implementation in Malaysian Food and Beverages Industry. A survey was carried out to determine the most prominent barriers of lean manufacturing implementation that are currently being faced in this industry. The amount of barriers identified for this study is twenty seven. Out of 1309 available organizations, a total of 300 organizations have been randomly selected as respondents, and 53 organizations responded. From the variable map, the analysis shows that, the negative perception towards lean manufacturing top the list as the most agreeable barrier, while the technical barriers came after it. It can also be seen from the variable map that averagely, lack of vision and direction is the barrier that is being faced. Finally, this is perhaps the first attempt in investigating the prominent barriers to Lean Manufacturing implementation in Malaysian food and beverages industry using Rasch Model.
Measuring health-related problem solving among African Americans with multiple chronic conditions: application of Rasch analysis.

PubMed

Fitzpatrick, Stephanie L; Hill-Briggs, Felicia

2015-10-01

Identification of patients with poor chronic disease self-management skills can facilitate treatment planning, determine effectiveness of interventions, and reduce disease complications. This paper describes the use of a Rasch model, the Rating Scale Model, to examine psychometric properties of the 50-item Health Problem-Solving Scale (HPSS) among 320 African American patients with high risk for cardiovascular disease. Items on the positive/effective HPSS subscales targeted patients at low, moderate, and high levels of positive/effective problem solving, whereas items on the negative/ineffective problem solving subscales mostly targeted those at moderate or high levels of ineffective problem solving. Validity was examined by correlating factor scores on the measure with clinical and behavioral measures. Items on the HPSS show promise in the ability to assess health-related problem solving among high risk patients. However, further revisions of the scale are needed to increase its usability and validity with large, diverse patient populations in the future.
Validating Quantitative Measurement Using Qualitative Data: Combining Rasch Scaling and Latent Semantic Analysis in Psychiatry

NASA Astrophysics Data System (ADS)

Lange, Rense

2015-02-01

An extension of concurrent validity is proposed that uses qualitative data for the purpose of validating quantitative measures. The approach relies on Latent Semantic Analysis (LSA) which places verbal (written) statements in a high dimensional semantic space. Using data from a medical / psychiatric domain as a case study - Near Death Experiences, or NDE - we established concurrent validity by connecting NDErs qualitative (written) experiential accounts with their locations on a Rasch scalable measure of NDE intensity. Concurrent validity received strong empirical support since the variance in the Rasch measures could be predicted reliably from the coordinates of their accounts in the LSA derived semantic space (R2 = 0.33). These coordinates also predicted NDErs age with considerable precision (R2 = 0.25). Both estimates are probably artificially low due to the small available data samples (n = 588). It appears that Rasch scalability of NDE intensity is a prerequisite for these findings, as each intensity level is associated (at least probabilistically) with a well- defined pattern of item endorsements.
Rasch scaling paranormal belief and experience: structure and semantics of Thalbourne's Australian Sheep-Goat Scale.

PubMed

Lange, Rense; Thalbourne, Michael A

2002-12-01

Research on the relation between demographic variables and paranormal belief remains controversial given the possible semantic distortions introduced by item and test level biases. We illustrate how Rasch scaling can be used to detect such biases and to quantify their effects, using the Australian Sheep-Goal Scale as a substantive example. Based on data from 1.822 respondents, this test was Rasch scalable, reliable, and unbiased at the test level. Consistent with other research in which unbiased measures of paranormal belief were used, extremely weak age and sex effects were found (partial eta2 = .005 and .012, respectively).
Responsiveness of a Neuromuscular Recovery Scale for Spinal Cord Injury: Inpatient and Outpatient Rehabilitation

DTIC Science & Technology

2013-10-01

Velozo’s research focus is on the development of functional outcome measures using Rasch measurement theory. Dr. Velozo’s research team has...functional outcome measures using Rasch measurement theory. Dr. Velozo’s research team has developed computerized adaptive measurement of physical
Measurement Musings.

ERIC Educational Resources Information Center

Fisher, William P., Jr.; Choi, Ellie; Fisher, William P.; Stenner, A. Jackson; Horabin, Ivan; Wright, Benjamin D.

1998-01-01

Comments on measurement aspects are presented in discussions of (1) methodology and morality (W. P. Fisher); (2) Rasch measurement (E. Choi); (3) novel wisdom of the Rasch approach (W. P. Fisher); (4) development of construct definition and calibration (A. J. Stenner and I. Horabin); and (5) origin of dimensions (B. D. Wright). (SLD)
Detecting Multidimensionality: Which Residual Data-Type Works Best?

ERIC Educational Resources Information Center

Linacre, John Michael

1998-01-01

Simulation studies indicate that, for responses to complete tests, construction of Rasch measures from observational data, followed by principal components factor analysis of Rasch residuals, provides an effective means of identifying multidimensionality. The most diagnostically useful residual form was found to be the standardized residual. (SLD)
A new look at the WHOQOL as health-related quality of life instrument among visually impaired people using Rasch analysis.

PubMed

Gothwal, Vijaya K; Srinivas, Marmamula; Rao, Gullapalli N

2013-05-01

To examine the psychometric characteristics of the World Health Organization quality of life instrument-modified Indian version (modified WHOQOL) and its subscales in adults with visual impairment (VI) using Rasch analysis. Cross-sectional data were of people aged ≥40 years with VI (n = 1,333) who responded to the modified WHOQOL in the Andhra Pradesh Eye Disease Study, India. Rasch analysis was used to explore the instrument and its subscales for key indices such as measurement precision by person separation reliability, PSR (i.e., discrimination between strata of participants' health-related QOL [HRQOL], recommended minimum value 0.8), unidimensionality (i.e., measurement of a single construct), and targeting (i.e., matching of item difficulty to participants' HRQOL). Rasch-guided iterative approach including category re-organization to enable threshold ordering and item deletion to overcome multidimensionality resulted in a unidimensional 9-item WHOQOL and a 6-item level of independence (LOI) subscale with adequate PSR (0.81 and 0.82, respectively). Targeting was sub-optimal for both (-1.58 logits for WHOQOL and -2.55 logits for the subscale). Remaining subscales were dysfunctional. The WHOQOL and LOI subscale can be improved and shortened, and the Rasch-revised versions are likely to assess the HROQL of VI patients best because of their brevity, reliability, and unidimensionality.
Modern psychometrics for assessing achievement goal orientation: a Rasch analysis.

PubMed

Muis, Krista R; Winne, Philip H; Edwards, Ordene V

2009-09-01

A program of research is needed that assesses the psychometric properties of instruments designed to quantify students' achievement goal orientations to clarify inconsistencies across previous studies and to provide a stronger basis for future research. We conducted traditional psychometric and modern Rasch-model analyses of the Achievement Goals Questionnaire (AGQ, Elliot & McGregor, 2001) and the Patterns of Adaptive Learning Scale (PALS, Midgley et al., 2000) to provide an in-depth analysis of the two most popular instruments in educational psychology. For Study 1, 217 undergraduate students enrolled in educational psychology courses participated. Thirty-four were male and 181 were female (two did not respond). Participants completed the AGQ in the context of their educational psychology class. For Study 2, 126 undergraduate students enrolled in educational psychology courses participated. Thirty were male and 95 were female (one did not respond). Participants completed the PALS in the context of their educational psychology class. Traditional psychometric assessments of the AGQ and PALS replicated previous studies. For both, reliability estimates ranged from good to very good for raw subscale scores and fit for the models of goal orientations were good. Based on traditional psychometrics, the AGQ and PALS are valid and reliable indicators of achievement goals. Rasch analyses revealed that estimates of reliability for items were very good but respondent ability estimates varied from poor to good for both the AGQ and PALS. These findings indicate that items validly and reliably reflect a group's aggregate goal orientation, but using either instrument to characterize an individual's goal orientation is hazardous.
The patient satisfaction questionnaire of EUprimecare project: measurement properties.

PubMed

Cimas, Marta; Ayala, Alba; García-Pérez, Sonia; Sarria-Santamera, Antonio; Forjaz, Maria João

2016-06-01

The measurement of patient satisfaction is considered an essential outcome indicator to evaluate health care quality. Patient satisfaction is considered a multi-dimensional construct, which would include a variety of domains. Although a large number of studies have proposed scales to measure patient satisfaction, there is a lack of psychometric information on them. This study aims to describe the psychometric properties of the Primary Care Satisfaction Scale (PCSS) of the EUprimecare project. A cross-sectional survey of patient satisfaction with primary care was carried out by telephone interview. Primary care services of Estonia, Finland, Germany, Hungary, Lithuania, Italy and Spain. A total of 3020 adult patients aged 18-65 years old attending primary care services. Classic psychometric properties were analysed and Rasch analysis was used to assess the following measurement properties: fit to the Rasch model; uni-dimensionality; reliability; differential item functioning (DIF) by gender, age, civil status, area of residency and country; local independency; adequacy of response scale; and scale targeting. To achieve good fit to the Rasch model, the original response scales of three items (1, 2 and 6) were rescored and Item 3 (waiting time in the room) was removed. The scale was uni-dimensional and Person Separation Index was 0.79, indicating a good reliability. All items were free from bias. PCSS linear measure displayed satisfactory convergent validity with overall satisfaction with primary care. PCSS, as a reliable and valid scale, could be used to measure patient satisfaction in primary care in Europe. © The Author 2016. Published by Oxford University Press in association with the International Society for Quality in Health Care; all rights reserved.
Rasch Analysis of the 9-Item Shared Decision Making Questionnaire in Women With Breast Cancer.

PubMed

Wu, Tzu-Yi; Chen, Cheng-Te; Huang, Yi-Jing; Hou, Wen-Hsuan; Wang, Jung-Der; Hsieh, Ching-Lin

2018-04-19

Shared decision making (SDM) is a best practice to help patients make optimal decisions by a process of healthcare, especially for women diagnosed with breast cancer and having heavy burden in long-term treatments. To promote successful SDM, it is crucial to assess the level of perceived involvement in SDM in women with breast cancer. The aims of this study were to apply Rasch analysis to examine the construct validity and person reliability of the 9-item Shared Decision Making Questionnaire (SDM-Q-9) in women with breast cancer. The construct validity of SDM-Q-9 was confirmed when the items fit the Rasch model's assumptions of unidimensionality: (1) infit and outfit mean square ranged from 0.6 to 1.4; (2) the unexplained variance of the first dimension of the principal component analysis was less than 20%. Person reliability was calculated. A total of 212 participants were recruited in this study. Item 1 did not fit the model's assumptions and was deleted. The unidimensionality of the remaining 8 items (SDM-Q-8) was supported with good item fit (infit and outfit mean square ranging from 0.6 to 1.3) and very low unexplained variance of the first dimension (5.3%) of the principal component analysis. The person reliability of the SDM-Q-8 was 0.90. The SDM-Q-8 was unidimensional and had good person reliability in women with breast cancer. The SDM-Q-8 has shown its potential for assessing the level of perceived involvement in SDM in women with breast cancer for both research and clinical purposes.
Clarification to "Examining Rater Errors in the Assessment of Written Composition with a Many-Faceted Rasch Model."

ERIC Educational Resources Information Center

Englehard, George, Jr.

1996-01-01

Data presented in figure three of the article cited may be misleading in that the automatic scaling procedure used by the computer program that generated the histogram highlighted spikes that would look different with different histogram methods. (SLD)
Anchor Selection Strategies for DIF Analysis: Review, Assessment, and New Approaches

ERIC Educational Resources Information Center

Kopf, Julia; Zeileis, Achim; Strobl, Carolin

2015-01-01

Differential item functioning (DIF) indicates the violation of the invariance assumption, for instance, in models based on item response theory (IRT). For item-wise DIF analysis using IRT, a common metric for the item parameters of the groups that are to be compared (e.g., for the reference and the focal group) is necessary. In the Rasch model,…
A Simulation Study on Methods of Correcting for the Effects of Extreme Response Style

ERIC Educational Resources Information Center

Wetzel, Eunike; Böhnke, Jan R.; Rose, Norman

2016-01-01

The impact of response styles such as extreme response style (ERS) on trait estimation has long been a matter of concern to researchers and practitioners. This simulation study investigated three methods that have been proposed for the correction of trait estimates for ERS effects: (a) mixed Rasch models, (b) multidimensional item response models,…
Rasch Model Parameter Estimation in the Presence of a Nonnormal Latent Trait Using a Nonparametric Bayesian Approach

ERIC Educational Resources Information Center

Finch, Holmes; Edwards, Julianne M.

2016-01-01

Standard approaches for estimating item response theory (IRT) model parameters generally work under the assumption that the latent trait being measured by a set of items follows the normal distribution. Estimation of IRT parameters in the presence of nonnormal latent traits has been shown to generate biased person and item parameter estimates. A…
An introduction to Item Response Theory and Rasch Analysis of the Eating Assessment Tool (EAT-10).

PubMed

Kean, Jacob; Brodke, Darrel S; Biber, Joshua; Gross, Paul

2018-03-01

Item response theory has its origins in educational measurement and is now commonly applied in health-related measurement of latent traits, such as function and symptoms. This application is due in large part to gains in the precision of measurement attributable to item response theory and corresponding decreases in response burden, study costs, and study duration. The purpose of this paper is twofold: introduce basic concepts of item response theory and demonstrate this analytic approach in a worked example, a Rasch model (1PL) analysis of the Eating Assessment Tool (EAT-10), a commonly used measure for oropharyngeal dysphagia. The results of the analysis were largely concordant with previous studies of the EAT-10 and illustrate for brain impairment clinicians and researchers how IRT analysis can yield greater precision of measurement.
Using Rasch Analysis to Explore What Students Learn about Probability Concepts

ERIC Educational Resources Information Center

Mahmud, Zamalia; Porter, Anne

2015-01-01

Students' understanding of probability concepts have been investigated from various different perspectives. This study was set out to investigate perceived understanding of probability concepts of forty-four students from the STAT131 Understanding Uncertainty and Variation course at the University of Wollongong, NSW. Rasch measurement which is…
Using Rasch Analysis to Inform Rating Scale Development

ERIC Educational Resources Information Center

Van Zile-Tamsen, Carol

2017-01-01

The use of surveys, questionnaires, and rating scales to measure important outcomes in higher education is pervasive, but reliability and validity information is often based on problematic Classical Test Theory approaches. Rasch Analysis, based on Item Response Theory, provides a better alternative for examining the psychometric quality of rating…
A Rasch Analysis of the Junior Metacognitive Awareness Inventory with Singapore Students

ERIC Educational Resources Information Center

Ning, Hoi Kwan

2018-01-01

The psychometric properties of the 2 versions of the Junior Metacognitive Awareness Inventory were examined with Singapore student samples. Other than 2 misfitting items and an underutilized response scale, Rasch analysis demonstrated that the instruments have good measurement precision, and no differential item functioning was detected across…
Rasch Analysis of Professional Behavior in Medical Education

ERIC Educational Resources Information Center

Lange, R.; Verhulst, S. J.; Roberts, N. K.; Dorsey, J. K.

2015-01-01

The use of students' "consumer feedback" to assess faculty behavior and improve the process of medical education is a significant challenge. We used quantitative Rasch measurement to analyze pre-categorized student comments listed by 385 graduating medical students. We found that students differed little with respect to the number of…

Examining Teacher Grades Using Rasch Measurement Theory

ERIC Educational Resources Information Center

Randall, Jennifer; Engelhard, George, Jr.

2009-01-01

In this study, we present an approach to questionnaire design within educational research based on Guttman's mapping sentences and Many-Facet Rasch Measurement Theory. We designed a 54-item questionnaire using Guttman's mapping sentences to examine the grading practices of teachers. Each item in the questionnaire represented a unique student…
Bayesian Estimation in the One-Parameter Latent Trait Model.

DTIC Science & Technology

1980-03-01

Journal of Mathematical and Statistical Psychology , 1973, 26, 31-44. (a) Andersen, E. B. A goodness of fit test for the Rasch model. Psychometrika, 1973, 28...technique for estimating latent trait mental test parameters. Educational and Psychological Measurement, 1976, 36, 705-715. Lindley, D. V. The...Lord, F. M. An analysis of verbal Scholastic Aptitude Test using Birnbaum’s three-parameter logistic model. Educational and Psychological
Variability in depression prevalence in early rheumatoid arthritis: a comparison of the CES-D and HAD-D Scales

PubMed Central

Covic, Tanya; Pallant, Julie F; Tennant, Alan; Cox, Sally; Emery, Paul; Conaghan, Philip G

2009-01-01

Background Depression is common in rheumatoid arthritis (RA), however reported prevalence varies considerably. Two frequently used instruments to identify depression are the Center for Epidemiological Studies Depression (CES-D) scale, and the Hospital Anxiety and Depression Scale (HADS). The objectives of this study were to test if the CES-D and HADS-D (a) satisfy current modern psychometric standards for unidimensional measurement in an early RA sample; (b) measure the same construct (i.e. depression); and (c) identify similar levels of depression. Methods Data from the two scales completed by patients with early RA were fitted to the Rasch measurement model to show that (a) each scale satisfies the criteria of fit to the model, including strict unidimensionality; (b) that the scales can be co-calibrated onto a single underlying continuum of depression and to (c) examine the location of the cut points on the underlying continuum as indication of the prevalence of depression. Results Ninety-two patients with early RA (62% female; mean age = 56.3, SD = 13.7) gave 141 sets of paired CES-D and HAD-D data. Fit of the data from the CES-D was found to be poor, and the scale had to be reduced to 13 items to satisfy Rasch measurement criteria whereas the HADS-D met model expectations from the outset. The 20 items combined (CES-D13 and HADS-D) satisfied Rasch model expectations. The CES-D gave a much higher prevalence of depression than the HADS-D. Conclusion The CES-D in its present form is unsuitable for use in patients with early RA, and needs to be reduced to a 13-item scale. The HADS-D is valid for early RA and the two scales measure the same underlying construct but their cut points lead to different estimates of the level of depression. Revised cut points on the CES-D13 provide comparative prevalence rates. PMID:19200388
Evaluation of Internal Construct Validity and Unidimensionality of the Brachial Assessment Tool, A Patient-Reported Outcome Measure for Brachial Plexus Injury.

PubMed

Hill, Bridget; Pallant, Julie; Williams, Gavin; Olver, John; Ferris, Scott; Bialocerkowski, Andrea

2016-12-01

To evaluate the internal construct validity and dimensionality of a new patient-reported outcome measure for people with traumatic brachial plexus injury (BPI) based on the International Classification of Functioning, Disability and Health definition of activity. Cross-sectional study. Outpatient clinics. Adults (age range, 18-82y) with a traumatic BPI (N=106). There were 106 people with BPI who completed a 51-item 5-response questionnaire. Responses were analyzed in 4 phases (missing responses, item correlations, exploratory factor analysis, and Rasch analysis) to evaluate the properties of fit to the Rasch model, threshold response, local dependency, dimensionality, differential item functioning, and targeting. Not applicable, as this study addresses the development of an outcome measure. Six items were deleted for missing responses, and 10 were deleted for high interitem correlations >.81. The remaining 35 items, while demonstrating fit to the Rasch model, showed evidence of local dependency and multidimensionality. Items were divided into 3 subscales: dressing and grooming (8 items), arm and hand (17 items), and no hand (6 items). All 3 subscales demonstrated fit to the model with no local dependency, minimal disordered thresholds, no unidimensionality or differential item functioning for age, time postinjury, or self-selected dominance. Subscales were combined into 3 subtests and demonstrated fit to the model, no misfit, and unidimensionality, allowing calculation of a summary score. This preliminary analysis supports the internal construct validity of the Brachial Assessment Tool, a unidimensional targeted 4-response patient-reported outcome measure designed to solely assess activity after traumatic BPI regardless of level of injury, age at recruitment, premorbid limb dominance, and time postinjury. Further examination is required to determine test-retest reliability and responsiveness. Copyright Â© 2016 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.
An Application of the Rasch Model to Computerized Adaptive Testing.

ERIC Educational Resources Information Center

Wisniewski, Dennis R.

Three questions concerning the Binary Search Method (BSM) of computerized adaptive testing were studied: (1) whether it provided a reliable and valid estimation of examinee ability; (2) its effect on examinee attitudes toward computerized adaptive testing and conventional paper-and-pencil testing; and (3) the relationship between item response…
Computerized Classification Testing with the Rasch Model

ERIC Educational Resources Information Center

Eggen, Theo J. H. M.

2011-01-01

If classification in a limited number of categories is the purpose of testing, computerized adaptive tests (CATs) with algorithms based on sequential statistical testing perform better than estimation-based CATs (e.g., Eggen & Straetmans, 2000). In these computerized classification tests (CCTs), the Sequential Probability Ratio Test (SPRT) (Wald,…
Conditional Versus Unconditional Procedures for Sample-Free Item Analysis

ERIC Educational Resources Information Center

Wright, Benjamin D.; Douglas, Graham A.

1977-01-01

Procedures for the Rasch model, sample free item calibration are reviewed and compared for accuracy. The theoretically ideal procedure is shown to have practical limitations. Two alternatives to the ideal are presented and discussed. A correction for bias in the most widely used alternative is presented. (Author/JKS)
Identification of Hierarchies of Student Learning about Percentages Using Rasch Analysis

ERIC Educational Resources Information Center

Burfitt, Joan

2013-01-01

A review of the research literature indicated that there were probable orders in which students develop understandings and skills for calculating with percentages. Such calculations might include using models to represent percentages, knowing fraction equivalents, selection of strategies to solve problems and determination of percentage change. To…
The Rasch Model for Evaluating Italian Student Performance

ERIC Educational Resources Information Center

Camminatiello, Ida; Gallo, Michele; Menini, Tullio

2010-01-01

In 1997 the Organisation for Economic Co-operation and Development (OECD) launched the OECD Programme for International Student Assessment (PISA) for collecting information about 15-year-old students in participating countries. Our study analyses the PISA 2006 cognitive test for evaluating the Italian student performance in mathematics, reading…
Item Banking. ERIC/AE Digest.

ERIC Educational Resources Information Center

Rudner, Lawrence

This digest discusses the advantages and disadvantages of using item banks, and it provides useful information for those who are considering implementing an item banking project in their school districts. The primary advantage of item banking is in test development. Using an item response theory method, such as the Rasch model, items from multiple…
Constructing the Exact Significance Level for a Person-Fit Statistic.

ERIC Educational Resources Information Center

Liou, Michelle; Chang, Chih-Hsin

1992-01-01

An extension is proposed for the network algorithm introduced by C.R. Mehta and N.R. Patel to construct exact tail probabilities for testing the general hypothesis that item responses are distributed according to the Rasch model. A simulation study indicates the efficiency of the algorithm. (SLD)
Historical Views of Invariance: Evidence from the Measurement Theories of Thorndike, Thurstone, and Rasch.

ERIC Educational Resources Information Center

Engelhard, George, Jr.

1992-01-01

A historical perspective is provided of the concept of invariance in measurement theory, describing sample-invariant item calibration and item-invariant measurement of individuals. Invariance as a key measurement concept is illustrated through the measurement theories of E. L. Thorndike, L. L. Thurstone, and G. Rasch. (SLD)
Rasch Analysis of the Geriatric Depression Scale--Short Form

ERIC Educational Resources Information Center

Chiang, Karl S.; Green, Kathy E.; Cox, Enid O.

2009-01-01

Purpose: The purpose of this study was to examine scale dimensionality, reliability, invariance, targeting, continuity, cutoff scores, and diagnostic use of the Geriatric Depression Scale-Short Form (GDS-SF) over time with a sample of 177 English-speaking U.S. elders. Design and Methods: An item response theory, Rasch analysis, was conducted with…
Measuring Engagement in Later Life Activities: Rasch-Based Scenario Scales for Work, Caregiving, Informal Helping, and Volunteering

ERIC Educational Resources Information Center

Ludlow, Larry H.; Matz-Costa, Christina; Johnson, Clair; Brown, Melissa; Besen, Elyssa; James, Jacquelyn B.

2014-01-01

The development of Rasch-based "comparative engagement scenarios" based on Guttman's facet theory and sentence mapping procedures is described. The scenario scales measuring engagement in work, caregiving, informal helping, and volunteering illuminate the lived experiences of role involvement among older adults and offer multiple…
Students' Appreciation of Expectation and Variation as a Foundation for Statistical Understanding

ERIC Educational Resources Information Center

Watson, Jane M.; Callingham, Rosemary A.; Kelly, Ben A.

2007-01-01

This study presents the results of a partial credit Rasch analysis of in-depth interview data exploring statistical understanding of 73 school students in 6 contextual settings. The use of Rasch analysis allowed the exploration of a single underlying variable across contexts, which included probability sampling, representation of temperature…
Rasch Analysis: A Primer for School Psychology Researchers and Practitioners

ERIC Educational Resources Information Center

Boone, William J.; Noltemeyer, Amity

2017-01-01

In order to progress as a field, school psychology research must be informed by effective measurement techniques. One approach to address the need for careful measurement is Rasch analysis. This technique can (a) facilitate the development of instruments that provide useful data, (b) provide data that can be used confidently for both descriptive…
Judging Anomalies at the 2010 Olympics in Men's Figure Skating

ERIC Educational Resources Information Center

Looney, Marilyn A.

2012-01-01

The purpose of this study was to determine if the 2010 Olympic figure skating judges had trouble scoring Plushenko and the transitions program component, and if the International Skating Union's (ISU) "corridor" method flagged the same judging anomalies as the Rasch analyses. A 3-facet (skater by program component by judge) Rasch rating…
The Psychometric Properties of the Invitational School Survey (ISS): An Australian Study

ERIC Educational Resources Information Center

Smith, Kenneth H.; Barnard, John

2004-01-01

This study provides psychometric data on the Inviting School Survey (Purkey & Fuller, 1995) using a rating scale analysis within the framework of the Rasch measurement philosophy (Bond & Fox, 2001; Rasch, 1980). The Inviting School Survey's factor structure and internal consistency are examined and compared with the Invitational Education…
Interpretation of the Rasch Ability and Difficulty Scales for Educational Purposes.

ERIC Educational Resources Information Center

Woodcock, Richard W.

Though many test developers have utilized item response theory in their work, few have taken advantage of the potential of item response theory for providing new interpretation procedures that accentuate the educational implications to be drawn from test scores. This paper describes several features, based upon the Rasch difficulty and ability…
Historical Perspectives on Invariant Measurement: Guttman, Rasch, and Mokken

ERIC Educational Resources Information Center

Engelhard, George, Jr.

2008-01-01

The purpose of this study is to describe how Guttman, Rasch, and Mokken approached issues related to invariant measurement. These measurement theorists were chosen to illustrate the evolution of our conceptualizations of invariant measurement during the 20th century within the research tradition of item response theory. Item response theory can be…

A Rasch Analysis of the Substance Abuse Subtle Screening Inventory-3

ERIC Educational Resources Information Center

Hill, Tara M.; Laux, John M.; Stone, Gregory; Dupuy, Paula; Scott, Holly

2013-01-01

Rasch analysis of the Substance Abuse Subtle Screening Inventory-3 (SASSI-3; F. G. Miller & Lazowski, 1999) indicated that the SASSI-3 meets fundamental measurement properties; however, the authors of the current study recommend the elimination of nonfunctioning items and the improvement of response options for the face valid scales to…
Using Rasch Rating Scale Methodology to Examine a Behavioral Screener for Preschoolers at Risk

ERIC Educational Resources Information Center

DiStefano, Christine; Greer, Fred W.; Kamphaus, R. W.; Brown, William H.

2014-01-01

A screening instrument used to identify young children at risk for behavioral and emotional difficulties, the Behavioral and Emotional Screening System Teacher Rating Scale-Preschool was examined. The Rasch Rating Scale Method was used to provide additional information about psychometric properties of items, respondents, and the response scale.…
A Psychometric Investigation of the Marlowe-Crowne Social Desirability Scale Using Rasch Measurement

ERIC Educational Resources Information Center

Seol, Hyunsoo

2007-01-01

The author used Rasch measurement to examine the reliability and validity of 382 Korean university students' scores on the Marlowe-Crowne Social Desirability Scale (MCSDS; D. P. Crowne and D. Marlowe, 1960). Results revealed that item-fit statistics and principal component analysis with standardized residuals provide evidence of MCSDS'…
Educational Leadership Effectiveness: A Rasch Analysis

ERIC Educational Resources Information Center

Sinnema, Claire; Ludlow, Larry; Robinson, Viviane

2016-01-01

Purpose: The purposes of this paper are, first, to establish the psychometric properties of the ELP tool, and, second, to test, using a Rasch item response theory analysis, the hypothesized progression of challenge presented by the items included in the tool. Design/ Methodology/ Approach: Data were collected at two time points through a survey of…
Rasch analysis for psychometric improvement of science attitude rating scales

NASA Astrophysics Data System (ADS)

Oon, Pey-Tee; Fan, Xitao

2017-04-01

Students' attitude towards science (SAS) is often a subject of investigation in science education research. Survey of rating scale is commonly used in the study of SAS. The present study illustrates how Rasch analysis can be used to provide psychometric information of SAS rating scales. The analyses were conducted on a 20-item SAS scale used in an existing dataset of The Trends in International Mathematics and Science Study (TIMSS) (2011). Data of all the eight-grade participants from Hong Kong and Singapore (N = 9942) were retrieved for analyses. Additional insights from Rasch analysis that are not commonly available from conventional test and item analyses were discussed, such as invariance measurement of SAS, unidimensionality of SAS construct, optimum utilization of SAS rating categories, and item difficulty hierarchy in the SAS scale. Recommendations on how TIMSS items on the measurement of SAS can be better designed were discussed. The study also highlights the importance of using Rasch estimates for statistical parametric tests (e.g. ANOVA, t-test) that are common in science education research for group comparisons.
Improving Consensus Scoring of Crowdsourced Data Using the Rasch Model: Development and Refinement of a Diagnostic Instrument.

PubMed

Brady, Christopher John; Mudie, Lucy Iluka; Wang, Xueyang; Guallar, Eliseo; Friedman, David Steven

2017-06-20

Diabetic retinopathy (DR) is a leading cause of vision loss in working age individuals worldwide. While screening is effective and cost effective, it remains underutilized, and novel methods are needed to increase detection of DR. This clinical validation study compared diagnostic gradings of retinal fundus photographs provided by volunteers on the Amazon Mechanical Turk (AMT) crowdsourcing marketplace with expert-provided gold-standard grading and explored whether determination of the consensus of crowdsourced classifications could be improved beyond a simple majority vote (MV) using regression methods. The aim of our study was to determine whether regression methods could be used to improve the consensus grading of data collected by crowdsourcing. A total of 1200 retinal images of individuals with diabetes mellitus from the Messidor public dataset were posted to AMT. Eligible crowdsourcing workers had at least 500 previously approved tasks with an approval rating of 99% across their prior submitted work. A total of 10 workers were recruited to classify each image as normal or abnormal. If half or more workers judged the image to be abnormal, the MV consensus grade was recorded as abnormal. Rasch analysis was then used to calculate worker ability scores in a random 50% training set, which were then used as weights in a regression model in the remaining 50% test set to determine if a more accurate consensus could be devised. Outcomes of interest were the percent correctly classified images, sensitivity, specificity, and area under the receiver operating characteristic (AUROC) for the consensus grade as compared with the expert grading provided with the dataset. Using MV grading, the consensus was correct in 75.5% of images (906/1200), with 75.5% sensitivity, 75.5% specificity, and an AUROC of 0.75 (95% CI 0.73-0.78). A logistic regression model using Rasch-weighted individual scores generated an AUROC of 0.91 (95% CI 0.88-0.93) compared with 0.89 (95% CI 0.86-92) for a model using unweighted scores (chi-square P value<.001). Setting a diagnostic cut-point to optimize sensitivity at 90%, 77.5% (465/600) were graded correctly, with 90.3% sensitivity, 68.5% specificity, and an AUROC of 0.79 (95% CI 0.76-0.83). Crowdsourced interpretations of retinal images provide rapid and accurate results as compared with a gold-standard grading. Creating a logistic regression model using Rasch analysis to weight crowdsourced classifications by worker ability improves accuracy of aggregated grades as compared with simple majority vote. ©Christopher John Brady, Lucy Iluka Mudie, Xueyang Wang, Eliseo Guallar, David Steven Friedman. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 20.06.2017.
Development of a patient reported outcome scale for fatigue in multiple sclerosis: The Neurological Fatigue Index (NFI-MS)

PubMed Central

2010-01-01

Background Fatigue is a common and debilitating symptom in multiple sclerosis (MS). Best-practice guidelines suggest that health services should repeatedly assess fatigue in persons with MS. Several fatigue scales are available but concern has been expressed about their validity. The objective of this study was to examine the reliability and validity of a new scale for MS fatigue, the Neurological Fatigue Index (NFI-MS). Methods Qualitative analysis of 40 MS patient interviews had previously contributed to a coherent definition of fatigue, and a potential 52 item set representing the salient themes. A draft questionnaire was mailed out to 1223 people with MS, and the resulting data subjected to both factor and Rasch analysis. Results Data from 635 (51.9% response) respondents were split randomly into an 'evaluation' and 'validation' sample. Exploratory factor analysis identified four potential subscales: 'physical', 'cognitive', 'relief by diurnal sleep or rest' and 'abnormal nocturnal sleep and sleepiness'. Rasch analysis led to further item reduction and the generation of a Summary scale comprising items from the Physical and Cognitive subscales. The scales were shown to fit Rasch model expectations, across both the evaluation and validation samples. Conclusion A simple 10-item Summary scale, together with scales measuring the physical and cognitive components of fatigue, were validated for MS fatigue. PMID:20152031
Validation and reliability of the VF-14 questionnaire in a German population.

PubMed

Chiang, Peggy Pei-Chia; Fenwick, Eva; Marella, Manjula; Finger, Robert; Lamoureux, Ecosse

2011-11-21

To evaluate the validity, reliability, and measurement characteristics of the Visual Function 14 (VF-14) in a German sample using Rasch analysis. This was a clinic-based, cross-sectional study with 184 patients with low vision recruited from an outpatient clinic at a German eye hospital. Participants underwent a clinical examination and completed the German VF-14 scale. The validity of the VF-14 scale was assessed using Rasch analysis. The main outcome measure was the overall functional score provided by the VF-14. After collapsing two response categories for items 13 and 14, the VF-14 scale satisfied fundamental criteria to achieve fit to the Rasch model, namely, ordered thresholds, the ability to distinguish between different strata of participant ability, absence of misfitting items, no evidence of unidimensionality, and no significant differential item functioning for key sociodemographic covariates. The VF-14 is able to discriminate between participants with different levels of vision impairment and across different cultural groups. The VF-14 is a valid, reliable, and unidimensional questionnaire for use in a German population. These findings contribute to the growing evidence base for second generation patient reported outcome measures in ophthalmology, and support the use of the German VF-14 in tertiary eye clinics in Germany to capture the impact of visual impairment on visual function from the patient's perspective and to inform low vision rehabilitation and interventions.
Assessing Validity of Measurement in Learning Disabilities Using Hierarchical Generalized Linear Modeling: The Roles of Anxiety and Motivation

ERIC Educational Resources Information Center

Sideridis, Georgios D.

2016-01-01

The purpose of the present studies was to test the hypothesis that the psychometric characteristics of ability scales may be significantly distorted if one accounts for emotional factors during test taking. Specifically, the present studies evaluate the effects of anxiety and motivation on the item difficulties of the Rasch model. In Study 1, the…
The Divergent Meanings of Life Satisfaction: Item Response Modeling of the Satisfaction with Life Scale in Greenland and Norway

ERIC Educational Resources Information Center

Vitterso, Joar; Biswas-Diener, Robert; Diener, Ed

2005-01-01

Cultural differences in response to the Satisfaction With Life Scale (SWLS) items is investigated. Data were fit to a mixed Rasch model in order to identify latent classes of participants in a combined sample of Norwegians (N = 461) and Greenlanders (N = 180). Initial analyses showed no mean difference in life satisfaction between the two…
Modeling School Violence across Grade Levels in the U.S. Using the Third International Mathematics and Science Study (TIMSS).

ERIC Educational Resources Information Center

Yu, Lei

School violence has increasingly captured public attention due to deadly school shootings. Controversy on school violence is demonstrated by a mixed picture of school safety and the lack of consensus on the definition of violence, which makes comparison of findings across studies difficult. This study extended the application of the Rasch model to…
A Psychometric Measurement Model for Adult English Language Learners: Pearson Test of English Academic

ERIC Educational Resources Information Center

Pae, Hye K.

2012-01-01

The aim of this study was to apply Rasch modeling to an examination of the psychometric properties of the "Pearson Test of English Academic" (PTE Academic). Analyzed were 140 test-takers' scores derived from the PTE Academic database. The mean age of the participants was 26.45 (SD = 5.82), ranging from 17 to 46. Conformity of the participants'…
Factors Associated with Knowledge of Diabetes in Patients with Type 2 Diabetes Using the Diabetes Knowledge Test Validated with Rasch Analysis

PubMed Central

Fenwick, Eva K.; Xie, Jing; Rees, Gwyn; Finger, Robert P.; Lamoureux, Ecosse L.

2013-01-01

Objective In patients with Type 2 diabetes, to determine the factors associated with diabetes knowledge, derived from Rasch analysis, and compare results with a traditional raw scoring method. Research Design & Methods Participants in this cross-sectional study underwent a comprehensive clinical and biochemical assessment. Diabetes knowledge (main outcome) was assessed using the Diabetes Knowledge Test (DKT) which was psychometrically validated using Rasch analysis. The relationship between diabetes knowledge and risk factors identified during univariate analyses was examined using multivariable linear regression. The results using raw and Rasch-transformed methods were descriptively compared. Results 181 patients (mean age±standard deviation = 66.97±9.17 years; 113 (62%) male) were included. Using Rasch-derived DKT scores, those with greater education (β = 1.14; CI: 0.25,2.04, p = 0.013); had seen an ophthalmologist (β = 1.65; CI: 0.63,2.66, p = 0.002), and spoke English at home (β = 1.37; CI: 0.43,2.31, p = 0.005) had significantly better diabetes knowledge than those with less education, had not seen an ophthalmologist and spoke a language other than English, respectively. Patients who were members of the National Diabetes Service Scheme (NDSS) and had seen a diabetes educator also had better diabetes knowledge than their counterparts. Higher HbA1c level was independently associated with worse diabetes knowledge. Using raw measures, access to an ophthalmologist and NDSS membership were not independently associated with diabetes knowledge. Conclusions Sociodemographic, clinical and service use factors were independently associated with diabetes knowledge based on both raw scores and Rasch-derived scores, which supports the implementation of targeted interventions to improve patients' knowledge. Choice of psychometric analytical method can affect study outcomes and should be considered during intervention development. PMID:24312484
Is the Berg Balance Scale an effective tool for the measurement of early postural control impairments in patients with Parkinson's disease? Evidence from Rasch analysis.

PubMed

La Porta, F; Giordano, A; Caselli, S; Foti, C; Franchignoni, F

2015-12-01

It is unclear whether the BBS is an effective tool for the measurement of early postural control impairments in patients with Parkinson's disease (PD). The aim of this paper was to evaluate BBS' content validity, internal construct validity, reliability and targeting in patients with PD within the Rasch analysis framework. Observational, cross-sectional study. Outpatient Rehabilitation Unit. A sample of 285 outpatients with PD. The content validity of the BBS was assessed using standard linking techniques. The BBS was administered by trained physiotherapists. The data collected then underwent Rasch analysis. Content validity analysis showed a lack of items assessing postural responses to tripping and slips and stability during walking. On Rasch analysis, the BBS failed the requirements of monotonicity, local independence, unidimensionality and invariance. After rescoring 7 items, grouping of locally dependent items into testlets, and deletion of the static sitting balance item because mistargeted and underdiscriminating, the Rasch-modified BBS for PD (BBS-PD) showed adequate internal construct validity (χ(2)24=39.693; P=0.023), including absence of differential item functioning (DIF) across gender and age, and was, as a whole, sufficiently precise for individual person measurement (PSI=0.894). However, the scale was not well targeted to the sample in view of the prevalence of higher scores. This study demonstrated the internal construct validity and reliability of the BBS-PD as a measurement tool for patients with PD within the Rasch analysis framework. However, the lack of items critical to the assessment of postural control impairments typical of PD, affected negatively the targeting, so that a significant percentage of patients was located in the higher ability range of the measurement continuum, where precision of measurement is reduced. These findings suggest that the BBS, even if modified, may not be an effective tool for the measurement of early postural control in patients with PD.
Reliability and Validity of the Visual, Musculoskeletal, and Balance Complaints Questionnaire.

PubMed

Lundqvist, Lars-Olov; Zetterlund, Christina; Richter, Hans O

2016-09-01

To evaluate the reliability and validity of the 15-item Visual, Musculoskeletal, and Balance Complaints Questionnaire (VMB) for people with visual impairments, using confirmatory factor analysis (CFA) and with Rasch analysis for use as an outcome measure. Two studies evaluated the VMB. In Study 1, VMB data were collected from 1249 out of 3063 individuals between 18 and 104 years old who were registered at a low vision center. CFA evaluated VMB factor structure and Rasch analysis evaluated VMB scale properties. In Study 2, a subsample of 52 individuals between 27 and 67 years old with visual impairments underwent further measurements. Visual clinical assessments, neck/scapular pain, and balance assessments were collected to evaluate the convergent validity of the VMB (i.e. the domain relationship with other, theoretically predicted measures). CFA supported the a priori three-factor structure of the VMB. The factor loadings of the items on their respective domains were all statistically significant. Rasch analysis indicated disordered categories and the original 10-point scale was subsequently replaced with a 5-point scale. Each VMB domain fitted the Rasch model, showing good metric properties, including unidimensionality (explained variances ≥66% and eigenvalues <1.9), person separation (1.86 to 2.29), reliability (0.87 to 0.94), item fit (infit MnSq's >0.72 and outfit MnSq's <1.47), targeting (0.30 to 0.50 logits), and insignificant differential item functioning (all DIFs but one <0.50 logits) from gender, age, and visual status. The three VMB domains correlated significantly with relevant visual, musculoskeletal, and balance assessments, demonstrating adequate convergent validity of the VMB. The VMB is a simple, inexpensive, and quick yet reliable and valid way to screen and evaluate concurrent visual, musculoskeletal, and balance complaints, with contribution to epidemiological and intervention research and potential clinical implications for the field of health services and low vision rehabilitation.
Psychometric properties of the Zarit Caregiver Burden Interview administered to caregivers to patients with Duchenne muscular dystrophy: a Rasch analysis.

PubMed

Landfeldt, Erik; Mayhew, Anna; Straub, Volker; Bushby, Katharine; Lochmüller, Hanns; Lindgren, Peter

2017-12-18

To explore the psychometric properties of the full 22-item English (UK and US) version of the Zarit Caregiver Burden Interview administered to caregivers to patients with Duchenne muscular dystrophy. Caregivers to patients with Duchenne muscular dystrophy from the United Kingdom and the United States, recruited through the TREAT-NMD network, completed the Zarit Caregiver Burden Interview online. The psychometric properties of the Zarit Caregiver Burden Interview were examined using Rasch analysis. A total of 475 caregivers completed the Zarit Caregiver Burden Interview. Model misfit was identified for 9 of 22 items (mean item fit residual 0.061, SD: 2.736) and 13 of 22 items displayed disordered thresholds. The overall item-trait interaction chi-square value was 499 (198 degrees of freedom, p < 0.001). The mean person fit residual was estimated at -0.213 (SD: 1.235). The Person Separation Index and Cronbach's α were estimated at 0.902 and 0.914, respectively. Item dependency was low and we found no significant differential item functioning by country or sex. Our Rasch analysis shows that the Zarit Caregiver Burden Interview fails to fully operationalize a quantitative conceptualization of caregiver burden among caregivers to patients with Duchenne muscular dystrophy from the United Kingdom and the United States. Further research is needed to understand the psychometric properties of the Zarit Caregiver Burden Interview in other populations and settings. Implications for Rehabilitation Duchenne muscular dystrophy is a terminal disease characterized by progressive muscle degeneration resulting in substantial disability and a significant burden on family caregivers. The Zarit Caregiver Burden Interview is one of the most widely applied measures of caregiver burden. Our Rasch analysis suggests that the Zarit Caregiver Burden Interview is not fit for purpose to measure burden in UK and US caregivers to patients with Duchenne muscular dystrophy. Clinicians and decision-makers should interpret Zarit Caregiver Burden Interview data from these populations with caution.
Revised Olweus Bully/Victim Questionnaire: evaluation in visually impaired.

PubMed

Gothwal, Vijaya K; Sumalini, Rebecca; Irfan, Shaik Mohammad; Giridhar, Avula; Bharani, Seelam

2013-08-01

To explore the psychometric properties of the revised Olweus Bully/Victim Questionnaire (OBVQ) in children with visual impairment (VI) using Rasch analysis. One hundred fifty Indian children with VI between 8 and 16 years (mean age, 11.6 years; 69% male; mean acuity in the better eye of 0.80 logMAR [Snellen, 20/126]) were administered the revised OBVQ. The 40-item revised OBVQ was developed to assess victimization (i.e., being bullied) and bullying (bullying others) in normally sighted schoolchildren. Only 16 items are used for Rasch analysis and are divided into two parts: I (victimization, eight items) and II (bullying others, eight items). Separate Rasch analysis was conducted for both parts, and the psychometric properties investigated included behavior of rating scale, extent to which the items measured a single construct (unidimensionality by fit statistics and principal component analysis [PCA] of residuals); ability to discriminate among participants' victimization and bullying behaviors (measurement precision as assessed by person separation reliability [PSR] minimum recommended value, 0.80); and targeting of items to participants' victimization and bullying. Response categories were misused for both parts I and II, which required repair before further analysis. Measurement precision was inadequate for both parts (PSR, 0.64 for part I and 0.19 for part II), indicating poor discriminatory ability. All items fit the Rasch model well in part I, indicating unidimensionality that was further confirmed using PCA of residuals. However, an item misfit in part II that required deletion following which the remaining items fit and PCA of residuals also supported unidimensionality. Targeting was -0.58 logits for part I, indicating that the items were matched well with the participants' victimization. By comparison, targeting was suboptimal for part II (-1.97 logits). In its current state, the revised OBVQ is not a valid psychometric instrument to assess victimization and bullying among children with VI.
Reliability and Validity of the Visual, Musculoskeletal, and Balance Complaints Questionnaire

PubMed Central

Lundqvist, Lars-Olov; Zetterlund, Christina; Richter, Hans O.

2016-01-01

ABSTRACT Purpose To evaluate the reliability and validity of the 15-item Visual, Musculoskeletal, and Balance Complaints Questionnaire (VMB) for people with visual impairments, using confirmatory factor analysis (CFA) and with Rasch analysis for use as an outcome measure. Methods Two studies evaluated the VMB. In Study 1, VMB data were collected from 1249 out of 3063 individuals between 18 and 104 years old who were registered at a low vision center. CFA evaluated VMB factor structure and Rasch analysis evaluated VMB scale properties. In Study 2, a subsample of 52 individuals between 27 and 67 years old with visual impairments underwent further measurements. Visual clinical assessments, neck/scapular pain, and balance assessments were collected to evaluate the convergent validity of the VMB (i.e. the domain relationship with other, theoretically predicted measures). Results CFA supported the a priori three-factor structure of the VMB. The factor loadings of the items on their respective domains were all statistically significant. Rasch analysis indicated disordered categories and the original 10-point scale was subsequently replaced with a 5-point scale. Each VMB domain fitted the Rasch model, showing good metric properties, including unidimensionality (explained variances ≥66% and eigenvalues <1.9), person separation (1.86 to 2.29), reliability (0.87 to 0.94), item fit (infit MnSq’s >0.72 and outfit MnSq’s <1.47), targeting (0.30 to 0.50 logits), and insignificant differential item functioning (all DIFs but one <0.50 logits) from gender, age, and visual status. The three VMB domains correlated significantly with relevant visual, musculoskeletal, and balance assessments, demonstrating adequate convergent validity of the VMB. Conclusions The VMB is a simple, inexpensive, and quick yet reliable and valid way to screen and evaluate concurrent visual, musculoskeletal, and balance complaints, with contribution to epidemiological and intervention research and potential clinical implications for the field of health services and low vision rehabilitation. PMID:27309524
Health- and vision-related quality of life in intellectually disabled children.

PubMed

Cui, Yu; Stapleton, Fiona; Suttle, Catherine; Bundy, Anita

2010-01-01

To investigate the psychometric properties of instruments for the assessment of self-reported functional vision performance and health-related quality of life in children with intellectual disabilities (IDs). Two instruments [Autoquestionnaire Enfant Image (AUQUEI), LV Prasad-Functional Vision Questionnaire (LVP-FVQ)] designed for the assessment of functional vision and health-related quality of life were adapted and administered to 168 school children with ID, aged 8 to 18 years. Rasch analysis was used to determine the appropriateness of the rating scales of these instruments and to identify any redundant items. Redundant items were excluded based on descriptive statistics and Rasch analysis, leaving 17 of 23 items in the revised AUQUEI and 16 of 22 in the LVP-FVQ. The AUQUEI items showed disordered thresholds on the rating scale. A modified step calibration (collapsed from four categories to three categories) resulted in ordered response thresholds for all items. The adjusted instrument produced an overall fit to the model (mean item infit = 1.06, SD = 0.32; mean item outfit = 1.11, SD = 0.35), indicating good construct validity. After Rasch analysis, the AUQUEI showed good content validity (person separation = 2.18; item reliability = 0.99; Cronbach alpha = 0.89). Increased similarity of person and item means and SDs on the logit scale after modification would indicate that the instrument was more applicable to the target population in its modified form. In contrast, the LVP-FVQ had a low person separation (1.35), suggesting that a more appropriate instrument is needed for assessment of vision-related quality of life in children with ID. The psychometric properties of two instruments were explored using Rasch analysis. By rescaling and reduction of items, the instruments were modified for use in a population of children with at least mild to moderate ID. However, an alternative instrument is needed for the assessment of vision-related quality of life in intellectually disabled children with normal vision or mild visual abnormalities.
Catquest-9SF questionnaire: validation of Malay and Chinese-language versions using Rasch analysis.

PubMed

Adnan, Tassha Hilda; Mohamed Apandi, Mokhlisoh; Kamaruddin, Haireen; Salowi, Mohamad Aziz; Law, Kian Boon; Haniff, Jamaiyah; Goh, Pik Pin

2018-01-05

Catquest questionnaire was originally developed in Swedish to measure patients' self-assessed visual function to evaluate the benefit of cataract surgery. The result of the Rasch analysis leading to the creation of the nine-item short form of Catquest, (Catquest-9SF), and it had been translated and validated in English. The aim is therefore to evaluate the translated Catquest-9SF questionnaire in Malay and Chinese (Mandarin) language version for measuring patient-reported visual function among cataract population in Malaysia. The English version of Catquest-9SF questionnaire was translated and back translated into Malay and Chinese languages. The Malay and Chinese translated versions were self-administered by 236 and 202 pre-operative patients drawn from a cataract surgery waiting list, respectively. The translated Catquest-9SF data and its four response options were assessed for fit to the Rasch model. The Catquest-9SF performed well in the Malay and Chinese translated versions fulfilling all criteria for valid measurement, as demonstrated by Rasch analysis. Both versions of questionnaire had ordered response thresholds, with a good person separation (Malay 2.84; and Chinese 2.59) and patient separation reliability (Malay 0.89; Chinese 0.87). Targeting was 0.30 and -0.11 logits in Malay and Chinese versions respectively, indicating that the item difficulty was well suited to the visual abilities of the patients. All items fit a single overall construct (Malay infit range 0.85-1.26, outfit range 0.73-1.13; Chinese infit range 0.80-1.51, outfit range 0.71-1.36), unidimensional by principal components analysis, and was free of Differential Item Functioning (DIF). These results support the good overall functioning of the Catquest-9SF in patients with cataract. The translated questionnaire to Malay and Chinese-language versions are reliable and valid in measuring visual disability outcomes in the Malaysian cataract population.

Pharmacy students' opinions of direct-to-consumer advertising: a pilot study at one university.

PubMed

Harrington, Amanda R; Desselle, Shane P; Apgar, David A; Hesselbacher, Elizabeth; Pié, Aaron; Quesnel, Aimee; Warholak, Terri L

2013-01-01

Direct-to-consumer advertisement (DTCA) of prescription medications has become an important informational source for health care consumers. As future health care professionals on the front line of potential communication and dispensing of products emerging from DTCA, it is important to elicit the attitudes of student-pharmacists. This study aims to (1) evaluate the validity of the DTCA attitudinal questionnaire using Rasch rating scale analysis and (2) investigate the attitudes of pharmacy students toward DTCA and determine whether these attitudes were associated with years of pharmacy education and demographic characteristics. This investigation used a cross-sectional print-based questionnaire to evaluate the attitudes of pharmacy students toward DTCA of prescription medications. The 16-item questionnaire included items addressing the attitudes of pharmacy students toward DTCA with respect to patients' knowledge of medications, pharmacists' interaction with patients, and overall consumer judgment of medical prescriptions. Analyses included Rasch analysis and a multiple linear regression. A total of 243 students submitted usable questionnaires (85% response rate). Item response categories were collapsed from 5 categories to 3, and 4 items were removed to achieve acceptable Rasch model fit. Pharmacy students demonstrated little difficulty in agreeing with the statements suggesting that DTCA helps patients take a more active role in health care and had the most difficulty in agreeing with items suggesting that DTCA may lead to inappropriate prescribing to satisfy patient requests. Students' overall support for DTCA was the only variable that predicted the questionnaire score (P<.001). In conclusion, the Rasch analysis evaluated the psychometric properties of the instrument and identified the necessity to adapt the questionnaire from previous iterations to adequately fit the student population. Future research should examine factors that contribute to the variance in attitudes toward DTCA among a larger and more heterogeneous population. Copyright © 2013 Elsevier Inc. All rights reserved.
A Generalized Measurement Model to Quantify Health: The Multi-Attribute Preference Response Model

PubMed Central

Krabbe, Paul F. M.

2013-01-01

After 40 years of deriving metric values for health status or health-related quality of life, the effective quantification of subjective health outcomes is still a challenge. Here, two of the best measurement tools, the discrete choice and the Rasch model, are combined to create a new model for deriving health values. First, existing techniques to value health states are briefly discussed followed by a reflection on the recent revival of interest in patients’ experience with regard to their possible role in health measurement. Subsequently, three basic principles for valid health measurement are reviewed, namely unidimensionality, interval level, and invariance. In the main section, the basic operation of measurement is then discussed in the framework of probabilistic discrete choice analysis (random utility model) and the psychometric Rasch model. It is then shown how combining the main features of these two models yields an integrated measurement model, called the multi-attribute preference response (MAPR) model, which is introduced here. This new model transforms subjective individual rank data into a metric scale using responses from patients who have experienced certain health states. Its measurement mechanism largely prevents biases such as adaptation and coping. Several extensions of the MAPR model are presented. The MAPR model can be applied to a wide range of research problems. If extended with the self-selection of relevant health domains for the individual patient, this model will be more valid than existing valuation techniques. PMID:24278141
The development and validation of the core competencies scale (CCS) for the college and university students.

PubMed

Ruan, Bin; Mok, Magdalena Mo Ching; Edginton, Christopher R; Chin, Ming Kai

2012-01-01

This article describes the development and validation of the Core Competencies Scale (CCS) using Bok's (2006) competency framework for undergraduate education. The framework included: communication, critical thinking, character development, citizenship, diversity, global understanding, widening of interest, and career and vocational development. The sample comprised 70 college and university students. Results of analysis using Rasch rating scale modelling showed that there was strong empirical evidence on the validity of the measures in contents, structure, interpretation, generalizability, and response options of the CCS scale. The implication of having developed Rasch-based valid and dependable measures in this study for gauging the value added of college and university education to their students is that the feedback generated from CCS will enable evidence-based decision and policy making to be implemented and strategized. Further, program effectiveness can be measured and thus accountability on the achievement of the program objectives.
Rasch Analysis of Scientific Literacy in an Astronomical Citizen Science Project

NASA Astrophysics Data System (ADS)

Price, A.

2012-06-01

(Abstract only) We investigate change in attitudes towards science and belief in the nature of science by participants in a citizen science project about astronomy. A pre-test was given to 1,385 participants and a post-test was given six months later to 165 participants. Nine participants were interviewed. Responses were analyzed using the Rasch Rating Scale Model to place Likert data on an interval scale allowing for more sensitive parametric analysis. Results show that overall attitudes did not change, p = .225. However, there was significant change towards attitudes relating to science news (positive) and scientific self efficacy (negative), p = .001 and p = .035, respectively. This change was related to social activity in the project. Beliefs in the nature of science exhibited a small but significant increase, p = .04. Relative positioning of scores on the belief items suggests the increase is mostly due to reinforcement of current beliefs.
Rasch Analysis of Scientific Literacy in an Astronomical Citizen Science Project

NASA Astrophysics Data System (ADS)

Price, Aaron

2011-05-01

We investigate change in attitudes towards science and belief in the nature of science by participants in a citizen science project about astronomy. A pre-test was given to 1,385 participants and a post-test was given six months later to 165 participants. Nine participants were interviewed. Responses were analyzed using the Rasch Rating Scale Model to place Likert data on an interval scale allowing for more sensitive parametric analysis. Results show that overall attitudes did not change, p = .225. However, there was significant change towards attitudes relating to science news (positive) and scientific self efficacy (negative), p < .001 and p = .035 respectively. This change was related to social activity in the project. Beliefs in the nature of science exhibited a small, but significant increase, p = .04. Relative positioning of scores on the belief items suggests the increase is mostly due to reinforcement of current beliefs.
Parental Health Attributions of Childhood Health and Illness: Development of the Pediatric Cultural Health Attributions Questionnaire (Pedi-CHAQ).

PubMed

Vaughn, Lisa M; McLinden, Daniel J; Shellmer, Diana; Baker, Raymond C

2011-01-01

The causes attributed to childhood health and illness across cultures (cultural health attributions) are key factors that are now more frequently identified as affecting the health outcomes of children. Research suggests that the causes attributed to an event such as illness are thought to affect subsequent motivation, emotional response, decision making, and behavior. To date, there is no measure of health attributions appropriate for use with parents of pediatric patients. Using the Many-Facets approach to Rasch analysis, this study assesses the psychometrics of a newly developed instrument, the Pediatric Health Attributions Questionnaire (Pedi-CHAQ), a measure designed to assess the cultural health attributions of parents in diverse communities. Results suggest acceptable Rasch model statistics of fit and reliability for the Pedi-CHAQ. A shortened version of the questionnaire was developed as a result of this study and next steps are discussed.
Post-hoc simulation study to adopt a computerized adaptive testing (CAT) for a Korean Medical License Examination.

PubMed

Seo, Dong Gi; Choi, Jeongwook

2018-05-17

Computerized adaptive testing (CAT) has been adopted in license examinations due to a test efficiency and accuracy. Many research about CAT have been published to prove the efficiency and accuracy of measurement. This simulation study investigated scoring method and item selection methods to implement CAT in Korean medical license examination (KMLE). This study used post-hoc (real data) simulation design. The item bank used in this study was designed with all items in a 2017 KMLE. All CAT algorithms for this study were implemented by a 'catR' package in R program. In terms of accuracy, Rasch and 2parametric logistic (PL) model performed better than 3PL model. Modal a Posteriori (MAP) or Expected a Posterior (EAP) provided more accurate estimates than MLE and WLE. Furthermore Maximum posterior weighted information (MPWI) or Minimum expected posterior variance (MEPV) performed better than other item selection methods. In terms of efficiency, Rasch model was recommended to reduce test length. Simulation study should be performed under varied test conditions before adopting a live CAT. Based on a simulation study, specific scoring and item selection methods should be predetermined before implementing a live CAT.
An approach to studying scale for students in higher education: a Rasch measurement model analysis.

PubMed

Waugh, R F; Hii, T K; Islam, A

2000-01-01

A questionnaire comprising 80 self-report items was designed to measure student Approaches to Studying in a higher education context. The items were conceptualized and designed from five learning orientations: a Deep Approach, a Surface Approach, a Strategic Approach, Clarity of Direction and Academic Self-Confidence, to include 40 attitude items and 40 corresponding behavior items. The study aimed to create a scale and investigate its psychometric properties using a Rasch measurement model. The convenience sample consisted of 350 students at an Australian university in 1998. The analysis supported the conceptual structure of the Scale as involving studying attitudes and behaviors towards five orientations to learning. Attitudes are mostly easier than behaviors, in line with the theory. Sixty-eight items fit the model and have good psychometric properties. The proportion of observed variance considered true is 92% and the Scale is well-targeted against the students. Some harder items are needed to improve the targeting and some further testing work needs to be done on the Surface Approach. In the Surface Approach and Clarity of Direction in Studying, attitudes make a lesser contribution than behaviors to the variable, Approaches to Studying.
Thorndike, Thurstone and Rasch: A Comparison of Their Approaches to Item-Invariant Measurement.

ERIC Educational Resources Information Center

Englehard, George, Jr.

The methods used by E. L. Thorndike, L. L. Thurstone, and G. Rasch to address issues related to item-invariant measurement and the scoring of individual performance are compared. The analyses highlight the close connection among the three methods, and suggest that progress in measurement theory reflects the movement from essentially ad hoc methods…
Developing Information Skills Test for Malaysian Youth Students Using Rasch Analysis

ERIC Educational Resources Information Center

Karim, Aidah Abdul; Shah, Parilah M.; Din, Rosseni; Ahmad, Mazalah; Lubis, Maimun Aqhsa

2014-01-01

This study explored the psychometric properties of a locally developed information skills test for youth students in Malaysia using Rasch analysis. The test was a combination of 24 structured and multiple choice items with a 4-point grading scale. The test was administered to 72 technical college students and 139 secondary school students. The…
Multidimensional Rasch Analysis of a Psychological Test with Multiple Subtests: A Statistical Solution for the Bandwidth-Fidelity Dilemma

ERIC Educational Resources Information Center

Cheng, Ying-Yao; Wang, Wen-Chung; Ho, Yi-Hui

2009-01-01

Educational and psychological tests are often composed of multiple short subtests, each measuring a distinct latent trait. Unfortunately, short subtests suffer from low measurement precision, which makes the bandwidth-fidelity dilemma inevitable. In this study, the authors demonstrate how a multidimensional Rasch analysis can be employed to take…
An Evaluation of the Environmental Literacy of Preservice Teachers in Turkey through Rasch Analysis

ERIC Educational Resources Information Center

Teksoz, G. Tuncer; Boone, J. W.; Tuzun, O. Yilmaz; Oztekin, C.

2014-01-01

The purpose of this study was to make use of proposed definitions of environmental literacy to (1) guide the application of Rasch analysis and (2) utilize the developed instrumentation to further inform the work of environmental educators. A total of 2311 preservice teachers attending Faculty of Education departments of four public universities…
A Comparison of the Rasch Separate Calibration and Between-Fit Methods of Detecting Item Bias.

ERIC Educational Resources Information Center

Smith, Richard M.

1996-01-01

The separate calibration t-test approach of B. Wright and M. Stone (1979) and the common calibration between-fit approach of B. Wright, R. Mead, and R. Draba (1976) appeared to have similar Type I error rates and similar power to detect item bias within a Rasch framework. (SLD)
Measuring Longitudinal Gains in Student Learning: A Comparison of Rasch Scoring and Summative Scoring Approaches

ERIC Educational Resources Information Center

Zhao, Yue; Huen, Jenny M. Y.; Chan, Y. W.

2017-01-01

This study pioneers a Rasch scoring approach and compares it to a conventional summative approach for measuring longitudinal gains in student learning. In this methodological note, our proposed methodology is demonstrated using an example of rating scales in a student survey as part of a higher education outcome assessment. Such assessments have…
Rasch Based Analysis of Oral Proficiency Test Data.

ERIC Educational Resources Information Center

Nakamura, Yuji

2001-01-01

This paper examines the rating scale data of oral proficiency tests analyzed by a Rasch Analysis focusing on an item map and factor analysis. In discussing the item map, the difficulty order of six items and students' answering patterns are analyzed using descriptive statistics and measures of central tendency of test scores. The data ranks the…
Development and initial validation of the Pharmacist Frequency of Interprofessional Collaboration Instrument (FICI-P) in primary care.

PubMed

Van, Connie; Costa, Daniel; Mitchell, Bernadette; Abbott, Penny; Krass, Ines

2012-01-01

Existing validated measures of pharmacist-physician collaboration focus on measuring attitudes toward collaboration and do not measure frequency of collaborative interactions. To develop and validate an instrument to measure the frequency of collaboration between pharmacists and general practitioners (GPs) from the pharmacist's perspective. An 11-item Pharmacist Frequency of Interprofessional Collaboration Instrument (FICI-P) was developed and administered to 586 pharmacists in 8 divisions of general practice in New South Wales, Australia. The initial items were informed by a review of the literature in addition to interviews of pharmacists and GPs. Items were subjected to principal component and Rasch analyses to determine each item's and the overall measure's psychometric properties and for any needed refinements. Two hundred and twenty four (38%) of pharmacist surveys were completed and returned. Principal component analysis suggested removal of 1 item for a final 1-factor solution. The refined 10-item FICI-P demonstrated internal consistency reliability at Cronbach's alpha=0.90. After collapsing the original 5-point response scale to a 4-point response scale, the refined FICI-P demonstrated fit to the Rasch model. Criterion validity of the FICI-P was supported by the correlation of FICI-P scores with scores on a previously validated Physician-Pharmacist Collaboration Instrument. Validity was also supported by predicted differences in FICI-P scores between subgroups of respondents stratified on age, colocation with GPs, and interactions during the intern-training period. The refined 10-item FICI-P was shown to have good internal consistency, criterion validity, and fit to the Rasch model. The creation of such a tool may allow for the measure of impact in the evaluation of interventions designed to improve interprofessional collaboration between GPs and pharmacists. Copyright © 2012 Elsevier Inc. All rights reserved.
[Examination of calibrated item banks for the assessment of work capacity in an outpatient sample of cardiological patients].

PubMed

Haschke, A; Abberger, B; Schröder, K; Wirtz, M; Bengel, J; Baumeister, H

2013-12-01

Work capacity is a major outcome variable in cardiological rehabilitation. However, there is a lacks of capacious and economic assessment instruments for work capacity. By developing item response theory based item banks a first step to close this gap is done. The present study aims to validate the work capacity item banks for cardiovascular rehabilitation inpatients (WCIB-Cardio) in a sample of cardiovascular rehabilitation outpatients. Additionally, we examined differences between in- and outpatients with regard to their work capacity. Data of 283 cardiovascular rehabilitation inpatients and 77 cardiovascular rehabilitation outpatients were collected in 15 rehabilitation centres. The WCIB-Cardio contains the 2 domains of "cognitive work capacity"(20 items) and "physical work capacity"(18 items). Validation of the item bank for cardiological outpatients was conducted with separate Rasch analysis for each domain. For the domain of cognitive work capacity 10 items showed satisfying quality criteria (Rasch reliability=0.71; overall model fit=0.07). For the domain of physical work capacity good values for Rasch-reliability (0.83) and overall -model fit (0.65) could be proven after exclusion of 3 items. Unidimensionality and a broad ability spectrum could be covered for both domains. With regard to content, outpatients evaluate themselves less burdened than inpatients for the domain of cognitive work capacity (‾X outpatient =-2.06 vs. ‾X inpatient =-2.49; p<0.07) similarly for the domain of physical work capacity (‾X outpatient =-3.68 vs. ‾X inpatient =-2.88; p<0.01). With the WCIB-Cardio II there is a precondition to develop self-report instruments of work capacity in cardiological in- and outpatients. © Georg Thieme Verlag KG Stuttgart · New York.
Reliability and validity of the Turkish version of the Rapid Estimate of Adult Literacy in Dentistry (TREALD-30).

PubMed

Peker, Kadriye; Köse, Taha Emre; Güray, Beliz; Uysal, Ömer; Erdem, Tamer Lütfi

2017-04-01

To culturally adapt the Turkish version of Rapid Estimate of Adult Literacy in Dentistry (TREALD-30) for Turkish-speaking adult dental patients and to evaluate its psychometric properties. After translation and cross-cultural adaptation, TREALD-30 was tested in a sample of 127 adult patients who attended a dental school clinic in Istanbul. Data were collected through clinical examinations and self-completed questionnaires, including TREALD-30, the Oral Health Impact Profile (OHIP), the Rapid Estimate of Adult Literacy in Medicine (REALM), two health literacy screening questions, and socio-behavioral characteristics. Psychometric properties were examined using Classical Test Theory (CTT) and Rasch analysis. Internal consistency (Cronbach's Alpha = 0.91) and test-retest reliability (Intraclass correlation coefficient = 0.99) were satisfactory for TREALD-30. It exhibited good convergent and predictive validity. Monthly family income, years of education, dental flossing, health literacy, and health literacy skills were found as stronger predictors of patients'oral health literacy (OHL). Confirmatory factor analysis (CFA) confirmed a two-factor model. The Rasch model explained 37.9% of the total variance in this dataset. In addition, TREALD-30 had eleven misfitting items, which indicated evidence of multidimensionality. The reliability indeces provided in Rasch analysis (person separation reliability = 0.91 and expected-a-posteriori/plausible reliability = 0.94) indicated that TREALD-30 had acceptable reliability. TREALD-30 showed satisfactory psychometric properties. It may be used to identify patients with low OHL. Socio-demographic factors, oral health behaviors and health literacy skills should be taken into account when planning future studies to assess the OHL in both clinical and community settings.
The reliability and validity of the English version of the Evaluation of Daily Activity Questionnaire for people with rheumatoid arthritis

PubMed Central

Tennant, Alan; Tyson, Sarah F.; Nordenskiöld, Ulla; Hawkins, Ruth; Prior, Yeliz

2015-01-01

Objectives. The Evaluation of Daily Activity Questionnaire (EDAQ) includes 138 items in 14 domains identified as important by people with RA. The aim of this study was to test the validity and reliability of the English EDAQ. Methods. A total of 502 participants completed two questionnaires 3 weeks apart. The first consisted of the EDAQ, HAQ, RA Quality of Life (RAQoL) and the Medical Outcomes Scale (MOS) 36-item Short-Form Health Survey (SF-36v2), and the second consisted of the EDAQ only. The 14 EDAQ domains were tested for: unidimensionality—using confirmatory factor analysis; fit, response dependency, invariance across groups (differential item functioning)—using Rasch analysis; internal consistency [Person Separation Index (PSI)]; concurrent validity—by correlations with the HAQ, SF-36v2 and RAQoL; and test–retest reliability (Spearman’s correlations). Results. Confirmatory factor analysis of the 14 EDAQ domains indicated unidimensionality, after adjustment for local dependency in each domain. All domains achieved a root mean square error of approximation <0.10 and satisfied Rasch model expectations for local dependency. DIF by age, gender and employment status was largely absent. The PSI was consistent with individual use (PSI = 0.94 for all 14 domains). For all domains, except Caring, concurrent validity was good: HAQ (rs = 0.72–0.91), RAQoL (rs = 0.67–0.82) and SF36v2 Physical Function scale (rs = −0.60 to −0.84) and test–retest reliability was good (rs = 0.70–0.89). Conclusion. Analysis supported a 14-domain, two-component structure (Self care and Mobility) of the EDAQ, where each domain, and both components, satisfied Rasch model requirements, and have robust reliability and validity. PMID:25863045
Measurement of Perceived Stress in Age-Related Macular Degeneration.

PubMed

Dougherty, Bradley E; Cooley, San-San L; Davidorf, Frederick H

2017-03-01

To validate the Perceived Stress Scale (PSS) in patients with age-related macular degeneration (AMD) using Rasch analysis. Study participants with AMD were recruited from the retina service of the Department of Ophthalmology at the Ohio State University during clinical visits for treatment or observation. Visual acuity with habitual distance correction was assessed. A 10-item version of the PSS was administered in large print or by reading the items to the patient. Rasch analysis was used to investigate the measurement properties of the PSS, including fit to the model, ability to separate between people with different levels of perceived stress, category response structure performance, and unidimensionality. A total of 137 patients with a diagnosis of AMD were enrolled. The mean (±SD) age of participants was 82 ± 9 years. Fifty-four percent were female. Median Early Treatment of Diabetic Retinopathy Study (ETDRS) visual acuity of the better eye was 65 letters (Snellen 20/50), with a range of approximately 20/800 to 20/15. Forty-seven percent of participants were receiving an anti-VEGF injection on the day of the study visit. The response category structure was appropriate. One item, "How often have you felt confident in your ability to handle your personal problems?" was removed due to poor fit statistics. The remaining nine items showed good fit to the model, acceptable measurement precision as assessed by the Rasch person separation statistic, and unidimensionality. There was some evidence of differential item functioning by age and visual acuity. The Perceived Stress Scale demonstrated acceptable measurement properties and may be useful for the measurement of perceived stress in patients with AMD.

Applying the Mixed Methods Instrument Development and Construct Validation Process: the Transformative Experience Questionnaire

ERIC Educational Resources Information Center

Koskey, Kristin L. K.; Sondergeld, Toni A.; Stewart, Victoria C.; Pugh, Kevin J.

2018-01-01

Onwuegbuzie and colleagues proposed the Instrument Development and Construct Validation (IDCV) process as a mixed methods framework for creating and validating measures. Examples applying IDCV are lacking. We provide an illustrative case integrating the Rasch model and cognitive interviews applied to the development of the Transformative…
Toddlers' Expressive Vocabulary Outcomes after One Year of Parent-Child Home Program Services

ERIC Educational Resources Information Center

Manz, Patricia H.; Bracaliello, Catherine B.; Pressimone, Vanessa J.; Eisenberg, Rachel A.; Gernhart, Amanda C.; Fu, Qiong; Zuniga, Cesar

2016-01-01

This quasi-experimental study examined expressive vocabulary outcomes for Parent-Child Home Program (PCHP) toddlers, after one year of home-visiting services. First, this study applied Rasch modelling to establish the construct validity and reliability of a widely used expressive vocabulary measure, as modified for a sample of ethnic and…
Combined Common Person and Common Item Equating of Medical Science Examinations.

ERIC Educational Resources Information Center

Kelley, Paul R.

This equating study of the National Board of Medical Examiners Examinations was a combined common persons and common items equating, using the Rasch model. The 1,000-item test was administered to about 3,000 second-year medical students in seven equal-length subtests: anatomy, physiology, biochemistry, pathology, microbiology, pharmacology, and…
Other Historical and Philosophical Perspectives on Invariance in Measurement

ERIC Educational Resources Information Center

Fisher, William P., Jr.

2008-01-01

Engelhard draws out the similarities and differences in Guttman's, Rasch's, and Mokken's perspectives on invariance in measurement. He provides a valuable model in evaluating the extent to which different measurement theories and methods serve as a basis for achieving the fundamental goals of quantification. The full extent of this point will…
Introducing "Emotioncy" as a Potential Source of Test Bias: A Mixed Rasch Modeling Study

ERIC Educational Resources Information Center

Pishghadam, Reza; Baghaei, Purya; Seyednozadi, Zahra

2017-01-01

This article attempts to present emotioncy as a potential source of test bias to inform the analysis of test item performance. Emotioncy is defined as a hierarchy, ranging from "exvolvement" (auditory, visual, and kinesthetic) to "involvement" (inner and arch), to emphasize the emotions evoked by the senses. This study…
Test Bias: An Objective Definition for Test Items.

ERIC Educational Resources Information Center

Durovic, Jerry J.

A test bias definition, applicable at the item-level of a test is presented. The definition conceptually equates test bias with measuring different things in different groups, and operationally equates test bias with a difference in item fit to the Rasch Model, greater than one, between groups. It is suggested that the proposed definition avoids…
Logistic Achievement Test Scaling and Equating with Fixed versus Estimated Lower Asymptotes.

ERIC Educational Resources Information Center

Phillips, S. E.

This study compared the lower asymptotes estimated by the maximum likelihood procedures of the LOGIST computer program with those obtained via application of the Norton methodology. The study also compared the equating results from the three-parameter logistic model with those obtained from the equipercentile, Rasch, and conditional…
Measuring Teacher Dispositions: An Application of the Rasch Model to a Complex Accreditation Requirement

ERIC Educational Resources Information Center

Wilkerson, Judy R.; Lang, William Steve

2004-01-01

The construct of dispositions is well defined in national standards, and U.S. colleges of education are required to assess candidate dispositions to meet accreditation requirements. Measurement, however, is virtually non-existent. On-line reviews of college accreditation reports indicate that colleges are attempting to assess dispositions without…
Assessing Attitudes toward Mathematics across Teacher Education Contexts

ERIC Educational Resources Information Center

Jong, Cindy; Hodges, Thomas E.

2015-01-01

This article reports on the development of attitudes toward mathematics among pre-service elementary teachers (n = 146) in relation to their experiences as K-12 learners of mathematics and experiences within a teacher education program. Using a combination of the Rasch Rating Scale Model and traditional parametric analyses, results indicate that…
Diagnostic Opportunities Using Rasch Measurement in the Context of a Misconceptions-Based Physical Science Assessment

ERIC Educational Resources Information Center

Wind, Stefanie A.; Gale, Jessica D.

2015-01-01

Multiple-choice (MC) items that are constructed such that distractors target known misconceptions for a particular domain provide useful diagnostic information about student misconceptions (Herrmann-Abell & DeBoer, 2011, 2014; Sadler, 1998). Item response theory models can be used to examine misconceptions distractor-driven multiple-choice…
Development of a Measurement Instrument to Assess Students' Electrolyte Conceptual Understanding

ERIC Educational Resources Information Center

Lu, Shanshan; Bi, Hualin

2016-01-01

To assess students' conceptual understanding levels and diagnose alternative frameworks of the electrolyte concept, a measurement instrument was developed using the Rasch model. This paper reports the use of the measurement instrument to assess 559 students from grade 10 to grade 12 in two cities. The results provided both diagnostic and summative…
Analysis of Open-Ended Statistics Questions with Many Facet Rasch Model

ERIC Educational Resources Information Center

Güler, Nese

2014-01-01

Problem Statement: The most significant disadvantage of open-ended items that allow the valid measurement of upper level cognitive behaviours, such as synthesis and evaluation, is scoring. The difficulty associated with objectively scoring the answers to the items contributes to the reduction of the reliability of the scores. Moreover, other…
Information Needs within a Multi-District Environment.

ERIC Educational Resources Information Center

Thomas, Gregory P.

This paper argues that no single measurement strategy serves all purposes and that applying methods and techniques which allow a variety of data elements to be retrieved and juxtaposed may be an investment in the future. Item response theory, Rasch model, and latent trait theory are all approaches to a single conceptual topic. An abbreviated look…
Measuring Student Teachers' Practices and Beliefs about Teaching Mathematics Using the Rasch Model

ERIC Educational Resources Information Center

Kaspersen, Eivind; Pepin, Birgit; Sikko, Svein Arne

2017-01-01

Several attempts have been made to measure and categorize beliefs and practices of mathematics teachers [Swan, M. 2006. "Designing and Using Research Instruments to Describe the Beliefs and Practices of Mathematics Teachers." "Research in Education" 75 (1): 58-70]. One of the reasons for measuring both beliefs and practices is…
Refining Change Measure with the Rasch Model

ERIC Educational Resources Information Center

Zaporozhets, Olga; Fox, Christine M.; Beltyukova, Svetlana A.; Laux, John M.; Piazza, Nick J.; Salyers, Kathleen

2015-01-01

This study was to develop a linear measure of change using University of Rhode Island Change Assessment items that represented Prochaska and DiClemente's theory. The resulting Toledo Measure of Change is short, is easy to use, and provides reliable scores for identification of individuals' stage of change and progression within that stage.
Students' Progression of Understanding the Matter Concept from Elementary to High School

ERIC Educational Resources Information Center

Liu, Xiufeng; Lesniak, Kathleen M.

2005-01-01

Using the US national sample from the Third International Mathematics and Science Study (TIMSS) and the Rasch modeling method, this study identified the conceptual progression sequence of various matter concept aspects, and compared students' latent abilities against the sequence. We found that the four matter aspects, i.e. conservation, physical…
Improving Students' Attitudes toward Science Using Instructional Congruence

ERIC Educational Resources Information Center

Zain, Ahmad Nurulazam Md; Samsudin, Mohd Ali; Rohandi, Robertus; Jusoh, Azman

2010-01-01

The objective of this study was to improve students' attitudes toward science using instructional congruence. The study was conducted in Malaysia, in three low-performing secondary schools in the state of Penang. Data collected with an Attitudes in Science instrument were analysed using Rasch modeling. Qualitative data based on the reflections of…
Evaluating the Bookmark Judgments of Standard-Setting Panelists

ERIC Educational Resources Information Center

Engelhard, George, Jr.

2011-01-01

The purpose of this study is to describe a new approach for evaluating the judgments of standard-setting panelists within the context of the bookmark procedure. The bookmark procedure is widely used for setting performance standards on high-stakes assessments. A many-faceted Rasch (MFR) model is proposed for evaluating the bookmark judgments of…
Psychometric properties of the Spanish version of the UPPS-P Impulsive Behavior Scale: A Rasch rating scale analysis and confirmatory factor analysis.

PubMed

Pilatti, Angelina; Lozano, Oscar M; Cyders, Melissa A

2015-12-01

The present study was aimed at determining the psychometric properties of the Spanish version of the UPPS-P Impulsive Behavior Scale in a sample of college students. Participants were 318 college students (36.2% men; mean age = 20.9 years, SD = 6.4 years). The psychometric properties of this Spanish version were analyzed using the Rasch model, and the factor structure was examined using confirmatory factor analysis. The verification of the global fit of the data showed adequate indexes for persons and items. The reliability estimates were high for both items and persons. Differential item functioning across gender was found for 23 items, which likely reflects known differences in impulsivity levels between men and women. The factor structure of the Spanish version of the UPPS-P replicates previous work with the original UPPS-P Scale. Overall, results suggest that test scores from the Spanish version of the UPPS-P show adequate psychometric properties to accurately assess the multidimensional model of impulsivity, which represents the most exhaustive measure of this construct. (c) 2015 APA, all rights reserved).
Development of an Item Bank for the Assessment of Knowledge on Biology in Argentine University Students.

PubMed

Cupani, Marcos; Zamparella, Tatiana Castro; Piumatti, Gisella; Vinculado, Grupo

The calibration of item banks provides the basis for computerized adaptive testing that ensures high diagnostic precision and minimizes participants' test burden. This study aims to develop a bank of items to measure the level of Knowledge on Biology using the Rasch model. The sample consisted of 1219 participants that studied in different faculties of the National University of Cordoba (mean age = 21.85 years, SD = 4.66; 66.9% are women). The items were organized in different forms and into separate subtests, with some common items across subtests. The students were told they had to answer 60 questions of knowledge on biology. Evaluation of Rasch model fit (Zstd >|2.0|), differential item functioning, dimensionality, local independence, item and person separation (>2.0), and reliability (>.80) resulted in a bank of 180 items with good psychometric properties. The bank provides items with a wide range of content coverage and may serve as a sound basis for computerized adaptive testing applications. The contribution of this work is significant in the field of educational assessment in Argentina.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.