multidimensional item response: Topics by Science.gov

Sample records for multidimensional item response

On Multidimensional Item Response Theory: A Coordinate-Free Approach. Research Report. ETS RR-07-30

ERIC Educational Resources Information Center

Antal, Tamás

2007-01-01

A coordinate-free definition of complex-structure multidimensional item response theory (MIRT) for dichotomously scored items is presented. The point of view taken emphasizes the possibilities and subtleties of understanding MIRT as a multidimensional extension of the classical unidimensional item response theory models. The main theorem of the…
A Multidimensional Ideal Point Item Response Theory Model for Binary Data

ERIC Educational Resources Information Center

Maydeu-Olivares, Albert; Hernandez, Adolfo; McDonald, Roderick P.

2006-01-01

We introduce a multidimensional item response theory (IRT) model for binary data based on a proximity response mechanism. Under the model, a respondent at the mode of the item response function (IRF) endorses the item with probability one. The mode of the IRF is the ideal point, or in the multidimensional case, an ideal hyperplane. The model…
Evaluating Item Fit for Multidimensional Item Response Models

ERIC Educational Resources Information Center

Zhang, Bo; Stone, Clement A.

2008-01-01

This research examines the utility of the s-x[superscript 2] statistic proposed by Orlando and Thissen (2000) in evaluating item fit for multidimensional item response models. Monte Carlo simulation was conducted to investigate both the Type I error and statistical power of this fit statistic in analyzing two kinds of multidimensional test…
Assessing Construct Validity Using Multidimensional Item Response Theory.

ERIC Educational Resources Information Center

Ackerman, Terry A.

The concept of a user-specified validity sector is discussed. The idea of the validity sector combines the work of M. D. Reckase (1986) and R. Shealy and W. Stout (1991). Reckase developed a methodology to represent an item in a multidimensional latent space as a vector. Item vectors are computed using multidimensional item response theory item…
A Bifactor Multidimensional Item Response Theory Model for Differential Item Functioning Analysis on Testlet-Based Items

ERIC Educational Resources Information Center

Fukuhara, Hirotaka; Kamata, Akihito

2011-01-01

A differential item functioning (DIF) detection method for testlet-based data was proposed and evaluated in this study. The proposed DIF model is an extension of a bifactor multidimensional item response theory (MIRT) model for testlets. Unlike traditional item response theory (IRT) DIF models, the proposed model takes testlet effects into…
The Definition of Difficulty and Discrimination for Multidimensional Item Response Theory Models.

ERIC Educational Resources Information Center

Reckase, Mark D.; McKinley, Robert L.

A study was undertaken to develop guidelines for the interpretation of the parameters of three multidimensional item response theory models and to determine the relationship between the parameters and traditional concepts of item difficulty and discrimination. The three models considered were multidimensional extensions of the one-, two-, and…
A Multidimensional Partial Credit Model with Associated Item and Test Statistics: An Application to Mixed-Format Tests

ERIC Educational Resources Information Center

Yao, Lihua; Schwarz, Richard D.

2006-01-01

Multidimensional item response theory (IRT) models have been proposed for better understanding the dimensional structure of data or to define diagnostic profiles of student learning. A compensatory multidimensional two-parameter partial credit model (M-2PPC) for constructed-response items is presented that is a generalization of those proposed to…
Applying Multidimensional Item Response Theory Models in Validating Test Dimensionality: An Example of K-12 Large-Scale Science Assessment

ERIC Educational Resources Information Center

Li, Ying; Jiao, Hong; Lissitz, Robert W.

2012-01-01

This study investigated the application of multidimensional item response theory (IRT) models to validate test structure and dimensionality. Multiple content areas or domains within a single subject often exist in large-scale achievement tests. Such areas or domains may cause multidimensionality or local item dependence, which both violate the…
Best Design for Multidimensional Computerized Adaptive Testing With the Bifactor Model

PubMed Central

Seo, Dong Gi; Weiss, David J.

2015-01-01

Most computerized adaptive tests (CATs) have been studied using the framework of unidimensional item response theory. However, many psychological variables are multidimensional and might benefit from using a multidimensional approach to CATs. This study investigated the accuracy, fidelity, and efficiency of a fully multidimensional CAT algorithm (MCAT) with a bifactor model using simulated data. Four item selection methods in MCAT were examined for three bifactor pattern designs using two multidimensional item response theory models. To compare MCAT item selection and estimation methods, a fixed test length was used. The Ds-optimality item selection improved θ estimates with respect to a general factor, and either D- or A-optimality improved estimates of the group factors in three bifactor pattern designs under two multidimensional item response theory models. The MCAT model without a guessing parameter functioned better than the MCAT model with a guessing parameter. The MAP (maximum a posteriori) estimation method provided more accurate θ estimates than the EAP (expected a posteriori) method under most conditions, and MAP showed lower observed standard errors than EAP under most conditions, except for a general factor condition using Ds-optimality item selection. PMID:29795848
Item Vector Plots for the Multidimensional Three-Parameter Logistic Model

ERIC Educational Resources Information Center

Bryant, Damon; Davis, Larry

2011-01-01

This brief technical note describes how to construct item vector plots for dichotomously scored items fitting the multidimensional three-parameter logistic model (M3PLM). As multidimensional item response theory (MIRT) shows promise of being a very useful framework in the test development life cycle, graphical tools that facilitate understanding…
Development of a Computerized Adaptive Testing for Diagnosing the Cognitive Process of Grade 7 Students in Learning Algebra, Using Multidimensional Item Response Theory

ERIC Educational Resources Information Center

Senarat, Somprasong; Tayraukham, Sombat; Piyapimonsit, Chatsiri; Tongkhambanjong, Sakesan

2013-01-01

The purpose of this research is to develop a multidimensional computerized adaptive test for diagnosing the cognitive process of grade 7 students in learning algebra by applying multidimensional item response theory. The research is divided into 4 steps: 1) the development of item bank of algebra, 2) the development of the multidimensional…
The Role of Psychometric Modeling in Test Validation: An Application of Multidimensional Item Response Theory

ERIC Educational Resources Information Center

Schilling, Stephen G.

2007-01-01

In this paper the author examines the role of item response theory (IRT), particularly multidimensional item response theory (MIRT) in test validation from a validity argument perspective. The author provides justification for several structural assumptions and interpretations, taking care to describe the role he believes they should play in any…
Effect Size Measures for Differential Item Functioning in a Multidimensional IRT Model

ERIC Educational Resources Information Center

Suh, Youngsuk

2016-01-01

This study adapted an effect size measure used for studying differential item functioning (DIF) in unidimensional tests and extended the measure to multidimensional tests. Two effect size measures were considered in a multidimensional item response theory model: signed weighted P-difference and unsigned weighted P-difference. The performance of…
A Multidimensional Scaling Approach to Dimensionality Assessment for Measurement Instruments Modeled by Multidimensional Item Response Theory

ERIC Educational Resources Information Center

Toro, Maritsa

2011-01-01

The statistical assessment of dimensionality provides evidence of the underlying constructs measured by a survey or test instrument. This study focuses on educational measurement, specifically tests comprised of items described as multidimensional. That is, items that require examinee proficiency in multiple content areas and/or multiple cognitive…
Applications of Multidimensional Item Response Theory Models with Covariates to Longitudinal Test Data. Research Report. ETS RR-16-21

ERIC Educational Resources Information Center

Fu, Jianbin

2016-01-01

The multidimensional item response theory (MIRT) models with covariates proposed by Haberman and implemented in the "mirt" program provide a flexible way to analyze data based on item response theory. In this report, we discuss applications of the MIRT models with covariates to longitudinal test data to measure skill differences at the…
Modelling Mathematics Problem Solving Item Responses Using a Multidimensional IRT Model

ERIC Educational Resources Information Center

Wu, Margaret; Adams, Raymond

2006-01-01

This research examined students' responses to mathematics problem-solving tasks and applied a general multidimensional IRT model at the response category level. In doing so, cognitive processes were identified and modelled through item response modelling to extract more information than would be provided using conventional practices in scoring…
Unidimensional Interpretations for Multidimensional Test Items

ERIC Educational Resources Information Center

Kahraman, Nilufer

2013-01-01

This article considers potential problems that can arise in estimating a unidimensional item response theory (IRT) model when some test items are multidimensional (i.e., show a complex factorial structure). More specifically, this study examines (1) the consequences of model misfit on IRT item parameter estimates due to unintended minor item-level…
Projective Item Response Model for Test-Independent Measurement

ERIC Educational Resources Information Center

Ip, Edward Hak-Sing; Chen, Shyh-Huei

2012-01-01

The problem of fitting unidimensional item-response models to potentially multidimensional data has been extensively studied. The focus of this article is on response data that contains a major dimension of interest but that may also contain minor nuisance dimensions. Because fitting a unidimensional model to multidimensional data results in…
Observed Score and True Score Equating Procedures for Multidimensional Item Response Theory

ERIC Educational Resources Information Center

Brossman, Bradley Grant

2010-01-01

The purpose of this research was to develop observed score and true score equating procedures to be used in conjunction with the Multidimensional Item Response Theory (MIRT) framework. Currently, MIRT scale linking procedures exist to place item parameter estimates and ability estimates on the same scale after separate calibrations are conducted.…
Innovative Application of a Multidimensional Item Response Model in Assessing the Influence of Social Desirability on the Pseudo-Relationship between Self-Efficacy and Behavior

ERIC Educational Resources Information Center

Watson, Kathy; Baranowski, Tom; Thompson, Debbe; Jago, Russell; Baranowski, Janice; Klesges, Lisa M.

2006-01-01

This study examined multidimensional item response theory (MIRT) modeling to assess social desirability (SocD) influences on self-reported physical activity self-efficacy (PASE) and fruit and vegetable self-efficacy (FVSE). The observed sample included 473 Houston-area adolescent males (10-14 years). SocD (nine items), PASE (19 items) and FVSE (21…

A Framework for Dimensionality Assessment for Multidimensional Item Response Models

ERIC Educational Resources Information Center

Svetina, Dubravka; Levy, Roy

2014-01-01

A framework is introduced for considering dimensionality assessment procedures for multidimensional item response models. The framework characterizes procedures in terms of their confirmatory or exploratory approach, parametric or nonparametric assumptions, and applicability to dichotomous, polytomous, and missing data. Popular and emerging…
Measuring change for a multidimensional test using a generalized explanatory longitudinal item response model.

PubMed

Cho, Sun-Joo; Athay, Michele; Preacher, Kristopher J

2013-05-01

Even though many educational and psychological tests are known to be multidimensional, little research has been done to address how to measure individual differences in change within an item response theory framework. In this paper, we suggest a generalized explanatory longitudinal item response model to measure individual differences in change. New longitudinal models for multidimensional tests and existing models for unidimensional tests are presented within this framework and implemented with software developed for generalized linear models. In addition to the measurement of change, the longitudinal models we present can also be used to explain individual differences in change scores for person groups (e.g., learning disabled students versus non-learning disabled students) and to model differences in item difficulties across item groups (e.g., number operation, measurement, and representation item groups in a mathematics test). An empirical example illustrates the use of the various models for measuring individual differences in change when there are person groups and multiple skill domains which lead to multidimensionality at a time point. © 2012 The British Psychological Society.
A Multidimensional Ideal Point Item Response Theory Model for Binary Data.

PubMed

Maydeu-Olivares, Albert; Hernández, Adolfo; McDonald, Roderick P

2006-12-01

We introduce a multidimensional item response theory (IRT) model for binary data based on a proximity response mechanism. Under the model, a respondent at the mode of the item response function (IRF) endorses the item with probability one. The mode of the IRF is the ideal point, or in the multidimensional case, an ideal hyperplane. The model yields closed form expressions for the cell probabilities. We estimate and test the goodness of fit of the model using only information contained in the univariate and bivariate moments of the data. Also, we pit the new model against the multidimensional normal ogive model estimated using NOHARM in four applications involving (a) attitudes toward censorship, (b) satisfaction with life, (c) attitudes of morality and equality, and (d) political efficacy. The normal PDF model is not invariant to simple operations such as reverse scoring. Thus, when there is no natural category to be modeled, as in many personality applications, it should be fit separately with and without reverse scoring for comparisons.
The Discriminating Power of Items that Measure More than One Dimension.

ERIC Educational Resources Information Center

Reckase, Mark D.

The work presented in this paper defined conceptually the concepts of multidimensional discrimination and information, derived mathematical expressions for the concepts for a particular multidimensional item response theory (IRT) model, and applied the concepts to actual test data. Multidimensional discrimination was defined as a function of the…
Assessment of health surveys: fitting a multidimensional graded response model.

PubMed

Depaoli, Sarah; Tiemensma, Jitske; Felt, John M

The multidimensional graded response model, an item response theory (IRT) model, can be used to improve the assessment of surveys, even when sample sizes are restricted. Typically, health-based survey development utilizes classical statistical techniques (e.g. reliability and factor analysis). In a review of four prominent journals within the field of Health Psychology, we found that IRT-based models were used in less than 10% of the studies examining scale development or assessment. However, implementing IRT-based methods can provide more details about individual survey items, which is useful when determining the final item content of surveys. An example using a quality of life survey for Cushing's syndrome (CushingQoL) highlights the main components for implementing the multidimensional graded response model. Patients with Cushing's syndrome (n = 397) completed the CushingQoL. Results from the multidimensional graded response model supported a 2-subscale scoring process for the survey. All items were deemed as worthy contributors to the survey. The graded response model can accommodate unidimensional or multidimensional scales, be used with relatively lower sample sizes, and is implemented in free software (example code provided in online Appendix). Use of this model can help to improve the quality of health-based scales being developed within the Health Sciences.
Assessing Dimensionality of Noncompensatory Multidimensional Item Response Theory with Complex Structures

ERIC Educational Resources Information Center

Svetina, Dubravka

2013-01-01

The purpose of this study was to investigate the effect of complex structure on dimensionality assessment in noncompensatory multidimensional item response models using dimensionality assessment procedures based on DETECT (dimensionality evaluation to enumerate contributing traits) and NOHARM (normal ogive harmonic analysis robust method). Five…
Comparison of Multidimensional Item Response Models: Multivariate Normal Ability Distributions versus Multivariate Polytomous Ability Distributions. Research Report. ETS RR-08-45

ERIC Educational Resources Information Center

Haberman, Shelby J.; von Davier, Matthias; Lee, Yi-Hsuan

2008-01-01

Multidimensional item response models can be based on multivariate normal ability distributions or on multivariate polytomous ability distributions. For the case of simple structure in which each item corresponds to a unique dimension of the ability vector, some applications of the two-parameter logistic model to empirical data are employed to…
Bayesian Analysis of Multidimensional Item Response Theory Models: A Discussion and Illustration of Three Response Style Models

ERIC Educational Resources Information Center

Leventhal, Brian C.; Stone, Clement A.

2018-01-01

Interest in Bayesian analysis of item response theory (IRT) models has grown tremendously due to the appeal of the paradigm among psychometricians, advantages of these methods when analyzing complex models, and availability of general-purpose software. Possible models include models which reflect multidimensionality due to designed test structure,…
Item Selection in Multidimensional Computerized Adaptive Testing--Gaining Information from Different Angles

ERIC Educational Resources Information Center

Wang, Chun; Chang, Hua-Hua

2011-01-01

Over the past thirty years, obtaining diagnostic information from examinees' item responses has become an increasingly important feature of educational and psychological testing. The objective can be achieved by sequentially selecting multidimensional items to fit the class of latent traits being assessed, and therefore Multidimensional…
Reporting of Subscores Using Multidimensional Item Response Theory

ERIC Educational Resources Information Center

Haberman, Shelby J.; Sinharay, Sandip

2010-01-01

Recently, there has been increasing interest in reporting subscores. This paper examines reporting of subscores using multidimensional item response theory (MIRT) models (e.g., Reckase in "Appl. Psychol. Meas." 21:25-36, 1997; C.R. Rao and S. Sinharay (Eds), "Handbook of Statistics, vol. 26," pp. 607-642, North-Holland, Amsterdam, 2007; Beguin &…
Bi-Factor Multidimensional Item Response Theory Modeling for Subscores Estimation, Reliability, and Classification

ERIC Educational Resources Information Center

Md Desa, Zairul Nor Deana

2012-01-01

In recent years, there has been increasing interest in estimating and improving subscore reliability. In this study, the multidimensional item response theory (MIRT) and the bi-factor model were combined to estimate subscores, to obtain subscores reliability, and subscores classification. Both the compensatory and partially compensatory MIRT…
Quantifying traditional Chinese medicine patterns using modern test theory: an example of functional constipation.

PubMed

Shen, Minxue; Cui, Yuanwu; Hu, Ming; Xu, Linyong

2017-01-13

The study aimed to validate a scale to assess the severity of "Yin deficiency, intestine heat" pattern of functional constipation based on the modern test theory. Pooled longitudinal data of 237 patients with "Yin deficiency, intestine heat" pattern of constipation from a prospective cohort study were used to validate the scale. Exploratory factor analysis was used to examine the common factors of items. A multidimensional item response model was used to assess the scale with the presence of multidimensionality. The Cronbach's alpha ranged from 0.79 to 0.89, and the split-half reliability ranged from 0.67 to 0.79 at different measurements. Exploratory factor analysis identified two common factors, and all items had cross factor loadings. Bidimensional model had better goodness of fit than the unidimensional model. Multidimensional item response model showed that the all items had moderate to high discrimination parameters. Parameters indicated that the first latent trait signified intestine heat, while the second trait characterized Yin deficiency. Information function showed that items demonstrated highest discrimination power among patients with moderate to high level of disease severity. Multidimensional item response theory provides a useful and rational approach in validating scales for assessing the severity of patterns in traditional Chinese medicine.
The Multidimensional Assessment of Interoceptive Awareness (MAIA)

PubMed Central

Mehling, Wolf E.; Price, Cynthia; Daubenmier, Jennifer J.; Acree, Mike; Bartmess, Elizabeth; Stewart, Anita

2012-01-01

This paper describes the development of a multidimensional self-report measure of interoceptive body awareness. The systematic mixed-methods process involved reviewing the current literature, specifying a multidimensional conceptual framework, evaluating prior instruments, developing items, and analyzing focus group responses to scale items by instructors and patients of body awareness-enhancing therapies. Following refinement by cognitive testing, items were field-tested in students and instructors of mind-body approaches. Final item selection was achieved by submitting the field test data to an iterative process using multiple validation methods, including exploratory cluster and confirmatory factor analyses, comparison between known groups, and correlations with established measures of related constructs. The resulting 32-item multidimensional instrument assesses eight concepts. The psychometric properties of these final scales suggest that the Multidimensional Assessment of Interoceptive Awareness (MAIA) may serve as a starting point for research and further collaborative refinement. PMID:23133619
A Multidimensional Item Response Model: Constrained Latent Class Analysis Using the Gibbs Sampler and Posterior Predictive Checks.

ERIC Educational Resources Information Center

Hoijtink, Herbert; Molenaar, Ivo W.

1997-01-01

This paper shows that a certain class of constrained latent class models may be interpreted as a special case of nonparametric multidimensional item response models. Parameters of this latent class model are estimated using an application of the Gibbs sampler, and model fit is investigated using posterior predictive checks. (SLD)
Maximizing the Information and Validity of a Linear Composite in the Factor Analysis Model for Continuous Item Responses

ERIC Educational Resources Information Center

Ferrando, Pere J.

2008-01-01

This paper develops results and procedures for obtaining linear composites of factor scores that maximize: (a) test information, and (b) validity with respect to external variables in the multiple factor analysis (FA) model. I treat FA as a multidimensional item response theory model, and use Ackerman's multidimensional information approach based…
Item usage in a multidimensional computerized adaptive test (MCAT) measuring health-related quality of life.

PubMed

Paap, Muirne C S; Kroeze, Karel A; Terwee, Caroline B; van der Palen, Job; Veldkamp, Bernard P

2017-11-01

Examining item usage is an important step in evaluating the performance of a computerized adaptive test (CAT). We study item usage for a newly developed multidimensional CAT which draws items from three PROMIS domains, as well as a disease-specific one. The multidimensional item bank used in the current study contained 194 items from four domains: the PROMIS domains fatigue, physical function, and ability to participate in social roles and activities, and a disease-specific domain (the COPD-SIB). The item bank was calibrated using the multidimensional graded response model and data of 795 patients with chronic obstructive pulmonary disease. To evaluate the item usage rates of all individual items in our item bank, CAT simulations were performed on responses generated based on a multivariate uniform distribution. The outcome variables included active bank size and item overuse (usage rate larger than the expected item usage rate). For average θ-values, the overall active bank size was 9-10%; this number quickly increased as θ-values became more extreme. For values of -2 and +2, the overall active bank size equaled 39-40%. There was 78% overlap between overused items and active bank size for average θ-values. For more extreme θ-values, the overused items made up a much smaller part of the active bank size: here the overlap was only 35%. Our results strengthen the claim that relatively short item banks may suffice when using polytomous items (and no content constraints/exposure control mechanisms), especially when using MCAT.
A Note on Explaining Away and Paradoxical Results in Multidimensional Item Response Theory. Research Report. ETS RR-12-13

ERIC Educational Resources Information Center

van Rijn, Peter W.; Rijmen, Frank

2012-01-01

Hooker and colleagues addressed a paradoxical situation that can arise in the application of multidimensional item response theory (MIRT) models to educational test data. We demonstrate that this MIRT paradox is an instance of the explaining-away phenomenon in Bayesian networks, and we attempt to enhance the understanding of MIRT models by placing…
Development and psychometric properties of the Suicidality of Adolescent Screening Scale (SASS) using Multidimensional Item Response Theory.

PubMed

Sukhawaha, Supattra; Arunpongpaisal, Suwanna; Hurst, Cameron

2016-09-30

Suicide prevention in adolescents by early detection using screening tools to identify high suicidal risk is a priority. Our objective was to build a multidimensional scale namely "Suicidality of Adolescent Screening Scale (SASS)" to identify adolescents at risk of suicide. An initial pool of items was developed by using in-depth interview, focus groups and a literature review. Initially, 77 items were administered to 307 adolescents and analyzed using the exploratory Multidimensional Item Response Theory (MIRT) to remove unnecessary items. A subsequent exploratory factor analysis revealed 35 items that collected into 4 factors: Stressors, Pessimism, Suicidality and Depression. To confirm this structure, a new sample of 450 adolescents were collected and confirmatory MIRT factor analysis was performed. The resulting scale was shown to be both construct valid and able to discriminate well between adolescents that had, and hadn't previous attempted suicide. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Vegetable parenting practices scale: Item response modeling analyses

USDA-ARS?s Scientific Manuscript database

Our objective was to evaluate the psychometric properties of a vegetable parenting practices scale using multidimensional polytomous item response modeling which enables assessing item fit to latent variables and the distributional characteristics of the items in comparison to the respondents. We al...
A Two-Decision Model for Responses to Likert-Type Items

ERIC Educational Resources Information Center

Thissen-Roe, Anne; Thissen, David

2013-01-01

Extreme response set, the tendency to prefer the lowest or highest response option when confronted with a Likert-type response scale, can lead to misfit of item response models such as the generalized partial credit model. Recently, a series of intrinsically multidimensional item response models have been hypothesized, wherein tendency toward…

Best Design for Multidimensional Computerized Adaptive Testing with the Bifactor Model

ERIC Educational Resources Information Center

Seo, Dong Gi; Weiss, David J.

2015-01-01

Most computerized adaptive tests (CATs) have been studied using the framework of unidimensional item response theory. However, many psychological variables are multidimensional and might benefit from using a multidimensional approach to CATs. This study investigated the accuracy, fidelity, and efficiency of a fully multidimensional CAT algorithm…
Validating the European Health Literacy Survey Questionnaire in people with type 2 diabetes: Latent trait analyses applying multidimensional Rasch modelling and confirmatory factor analysis.

PubMed

Finbråten, Hanne Søberg; Pettersen, Kjell Sverre; Wilde-Larsson, Bodil; Nordström, Gun; Trollvik, Anne; Guttersrud, Øystein

2017-11-01

To validate the European Health Literacy Survey Questionnaire (HLS-EU-Q47) in people with type 2 diabetes mellitus. The HLS-EU-Q47 latent variable is outlined in a framework with four cognitive domains integrated in three health domains, implying 12 theoretically defined subscales. Valid and reliable health literacy measurers are crucial to effectively adapt health communication and education to individuals and groups of patients. Cross-sectional study applying confirmatory latent trait analyses. Using a paper-and-pencil self-administered approach, 388 adults responded in March 2015. The data were analysed using the Rasch methodology and confirmatory factor analysis. Response violation (response dependency) and trait violation (multidimensionality) of local independence were identified. Fitting the "multidimensional random coefficients multinomial logit" model, 1-, 3- and 12-dimensional Rasch models were applied and compared. Poor model fit and differential item functioning were present in some items, and several subscales suffered from poor targeting and low reliability. Despite multidimensional data, we did not observe any unordered response categories. Interpreting the domains as distinct but related latent dimensions, the data fit a 12-dimensional Rasch model and a 12-factor confirmatory factor model best. Therefore, the analyses did not support the estimation of one overall "health literacy score." To support the plausibility of claims based on the HLS-EU score(s), we suggest: removing the health care aspect to reduce the magnitude of multidimensionality; rejecting redundant items to avoid response dependency; adding "harder" items and applying a six-point rating scale to improve subscale targeting and reliability; and revising items to improve model fit and avoid bias owing to person factors. © 2017 John Wiley & Sons Ltd.
Lord's Wald Test for Detecting Dif in Multidimensional Irt Models: A Comparison of Two Estimation Approaches

ERIC Educational Resources Information Center

Lee, Soo; Suh, Youngsuk

2018-01-01

Lord's Wald test for differential item functioning (DIF) has not been studied extensively in the context of the multidimensional item response theory (MIRT) framework. In this article, Lord's Wald test was implemented using two estimation approaches, marginal maximum likelihood estimation and Bayesian Markov chain Monte Carlo estimation, to detect…
An Item Response Theory Model for Test Bias.

ERIC Educational Resources Information Center

Shealy, Robin; Stout, William

This paper presents a conceptualization of test bias for standardized ability tests which is based on multidimensional, non-parametric, item response theory. An explanation of how individually-biased items can combine through a test score to produce test bias is provided. It is contended that bias, although expressed at the item level, should be…
IRTPRO 2.1 for Windows (Item Response Theory for Patient-Reported Outcomes)

ERIC Educational Resources Information Center

Paek, Insu; Han, Kyung T.

2013-01-01

This article reviews a new item response theory (IRT) model estimation program, IRTPRO 2.1, for Windows that is capable of unidimensional and multidimensional IRT model estimation for existing and user-specified constrained IRT models for dichotomously and polytomously scored item response data. (Contains 1 figure and 2 notes.)
Analyzing Longitudinal Item Response Data via the Pairwise Fitting Method

ERIC Educational Resources Information Center

Fu, Zhi-Hui; Tao, Jian; Shi, Ning-Zhong; Zhang, Ming; Lin, Nan

2011-01-01

Multidimensional item response theory (MIRT) models can be applied to longitudinal educational surveys where a group of individuals are administered different tests over time with some common items. However, computational problems typically arise as the dimension of the latent variables increases. This is especially true when the latent variable…
Exploring the Robustness of a Unidimensional Item Response Theory Model with Empirically Multidimensional Data

ERIC Educational Resources Information Center

Anderson, Daniel; Kahn, Joshua D.; Tindal, Gerald

2017-01-01

Unidimensionality and local independence are two common assumptions of item response theory. The former implies that all items measure a common latent trait, while the latter implies that responses are independent, conditional on respondents' location on the latent trait. Yet, few tests are truly unidimensional. Unmodeled dimensions may result in…
Extreme Response Style: Which Model Is Best?

ERIC Educational Resources Information Center

Leventhal, Brian

2017-01-01

More robust and rigorous psychometric models, such as multidimensional Item Response Theory models, have been advocated for survey applications. However, item responses may be influenced by construct-irrelevant variance factors such as preferences for extreme response options. Through empirical and simulation methods, this study evaluates the use…
Taking the Test Taker's Perspective: Response Process and Test Motivation in Multidimensional Forced-Choice Versus Rating Scale Instruments.

PubMed

Sass, Rachelle; Frick, Susanne; Reips, Ulf-Dietrich; Wetzel, Eunike

2018-03-01

The multidimensional forced-choice (MFC) format has been proposed as an alternative to the rating scale (RS) response format. However, it is unclear how changing the response format may affect the response process and test motivation of participants. In Study 1, we investigated the MFC response process using the think-aloud technique. In Study 2, we compared test motivation between the RS format and different versions of the MFC format (presenting 2, 3, 4, and 5 items simultaneously). The response process to MFC item blocks was similar to the RS response process but involved an additional step of weighing the items within a block against each other. The RS and MFC response format groups did not differ in their test motivation. Thus, from the test taker's perspective, the MFC format is somewhat more demanding to respond to, but this does not appear to decrease test motivation.
What can we learn from PISA?: Investigating PISA's approach to scientific literacy

NASA Astrophysics Data System (ADS)

Schwab, Cheryl Jean

This dissertation is an investigation of the relationship between the multidimensional conception of scientific literacy and its assessment. The Programme for International Student Assessment (PISA), developed under the auspices of the Organization for Economic Cooperation and Development (OECD), offers a unique opportunity to evaluate the assessment of scientific literacy. PISA developed a continuum of performance for scientific literacy across three competencies (i.e., process, content, and situation). Foundational to the interpretation of PISA science assessment is PISA's definition of scientific literacy, which I argue incorporates three themes drawn from history: (a) scientific way of thinking, (b) everyday relevance of science, and (c) scientific literacy for all students. Three coordinated studies were conducted to investigate the validity of PISA science assessment and offer insight into the development of items to assess scientific 2 literacy. Multidimensional models of the internal structure of the PISA 2003 science items were found not to reflect the complex character of PISA's definition of scientific literacy. Although the multidimensional models across the three competencies significantly decreased the G2 statistic from the unidimensional model, high correlations between the dimensions suggest that the dimensions are similar. A cognitive analysis of student verbal responses to PISA science items revealed that students were using competencies of scientific literacy, but the competencies were not elicited by the PISA science items at the depth required by PISA's definition of scientific literacy. Although student responses contained only knowledge of scientific facts and simple scientific concepts, students were using more complex skills to interpret and communicate their responses. Finally the investigation of different scoring approaches and item response models illustrated different ways to interpret student responses to assessment items. These analyses highlighted the complexities of students' responses to the PISA science items and the use of the ordered partition model to accommodate different but equal item responses. The results of the three investigations are used to discuss ways to improve the development and interpretation of PISA's science items.
Data Visualization of Item-Total Correlation by Median Smoothing

ERIC Educational Resources Information Center

Yu, Chong Ho; Douglas, Samantha; Lee, Anna; An, Min

2016-01-01

This paper aims to illustrate how data visualization could be utilized to identify errors prior to modeling, using an example with multi-dimensional item response theory (MIRT). MIRT combines item response theory and factor analysis to identify a psychometric model that investigates two or more latent traits. While it may seem convenient to…
Factor structure and gender stability in the multidimensional condom attitudes scale.

PubMed

Starosta, Amy J; Berghoff, Christopher R; Earleywine, Mitch

2015-06-01

Sexually transmitted infections continue to trouble the United States and can be attenuated through increased condom use. Attitudes about condoms are an important multidimensional factor that can affect sexual health choices and have been successfully measured using the Multidimensional Condom Attitudes Scale (MCAS). Such attitudes have the potential to vary between men and women, yet little work has been undertaken to identify if the MCAS accurately captures attitudes without being influenced by underlying gender biases. We examined the factor structure and gender invariance on the MCAS using confirmatory factor analysis and item response theory, within-subscale differential item functioning analyses. More than 770 participants provided data via the Internet. Results of differential item functioning analyses identified three items as differentially functioning between the genders, and removal of these items is recommended. Findings confirmed the previously hypothesized multidimensional nature of condom attitudes and the five-factor structure of the MCAS even after the removal of the three problematic items. In general, comparisons across genders using the MCAS seem reasonable from a methodological standpoint. Results are discussed in terms of improving sexual health research and interventions. © The Author(s) 2014.
Equating Multidimensional Tests under a Random Groups Design: A Comparison of Various Equating Procedures

ERIC Educational Resources Information Center

Lee, Eunjung

2013-01-01

The purpose of this research was to compare the equating performance of various equating procedures for the multidimensional tests. To examine the various equating procedures, simulated data sets were used that were generated based on a multidimensional item response theory (MIRT) framework. Various equating procedures were examined, including…
Development and Application of Methods for Estimating Operating Characteristics of Discrete Test Item Responses without Assuming any Mathematical Form.

ERIC Educational Resources Information Center

Samejima, Fumiko

In latent trait theory the latent space, or space of the hypothetical construct, is usually represented by some unidimensional or multi-dimensional continuum of real numbers. Like the latent space, the item response can either be treated as a discrete variable or as a continuous variable. Latent trait theory relates the item response to the latent…
A Person Fit Test for IRT Models for Polytomous Items

ERIC Educational Resources Information Center

Glas, C. A. W.; Dagohoy, Anna Villa T.

2007-01-01

A person fit test based on the Lagrange multiplier test is presented for three item response theory models for polytomous items: the generalized partial credit model, the sequential model, and the graded response model. The test can also be used in the framework of multidimensional ability parameters. It is shown that the Lagrange multiplier…
Model-Based Collaborative Filtering Analysis of Student Response Data: Machine-Learning Item Response Theory

ERIC Educational Resources Information Center

Bergner, Yoav; Droschler, Stefan; Kortemeyer, Gerd; Rayyan, Saif; Seaton, Daniel; Pritchard, David E.

2012-01-01

We apply collaborative filtering (CF) to dichotomously scored student response data (right, wrong, or no interaction), finding optimal parameters for each student and item based on cross-validated prediction accuracy. The approach is naturally suited to comparing different models, both unidimensional and multidimensional in ability, including a…
Acquiescent Responding in Balanced Multidimensional Scales and Exploratory Factor Analysis

ERIC Educational Resources Information Center

Lorenzo-Seva, Urbano; Rodriguez-Fornells, Antoni

2006-01-01

Personality tests often consist of a set of dichotomous or Likert items. These response formats are known to be susceptible to an agreeing-response bias called acquiescence. The common assumption in balanced scales is that the sum of appropriately reversed responses should be reasonably free of acquiescence. However, inter-item correlation (or…
Using a Multivariate Multilevel Polytomous Item Response Theory Model to Study Parallel Processes of Change: The Dynamic Association between Adolescents' Social Isolation and Engagement with Delinquent Peers in the National Youth Survey

ERIC Educational Resources Information Center

Hsieh, Chueh-An; von Eye, Alexander A.; Maier, Kimberly S.

2010-01-01

The application of multidimensional item response theory models to repeated observations has demonstrated great promise in developmental research. It allows researchers to take into consideration both the characteristics of item response and measurement error in longitudinal trajectory analysis, which improves the reliability and validity of the…
Affective Outcomes of Schooling: Full-Information Item Factor Analysis of a Student Questionnaire.

ERIC Educational Resources Information Center

Muraki, Eiji; Engelhard, George, Jr.

Recent developments in dichotomous factor analysis based on multidimensional item response models (Bock and Aitkin, 1981; Muthen, 1978) provide an effective method for exploring the dimensionality of questionnaire items. Implemented in the TESTFACT program, this "full information" item factor analysis accounts not only for the pairwise joint…
Multidimensional Extension of Multiple Indicators Multiple Causes Models to Detect DIF

ERIC Educational Resources Information Center

Lee, Soo; Bulut, Okan; Suh, Youngsuk

2017-01-01

A number of studies have found multiple indicators multiple causes (MIMIC) models to be an effective tool in detecting uniform differential item functioning (DIF) for individual items and item bundles. A recently developed MIMIC-interaction model is capable of detecting both uniform and nonuniform DIF in the unidimensional item response theory…

Estimating a Noncompensatory IRT Model Using Metropolis within Gibbs Sampling

ERIC Educational Resources Information Center

Babcock, Ben

2011-01-01

Relatively little research has been conducted with the noncompensatory class of multidimensional item response theory (MIRT) models. A Monte Carlo simulation study was conducted exploring the estimation of a two-parameter noncompensatory item response theory (IRT) model. The estimation method used was a Metropolis-Hastings within Gibbs algorithm…
Vegetable parenting practices scale. Item response modeling analyses

PubMed Central

Chen, Tzu-An; O’Connor, Teresia; Hughes, Sheryl; Beltran, Alicia; Baranowski, Janice; Diep, Cassandra; Baranowski, Tom

2015-01-01

Objective To evaluate the psychometric properties of a vegetable parenting practices scale using multidimensional polytomous item response modeling which enables assessing item fit to latent variables and the distributional characteristics of the items in comparison to the respondents. We also tested for differences in the ways item function (called differential item functioning) across child’s gender, ethnicity, age, and household income groups. Method Parents of 3–5 year old children completed a self-reported vegetable parenting practices scale online. Vegetable parenting practices consisted of 14 effective vegetable parenting practices and 12 ineffective vegetable parenting practices items, each with three subscales (responsiveness, structure, and control). Multidimensional polytomous item response modeling was conducted separately on effective vegetable parenting practices and ineffective vegetable parenting practices. Results One effective vegetable parenting practice item did not fit the model well in the full sample or across demographic groups, and another was a misfit in differential item functioning analyses across child’s gender. Significant differential item functioning was detected across children’s age and ethnicity groups, and more among effective vegetable parenting practices than ineffective vegetable parenting practices items. Wright maps showed items only covered parts of the latent trait distribution. The harder- and easier-to-respond ends of the construct were not covered by items for effective vegetable parenting practices and ineffective vegetable parenting practices, respectively. Conclusions Several effective vegetable parenting practices and ineffective vegetable parenting practices scale items functioned differently on the basis of child’s demographic characteristics; therefore, researchers should use these vegetable parenting practices scales with caution. Item response modeling should be incorporated in analyses of parenting practice questionnaires to better assess differences across demographic characteristics. PMID:25895694
A Multilevel Multidimensional Item Response Theory Model to Address the Role of Response Style on Measurement of Attitudes in PISA 2006

ERIC Educational Resources Information Center

Lu, Yi

2012-01-01

Cross-national comparisons of responses to survey items are often affected by response style, particularly extreme response style (ERS). ERS varies across cultures, and has the potential to bias inferences in cross-national comparisons. For example, in both PISA and TIMSS assessments, it has been documented that when examined within countries,…
Item Parameter Estimation for the MIRT Model: Bias and Precision of Confirmatory Factor Analysis-Based Models

ERIC Educational Resources Information Center

Finch, Holmes

2010-01-01

The accuracy of item parameter estimates in the multidimensional item response theory (MIRT) model context is one that has not been researched in great detail. This study examines the ability of two confirmatory factor analysis models specifically for dichotomous data to properly estimate item parameters using common formulae for converting factor…
Students' Proficiency Scores within Multitrait Item Response Theory

ERIC Educational Resources Information Center

Scott, Terry F.; Schumayer, Daniel

2015-01-01

In this paper we present a series of item response models of data collected using the Force Concept Inventory. The Force Concept Inventory (FCI) was designed to poll the Newtonian conception of force viewed as a multidimensional concept, that is, as a complex of distinguishable conceptual dimensions. Several previous studies have developed…
Unidimensional and Multidimensional Models for Item Response Theory.

ERIC Educational Resources Information Center

McDonald, Roderick P.

This paper provides an up-to-date review of the relationship between item response theory (IRT) and (nonlinear) common factor theory and draws out of this relationship some implications for current and future research in IRT. Nonlinear common factor analysis yields a natural embodiment of the weak principle of local independence in appropriate…
Mixed-Format Test Score Equating: Effect of Item-Type Multidimensionality, Length and Composition of Common-Item Set, and Group Ability Difference

ERIC Educational Resources Information Center

Wang, Wei

2013-01-01

Mixed-format tests containing both multiple-choice (MC) items and constructed-response (CR) items are now widely used in many testing programs. Mixed-format tests often are considered to be superior to tests containing only MC items although the use of multiple item formats leads to measurement challenges in the context of equating conducted under…
Application of the Bifactor Model to Computerized Adaptive Testing

ERIC Educational Resources Information Center

Seo, Dong Gi

2011-01-01

Most computerized adaptive tests (CAT) have been studied under the framework of unidimensional item response theory. However, many psychological variables are multidimensional and might benefit from using a multidimensional approach to CAT. In addition, a number of psychological variables (e.g., quality of life, depression) can be conceptualized…
The Development of a Psychometrically-Sound Instrument to Measure Teachers' Multidimensional Attitudes toward Inclusive Education

ERIC Educational Resources Information Center

Mahat, Marian

2008-01-01

The "Multidimensional Attitudes toward Inclusive Education Scale" (MATIES) was developed to effectively measure affective, cognitive and behavioural aspects of attitudes, within the realm of inclusive education that includes physical, social and curricular inclusion. Models within Item Response Theory and Classical Test Theory were used…
Examining the Reliability of Student Growth Percentiles Using Multidimensional IRT

ERIC Educational Resources Information Center

Monroe, Scott; Cai, Li

2015-01-01

Student growth percentiles (SGPs, Betebenner, 2009) are used to locate a student's current score in a conditional distribution based on the student's past scores. Currently, following Betebenner (2009), quantile regression (QR) is most often used operationally to estimate the SGPs. Alternatively, multidimensional item response theory (MIRT) may…
Method of data mining including determining multidimensional coordinates of each item using a predetermined scalar similarity value for each item pair

DOEpatents

Meyers, Charles E.; Davidson, George S.; Johnson, David K.; Hendrickson, Bruce A.; Wylie, Brian N.

1999-01-01

A method of data mining represents related items in a multidimensional space. Distance between items in the multidimensional space corresponds to the extent of relationship between the items. The user can select portions of the space to perceive. The user also can interact with and control the communication of the space, focusing attention on aspects of the space of most interest. The multidimensional spatial representation allows more ready comprehension of the structure of the relationships among the items.
A Comparative Study of Online Item Calibration Methods in Multidimensional Computerized Adaptive Testing

ERIC Educational Resources Information Center

Chen, Ping

2017-01-01

Calibration of new items online has been an important topic in item replenishment for multidimensional computerized adaptive testing (MCAT). Several online calibration methods have been proposed for MCAT, such as multidimensional "one expectation-maximization (EM) cycle" (M-OEM) and multidimensional "multiple EM cycles"…
Using Explanatory Item Response Models to Evaluate Complex Scientific Tasks Designed for the Next Generation Science Standards

NASA Astrophysics Data System (ADS)

Chiu, Tina

This dissertation includes three studies that analyze a new set of assessment tasks developed by the Learning Progressions in Middle School Science (LPS) Project. These assessment tasks were designed to measure science content knowledge on the structure of matter domain and scientific argumentation, while following the goals from the Next Generation Science Standards (NGSS). The three studies focus on the evidence available for the success of this design and its implementation, generally labelled as "validity" evidence. I use explanatory item response models (EIRMs) as the overarching framework to investigate these assessment tasks. These models can be useful when gathering validity evidence for assessments as they can help explain student learning and group differences. In the first study, I explore the dimensionality of the LPS assessment by comparing the fit of unidimensional, between-item multidimensional, and Rasch testlet models to see which is most appropriate for this data. By applying multidimensional item response models, multiple relationships can be investigated, and in turn, allow for a more substantive look into the assessment tasks. The second study focuses on person predictors through latent regression and differential item functioning (DIF) models. Latent regression models show the influence of certain person characteristics on item responses, while DIF models test whether one group is differentially affected by specific assessment items, after conditioning on latent ability. Finally, the last study applies the linear logistic test model (LLTM) to investigate whether item features can help explain differences in item difficulties.
How IRT Can Solve Problems of Ipsative Data in Forced-Choice Questionnaires

ERIC Educational Resources Information Center

Brown, Anna; Maydeu-Olivares, Alberto

2013-01-01

In multidimensional forced-choice (MFC) questionnaires, items measuring different attributes are presented in blocks, and participants have to rank order the items within each block (fully or partially). Such comparative formats can reduce the impact of numerous response biases often affecting single-stimulus items (aka rating or Likert scales).…
Psychometric properties of the Multidimensional Assessment of Fatigue scale in traumatic brain injury: an NIDRR Traumatic Brain Injury Model Systems study.

PubMed

Lequerica, Anthony; Bushnik, Tamara; Wright, Jerry; Kolakowsky-Hayner, Stephanie A; Hammond, Flora M; Dijkers, Marcel P; Cantor, Joshua

2012-01-01

To investigate the psychometric properties of the Multidimensional Assessment of Fatigue (MAF) scale in a traumatic brain injury (TBI) sample. Prospective survey study. Community. One hundred sixty-seven individuals with TBI admitted for inpatient rehabilitation, enrolled into the TBI Model Systems national database, and followed up at either the first or second year postinjury. Not applicable. Multidimensional Assessment of Fatigue. The initial analysis, using items 1 to 14, which are based on a 10-point rating scale, found that only 1 item ("walking") misfit the overall construct of fatigue in this TBI population. However, this 10-point rating scale was found to have disordered thresholds. When ratings were collapsed into 4 response categories, all MAF items used to calculate the Global Fatigue Index formed a unidimensional scale. Findings generally support the unidimensionality of the MAF when used in a TBI population but call into question the use of a 10-point rating scale for items 1 to 14. Further study is needed to investigate the use of a 4-category rating scale across all items and the fit of the "walking" item for a measure of fatigue among individuals with TBI.
How Can Multivariate Item Response Theory Be Used in Reporting of Susbcores? Research Report. ETS RR-10-09

ERIC Educational Resources Information Center

Haberman, Shelby J.; Sinharay, Sandip

2010-01-01

Recently, there has been increasing interest in reporting diagnostic scores. This paper examines reporting of subscores using multidimensional item response theory (MIRT) models. An MIRT model is fitted using a stabilized Newton-Raphson algorithm (Haberman, 1974, 1988) with adaptive Gauss-Hermite quadrature (Haberman, von Davier, & Lee, 2008).…
Direct Estimation of Correlation as a Measure of Association Strength Using Multidimensional Item Response Models

ERIC Educational Resources Information Center

Wang, Wen-Chung

2004-01-01

The Pearson correlation is used to depict effect sizes in the context of item response theory. Amultidimensional Rasch model is used to directly estimate the correlation between latent traits. Monte Carlo simulations were conducted to investigate whether the population correlation could be accurately estimated and whether the bootstrap method…
Multidimensional CAT Item Selection Methods for Domain Scores and Composite Scores with Item Exposure Control and Content Constraints

ERIC Educational Resources Information Center

Yao, Lihua

2014-01-01

The intent of this research was to find an item selection procedure in the multidimensional computer adaptive testing (CAT) framework that yielded higher precision for both the domain and composite abilities, had a higher usage of the item pool, and controlled the exposure rate. Five multidimensional CAT item selection procedures (minimum angle;…
A Standardized Generalized Dimensionality Discrepancy Measure and a Standardized Model-Based Covariance for Dimensionality Assessment for Multidimensional Models

ERIC Educational Resources Information Center

Levy, Roy; Xu, Yuning; Yel, Nedim; Svetina, Dubravka

2015-01-01

The standardized generalized dimensionality discrepancy measure and the standardized model-based covariance are introduced as tools to critique dimensionality assumptions in multidimensional item response models. These tools are grounded in a covariance theory perspective and associated connections between dimensionality and local independence.…
Introducing Multidimensional Item Response Modeling in Health Behavior and Health Education Research

ERIC Educational Resources Information Center

Allen, Diane D.; Wilson, Mark

2006-01-01

When measuring participant-reported attitudes and outcomes in the behavioral sciences, there are many instances when the common measurement assumption of unidimensionality does not hold. In these cases, the application of a multidimensional measurement model is both technically appropriate and potentially advantageous in substance. In this paper,…

Multidimensional Item Response Theory Models in Vocational Interest Measurement: An Illustration Using the AIST-R

ERIC Educational Resources Information Center

Wetzel, Eunike; Hell, Benedikt

2014-01-01

Vocational interest inventories are commonly analyzed using a unidimensional approach, that is, each subscale is analyzed separately. However, the theories on which these inventories are based often postulate specific relationships between the interest traits. This article presents a multidimensional approach to the analysis of vocational interest…
Multidimensional student skills with collaborative filtering

NASA Astrophysics Data System (ADS)

Bergner, Yoav; Rayyan, Saif; Seaton, Daniel; Pritchard, David E.

2013-01-01

Despite the fact that a physics course typically culminates in one final grade for the student, many instructors and researchers believe that there are multiple skills that students acquire to achieve mastery. Assessment validation and data analysis in general may thus benefit from extension to multidimensional ability. This paper introduces an approach for model determination and dimensionality analysis using collaborative filtering (CF), which is related to factor analysis and item response theory (IRT). Model selection is guided by machine learning perspectives, seeking to maximize the accuracy in predicting which students will answer which items correctly. We apply the CF to response data for the Mechanics Baseline Test and combine the results with prior analysis using unidimensional IRT.
A General Program for Item-Response Analysis That Employs the Stabilized Newton-Raphson Algorithm. Research Report. ETS RR-13-32

ERIC Educational Resources Information Center

Haberman, Shelby J.

2013-01-01

A general program for item-response analysis is described that uses the stabilized Newton-Raphson algorithm. This program is written to be compliant with Fortran 2003 standards and is sufficiently general to handle independent variables, multidimensional ability parameters, and matrix sampling. The ability variables may be either polytomous or…
Robust Measurement via A Fused Latent and Graphical Item Response Theory Model.

PubMed

Chen, Yunxiao; Li, Xiaoou; Liu, Jingchen; Ying, Zhiliang

2018-03-12

Item response theory (IRT) plays an important role in psychological and educational measurement. Unlike the classical testing theory, IRT models aggregate the item level information, yielding more accurate measurements. Most IRT models assume local independence, an assumption not likely to be satisfied in practice, especially when the number of items is large. Results in the literature and simulation studies in this paper reveal that misspecifying the local independence assumption may result in inaccurate measurements and differential item functioning. To provide more robust measurements, we propose an integrated approach by adding a graphical component to a multidimensional IRT model that can offset the effect of unknown local dependence. The new model contains a confirmatory latent variable component, which measures the targeted latent traits, and a graphical component, which captures the local dependence. An efficient proximal algorithm is proposed for the parameter estimation and structure learning of the local dependence. This approach can substantially improve the measurement, given no prior information on the local dependence structure. The model can be applied to measure both a unidimensional latent trait and multidimensional latent traits.
Students' proficiency scores within multitrait item response theory

NASA Astrophysics Data System (ADS)

Scott, Terry F.; Schumayer, Daniel

2015-12-01

In this paper we present a series of item response models of data collected using the Force Concept Inventory. The Force Concept Inventory (FCI) was designed to poll the Newtonian conception of force viewed as a multidimensional concept, that is, as a complex of distinguishable conceptual dimensions. Several previous studies have developed single-trait item response models of FCI data; however, we feel that multidimensional models are also appropriate given the explicitly multidimensional design of the inventory. The models employed in the research reported here vary in both the number of fitting parameters and the number of underlying latent traits assumed. We calculate several model information statistics to ensure adequate model fit and to determine which of the models provides the optimal balance of information and parsimony. Our analysis indicates that all item response models tested, from the single-trait Rasch model through to a model with ten latent traits, satisfy the standard requirements of fit. However, analysis of model information criteria indicates that the five-trait model is optimal. We note that an earlier factor analysis of the same FCI data also led to a five-factor model. Furthermore the factors in our previous study and the traits identified in the current work match each other well. The optimal five-trait model assigns proficiency scores to all respondents for each of the five traits. We construct a correlation matrix between the proficiencies in each of these traits. This correlation matrix shows strong correlations between some proficiencies, and strong anticorrelations between others. We present an interpretation of this correlation matrix.
PROC IRT: A SAS Procedure for Item Response Theory

PubMed Central

Matlock Cole, Ki; Paek, Insu

2017-01-01

This article reviews the procedure for item response theory (PROC IRT) procedure in SAS/STAT 14.1 to conduct item response theory (IRT) analyses of dichotomous and polytomous datasets that are unidimensional or multidimensional. The review provides an overview of available features, including models, estimation procedures, interfacing, input, and output files. A small-scale simulation study evaluates the IRT model parameter recovery of the PROC IRT procedure. The use of the IRT procedure in Statistical Analysis Software (SAS) may be useful for researchers who frequently utilize SAS for analyses, research, and teaching.
Comparison of Factor Simplicity Indices for Dichotomous Data: DETECT R, Bentler's Simplicity Index, and the Loading Simplicity Index

ERIC Educational Resources Information Center

Finch, Holmes; Stage, Alan Kirk; Monahan, Patrick

2008-01-01

A primary assumption underlying several of the common methods for modeling item response data is unidimensionality, that is, test items tap into only one latent trait. This assumption can be assessed several ways, using nonlinear factor analysis and DETECT, a method based on the item conditional covariances. When multidimensionality is identified,…
Modeling Local Item Dependence Due to Common Test Format with a Multidimensional Rasch Model

ERIC Educational Resources Information Center

Baghaei, Purya; Aryadoust, Vahid

2015-01-01

Research shows that test method can exert a significant impact on test takers' performance and thereby contaminate test scores. We argue that common test method can exert the same effect as common stimuli and violate the conditional independence assumption of item response theory models because, in general, subsets of items which have a shared…
Assessment of Computer and Information Literacy in ICILS 2013: Do Different Item Types Measure the Same Construct?

ERIC Educational Resources Information Center

Ihme, Jan Marten; Senkbeil, Martin; Goldhammer, Frank; Gerick, Julia

2017-01-01

The combination of different item formats is found quite often in large scale assessments, and analyses on the dimensionality often indicate multi-dimensionality of tests regarding the task format. In ICILS 2013, three different item types (information-based response tasks, simulation tasks, and authoring tasks) were used to measure computer and…
An alternative to Rasch analysis using triadic comparisons and multi-dimensional scaling

NASA Astrophysics Data System (ADS)

Bradley, C.; Massof, R. W.

2016-11-01

Rasch analysis is a principled approach for estimating the magnitude of some shared property of a set of items when a group of people assign ordinal ratings to them. In the general case, Rasch analysis not only estimates person and item measures on the same invariant scale, but also estimates the average thresholds used by the population to define rating categories. However, Rasch analysis fails when there is insufficient variance in the observed responses because it assumes a probabilistic relationship between person measures, item measures and the rating assigned by a person to an item. When only a single person is rating all items, there may be cases where the person assigns the same rating to many items no matter how many times he rates them. We introduce an alternative to Rasch analysis for precisely these situations. Our approach leverages multi-dimensional scaling (MDS) and requires only rank orderings of items and rank orderings of pairs of distances between items to work. Simulations show one variant of this approach - triadic comparisons with non-metric MDS - provides highly accurate estimates of item measures in realistic situations.
Multidimensional Assessment of Spirituality/Religion in Patients with HIV: Conceptual Framework and Empirical Refinement

PubMed Central

Kudel, Ian; Cotton, Sian; Leonard, Anthony C.; Tsevat, Joel; Ritchey, P. Neal

2011-01-01

A decade ago, an expert panel developed a framework for measuring spirituality/religion in health research (Brief Multidimensional Measure of Religiousness/Spirituality), but empirical testing of this framework has been limited. The purpose of this study was to determine whether responses to items across multiple measures assessing spirituality/religion by 450 patients with HIV replicate this model. We hypothesized a six-factor model underlying a collective of 56 items, but results of confirmatory factor analyses suggested eight dimensions: Meaning/Peace, Tangible Connection to the Divine, Positive Religious Coping, Love/Appreciation, Negative Religious Coping, Positive Congregational Support, Negative Congregational Support, and Cultural Practices. This study corroborates parts of the factor structure underlying the Brief Multidimensional Measure of Religiousness/Spirituality and some recent refinements of the original framework. PMID:21136166
Multidimensional assessment of spirituality/religion in patients with HIV: conceptual framework and empirical refinement.

PubMed

Szaflarski, Magdalena; Kudel, Ian; Cotton, Sian; Leonard, Anthony C; Tsevat, Joel; Ritchey, P Neal

2012-12-01

A decade ago, an expert panel developed a framework for measuring spirituality/religion in health research (Brief Multidimensional Measure of Religiousness/Spirituality), but empirical testing of this framework has been limited. The purpose of this study was to determine whether responses to items across multiple measures assessing spirituality/religion by 450 patients with HIV replicate this model. We hypothesized a six-factor model underlying a collective of 56 items, but results of confirmatory factor analyses suggested eight dimensions: Meaning/Peace, Tangible Connection to the Divine, Positive Religious Coping, Love/Appreciation, Negative Religious Coping, Positive Congregational Support, Negative Congregational Support, and Cultural Practices. This study corroborates parts of the factor structure underlying the Brief Multidimensional Measure of Religiousness/Spirituality and some recent refinements of the original framework.
The Nature of Science Instrument-Elementary (NOSI-E): Using Rasch principles to develop a theoretically grounded scale to measure elementary student understanding of the nature of science

NASA Astrophysics Data System (ADS)

Peoples, Shelagh

The purpose of this study was to determine which of three competing models will provide, reliable, interpretable, and responsive measures of elementary students' understanding of the nature of science (NOS). The Nature of Science Instrument-Elementary (NOSI-E), a 28-item Rasch-based instrument, was used to assess students' NOS understanding. The NOS construct was conceptualized using five construct dimensions (Empirical, Inventive, Theory-laden, Certainty and Socially & Culturally Embedded). The competing models represent three internal models for the NOS construct. One postulate is that the NOS construct is unidimensional where one latent construct explains the relationship between the 28 items of the NOSI-E. Alternatively, the NOS construct is composed of five independent unidimensional constructs (the consecutive approach). Lastly, the NOS construct is multidimensional and composed of five inter-related but separate dimensions. A validity argument was developed that hypothesized that the internal structure of the NOS construct is best represented by the multidimensional Rasch model. Four sets of analyses were performed in which the three representations were compared. These analyses addressed five validity aspects (content, substantive, generalizability, structural and external) of construct validity. The vast body of evidence supported the claim that the NOS construct is composed of five separate but inter-related dimensions that is best represented by the multidimensional Rasch model. The results of the multidimensional analyses indicated that the items of the five subscales were of excellent technical quality, exhibited no differential item functioning (based on gender), had an item hierarchy that conformed to theoretical expectations; and together formed subscales of reasonable reliability (> 0.7 on each subscale) that were responsive to change in the construct. Theory-laden scores from the multidimensional model predicted students' science achievement with scores from all five NOS dimensions significantly predicting students' perceptions of the constructivist nature of their classroom learning environment. The NOSI-E instrument is a theoretically grounded scale that can measure elementary students' NOS understanding and appears suitable for use in science education research.
A Multidimensional Computerized Adaptive Short-Form Quality of Life Questionnaire Developed and Validated for Multiple Sclerosis: The MusiQoL-MCAT.

PubMed

Michel, Pierre; Baumstarck, Karine; Ghattas, Badih; Pelletier, Jean; Loundou, Anderson; Boucekine, Mohamed; Auquier, Pascal; Boyer, Laurent

2016-04-01

The aim was to develop a multidimensional computerized adaptive short-form questionnaire, the MusiQoL-MCAT, from a fixed-length QoL questionnaire for multiple sclerosis.A total of 1992 patients were enrolled in this international cross-sectional study. The development of the MusiQoL-MCAT was based on the assessment of between-items MIRT model fit followed by real-data simulations. The MCAT algorithm was based on Bayesian maximum a posteriori estimation of latent traits and Kullback-Leibler information item selection. We examined several simulations based on a fixed number of items. Accuracy was assessed using correlations (r) between initial IRT scores and MCAT scores. Precision was assessed using the standard error measurement (SEM) and the root mean square error (RMSE).The multidimensional graded response model was used to estimate item parameters and IRT scores. Among the MCAT simulations, the 16-item version of the MusiQoL-MCAT was selected because the accuracy and precision became stable with 16 items with satisfactory levels (r ≥ 0.9, SEM ≤ 0.55, and RMSE ≤ 0.3). External validity of the MusiQoL-MCAT was satisfactory.The MusiQoL-MCAT presents satisfactory properties and can individually tailor QoL assessment to each patient, making it less burdensome to patients and better adapted for use in clinical practice.
A Multidimensional Computerized Adaptive Short-Form Quality of Life Questionnaire Developed and Validated for Multiple Sclerosis

PubMed Central

Michel, Pierre; Baumstarck, Karine; Ghattas, Badih; Pelletier, Jean; Loundou, Anderson; Boucekine, Mohamed; Auquier, Pascal; Boyer, Laurent

2016-01-01

Abstract The aim was to develop a multidimensional computerized adaptive short-form questionnaire, the MusiQoL-MCAT, from a fixed-length QoL questionnaire for multiple sclerosis. A total of 1992 patients were enrolled in this international cross-sectional study. The development of the MusiQoL-MCAT was based on the assessment of between-items MIRT model fit followed by real-data simulations. The MCAT algorithm was based on Bayesian maximum a posteriori estimation of latent traits and Kullback–Leibler information item selection. We examined several simulations based on a fixed number of items. Accuracy was assessed using correlations (r) between initial IRT scores and MCAT scores. Precision was assessed using the standard error measurement (SEM) and the root mean square error (RMSE). The multidimensional graded response model was used to estimate item parameters and IRT scores. Among the MCAT simulations, the 16-item version of the MusiQoL-MCAT was selected because the accuracy and precision became stable with 16 items with satisfactory levels (r ≥ 0.9, SEM ≤ 0.55, and RMSE ≤ 0.3). External validity of the MusiQoL-MCAT was satisfactory. The MusiQoL-MCAT presents satisfactory properties and can individually tailor QoL assessment to each patient, making it less burdensome to patients and better adapted for use in clinical practice. PMID:27057832
Item Response Modeling of Forced-Choice Questionnaires

ERIC Educational Resources Information Center

Brown, Anna; Maydeu-Olivares, Alberto

2011-01-01

Multidimensional forced-choice formats can significantly reduce the impact of numerous response biases typically associated with rating scales. However, if scored with classical methodology, these questionnaires produce ipsative data, which lead to distorted scale relationships and make comparisons between individuals problematic. This research…
Probing Lexical Representations: Simultaneous Modeling of Word and Reader Contributions to Multidimensional Lexical Representations

ERIC Educational Resources Information Center

Goodwin, Amanda P.; Gilbert, Jennifer K.; Cho, Sun-Joo; Kearns, Devin M.

2014-01-01

The current study models reader, item, and word contributions to the lexical representations of 39 morphologically complex words for 172 middle school students using a crossed random-effects item response model with multiple outcomes. We report 3 findings. First, results suggest that lexical representations can be characterized by separate but…
Selecting Soldiers and Civilians into the U.S. Army Officer Candidate School : Developing Empirical Selection Composites

DTIC Science & Technology

2014-07-01

a biographical instrument measuring personality ; (b) a Work Values instrument representing work preferences investigated in prior officer and...items used in SelectOCS Phase 2 (see Table 2.5). TAPAS uses multidimensional pairwise preference (MDPP) personality items scored using item response...presented respondents with a list of 30 traits and 30 skills (derived from leadership and personality literature) and instructed them to rate the
Evaluation of Student Performance through a Multidimensional Finite Mixture IRT Model.

PubMed

Bacci, Silvia; Bartolucci, Francesco; Grilli, Leonardo; Rampichini, Carla

2017-01-01

In the Italian academic system, a student can enroll for an exam immediately after the end of the teaching period or can postpone it; in this second case the exam result is missing. We propose an approach for the evaluation of a student performance throughout the course of study, accounting also for nonattempted exams. The approach is based on an item response theory model that includes two discrete latent variables representing student performance and priority in selecting the exams to take. We explicitly account for nonignorable missing observations as the indicators of attempted exams also contribute to measure the performance (within-item multidimensionality). The model also allows for individual covariates in its structural part.
Lord-Wingersky Algorithm Version 2.0 for Hierarchical Item Factor Models with Applications in Test Scoring, Scale Alignment, and Model Fit Testing. CRESST Report 830

ERIC Educational Resources Information Center

Cai, Li

2013-01-01

Lord and Wingersky's (1984) recursive algorithm for creating summed score based likelihoods and posteriors has a proven track record in unidimensional item response theory (IRT) applications. Extending the recursive algorithm to handle multidimensionality is relatively simple, especially with fixed quadrature because the recursions can be defined…

Advancing the efficiency and efficacy of patient reported outcomes with multivariate computer adaptive testing.

PubMed

Morris, Scott; Bass, Mike; Lee, Mirinae; Neapolitan, Richard E

2017-09-01

The Patient Reported Outcomes Measurement Information System (PROMIS) initiative developed an array of patient reported outcome (PRO) measures. To reduce the number of questions administered, PROMIS utilizes unidimensional item response theory and unidimensional computer adaptive testing (UCAT), which means a separate set of questions is administered for each measured trait. Multidimensional item response theory (MIRT) and multidimensional computer adaptive testing (MCAT) simultaneously assess correlated traits. The objective was to investigate the extent to which MCAT reduces patient burden relative to UCAT in the case of PROs. One MIRT and 3 unidimensional item response theory models were developed using the related traits anxiety, depression, and anger. Using these models, MCAT and UCAT performance was compared with simulated individuals. Surprisingly, the root mean squared error for both methods increased with the number of items. These results were driven by large errors for individuals with low trait levels. A second analysis focused on individuals aligned with item content. For these individuals, both MCAT and UCAT accuracies improved with additional items. Furthermore, MCAT reduced the test length by 50%. For the PROMIS Emotional Distress banks, neither UCAT nor MCAT provided accurate estimates for individuals at low trait levels. Because the items in these banks were designed to detect clinical levels of distress, there is little information for individuals with low trait values. However, trait estimates for individuals targeted by the banks were accurate and MCAT asked substantially fewer questions. By reducing the number of items administered, MCAT can allow clinicians and researchers to assess a wider range of PROs with less patient burden. © The Author 2017. Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved. For Permissions, please email: journals.permissions@oup.com
The Multidimensional Structure of Verbal Comprehension Test Items.

ERIC Educational Resources Information Center

Peled, Zimra

1984-01-01

The multidimensional structure of verbal comprehension test items was investigated. Empirical evidence was provided to support the theory that item tasks are multivariate-multiordered composites of faceted components: language, contextual knowledge, and cognitive operation. Linear and circular properties of cylindrical manifestation were…
Construct validity of the Swedish version of the revised piper fatigue scale in an oncology sample--a Rasch analysis.

PubMed

Lundgren-Nilsson, Asa; Dencker, Anna; Jakobsson, Sofie; Taft, Charles; Tennant, Alan

2014-06-01

Fatigue is a common and distressing symptom in cancer patients due to both the disease and its treatments. The concept of fatigue is multidimensional and includes both physical and mental components. The 22-item Revised Piper Fatigue Scale (RPFS) is a multidimensional instrument developed to assess cancer-related fatigue. This study reports on the construct validity of the Swedish version of the RPFS from the perspective of Rasch measurement. The Swedish version of the RPFS was answered by 196 cancer patients fatigued after 4 to 5 weeks of curative radiation therapy. Data from the scale were fitted to the Rasch measurement model. This involved testing a series of assumptions, including the stochastic ordering of items, local response dependency, and unidimensionality. A series of fit statistics were computed, differential item functioning (DIF) was tested, and local response dependency was accommodated through testlets. The Behavioral, Affective and Sensory domains all satisfied the Rasch model expectations. No DIF was observed, and all domains were found to be unidimensional. The Mood/Cognitive scale failed to fit the model, and substantial multidimensionality was found. Splitting the scale between Mood and Cognitive items resolved fit to the Rasch model, and new domains were unidimensional without DIF. The current Rasch analyses add to the evidence of measurement properties of the scale and show that the RPFS has good psychometric properties and works well to measure fatigue. The original four-factor structure, however, was not supported. Copyright © 2014 International Society for Pharmacoeconomics and Outcomes Research (ISPOR). Published by Elsevier Inc. All rights reserved.
A Simulation Study on Methods of Correcting for the Effects of Extreme Response Style

ERIC Educational Resources Information Center

Wetzel, Eunike; Böhnke, Jan R.; Rose, Norman

2016-01-01

The impact of response styles such as extreme response style (ERS) on trait estimation has long been a matter of concern to researchers and practitioners. This simulation study investigated three methods that have been proposed for the correction of trait estimates for ERS effects: (a) mixed Rasch models, (b) multidimensional item response models,…
Assessment of fatigue in rheumatoid arthritis: a psychometric comparison of single-item, multiitem, and multidimensional measures.

PubMed

Oude Voshaar, Martijn A H; Ten Klooster, Peter M; Bode, Christina; Vonkeman, Harald E; Glas, Cees A W; Jansen, Tim; van Albada-Kuipers, Iet; van Riel, Piet L C M; van de Laar, Mart A F J

2015-03-01

To compare the psychometric functioning of multidimensional disease-specific, multiitem generic, and single-item measures of fatigue in patients with rheumatoid arthritis (RA). Confirmatory factor analysis (CFA) and longitudinal item response theory (IRT) modeling were used to evaluate the measurement structure and local reliability of the Bristol RA Fatigue Multi-Dimensional Questionnaire (BRAF-MDQ), the Medical Outcomes Study Short Form-36 (SF-36) vitality scale, and the BRAF Numerical Rating Scales (BRAF-NRS) in a sample of 588 patients with RA. A 1-factor CFA model yielded a similar fit to a 5-factor model with subscale-specific dimensions, and the items from the different instruments adequately fit the IRT model, suggesting essential unidimensionality in measurement. The SF-36 vitality scale outperformed the BRAF-MDQ at lower levels of fatigue, but was less precise at moderate to higher levels of fatigue. At these levels of fatigue, the living, cognition, and emotion subscales of the BRAF-MDQ provide additional precision. The BRAF-NRS showed a limited measurement range with its highest precision centered on average levels of fatigue. The different instruments appear to access a common underlying domain of fatigue severity, but differ considerably in their measurement precision along the continuum. The SF-36 vitality scale can be used to measure fatigue severity in samples with relatively mild fatigue. For samples expected to have higher levels of fatigue, the multidimensional BRAF-MDQ appears to be a better choice. The BRAF-NRS are not recommended if precise assessment is required, for instance in longitudinal settings.
The SIETTE Automatic Assessment Environment

ERIC Educational Resources Information Center

Conejo, Ricardo; Guzmán, Eduardo; Trella, Monica

2016-01-01

This article describes the evolution and current state of the domain-independent Siette assessment environment. Siette supports different assessment methods--including classical test theory, item response theory, and computer adaptive testing--and integrates them with multidimensional student models used by intelligent educational systems.…
Replication of Structure Findings regarding the Interpersonal Reactivity Index.

ERIC Educational Resources Information Center

Carey, John C.; And Others

1988-01-01

Attempted to verify multidimensional nature and item composition of Interpersonal Reactivity Index (IRI) subscales through factor analysis. IRI responses from 365 female clinical dieticians and dietetic interns supported contention that IRI subscales measure four discernibly different empathy dimensions. (NB)
Umyuangcaryaraq "Reflecting": multidimensional assessment of reflective processes on the consequences of alcohol use among rural Yup'ik Alaska Native youth.

PubMed

Allen, James; Fok, Carlotta Ching Ting; Henry, David; Skewes, Monica

2012-09-01

Concerns in some settings regarding the accuracy and ethics of employing direct questions about alcohol use suggest need for alternative assessment approaches with youth. Umyuangcaryaraq is a Yup'ik Alaska Native word meaning "Reflecting." The Reflective Processes Scale was developed as a youth measure tapping awareness and thinking over potential negative consequences of alcohol misuse as a protective factor that includes cultural elements often shared by many other Alaska Native and American Indian cultures. This study assessed multidimensional structure, item functioning, and validity. Responses from 284 rural Alaska Native youth allowed bifactor analysis to assess structure, estimates of location and discrimination parameters, and convergent and discriminant validity. A bifactor model of the scale items with three content factors provided excellent fit to observed data. Item response theory analysis suggested a binary response format as optimal. Evidence of convergent and discriminant validity was established. The measure provides an assessment of reflective processes about alcohol that Alaska Native youth engage in when thinking about reasons not to drink. The concept of reflective processes has potential to extend understandings of cultural variation in mindfulness, alcohol expectancies research, and culturally mediated protective factors in Alaska Native and American Indian youth.
Conditional Covariance-Based Nonparametric Multidimensionality Assessment.

ERIC Educational Resources Information Center

Stout, William; And Others

1996-01-01

Three nonparametric procedures that use estimates of covariances of item-pair responses conditioned on examinee trait level for assessing dimensionality of a test are described. The HCA/CCPROX, DIMTEST, and DETECT are applied to a dimensionality study of the Law School Admission Test. (SLD)
Evaluation of the Parent-Report Inventory of Callous-Unemotional Traits in a Sample of Children Recruited from Intimate Partner Violence Services: A Multidimensional Rasch Analysis.

PubMed

McDonald, Shelby Elaine; Ma, Lin; Green, Kathy E; Hitti, Stephanie A; Cody, Anna M; Donovan, Courtney; Williams, James Herbert; Ascione, Frank R

2018-03-01

Our study applied multidimensional item response theory (MIRT) to compare structural models of the parent-report version of the Inventory of Callous and Unemotional Traits (ICU; English and North American Spanish translations). A total of 291 maternal caregivers were recruited from community-based domestic violence services and reported on their children (77.9% ethnic minority; 47% female), who ranged in age from 7 to 12 years (mean = 9.07, standard deviation = 1.64). We compared 9 models that were based on prior psychometric evaluations of the ICU. MIRT analyses indicated that a revised 18-item version comprising 2 factors (callous-unemotional and empathic-prosocial) was more suitable for our sample. Differential item functioning was found for several items across ethnic and language groups, but not for child gender or age. Evidence of construct validity was found. We recommend continued research and revisions to the ICU to better assess the presence of callous-unemotional traits in community samples of school-age children. © 2017 Wiley Periodicals, Inc.
Item response theory and structural equation modelling for ordinal data: Describing the relationship between KIDSCREEN and Life-H.

PubMed

Titman, Andrew C; Lancaster, Gillian A; Colver, Allan F

2016-10-01

Both item response theory and structural equation models are useful in the analysis of ordered categorical responses from health assessment questionnaires. We highlight the advantages and disadvantages of the item response theory and structural equation modelling approaches to modelling ordinal data, from within a community health setting. Using data from the SPARCLE project focussing on children with cerebral palsy, this paper investigates the relationship between two ordinal rating scales, the KIDSCREEN, which measures quality-of-life, and Life-H, which measures participation. Practical issues relating to fitting models, such as non-positive definite observed or fitted correlation matrices, and approaches to assessing model fit are discussed. item response theory models allow properties such as the conditional independence of particular domains of a measurement instrument to be assessed. When, as with the SPARCLE data, the latent traits are multidimensional, structural equation models generally provide a much more convenient modelling framework. © The Author(s) 2013.
Modernizing quality of life assessment: development of a multidimensional computerized adaptive questionnaire for patients with schizophrenia.

PubMed

Michel, Pierre; Baumstarck, Karine; Lancon, Christophe; Ghattas, Badih; Loundou, Anderson; Auquier, Pascal; Boyer, Laurent

2018-04-01

Quality of life (QoL) is still assessed using paper-based and fixed-length questionnaires, which is one reason why QoL measurements have not been routinely implemented in clinical practice. Providing new QoL measures that combine computer technology with modern measurement theory may enhance their clinical use. The aim of this study was to develop a QoL multidimensional computerized adaptive test (MCAT), the SQoL-MCAT, from the fixed-length SQoL questionnaire for patients with schizophrenia. In this multicentre cross-sectional study, we collected sociodemographic information, clinical characteristics (i.e., duration of illness, the PANSS, and the Calgary Depression Scale), and quality of life (i.e., SQoL). The development of the SQoL-CAT was divided into three stages: (1) multidimensional item response theory (MIRT) analysis, (2) multidimensional computerized adaptive test (MCAT) simulations with analyses of accuracy and precision, and (3) external validity. Five hundred and seventeen patients participated in this study. The MIRT analysis found that all items displayed good fit with the multidimensional graded response model, with satisfactory reliability for each dimension. The SQoL-MCAT was 39% shorter than the fixed-length SQoL questionnaire and had satisfactory accuracy (levels of correlation >0.9) and precision (standard error of measurement <0.55 and root mean square error <0.3). External validity was confirmed via correlations between the SQoL-MCAT dimension scores and symptomatology scores. The SQoL-MCAT is the first computerized adaptive QoL questionnaire for patients with schizophrenia. Tailored for patient characteristics and significantly shorter than the paper-based version, the SQoL-MCAT may improve the feasibility of assessing QoL in clinical practice.
Item response theory - A first approach

NASA Astrophysics Data System (ADS)

Nunes, Sandra; Oliveira, Teresa; Oliveira, Amílcar

2017-07-01

The Item Response Theory (IRT) has become one of the most popular scoring frameworks for measurement data, frequently used in computerized adaptive testing, cognitively diagnostic assessment and test equating. According to Andrade et al. (2000), IRT can be defined as a set of mathematical models (Item Response Models - IRM) constructed to represent the probability of an individual giving the right answer to an item of a particular test. The number of Item Responsible Models available to measurement analysis has increased considerably in the last fifteen years due to increasing computer power and due to a demand for accuracy and more meaningful inferences grounded in complex data. The developments in modeling with Item Response Theory were related with developments in estimation theory, most remarkably Bayesian estimation with Markov chain Monte Carlo algorithms (Patz & Junker, 1999). The popularity of Item Response Theory has also implied numerous overviews in books and journals, and many connections between IRT and other statistical estimation procedures, such as factor analysis and structural equation modeling, have been made repeatedly (Van der Lindem & Hambleton, 1997). As stated before the Item Response Theory covers a variety of measurement models, ranging from basic one-dimensional models for dichotomously and polytomously scored items and their multidimensional analogues to models that incorporate information about cognitive sub-processes which influence the overall item response process. The aim of this work is to introduce the main concepts associated with one-dimensional models of Item Response Theory, to specify the logistic models with one, two and three parameters, to discuss some properties of these models and to present the main estimation procedures.
The Piper Fatigue Scale-12 (PFS-12): psychometric findings and item reduction in a cohort of breast cancer survivors.

PubMed

Reeve, Bryce B; Stover, Angela M; Alfano, Catherine M; Smith, Ashley Wilder; Ballard-Barbash, Rachel; Bernstein, Leslie; McTiernan, Anne; Baumgartner, Kathy B; Piper, Barbara F

2012-11-01

Brief, valid measures of fatigue, a prevalent and distressing cancer symptom, are needed for use in research. This study's primary aim was to create a shortened version of the revised Piper Fatigue Scale (PFS-R) based on data from a diverse cohort of breast cancer survivors. A secondary aim was to determine whether the PFS captured multiple distinct aspects of fatigue (a multidimensional model) or a single overall fatigue factor (a unidimensional model). Breast cancer survivors (n = 799; stages in situ through IIIa; ages 29-86 years) were recruited through three SEER registries (New Mexico, Western Washington, and Los Angeles, CA) as part of the Health, Eating, Activity, and Lifestyle (HEAL) study. Fatigue was measured approximately 3 years post-diagnosis using the 22-item PFS-R that has four subscales (Behavior, Affect, Sensory, and Cognition). Confirmatory factor analysis was used to compare unidimensional and multidimensional models. Six criteria were used to make item selections to shorten the PFS-R: scale's content validity, items' relationship with fatigue, content redundancy, differential item functioning by race and/or education, scale reliability, and literacy demand. Factor analyses supported the original 4-factor structure. There was also evidence from the bi-factor model for a dominant underlying fatigue factor. Six items tested positive for differential item functioning between African-American and Caucasian survivors. Four additional items either showed poor association, local dependence, or content validity concerns. After removing these 10 items, the reliability of the PFS-12 subscales ranged from 0.87 to 0.89, compared to 0.90-0.94 prior to item removal. The newly developed PFS-12 can be used to assess fatigue in African-American and Caucasian breast cancer survivors and reduces response burden without compromising reliability or validity. This is the first study to determine PFS literacy demand and to compare PFS-R responses in African-Americans and Caucasian breast cancer survivors. Further testing in diverse populations is warranted.
Measuring Response Styles Across the Big Five: A Multiscale Extension of an Approach Using Multinomial Processing Trees.

PubMed

Khorramdel, Lale; von Davier, Matthias

2014-01-01

This study shows how to address the problem of trait-unrelated response styles (RS) in rating scales using multidimensional item response theory. The aim is to test and correct data for RS in order to provide fair assessments of personality. Expanding on an approach presented by Böckenholt (2012), observed rating data are decomposed into multiple response processes based on a multinomial processing tree. The data come from a questionnaire consisting of 50 items of the International Personality Item Pool measuring the Big Five dimensions administered to 2,026 U.S. students with a 5-point rating scale. It is shown that this approach can be used to test if RS exist in the data and that RS can be differentiated from trait-related responses. Although the extreme RS appear to be unidimensional after exclusion of only 1 item, a unidimensional measure for the midpoint RS is obtained only after exclusion of 10 items. Both RS measurements show high cross-scale correlations and item response theory-based (marginal) reliabilities. Cultural differences could be found in giving extreme responses. Moreover, it is shown how to score rating data to correct for RS after being proved to exist in the data.
Factorial invariance of pediatric patient self-reported fatigue across age and gender: a multigroup confirmatory factor analysis approach utilizing the PedsQL™ Multidimensional Fatigue Scale.

PubMed

Varni, James W; Beaujean, A Alexander; Limbers, Christine A

2013-11-01

In order to compare multidimensional fatigue research findings across age and gender subpopulations, it is important to demonstrate measurement invariance, that is, that the items from an instrument have equivalent meaning across the groups studied. This study examined the factorial invariance of the 18-item PedsQL™ Multidimensional Fatigue Scale items across age and gender and tested a bifactor model. Multigroup confirmatory factor analysis (MG-CFA) was performed specifying a three-factor model across three age groups (5-7, 8-12, and 13-18 years) and gender. MG-CFA models were proposed in order to compare the factor structure, metric, scalar, and error variance across age groups and gender. The analyses were based on 837 children and adolescents recruited from general pediatric clinics, subspecialty clinics, and hospitals in which children were being seen for well-child checks, mild acute illness, or chronic illness care. A bifactor model of the items with one general factor influencing all the items and three domain-specific factors representing the General, Sleep/Rest, and Cognitive Fatigue domains fit the data better than oblique factor models. Based on the multiple measures of model fit, configural, metric, and scalar invariance were found for almost all items across the age and gender groups, as was invariance in the factor covariances. The PedsQL™ Multidimensional Fatigue Scale demonstrated strict factorial invariance for child and adolescent self-report across gender and strong factorial invariance across age subpopulations. The findings support an equivalent three-factor structure across the age and gender groups studied. Based on these data, it can be concluded that pediatric patients across the groups interpreted the items in a similar manner regardless of their age or gender, supporting the multidimensional factor structure interpretation of the PedsQL™ Multidimensional Fatigue Scale.
Differential item functioning analysis of the Vanderbilt Expertise Test for cars.

PubMed

Lee, Woo-Yeol; Cho, Sun-Joo; McGugin, Rankin W; Van Gulick, Ana Beth; Gauthier, Isabel

2015-01-01

The Vanderbilt Expertise Test for cars (VETcar) is a test of visual learning for contemporary car models. We used item response theory to assess the VETcar and in particular used differential item functioning (DIF) analysis to ask if the test functions the same way in laboratory versus online settings and for different groups based on age and gender. An exploratory factor analysis found evidence of multidimensionality in the VETcar, although a single dimension was deemed sufficient to capture the recognition ability measured by the test. We selected a unidimensional three-parameter logistic item response model to examine item characteristics and subject abilities. The VETcar had satisfactory internal consistency. A substantial number of items showed DIF at a medium effect size for test setting and for age group, whereas gender DIF was negligible. Because online subjects were on average older than those tested in the lab, we focused on the age groups to conduct a multigroup item response theory analysis. This revealed that most items on the test favored the younger group. DIF could be more the rule than the exception when measuring performance with familiar object categories, therefore posing a challenge for the measurement of either domain-general visual abilities or category-specific knowledge.
An Aggregate IRT Procedure for Exploratory Factor Analysis

ERIC Educational Resources Information Center

Camilli, Gregory; Fox, Jean-Paul

2015-01-01

An aggregation strategy is proposed to potentially address practical limitation related to computing resources for two-level multidimensional item response theory (MIRT) models with large data sets. The aggregate model is derived by integration of the normal ogive model, and an adaptation of the stochastic approximation expectation maximization…
Using Unidimensional IRT Models for Dichotomous Classification via Computerized Classification Testing with Multidimensional Data.

ERIC Educational Resources Information Center

Lau, Che-Ming Allen; And Others

This study focused on the robustness of unidimensional item response theory (UIRT) models in computerized classification testing against violation of the unidimensionality assumption. The study addressed whether UIRT models remain acceptable under various testing conditions and dimensionality strengths. Monte Carlo simulation techniques were used…
Multidimensional and Hierarchical Assessment of School Motivation: Cross-Cultural Validation

ERIC Educational Resources Information Center

McInerney, Dennis M.; Ali, Jinnat

2006-01-01

This study examines the multidimensional and hierarchical structure of achievement goal orientation measured by the Inventory of School Motivation. The instrument consists of eight different scales with 43 survey items (ranging from three to seven items each). Each scale reflects one of eight specific dimensions: task, effort, competition, social…

Generalized Full-Information Item Bifactor Analysis

PubMed Central

Cai, Li; Yang, Ji Seung; Hansen, Mark

2011-01-01

Full-information item bifactor analysis is an important statistical method in psychological and educational measurement. Current methods are limited to single group analysis and inflexible in the types of item response models supported. We propose a flexible multiple-group item bifactor analysis framework that supports a variety of multidimensional item response theory models for an arbitrary mixing of dichotomous, ordinal, and nominal items. The extended item bifactor model also enables the estimation of latent variable means and variances when data from more than one group are present. Generalized user-defined parameter restrictions are permitted within or across groups. We derive an efficient full-information maximum marginal likelihood estimator. Our estimation method achieves substantial computational savings by extending Gibbons and Hedeker’s (1992) bifactor dimension reduction method so that the optimization of the marginal log-likelihood only requires two-dimensional integration regardless of the dimensionality of the latent variables. We use simulation studies to demonstrate the flexibility and accuracy of the proposed methods. We apply the model to study cross-country differences, including differential item functioning, using data from a large international education survey on mathematics literacy. PMID:21534682
Dimensionality Assessment for Dichotomously Scored Items Using Multidimensional Scaling.

ERIC Educational Resources Information Center

Jones, Patricia B.; And Others

In order to determine the effectiveness of multidimensional scaling (MDS) in recovering the dimensionality of a set of dichotomously-scored items, data were simulated in one, two, and three dimensions for a variety of correlations with the underlying latent trait. Similarity matrices were constructed from these data using three margin-sensitive…
Comparing the Performance of Five Multidimensional CAT Selection Procedures with Different Stopping Rules

ERIC Educational Resources Information Center

Yao, Lihua

2013-01-01

Through simulated data, five multidimensional computerized adaptive testing (MCAT) selection procedures with varying test lengths are examined and compared using different stopping rules. Fixed item exposure rates are used for all the items, and the Priority Index (PI) method is used for the content constraints. Two stopping rules, standard error…
Calibration of the Test of Relational Reasoning.

PubMed

Dumas, Denis; Alexander, Patricia A

2016-10-01

Relational reasoning, or the ability to discern meaningful patterns within a stream of information, is a critical cognitive ability associated with academic and professional success. Importantly, relational reasoning has been described as taking multiple forms, depending on the type of higher order relations being drawn between and among concepts. However, the reliable and valid measurement of such a multidimensional construct of relational reasoning has been elusive. The Test of Relational Reasoning (TORR) was designed to tap 4 forms of relational reasoning (i.e., analogy, anomaly, antinomy, and antithesis). In this investigation, the TORR was calibrated and scored using multidimensional item response theory in a large, representative undergraduate sample. The bifactor model was identified as the best-fitting model, and used to estimate item parameters and construct reliability. To improve the usefulness of the TORR to educators, scaled scores were also calculated and presented. (PsycINFO Database Record (c) 2016 APA, all rights reserved).
Multidimensional daily diary of fatigue-fibromyalgia-17 items (MDF-fibro-17). part 1: development and content validity.

PubMed

Morris, S; Li, Y; Smith, J A M; Dube', S; Burbridge, C; Symonds, T

2017-05-16

Fibromyalgia (FM), a disorder characterized by chronic widespread pain and tenderness, affects greater than five million individuals in the United States alone. Patients experience multiple symptoms in addition to pain, and among them, fatigue is one of the most bothersome and disabling. There is a growing body of literature suggesting that fatigue is a multidimensional concept. Currently, to our knowledge, no multidimensional Patient Reported Outcome (PRO) measure of FM-related fatigue meets Food and Drug Administration (FDA) requirements to support a product label claim. Therefore, the objective of this research was to evaluate qualitative and quantitative data previously gathered to inform the development of a comprehensive, multidimensional, PRO measure to assess FM-related fatigue in FM clinical trials. Existing qualitative and quantitative data from three previously conducted studies in patients with FM were reviewed to inform the initial development of a multidimensional PRO measure of FM-related fatigue: 1) a concept elicitation study involving in-depth, open-ended interviews with patients with FM in the United States (US) (N = 20), Germany (N = 10), and France (N = 10); 2) a cognitive debriefing and pilot study of a preliminary pool of 23 items (N = 20 US patients with FM); and 3) a methodology study that explored initial psychometrics of the item pool (N = 145 US patients with FM). Five domains were identified that intend to capture the broad experience of FM-related fatigue reported in the qualitative research: the Global Fatigue Experience, Cognitive Fatigue, Physical Fatigue, Motivation, and Impact on Function. Seventeen of the original pool of 23 items were selected to best capture these five dimensions. These 17 items formed the basis of a newly developed multidimensional PRO measure to assess FM-related fatigue in clinical trials: the Multidimensional Daily Diary of Fatigue-Fibromyalgia-17 (MDF-Fibro-17). Qualitative analysis, and preliminary quantitative item level data, confirmed that FM-related fatigue is multidimensional and provided strong support for the content validity of the MDF-Fibro-17. The next stage was to quantitatively evaluate the measure to confirm the factor structure, psychometric properties, sensitivity to change, and meaningful change. This has been conducted and is being reported separately.
A mixed-effects regression model for longitudinal multivariate ordinal data.

PubMed

Liu, Li C; Hedeker, Donald

2006-03-01

A mixed-effects item response theory model that allows for three-level multivariate ordinal outcomes and accommodates multiple random subject effects is proposed for analysis of multivariate ordinal outcomes in longitudinal studies. This model allows for the estimation of different item factor loadings (item discrimination parameters) for the multiple outcomes. The covariates in the model do not have to follow the proportional odds assumption and can be at any level. Assuming either a probit or logistic response function, maximum marginal likelihood estimation is proposed utilizing multidimensional Gauss-Hermite quadrature for integration of the random effects. An iterative Fisher scoring solution, which provides standard errors for all model parameters, is used. An analysis of a longitudinal substance use data set, where four items of substance use behavior (cigarette use, alcohol use, marijuana use, and getting drunk or high) are repeatedly measured over time, is used to illustrate application of the proposed model.
Performance of the S - [chi][squared] Statistic for Full-Information Bifactor Models

ERIC Educational Resources Information Center

Li, Ying; Rupp, Andre A.

2011-01-01

This study investigated the Type I error rate and power of the multivariate extension of the S - [chi][squared] statistic using unidimensional and multidimensional item response theory (UIRT and MIRT, respectively) models as well as full-information bifactor (FI-bifactor) models through simulation. Manipulated factors included test length, sample…
Short-Term Memory Scanning Viewed as Exemplar-Based Categorization

ERIC Educational Resources Information Center

Nosofsky, Robert M.; Little, Daniel R.; Donkin, Christopher; Fific, Mario

2011-01-01

Exemplar-similarity models such as the exemplar-based random walk (EBRW) model (Nosofsky & Palmeri, 1997b) were designed to provide a formal account of multidimensional classification choice probabilities and response times (RTs). At the same time, a recurring theme has been to use exemplar models to account for old-new item recognition and to…
Bi-Factor MIRT Observed-Score Equating for Mixed-Format Tests

ERIC Educational Resources Information Center

Lee, Guemin; Lee, Won-Chan

2016-01-01

The main purposes of this study were to develop bi-factor multidimensional item response theory (BF-MIRT) observed-score equating procedures for mixed-format tests and to investigate relative appropriateness of the proposed procedures. Using data from a large-scale testing program, three types of pseudo data sets were formulated: matched samples,…
The Robustness of IRT-Based Vertical Scaling Methods to Violation of Unidimensionality

ERIC Educational Resources Information Center

Yin, Liqun

2013-01-01

In recent years, many states have adopted Item Response Theory (IRT) based vertically scaled tests due to their compelling features in a growth-based accountability context. However, selection of a practical and effective calibration/scaling method and proper understanding of issues with possible multidimensionality in the test data is critical to…
Calibration of Response Data Using MIRT Models with Simple and Mixed Structures

ERIC Educational Resources Information Center

Zhang, Jinming

2012-01-01

It is common to assume during a statistical analysis of a multiscale assessment that the assessment is composed of several unidimensional subtests or that it has simple structure. Under this assumption, the unidimensional and multidimensional approaches can be used to estimate item parameters. These two approaches are equivalent in parameter…
Exploring Unidimensional Proficiency Classification Accuracy from Multidimensional Data in a Vertical Scaling Context

ERIC Educational Resources Information Center

Kroopnick, Marc Howard

2010-01-01

When Item Response Theory (IRT) is operationally applied for large scale assessments, unidimensionality is typically assumed. This assumption requires that the test measures a single latent trait. Furthermore, when tests are vertically scaled using IRT, the assumption of unidimensionality would require that the battery of tests across grades…
The Dimensionality of Cognitive Structure: A MIRT Approach and the Use of Subscores

ERIC Educational Resources Information Center

Cheng, Yi-Ling

2016-01-01

The present study explored the dimensionality of cognitive structure from two approaches. The first approach used a famous relation between Visual Spatial Working Memory (VSWM) and calculation to demonstrate the multidimensional item response analyses when true dimensions are unknown. The second approach explored the detectability of dimensions by…
A model for incomplete longitudinal multivariate ordinal data.

PubMed

Liu, Li C

2008-12-30

In studies where multiple outcome items are repeatedly measured over time, missing data often occur. A longitudinal item response theory model is proposed for analysis of multivariate ordinal outcomes that are repeatedly measured. Under the MAR assumption, this model accommodates missing data at any level (missing item at any time point and/or missing time point). It allows for multiple random subject effects and the estimation of item discrimination parameters for the multiple outcome items. The covariates in the model can be at any level. Assuming either a probit or logistic response function, maximum marginal likelihood estimation is described utilizing multidimensional Gauss-Hermite quadrature for integration of the random effects. An iterative Fisher-scoring solution, which provides standard errors for all model parameters, is used. A data set from a longitudinal prevention study is used to motivate the application of the proposed model. In this study, multiple ordinal items of health behavior are repeatedly measured over time. Because of a planned missing design, subjects answered only two-third of all items at a given point. Copyright 2008 John Wiley & Sons, Ltd.
The PedsQL Multidimensional Fatigue Scale in pediatric rheumatology: reliability and validity.

PubMed

Varni, James W; Burwinkle, Tasha M; Szer, Ilona S

2004-12-01

. The PedsQL (Pediatric Quality of Life Inventory) is a modular instrument designed to measure health related quality of life (HRQOL) in children and adolescents ages 2-18 years. The recently developed 18-item PedsQL Multidimensional Fatigue Scale was designed to measure fatigue in pediatric patients and comprises the General Fatigue Scale (6 items), Sleep/Rest Fatigue Scale (6 items), and Cognitive Fatigue Scale (6 items). The PedsQL 4.0 Generic Core Scales were developed as the generic core measure to be integrated with the PedsQL Disease-Specific Modules. The PedsQL 3.0 Rheumatology Module was designed to measure pediatric rheumatology-specific HRQOL. Methods. The PedsQL Multidimensional Fatigue Scale, Generic Core Scales, and Rheumatology Module were administered to 163 children and 154 parents (183 families accrued overall) recruited from a pediatric rheumatology clinic. Results. Internal consistency reliability for the PedsQL Multidimensional Fatigue Scale Total Score (a = 0.95 child, 0.95 parent report), General Fatigue Scale (a = 0.93 child, 0.92 parent), Sleep/Rest Fatigue Scale (a = 0.88 child, 0.90 parent), and Cognitive Fatigue Scale (a = 0.93 child, 0.96 parent) were excellent for group and individual comparisons. The validity of the PedsQL Multidimensional Fatigue Scale was confirmed through hypothesized intercorrelations with dimensions of generic and rheumatology-specific HRQOL. The PedsQL Multidimensional Fatigue Scale distinguished between healthy children and children with rheumatic diseases as a group, and was associated with greater disease severity. Children with fibromyalgia manifested greater fatigue than children with other rheumatic diseases. The results confirm the initial reliability and validity of the PedsQL Multidimensional Fatigue Scale in pediatric rheumatology.
Assessing Psycho-social Barriers to Rehabilitation in Injured Workers with Chronic Musculoskeletal Pain: Development and Item Properties of the Yellow Flag Questionnaire (YFQ).

PubMed

Salathé, Cornelia Rolli; Trippolini, Maurizio Alen; Terribilini, Livio Claudio; Oliveri, Michael; Elfering, Achim

2018-06-01

Purpose To develop a multidimensional scale to asses psychosocial beliefs-the Yellow Flag Questionnaire (YFQ)-aimed at guiding interventions for workers with chronic musculoskeletal (MSK) pain. Methods Phase 1 consisted of item selection based on literature search, item development and expert consensus rounds. In phase 2, items were reduced with calculating a quality-score per item, using structure equation modeling and confirmatory factor analysis on data from 666 workers. In phase 3, Cronbach's α, and Pearson correlations coefficients were computed to compare YFQ with disability, anxiety, depression and self-efficacy and the YFQ score based on data from 253 injured workers. Regressions of YFQ total score on disability, anxiety, depression and self-efficacy were calculated. Results After phase 1, the YFQ included 116 items and 15 domains. Further reductions of items in phase 2 by applying the item quality criteria reduced the total to 48 items. Phase factor analysis with structural equation modeling confirmed 32 items in seven domains: activity, work, emotions, harm & blame, diagnosis beliefs, co-morbidity and control. Cronbach α was 0.91 for the total score, between 0.49 and 0.81 for the 7 distinct scores of each domain, respectively. Correlations between YFQ total score ranged with disability, anxiety, depression and self-efficacy was .58, .66, .73, -.51, respectively. After controlling for age and gender the YFQ total score explained between R2 27% and R2 53% variance of disability, anxiety, depression and self-efficacy. Conclusions The YFQ, a multidimensional screening scale is recommended for use to assess psychosocial beliefs of workers with chronic MSK pain. Further evaluation of the measurement properties such as the test-retest reliability, responsiveness and prognostic validity is warranted.
Multidimensional CAT Item Selection Methods for Domain Scores and Composite Scores: Theory and Applications

ERIC Educational Resources Information Center

Yao, Lihua

2012-01-01

Multidimensional computer adaptive testing (MCAT) can provide higher precision and reliability or reduce test length when compared with unidimensional CAT or with the paper-and-pencil test. This study compared five item selection procedures in the MCAT framework for both domain scores and overall scores through simulation by varying the structure…
Dimensionality of the Latent Structure and Item Selection via Latent Class Multidimensional IRT Models

ERIC Educational Resources Information Center

Bartolucci, F.; Montanari, G. E.; Pandolfi, S.

2012-01-01

With reference to a questionnaire aimed at assessing the performance of Italian nursing homes on the basis of the health conditions of their patients, we investigate two relevant issues: dimensionality of the latent structure and discriminating power of the items composing the questionnaire. The approach is based on a multidimensional item…
Speeded Old-New Recognition of Multidimensional Perceptual Stimuli: Modeling Performance at the Individual-Participant and Individual-Item Levels

ERIC Educational Resources Information Center

Nosofsky, Robert M.; Stanton, Roger D.

2006-01-01

Observers made speeded old-new recognition judgments of color stimuli embedded in a multidimensional similarity space. The paradigm used multiple lists but with the underlying similarity structures repeated across lists, to allow for quantitative modeling of the data at the individual-participant and individual-item levels. Correct rejection…
Differential item functioning analysis of the Vanderbilt Expertise Test for cars

PubMed Central

Lee, Woo-Yeol; Cho, Sun-Joo; McGugin, Rankin W.; Van Gulick, Ana Beth; Gauthier, Isabel

2015-01-01

The Vanderbilt Expertise Test for cars (VETcar) is a test of visual learning for contemporary car models. We used item response theory to assess the VETcar and in particular used differential item functioning (DIF) analysis to ask if the test functions the same way in laboratory versus online settings and for different groups based on age and gender. An exploratory factor analysis found evidence of multidimensionality in the VETcar, although a single dimension was deemed sufficient to capture the recognition ability measured by the test. We selected a unidimensional three-parameter logistic item response model to examine item characteristics and subject abilities. The VETcar had satisfactory internal consistency. A substantial number of items showed DIF at a medium effect size for test setting and for age group, whereas gender DIF was negligible. Because online subjects were on average older than those tested in the lab, we focused on the age groups to conduct a multigroup item response theory analysis. This revealed that most items on the test favored the younger group. DIF could be more the rule than the exception when measuring performance with familiar object categories, therefore posing a challenge for the measurement of either domain-general visual abilities or category-specific knowledge. PMID:26418499

Item response theory detects differential item functioning between healthy and ill children in QoL measures

PubMed Central

Langer, Michelle M.; Hill, Cheryl D.; Thissen, David; Burwinkle, Tasha M.; Varni, James W.; DeWalt, Darren A.

2008-01-01

Objective To demonstrate the value of item response theory (IRT) and differential item functioning (DIF) methods in examining a health-related quality of life (HRQOL) measure in children and adolescents. Study Design and Setting This illustration uses data from 5,429 children using the four subscales of the PedsQL™ 4.0 Generic Core Scales. The IRT model-based likelihood ratio test was used to detect and evaluate DIF between healthy children and children with a chronic condition. Results DIF was detected for a majority of items but cancelled out at the total test score level due to opposing directions of DIF. Post-hoc analysis indicated that this pattern of results may be due to multidimensionality. We discuss issues in detecting and handling DIF. Conclusion This paper describes how to perform DIF analyses in validating a questionnaire to ensure that scores have equivalent meaning across subgroups. It offers insight into ways information gained through the analysis can be used to evaluate an existing scale. PMID:18226750
Development and Validation of a Short Form for the Multidimensional Work Ethic Profile

ERIC Educational Resources Information Center

Meriac, John P.; Woehr, David J.; Gorman, C. Allen; Thomas, Amanda L. E.

2013-01-01

The multidimensional work ethic profile (MWEP) has become one of the most widely-used inventories for measuring the work ethic construct. However, its length has been a potential barrier to even more widespread use. We developed a short form of the MWEP, the MWEP-SF. A subset of items from the original measure was identified, using item response…
Criteria of Career Success among Chinese Employees: Developing a Multidimensional Scale with Qualitative and Quantitative Approaches

ERIC Educational Resources Information Center

Zhou, Wenxia; Sun, Jianmin; Guan, Yanjun; Li, Yuhui; Pan, Jingzhou

2013-01-01

The current research aimed to develop a multidimensional measure on the criteria of career success in a Chinese context. Items on the criteria of career success were obtained using a qualitative approach among 30 Chinese employees; exploratory factor analysis was conducted to select items and determine the factor structure among a new sample of…
Using a Multidimensional IRT Framework to Better Understand Differential Item Functioning (DIF): A Tale of Three DIF Detection Procedures

ERIC Educational Resources Information Center

Walker, Cindy M.; Gocer Sahin, Sakine

2017-01-01

The theoretical reason for the presence of differential item functioning (DIF) is that data are multidimensional and two groups of examinees differ in their underlying ability distribution for the secondary dimension(s). Therefore, the purpose of this study was to determine how much the secondary ability distributions must differ before DIF is…
Multilevel Multidimensional Item Response Model with a Multilevel Latent Covariate

ERIC Educational Resources Information Center

Cho, Sun-Joo; Bottge, Brian A.

2015-01-01

In a pretest-posttest cluster-randomized trial, one of the methods commonly used to detect an intervention effect involves controlling pre-test scores and other related covariates while estimating an intervention effect at post-test. In many applications in education, the total post-test and pre-test scores that ignores measurement error in the…
Validity Study in Multidimensional Latent Space and Efficient Computerized Adaptive Testing. Final Report.

ERIC Educational Resources Information Center

Samejima, Fumiko

This paper is the final report of a multi-year project sponsored by the Office of Naval Research (ONR) in 1987 through 1990. The main objectives of the research summarized were to: investigate the non-parametric approach to the estimation of the operating characteristics of discrete item responses; revise and strengthen the package computer…
Making the Most of What We Have: A Practical Application of Multidimensional Item Response Theory in Test Scoring

ERIC Educational Resources Information Center

de la Torre, Jimmy; Patz, Richard J.

2005-01-01

This article proposes a practical method that capitalizes on the availability of information from multiple tests measuring correlated abilities given in a single test administration. By simultaneously estimating different abilities with the use of a hierarchical Bayesian framework, more precise estimates for each ability dimension are obtained.…
Bayesian Estimation of Multidimensional Item Response Models. A Comparison of Analytic and Simulation Algorithms

ERIC Educational Resources Information Center

Martin-Fernandez, Manuel; Revuelta, Javier

2017-01-01

This study compares the performance of two estimation algorithms of new usage, the Metropolis-Hastings Robins-Monro (MHRM) and the Hamiltonian MCMC (HMC), with two consolidated algorithms in the psychometric literature, the marginal likelihood via EM algorithm (MML-EM) and the Markov chain Monte Carlo (MCMC), in the estimation of multidimensional…
Some New Dimensions of Student Attitudes Toward Basic School Subjects.

ERIC Educational Resources Information Center

Hogan, Thomas P.

To investigate whether student attitudes toward basic school subjects were multidimensional, responses of 876 students in grade 6 to a preliminary pool of 72 items from the Survey of School Attitudes (SSA) were factor analyzed. If attitudes are unidimensional, as suggested by the four scores yielded by the SSA (one each for reading/language arts,…
Development and assessment of the Quality of Life in Childhood Epilepsy Questionnaire (QOLCE-16).

PubMed

Goodwin, Shane W; Ferro, Mark A; Speechley, Kathy N

2018-03-01

The aim of this study was to develop and validate a brief version of the Quality of Life in Childhood Epilepsy Questionnaire (QOLCE). A secondary aim was to compare the results described in previously published studies using the QOLCE-55 with those obtained using the new brief version. Data come from 373 children involved in the Health-related Quality of Life in Children with Epilepsy Study, a multicenter prospective cohort study. Item response theory (IRT) methods were used to assess dimensionality and item properties and to guide the selection of items. Replication of results using the brief measure was conducted with multiple regression, multinomial regression, and latent mixture modeling techniques. IRT methods identified a bi-factor graded response model that best fits the data. Thirty-nine items were removed, resulting in a 16-item QOLCE (QOLCE-16) with an equal number of items in all 4 domains of functioning (Cognitive, Emotional, Social, and Physical). Model fit was excellent: Comparative Fit Index = 0.99; Tucker-Lewis Index = 0.99; root mean square error of approximation = 0.052 (90% confidence interval [CI] 0.041-0.064); weighted root mean square = 0.76. Results that were reported previously using the QOLCE-55 and QOLCE-76 were comparable to those generated using the QOLCE-16. The QOLCE-16 is a multidimensional measure of health-related quality of life (HRQoL) with good psychometric properties and a short-estimated completion time. It is notable that the items were calibrated using multidimensional IRT methods to create a measure that conforms to conventional definitions of HRQoL. The QOLCE-16 is an appropriate measure for both clinicians and researchers wanting to record HRQoL information in children with epilepsy. Wiley Periodicals, Inc. © 2018 International League Against Epilepsy.
Measuring ability to assess claims about treatment effects: a latent trait analysis of items from the ‘Claim Evaluation Tools’ database using Rasch modelling

PubMed Central

Austvoll-Dahlgren, Astrid; Guttersrud, Øystein; Nsangi, Allen; Semakula, Daniel; Oxman, Andrew D

2017-01-01

Background The Claim Evaluation Tools database contains multiple-choice items for measuring people’s ability to apply the key concepts they need to know to be able to assess treatment claims. We assessed items from the database using Rasch analysis to develop an outcome measure to be used in two randomised trials in Uganda. Rasch analysis is a form of psychometric testing relying on Item Response Theory. It is a dynamic way of developing outcome measures that are valid and reliable. Objectives To assess the validity, reliability and responsiveness of 88 items addressing 22 key concepts using Rasch analysis. Participants We administrated four sets of multiple-choice items in English to 1114 people in Uganda and Norway, of which 685 were children and 429 were adults (including 171 health professionals). We scored all items dichotomously. We explored summary and individual fit statistics using the RUMM2030 analysis package. We used SPSS to perform distractor analysis. Results Most items conformed well to the Rasch model, but some items needed revision. Overall, the four item sets had satisfactory reliability. We did not identify significant response dependence between any pairs of items and, overall, the magnitude of multidimensionality in the data was acceptable. The items had a high level of difficulty. Conclusion Most of the items conformed well to the Rasch model’s expectations. Following revision of some items, we concluded that most of the items were suitable for use in an outcome measure for evaluating the ability of children or adults to assess treatment claims. PMID:28550019
The Piper Fatigue Scale-12 (PFS-12): Psychometric Findings and Item Reduction in a Cohort of Breast Cancer Survivors

PubMed Central

Reeve, Bryce B.; Stover, Angela M.; Alfano, Catherine M.; Smith, Ashley Wilder; Ballard-Barbash, Rachel; Bernstein, Leslie; McTiernan, Anne; Baumgartner, Kathy B.; Piper, Barbara F.

2013-01-01

Purpose Brief, valid measures of fatigue, a prevalent and distressing cancer symptom, are needed for use in research. This study’s primary aim was to create a shortened version of the revised Piper Fatigue Scale (PFS-R) based on data from a diverse cohort of breast cancer survivors. A secondary aim was to determine whether the PFS captured multiple distinct aspects of fatigue (a multidimensional model) or a single overall fatigue factor (a unidimensional model). Methods Breast cancer survivors (n=799; stages in situ through IIIa; ages 29–86 yrs) were recruited through 3 SEER registries (New Mexico, Western Washington, and Los Angeles, CA) as part of the Health, Eating, Activity, and Lifestyle (HEAL) study. Fatigue was measured approximately 3 years post-diagnosis using the 22-item PFS-R that has 4 subscales (Behavior, Affect, Sensory, and Cognition). Confirmatory factor analysis was used to compare unidimensional and multidimensional models. Six criteria were used to make item selections to shorten the PFS-R: scale’s content validity, items’ relationship with fatigue, content redundancy, differential item functioning by race and/or education, scale reliability, and literacy demand. Results Factor analyses supported the original 4-factor structure. There was also evidence from the bi-factor model for a dominant underlying fatigue factor. Six items tested positive for differential item functioning between African-American and Caucasian survivors. Four additional items either showed poor association, local dependence, or content validity concerns. After removing these 10 items, the reliability of the PFS-12 subscales ranged from 0.87–0.89, compared to 0.90–0.94 prior to item removal. Conclusion The newly developed PFS-12 can be used to assess fatigue in African-American and Caucasian breast cancer survivors and reduces response burden without compromising reliability or validity. This is the first study to determine PFS literacy demand and to compare PFS-R responses in African-Americans and Caucasian breast cancer survivors. Further testing in diverse populations is warranted. PMID:22933027
Unidimensional Vertical Scaling in Multidimensional Space. Research Report. ETS RR-17-29

ERIC Educational Resources Information Center

Carlson, James E.

2017-01-01

In this paper, I consider a set of test items that are located in a multidimensional space, S[subscript M], but are located along a curved line in S[subscript M] and can be scaled unidimensionally. Furthermore, I am demonstrating a case in which the test items are administered across 6 levels, such as occurs in K-12 assessment across 6 grade…
What Do You Think You Are Measuring? A Mixed-Methods Procedure for Assessing the Content Validity of Test Items and Theory-Based Scaling

PubMed Central

Koller, Ingrid; Levenson, Michael R.; Glück, Judith

2017-01-01

The valid measurement of latent constructs is crucial for psychological research. Here, we present a mixed-methods procedure for improving the precision of construct definitions, determining the content validity of items, evaluating the representativeness of items for the target construct, generating test items, and analyzing items on a theoretical basis. To illustrate the mixed-methods content-scaling-structure (CSS) procedure, we analyze the Adult Self-Transcendence Inventory, a self-report measure of wisdom (ASTI, Levenson et al., 2005). A content-validity analysis of the ASTI items was used as the basis of psychometric analyses using multidimensional item response models (N = 1215). We found that the new procedure produced important suggestions concerning five subdimensions of the ASTI that were not identifiable using exploratory methods. The study shows that the application of the suggested procedure leads to a deeper understanding of latent constructs. It also demonstrates the advantages of theory-based item analysis. PMID:28270777
Multidimensional Computerized Adaptive Testing for Indonesia Junior High School Biology

ERIC Educational Resources Information Center

Kuo, Bor-Chen; Daud, Muslem; Yang, Chih-Wei

2015-01-01

This paper describes a curriculum-based multidimensional computerized adaptive test that was developed for Indonesia junior high school Biology. In adherence to the Indonesian curriculum of different Biology dimensions, 300 items was constructed, and then tested to 2238 students. A multidimensional random coefficients multinomial logit model was…
[Measuring job satisfaction: development of a multidimensional scale].

PubMed

Faraci, Palmira; Valenti, Giusy

2016-01-01

Although numerous studies have been done on the topic ofjob satisfaction, as regards the Italian research, the construction of specific psychometric instruments is lacking. The present paper is aimed to develop a scale to measure job satisfaction referring to our cultural context. Participants were 222 workers (36.5% males, 63.5% females) with an average age of 38.39 years (SD = 10.91). The formulated items were selected from a large item pool on the basis of the evaluation by a group of expert judges, and the item analysis procedure. In order to establish test validity, the following instruments were also administered: Occupational Stress Indicator, Satisfaction With Life Scale, Rosenberg Self-Esteem Scale, Multidimensional Scale of Perceived Social Support, and Beck Depression Inventory. Both exploratory and confirmatory factor analyses highlighted a 6-factor structure. Those factors were responsible for 51.30% of the total variance. Reliability analyses indicated satisfying internal consistency (ranging from alpha = .73 to alpha = .86). Construct validity was supported by results obtained calculating correlations with the theoretically associated variables. Our findings suggest promising psychometric properties for the presented measure. The instrument could be used in specific programs developed to promote well-being conditions in work settings.
A note on measuring apprehension about writing.

PubMed

Rechtien, J G; Dizinno, G

1997-06-01

Having revised Daly and Miller's 1975 unidimensional Writing Apprehension Test, Riffe and Stacks in 1992 proposed eight multidimensional factors derived from responses to 56 items in their Mass Communication Writing Apprehension Measure, administered to communication students to identify the various dimensions of apprehension about writing shared with business writers and specific to their major. The current authors administered the questionnaire at the beginning of an academic year to 419 freshmen from all undergraduate schools and majors at a private liberal arts university. It was hypothesized that the factors found among the homogeneous population of communication majors would not be replicated among the more heterogeneous student population. The hypothesis was partially upheld. Seven factors were identified. Two duplicated most items found by Riffe and Stacks (1992), four added items, and one was new. The results of this study suggest that, although the general population of students differs from students in mass communication, as Riffe and Stacks remarked, the groups also share similar content in their writing apprehension, that writing apprehension is multidimensional, that caution must be exercised when administering any instrument for the diagnostic and counseling purposes suggested by Riffe and Stacks, and that writing apprehension should also be investigated from the perspective of locus of control.
A Typology of Marital Quality of Enduring Marriages in Israel

ERIC Educational Resources Information Center

Cohen, Orna; Geron, Yael; Farchi, Alva

2010-01-01

This article presents a typology of enduring marriages of Israeli couples married for at least 40 years. Based on the view that marital quality is a multidimensional phenomenon, the typology is derived from a cluster analysis of responses of husbands and wives in 51 couples to the ENRICH scale items. Three types of enduring marriages were found:…
The Reliability and Construct Validity of American College Students' Responses to the WHOQOL-BREF

ERIC Educational Resources Information Center

D'Abundo, Michelle; Orsini, M. M.; Milroy, J. J.; Sidman, C. L.

2011-01-01

The World Health Organization Quality of Life (WHOQOL-100) instrument was developed to assess quality of life from a multi-dimensional perspective. A shorter 26-item version of the instrument was created called the WHOQOL-BREF, which is the focus of this study. Based on previous research, it is unclear if the WHOQOL-BREF instrument is appropriate…
Psychometric properties of the Triarchic Psychopathy Measure: An item response theory approach.

PubMed

Shou, Yiyun; Sellbom, Martin; Xu, Jing

2018-05-01

There is cumulative evidence for the cross-cultural validity of the Triarchic Psychopathy Measure (TriPM; Patrick, 2010) among non-Western populations. Recent studies using correlational and regression analyses show promising construct validity of the TriPM in Chinese samples. However, little is known about the efficiency of items in TriPM in assessing the proposed latent traits. The current study evaluated the psychometric properties of the Chinese TriPM at the item level using item response theory analyses. It also examined the measurement invariance of the TriPM between the Chinese and the U.S. student samples by applying differential item functioning analyses under the item response theory framework. The results supported the unidimensional nature of the Disinhibition and Meanness scales. Both scales had a greater level of precision in the respective underlying constructs at the positive ends. The two scales, however, had several items that were weakly associated with their respective latent traits in the Chinese student sample. Boldness, on the other hand, was found to be multidimensional, and reflected a more normally distributed range of variation. The examination of measurement bias via differential item functioning analyses revealed that a number of items of the TriPM were not equivalent across the Chinese and the U.S. Some modification and adaptation of items might be considered for improving the precision of the TriPM for Chinese participants. (PsycINFO Database Record (c) 2018 APA, all rights reserved).

The positive mental health instrument: development and validation of a culturally relevant scale in a multi-ethnic Asian population.

PubMed

Vaingankar, Janhavi Ajit; Subramaniam, Mythily; Chong, Siow Ann; Abdin, Edimansyah; Orlando Edelen, Maria; Picco, Louisa; Lim, Yee Wei; Phua, Mei Yen; Chua, Boon Yiang; Tee, Joseph Y S; Sherbourne, Cathy

2011-10-31

Instruments to measure mental health and well-being are largely developed and often used within Western populations and this compromises their validity in other cultures. A previous qualitative study in Singapore demonstrated the relevance of spiritual and religious practices to mental health, a dimension currently not included in exiting multi-dimensional measures. The objective of this study was to develop a self-administered measure that covers all key and culturally appropriate domains of mental health, which can be applied to compare levels of mental health across different age, gender and ethnic groups. We present the item reduction and validation of the Positive Mental Health (PMH) instrument in a community-based adult sample in Singapore. Surveys were conducted among adult (21-65 years) residents belonging to Chinese, Malay and Indian ethnicities. Exploratory and confirmatory factor analysis (EFA, CFA) were conducted and items were reduced using item response theory tests (IRT). The final version of the PMH instrument was tested for internal consistency and criterion validity. Items were tested for differential item functioning (DIF) to check if items functioned in the same way across all subgroups. EFA and CFA identified six first-order factor structure (General coping, Personal growth and autonomy, Spirituality, Interpersonal skills, Emotional support, and Global affect) under one higher-order dimension of Positive Mental Health (RMSEA=0.05, CFI=0.96, TLI=0.96). A 47-item self-administered multi-dimensional instrument with a six-point Likert response scale was constructed. The slope estimates and strength of the relation to the theta for all items in each six PMH subscales were high (range:1.39 to 5.69), suggesting good discrimination properties. The threshold estimates for the instrument ranged from -3.45 to 1.61 indicating that the instrument covers entire spectrums for the six dimensions. The instrument demonstrated high internal consistency and had significant and expected correlations with other well-being measures. Results confirmed absence of DIF. The PMH instrument is a reliable and valid instrument that can be used to measure and compare level of mental health across different age, gender and ethnic groups in Singapore.
An evaluation of computerized adaptive testing for general psychological distress: combining GHQ-12 and Affectometer-2 in an item bank for public mental health research.

PubMed

Stochl, Jan; Böhnke, Jan R; Pickett, Kate E; Croudace, Tim J

2016-05-20

Recent developments in psychometric modeling and technology allow pooling well-validated items from existing instruments into larger item banks and their deployment through methods of computerized adaptive testing (CAT). Use of item response theory-based bifactor methods and integrative data analysis overcomes barriers in cross-instrument comparison. This paper presents the joint calibration of an item bank for researchers keen to investigate population variations in general psychological distress (GPD). Multidimensional item response theory was used on existing health survey data from the Scottish Health Education Population Survey (n = 766) to calibrate an item bank consisting of pooled items from the short common mental disorder screen (GHQ-12) and the Affectometer-2 (a measure of "general happiness"). Computer simulation was used to evaluate usefulness and efficacy of its adaptive administration. A bifactor model capturing variation across a continuum of population distress (while controlling for artefacts due to item wording) was supported. The numbers of items for different required reliabilities in adaptive administration demonstrated promising efficacy of the proposed item bank. Psychometric modeling of the common dimension captured by more than one instrument offers the potential of adaptive testing for GPD using individually sequenced combinations of existing survey items. The potential for linking other item sets with alternative candidate measures of positive mental health is discussed since an optimal item bank may require even more items than these.
Measuring ability to assess claims about treatment effects: a latent trait analysis of items from the 'Claim Evaluation Tools' database using Rasch modelling.

PubMed

Austvoll-Dahlgren, Astrid; Guttersrud, Øystein; Nsangi, Allen; Semakula, Daniel; Oxman, Andrew D

2017-05-25

The Claim Evaluation Tools database contains multiple-choice items for measuring people's ability to apply the key concepts they need to know to be able to assess treatment claims. We assessed items from the database using Rasch analysis to develop an outcome measure to be used in two randomised trials in Uganda. Rasch analysis is a form of psychometric testing relying on Item Response Theory. It is a dynamic way of developing outcome measures that are valid and reliable. To assess the validity, reliability and responsiveness of 88 items addressing 22 key concepts using Rasch analysis. We administrated four sets of multiple-choice items in English to 1114 people in Uganda and Norway, of which 685 were children and 429 were adults (including 171 health professionals). We scored all items dichotomously. We explored summary and individual fit statistics using the RUMM2030 analysis package. We used SPSS to perform distractor analysis. Most items conformed well to the Rasch model, but some items needed revision. Overall, the four item sets had satisfactory reliability. We did not identify significant response dependence between any pairs of items and, overall, the magnitude of multidimensionality in the data was acceptable. The items had a high level of difficulty. Most of the items conformed well to the Rasch model's expectations. Following revision of some items, we concluded that most of the items were suitable for use in an outcome measure for evaluating the ability of children or adults to assess treatment claims. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2017. All rights reserved. No commercial use is permitted unless otherwise expressly granted.
Alzheimer's Disease Assessment: A Review and Illustrations Focusing on Item Response Theory Techniques.

PubMed

Balsis, Steve; Choudhury, Tabina K; Geraci, Lisa; Benge, Jared F; Patrick, Christopher J

2018-04-01

Alzheimer's disease (AD) affects neurological, cognitive, and behavioral processes. Thus, to accurately assess this disease, researchers and clinicians need to combine and incorporate data across these domains. This presents not only distinct methodological and statistical challenges but also unique opportunities for the development and advancement of psychometric techniques. In this article, we describe relatively recent research using item response theory (IRT) that has been used to make progress in assessing the disease across its various symptomatic and pathological manifestations. We focus on applications of IRT to improve scoring, test development (including cross-validation and adaptation), and linking and calibration. We conclude by describing potential future multidimensional applications of IRT techniques that may improve the precision with which AD is measured.
The Role of Content and Context in PISA Interest Scales: A study of the embedded interest items in the PISA 2006 science assessment

NASA Astrophysics Data System (ADS)

Drechsel, Barbara; Carstensen, Claus; Prenzel, Manfred

2011-01-01

This paper focuses interest in science as one of the attitudinal aspects of scientific literacy. Large-scale data from the Programme for International Student Assessment (PISA) 2006 are analysed in order to describe student interest more precisely. So far the analyses have provided a general indicator of interest, aggregated over all contexts and contents in the science test. With its innovative approach PISA embeds interest items within the cognitive test unit and its contents and contexts. The main difference from conventional interest measures is that in most questionnaires, a relatively small number of interest items cover broad fields of contents and contexts. The science units represent a number of systematically differentiated scientific contexts and contents. The units' stimulus texts allow for concrete descriptions of relevant content aspects, applications, and contexts. In the analyses, multidimensional item response models are applied in order to disentangle student interest. The results indicate that multidimensional models fit the data. A two-dimensional model separating interest into two different knowledge of science dimensions described in the PISA science framework is further analysed with respect to gender, performance differences, and country. The findings give a comprehensive description of students' interest in science. The paper deals with methodological problems and describes requirements of the test construction for further assessments. The results are discussed with regard to their significance for science education.
Evaluation of Internal Construct Validity and Unidimensionality of the Brachial Assessment Tool, A Patient-Reported Outcome Measure for Brachial Plexus Injury.

PubMed

Hill, Bridget; Pallant, Julie; Williams, Gavin; Olver, John; Ferris, Scott; Bialocerkowski, Andrea

2016-12-01

To evaluate the internal construct validity and dimensionality of a new patient-reported outcome measure for people with traumatic brachial plexus injury (BPI) based on the International Classification of Functioning, Disability and Health definition of activity. Cross-sectional study. Outpatient clinics. Adults (age range, 18-82y) with a traumatic BPI (N=106). There were 106 people with BPI who completed a 51-item 5-response questionnaire. Responses were analyzed in 4 phases (missing responses, item correlations, exploratory factor analysis, and Rasch analysis) to evaluate the properties of fit to the Rasch model, threshold response, local dependency, dimensionality, differential item functioning, and targeting. Not applicable, as this study addresses the development of an outcome measure. Six items were deleted for missing responses, and 10 were deleted for high interitem correlations >.81. The remaining 35 items, while demonstrating fit to the Rasch model, showed evidence of local dependency and multidimensionality. Items were divided into 3 subscales: dressing and grooming (8 items), arm and hand (17 items), and no hand (6 items). All 3 subscales demonstrated fit to the model with no local dependency, minimal disordered thresholds, no unidimensionality or differential item functioning for age, time postinjury, or self-selected dominance. Subscales were combined into 3 subtests and demonstrated fit to the model, no misfit, and unidimensionality, allowing calculation of a summary score. This preliminary analysis supports the internal construct validity of the Brachial Assessment Tool, a unidimensional targeted 4-response patient-reported outcome measure designed to solely assess activity after traumatic BPI regardless of level of injury, age at recruitment, premorbid limb dominance, and time postinjury. Further examination is required to determine test-retest reliability and responsiveness. Copyright Â© 2016 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.
Variance Estimation for NAEP Data Using a Resampling-Based Approach: An Application of Cognitive Diagnostic Models. Research Report. ETS RR-10-26

ERIC Educational Resources Information Center

Hsieh, Chueh-an; Xu, Xueli; von Davier, Matthias

2010-01-01

This paper presents an application of a jackknifing approach to variance estimation of ability inferences for groups of students, using a multidimensional discrete model for item response data. The data utilized to demonstrate the approach come from the National Assessment of Educational Progress (NAEP). In contrast to the operational approach…
How True Is Grit? Assessing Its Relations to High School and College Students' Personality Characteristics, Self-Regulation, Engagement, and Achievement

ERIC Educational Resources Information Center

Muenks, Katherine; Wigfield, Allan; Yang, Ji Seung; O'Neal, Colleen R.

2017-01-01

Duckworth, Peterson, Matthews, and Kelly (2007) defined "grit" as one's passion and perseverance toward long-term goals. They proposed that it consists of 2 components: consistency of interests and perseverance of effort. In a high school and college student sample, we used a multidimensional item response theory approach to examine (a)…
National Reading Tests in Denmark, Norway, and Sweden: A Comparison of Construct Definitions, Cognitive Targets, and Response Formats

ERIC Educational Resources Information Center

Tengberg, Michael

2017-01-01

Reading comprehension tests are often assumed to measure the same, or at least similar, constructs. Yet, reading is not a single but a multidimensional form of processing, which means that variations in terms of reading material and item design may emphasize one aspect of the construct at the cost of another. The educational systems in Denmark,…
Examining the Inseparability of Content Knowledge from LSP Reading Ability: An Approach Combining Bifactor-Multidimensional Item Response Theory and Structural Equation Modeling

ERIC Educational Resources Information Center

Cai, Yuyang; Kunnan, Antony John

2018-01-01

This study examined the separability of domain-general and domain-specific content knowledge from Language for Specific Purposes (LSP) reading ability. A pool of 1,491 nursing students in China participated by responding to a nursing English test and a nursing knowledge test. Primary data analysis involved four steps: (a) conducting a…
The Evidence for a Subscore Structure in a Test of English Language Competency for English Language Learners

ERIC Educational Resources Information Center

Reckase, Mark D.; Xu, Jing-Ru

2015-01-01

How to compute and report subscores for a test that was originally designed for reporting scores on a unidimensional scale has been a topic of interest in recent years. In the research reported here, we describe an application of multidimensional item response theory to identify a subscore structure in a test designed for reporting results using a…
The Vulvar Pain Assessment Questionnaire inventory.

PubMed

Dargie, Emma; Holden, Ronald R; Pukall, Caroline F

2016-12-01

Millions suffer from chronic vulvar pain (ie, vulvodynia). Vulvodynia represents the intersection of 2 difficult subjects for health care professionals to tackle: sexuality and chronic pain. Those with chronic vulvar pain are often uncomfortable seeking help, and many who do so fail to receive proper diagnoses. The current research developed a multidimensional assessment questionnaire, the Vulvar Pain Assessment Questionnaire (VPAQ) inventory, to assist in the assessment and diagnosis of those with vulvar pain. A large pool of items was created to capture pain characteristics, emotional/cognitive functioning, physical functioning, coping skills, and partner factors. The item pool was subsequently administered online to 288 participants with chronic vulvar pain. Of those, 248 participants also completed previously established questionnaires that were used to evaluate the convergent and discriminant validity of the VPAQ. Exploratory factor analyses of the item pool established 6 primary scales: Pain Severity, Emotional Response, Cognitive Response, and Interference with Life, Sexual Function, and Self-Stimulation/Penetration. A brief screening version accompanies a more detailed version. In addition, 3 supplementary scales address pain quality characteristics, coping skills, and the impact on one's romantic relationship. When relationships among VPAQ scales and previously researched scales were examined, evidence of convergent and discriminant validity was observed. These patterns of findings are consistent with the literature on the multidimensional nature of vulvodynia. The VPAQ can be used for assessment, diagnosis, treatment formulation, and treatment monitoring. In addition, the VPAQ could potentially be used to promote communication between patients and providers, and point toward helpful treatment options and/or referrals.
Motivation and Engagement in the Workplace: Examining a Multidimensional Framework and Instrument from a Measurement and Evaluation Perspective

ERIC Educational Resources Information Center

Martin, Andrew J.

2009-01-01

This investigation conducts measurement and evaluation of a multidimensional model of workplace motivation and engagement from a construct validation perspective. Two studies were conducted, one using the multi-item multidimensional Motivation and Engagement Scale-Work (N = 637 school personnel) and one using a parallel short form (N = 574 school…
Calibrating well-being, quality of life and common mental disorder items: psychometric epidemiology in public mental health research.

PubMed

Böhnke, Jan R; Croudace, Tim J

2016-08-01

The assessment of 'general health and well-being' in public mental health research stimulates debates around relative merits of questionnaire instruments and their items. Little evidence regarding alignment or differential advantages of instruments or items has appeared to date. Population-based psychometric study of items employed in public mental health narratives. Multidimensional item response theory was applied to General Health Questionnaire (GHQ-12), Warwick-Edinburgh Mental Well-being Scale (WEMWBS) and EQ-5D items (Health Survey for England, 2010-2012; n = 19 290). A bifactor model provided the best account of the data and showed that the GHQ-12 and WEMWBS items assess mainly the same construct. Only one item of the EQ-5D showed relevant overlap with this dimension (anxiety/depression). Findings were corroborated by comparisons with alternative models and cross-validation analyses. The consequences of this lack of differentiation (GHQ-12 v. WEMWBS) for mental health and well-being narratives deserves discussion to enrich debates on priorities in public mental health and its assessment. © The Royal College of Psychiatrists 2015.
Is Going Beyond Rasch Analysis Necessary to Assess the Construct Validity of a Motor Function Scale?

PubMed

Guillot, Tiffanie; Roche, Sylvain; Rippert, Pascal; Hamroun, Dalil; Iwaz, Jean; Ecochard, René; Vuillerot, Carole

2018-04-03

To examine whether a Rasch analysis is sufficient to establish the construct validity of the Motor Function Measure (MFM) and discuss whether weighting the MFM item scores would improve the MFM construct validity. Observational cross-sectional multicenter study. Twenty-three physical medicine departments, neurology departments, or reference centers for neuromuscular diseases. Patients (N=911) aged 6 to 60 years with Charcot-Marie-Tooth disease (CMT), facioscapulohumeral dystrophy (FSHD), or myotonic dystrophy type 1 (DM1). None. Comparison of the goodness-of-fit of the confirmatory factor analysis (CFA) model vs that of a modified multidimensional Rasch model on MFM item scores in each considered disease. The CFA model showed good fit to the data and significantly better goodness of fit than the modified multidimensional Rasch model regardless of the disease (P<.001). Statistically significant differences in item standardized factor loadings were found between DM1, CMT, and FSHD in only 6 of 32 items (items 6, 27, 2, 7, 9 and 17). For multidimensional scales designed to measure patient abilities in various diseases, a Rasch analysis might not be the most convenient, whereas a CFA is able to establish the scale construct validity and provide weights to adapt the item scores to a specific disease. Copyright © 2018 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.
Beyond factor analysis: Multidimensionality and the Parkinson's Disease Sleep Scale-Revised.

PubMed

Pushpanathan, Maria E; Loftus, Andrea M; Gasson, Natalie; Thomas, Meghan G; Timms, Caitlin F; Olaithe, Michelle; Bucks, Romola S

2018-01-01

Many studies have sought to describe the relationship between sleep disturbance and cognition in Parkinson's disease (PD). The Parkinson's Disease Sleep Scale (PDSS) and its variants (the Parkinson's disease Sleep Scale-Revised; PDSS-R, and the Parkinson's Disease Sleep Scale-2; PDSS-2) quantify a range of symptoms impacting sleep in only 15 items. However, data from these scales may be problematic as included items have considerable conceptual breadth, and there may be overlap in the constructs assessed. Multidimensional measurement models, accounting for the tendency for items to measure multiple constructs, may be useful more accurately to model variance than traditional confirmatory factor analysis. In the present study, we tested the hypothesis that a multidimensional model (a bifactor model) is more appropriate than traditional factor analysis for data generated by these types of scales, using data collected using the PDSS-R as an exemplar. 166 participants diagnosed with idiopathic PD participated in this study. Using PDSS-R data, we compared three models: a unidimensional model; a 3-factor model consisting of sub-factors measuring insomnia, motor symptoms and obstructive sleep apnoea (OSA) and REM sleep behaviour disorder (RBD) symptoms; and, a confirmatory bifactor model with both a general factor and the same three sub-factors. Only the confirmatory bifactor model achieved satisfactory model fit, suggesting that PDSS-R data are multidimensional. There were differential associations between factor scores and patient characteristics, suggesting that some PDSS-R items, but not others, are influenced by mood and personality in addition to sleep symptoms. Multidimensional measurement models may also be a helpful tool in the PDSS and the PDSS-2 scales and may improve the sensitivity of these instruments.
The PedsQL multidimensional fatigue scale in pediatric obesity: feasibility, reliability and validity.

PubMed

Varni, James W; Limbers, Christine A; Bryant, William P; Wilson, Don P

2010-01-01

The PedsQL (Pediatric Quality of Life Inventory) is a modular instrument designed to measure health-related quality of life (HRQOL) and disease-specific symptoms in children and adolescents. The PedsQL Multidimensional Fatigue Scale was designed as a child self-report and parent proxy-report generic symptom-specific instrument to measure fatigue in pediatric patients. The objective of the present study was to determine the feasibility, reliability, and validity of the PedsQL Multidimensional Fatigue Scale in pediatric obesity. The 18-item PedsQL Multidimensional Fatigue Scale (General Fatigue, Sleep/Rest Fatigue, and Cognitive Fatigue domains) and the PedsQL 4.0 Generic Core Scales were completed by 41 pediatric patients with a physician-diagnosis of obesity and 43 parents from a hospital-based Pediatric Endocrinology Clinic. The PedsQL Multidimensional Fatigue Scale evidenced minimal missing responses (1.6%, child report; 0.5%, parent report), achieved excellent reliability for the Total Fatigue Scale Score (alpha = 0.90 child report, 0.90 parent report), distinguished between pediatric patients with obesity and healthy children, and was significantly correlated with the PedsQL 4.0 Generic Core Scales supporting construct validity. Pediatric patients with obesity experienced fatigue comparable with pediatric patients receiving cancer treatment, demonstrating the relative severity of their fatigue symptoms. The results demonstrate the measurement properties of the PedsQL Multidimensional Fatigue Scale in pediatric obesity. The findings suggest that the PedsQL Multidimensional Fatigue Scale may be utilized in the standardized evaluation of fatigue in pediatric patients with obesity.
Introduction to bifactor polytomous item response theory analysis.

PubMed

Toland, Michael D; Sulis, Isabella; Giambona, Francesca; Porcu, Mariano; Campbell, Jonathan M

2017-02-01

A bifactor item response theory model can be used to aid in the interpretation of the dimensionality of a multifaceted questionnaire that assumes continuous latent variables underlying the propensity to respond to items. This model can be used to describe the locations of people on a general continuous latent variable as well as on continuous orthogonal specific traits that characterize responses to groups of items. The bifactor graded response (bifac-GR) model is presented in contrast to a correlated traits (or multidimensional GR model) and unidimensional GR model. Bifac-GR model specification, assumptions, estimation, and interpretation are demonstrated with a reanalysis of data (Campbell, 2008) on the Shared Activities Questionnaire. We also show the importance of marginalizing the slopes for interpretation purposes and we extend the concept to the interpretation of the information function. To go along with the illustrative example analyses, we have made available supplementary files that include command file (syntax) examples and outputs from flexMIRT, IRTPRO, R, Mplus, and STATA. Supplementary data to this article can be found online at http://dx.doi.org/10.1016/j.jsp.2016.11.001. Data needed to reproduce analyses in this article are available as supplemental materials (online only) in the Appendix of this article. Copyright © 2016 Society for the Study of School Psychology. Published by Elsevier Ltd. All rights reserved.
Rewards of bridging the divide between measurement and clinical theory: demonstration of a bifactor model for the Brief Symptom Inventory.

PubMed

Thomas, Michael L

2012-03-01

There is growing evidence that psychiatric disorders maintain hierarchical associations where general and domain-specific factors play prominent roles (see D. Watson, 2005). Standard, unidimensional measurement models can fail to capture the meaningful nuances of such complex latent variable structures. The present study examined the ability of the multidimensional item response theory bifactor model (see R. D. Gibbons & D. R. Hedeker, 1992) to improve construct validity by serving as a bridge between measurement and clinical theories. Archival data consisting of 688 outpatients' psychiatric diagnoses and item-level responses to the Brief Symptom Inventory (BSI; L. R. Derogatis, 1993) were extracted from files at a university mental health clinic. The bifactor model demonstrated superior fit for the internal structure of the BSI and improved overall diagnostic accuracy in the sample (73%) compared with unidimensional (61%) and oblique simple structure (65%) models. Consistent with clinical theory, multiple sources of item variance were drawn from individual test items. Test developers and clinical researchers are encouraged to consider model-based measurement in the assessment of psychiatric distress.
Ability evaluation by binary tests: Problems, challenges & recent advances

NASA Astrophysics Data System (ADS)

Bashkansky, E.; Turetsky, V.

2016-11-01

Binary tests designed to measure abilities of objects under test (OUTs) are widely used in different fields of measurement theory and practice. The number of test items in such tests is usually very limited. The response to each test item provides only one bit of information per OUT. The problem of correct ability assessment is even more complicated, when the levels of difficulty of the test items are unknown beforehand. This fact makes the search for effective ways of planning and processing the results of such tests highly relevant. In recent years, there has been some progress in this direction, generated by both the development of computational tools and the emergence of new ideas. The latter are associated with the use of so-called “scale invariant item response models”. Together with maximum likelihood estimation (MLE) approach, they helped to solve some problems of engineering and proficiency testing. However, several issues related to the assessment of uncertainties, replications scheduling, the use of placebo, as well as evaluation of multidimensional abilities still present a challenge for researchers. The authors attempt to outline the ways to solve the above problems.

Development of a Multidimensional Functional Health Scale for Older Adults in China.

PubMed

Mao, Fanzhen; Han, Yaofeng; Chen, Junze; Chen, Wei; Yuan, Manqiong; Alicia Hong, Y; Fang, Ya

2016-05-01

A first step to achieve successful aging is assessing functional wellbeing of older adults. This study reports the development of a culturally appropriate brief scale (the Multidimensional Functional Health Scale for Chinese Elderly, MFHSCE) to assess the functional health of Chinese elderly. Through systematic literature review, Delphi method, cultural adaptation, synthetic statistical item selection, Cronbach's alpha and confirmatory factor analysis, we conducted development of item pool, two rounds of item selection, and psychometric evaluation. Synthetic statistical item selection and psychometric evaluation was processed among 539 and 2032 older adults, separately. The MFHSCE consists of 30 items, covering activities of daily living, social relationships, physical health, mental health, cognitive function, and economic resources. The Cronbach's alpha was 0.92, and the comparative fit index was 0.917. The MFHSCE has good internal consistency and construct validity; it is also concise and easy to use in general practice, especially in communities in China.
A Procedure To Detect Test Bias Present Simultaneously in Several Items.

ERIC Educational Resources Information Center

Shealy, Robin; Stout, William

A statistical procedure is presented that is designed to test for unidirectional test bias existing simultaneously in several items of an ability test, based on the assumption that test bias is incipient within the two groups' ability differences. The proposed procedure--Simultaneous Item Bias (SIB)--is based on a multidimensional item response…
Using existing questionnaires in latent class analysis: should we use summary scores or single items as input? A methodological study using a cohort of patients with low back pain.

PubMed

Nielsen, Anne Molgaard; Vach, Werner; Kent, Peter; Hestbaek, Lise; Kongsted, Alice

2016-01-01

Latent class analysis (LCA) is increasingly being used in health research, but optimal approaches to handling complex clinical data are unclear. One issue is that commonly used questionnaires are multidimensional, but expressed as summary scores. Using the example of low back pain (LBP), the aim of this study was to explore and descriptively compare the application of LCA when using questionnaire summary scores and when using single items to subgrouping of patients based on multidimensional data. Baseline data from 928 LBP patients in an observational study were classified into four health domains (psychology, pain, activity, and participation) using the World Health Organization's International Classification of Functioning, Disability, and Health framework. LCA was performed within each health domain using the strategies of summary-score and single-item analyses. The resulting subgroups were descriptively compared using statistical measures and clinical interpretability. For each health domain, the preferred model solution ranged from five to seven subgroups for the summary-score strategy and seven to eight subgroups for the single-item strategy. There was considerable overlap between the results of the two strategies, indicating that they were reflecting the same underlying data structure. However, in three of the four health domains, the single-item strategy resulted in a more nuanced description, in terms of more subgroups and more distinct clinical characteristics. In these data, application of both the summary-score strategy and the single-item strategy in the LCA subgrouping resulted in clinically interpretable subgroups, but the single-item strategy generally revealed more distinguishing characteristics. These results 1) warrant further analyses in other data sets to determine the consistency of this finding, and 2) warrant investigation in longitudinal data to test whether the finer detail provided by the single-item strategy results in improved prediction of outcomes and treatment response.
A Conditional Exposure Control Method for Multidimensional Adaptive Testing

ERIC Educational Resources Information Center

Finkelman, Matthew; Nering, Michael L.; Roussos, Louis A.

2009-01-01

In computerized adaptive testing (CAT), ensuring the security of test items is a crucial practical consideration. A common approach to reducing item theft is to define maximum item exposure rates, i.e., to limit the proportion of examinees to whom a given item can be administered. Numerous methods for controlling exposure rates have been proposed…
An Alternative Approach for the Analyses and Interpretation of Attachment Sort Items

ERIC Educational Resources Information Center

Kirkland, John; Bimler, David; Drawneek, Andrew; McKim, Margaret; Scholmerich, Axel

2004-01-01

Attachment Q-Sort (AQS) is a tool for quantifying observations about toddler/caregiver relationships. Previous studies have applied factor analysis to the full 90 AQS item set to explore the structure underlying them. Here we explore that structure by applying multidimensional scaling (MDS) to judgements of inter-item similarity. AQS items are…
Developing Multidimensional Likert Scales Using Item Factor Analysis: The Case of Four-Point Items

ERIC Educational Resources Information Center

Asún, Rodrigo A.; Rdz-Navarro, Karina; Alvarado, Jesús M.

2016-01-01

This study compares the performance of two approaches in analysing four-point Likert rating scales with a factorial model: the classical factor analysis (FA) and the item factor analysis (IFA). For FA, maximum likelihood and weighted least squares estimations using Pearson correlation matrices among items are compared. For IFA, diagonally weighted…
Development of the Computer-Adaptive Version of the Late-Life Function and Disability Instrument

PubMed Central

Tian, Feng; Kopits, Ilona M.; Moed, Richard; Pardasaney, Poonam K.; Jette, Alan M.

2012-01-01

Background. Having psychometrically strong disability measures that minimize response burden is important in assessing of older adults. Methods. Using the original 48 items from the Late-Life Function and Disability Instrument and newly developed items, a 158-item Activity Limitation and a 62-item Participation Restriction item pool were developed. The item pools were administered to a convenience sample of 520 community-dwelling adults 60 years or older. Confirmatory factor analysis and item response theory were employed to identify content structure, calibrate items, and build the computer-adaptive testings (CATs). We evaluated real-data simulations of 10-item CAT subscales. We collected data from 102 older adults to validate the 10-item CATs against the Veteran’s Short Form-36 and assessed test–retest reliability in a subsample of 57 subjects. Results. Confirmatory factor analysis revealed a bifactor structure, and multi-dimensional item response theory was used to calibrate an overall Activity Limitation Scale (141 items) and an overall Participation Restriction Scale (55 items). Fit statistics were acceptable (Activity Limitation: comparative fit index = 0.95, Tucker Lewis Index = 0.95, root mean square error approximation = 0.03; Participation Restriction: comparative fit index = 0.95, Tucker Lewis Index = 0.95, root mean square error approximation = 0.05). Correlation of 10-item CATs with full item banks were substantial (Activity Limitation: r = .90; Participation Restriction: r = .95). Test–retest reliability estimates were high (Activity Limitation: r = .85; Participation Restriction r = .80). Strength and pattern of correlations with Veteran’s Short Form-36 subscales were as hypothesized. Each CAT, on average, took 3.56 minutes to administer. Conclusions. The Late-Life Function and Disability Instrument CATs demonstrated strong reliability, validity, accuracy, and precision. The Late-Life Function and Disability Instrument CAT can achieve psychometrically sound disability assessment in older persons while reducing respondent burden. Further research is needed to assess their ability to measure change in older adults. PMID:22546960
Measuring the Perception of the Teachers' Autonomy-Supportive Behavior in Physical Education: Development and Initial Validation of a Multi-Dimensional Instrument

ERIC Educational Resources Information Center

Tilga, Henri; Hein, Vello; Koka, Andre

2017-01-01

This research aimed to develop and validate an instrument to assess the students' perceptions of the teachers' autonomy-supportive behavior by the multi-dimensional scale (Multi-Dimensional Perceived Autonomy Support Scale for Physical Education). The participants were 1,476 students aged 12- to 15-years-old. In Study 1, a pool of 37 items was…
A Comparison Study of Item Exposure Control Strategies in MCAT

ERIC Educational Resources Information Center

Mao, Xiuzhen; Ozdemir, Burhanettin; Wang, Yating; Xiu, Tao

2016-01-01

Four item selection indexes with and without exposure control are evaluated and compared in multidimensional computerized adaptive testing (CAT). The four item selection indices are D-optimality, Posterior expectation Kullback-Leibler information (KLP), the minimized error variance of the linear combination score with equal weight (V1), and the…
Three-dimensional structural representation of the sleep-wake adaptability.

PubMed

Putilov, Arcady A

2016-01-01

Various characteristics of the sleep-wake cycle can determine the success or failure of individual adjustment to certain temporal conditions of the today's society. However, it remains to be explored how many such characteristics can be self-assessed and how they are inter-related one to another. The aim of the present report was to apply a three-dimensional structural representation of the sleep-wake adaptability in the form of "rugby cake" (scalene or triaxial ellipsoid) to explain the results of analysis of the pattern of correlations of the responses to the initial 320-item list of a new inventory with scores on the six scales designed for multidimensional self-assessment of the sleep-wake adaptability (Morning and Evening Lateness, Anytime and Nighttime Sleepability, and Anytime and Daytime Wakeability). The results obtained for sample consisting of 149 respondents were confirmed by the results of similar analysis of earlier collected responses of 139 respondents to the same list of 320 items and responses of 1213 respondents to the 72 items of one of the earlier established questionnaire tools. Empirical evidence was provided in support of the model-driven prediction of the possibility to identify items linked to as many as 36 narrow (6 core and 30 mixed) adaptabilities of the sleep-wake cycle. The results enabled the selection of 168 items for self-assessment of all these adaptabilities predicted by the rugby cake model.
Development and initial evaluation of the SCI-FI/AT

PubMed Central

Jette, Alan M.; Slavin, Mary D.; Ni, Pengsheng; Kisala, Pamela A.; Tulsky, David S.; Heinemann, Allen W.; Charlifue, Susie; Tate, Denise G.; Fyffe, Denise; Morse, Leslie; Marino, Ralph; Smith, Ian; Williams, Steve

2015-01-01

Objectives To describe the domain structure and calibration of the Spinal Cord Injury Functional Index for samples using Assistive Technology (SCI-FI/AT) and report the initial psychometric properties of each domain. Design Cross sectional survey followed by computerized adaptive test (CAT) simulations. Setting Inpatient and community settings. Participants A sample of 460 adults with traumatic spinal cord injury (SCI) stratified by level of injury, completeness of injury, and time since injury. Interventions None Main outcome measure SCI-FI/AT Results Confirmatory factor analysis (CFA) and Item response theory (IRT) analyses identified 4 unidimensional SCI-FI/AT domains: Basic Mobility (41 items) Self-care (71 items), Fine Motor Function (35 items), and Ambulation (29 items). High correlations of full item banks with 10-item simulated CATs indicated high accuracy of each CAT in estimating a person's function, and there was high measurement reliability for the simulated CAT scales compared with the full item bank. SCI-FI/AT item difficulties in the domains of Self-care, Fine Motor Function, and Ambulation were less difficult than the same items in the original SCI-FI item banks. Conclusion With the development of the SCI-FI/AT, clinicians and investigators have available multidimensional assessment scales that evaluate function for users of AT to complement the scales available in the original SCI-FI. PMID:26010975
Development and initial evaluation of the SCI-FI/AT.

PubMed

Jette, Alan M; Slavin, Mary D; Ni, Pengsheng; Kisala, Pamela A; Tulsky, David S; Heinemann, Allen W; Charlifue, Susie; Tate, Denise G; Fyffe, Denise; Morse, Leslie; Marino, Ralph; Smith, Ian; Williams, Steve

2015-05-01

To describe the domain structure and calibration of the Spinal Cord Injury Functional Index for samples using Assistive Technology (SCI-FI/AT) and report the initial psychometric properties of each domain. Cross sectional survey followed by computerized adaptive test (CAT) simulations. Inpatient and community settings. A sample of 460 adults with traumatic spinal cord injury (SCI) stratified by level of injury, completeness of injury, and time since injury. None SCI-FI/AT RESULTS: Confirmatory factor analysis (CFA) and Item response theory (IRT) analyses identified 4 unidimensional SCI-FI/AT domains: Basic Mobility (41 items) Self-care (71 items), Fine Motor Function (35 items), and Ambulation (29 items). High correlations of full item banks with 10-item simulated CATs indicated high accuracy of each CAT in estimating a person's function, and there was high measurement reliability for the simulated CAT scales compared with the full item bank. SCI-FI/AT item difficulties in the domains of Self-care, Fine Motor Function, and Ambulation were less difficult than the same items in the original SCI-FI item banks. With the development of the SCI-FI/AT, clinicians and investigators have available multidimensional assessment scales that evaluate function for users of AT to complement the scales available in the original SCI-FI.
The PedsQL Multidimensional Fatigue Scale in type 1 diabetes: feasibility, reliability, and validity.

PubMed

Varni, James W; Limbers, Christine A; Bryant, William P; Wilson, Don P

2009-08-01

The Pediatric Quality of Life Inventory (PedsQL, Mapi Research Trust, Lyon, France; www.pedsql.org) is a modular instrument designed to measure health-related quality of life and disease-specific symptoms in children and adolescents. The PedsQL Multidimensional Fatigue Scale was designed as a child self-report and parent proxy-report generic symptom-specific instrument to measure fatigue in pediatric patients. The objective of the present study was to determine the feasibility, reliability, and validity of the PedsQL Multidimensional Fatigue Scale in type 1 diabetes. The 18-item PedsQL Multidimensional Fatigue Scale (General Fatigue, Sleep/Rest Fatigue, and Cognitive Fatigue domains) and the PedsQL 4.0 Generic Core Scales were administered to 83 pediatric patients with type 1 diabetes and 84 parents. The PedsQL Multidimensional Fatigue Scale evidenced minimal missing responses (0.3% child report and 0.3% parent report), achieved excellent reliability for the Total Fatigue Scale score (alpha= 0.92 child report, 0.94 parent report), distinguished between pediatric patients with diabetes and healthy children, and was significantly correlated with the PedsQL 4.0 Generic Core Scales supporting construct validity. Pediatric patients with diabetes experienced fatigue that was comparable to pediatric patients with cancer on treatment, demonstrating the relative severity of their fatigue symptoms. The results demonstrate the measurement properties of the PedsQL Multidimensional Fatigue Scale in type 1 diabetes. The findings suggest that the PedsQL Multidimensional Fatigue Scale may be utilized in the standardized evaluation of fatigue in pediatric patients with type 1 diabetes.
The PedsQL Multidimensional Fatigue Scale in young adults: feasibility, reliability and validity in a University student population.

PubMed

Varni, James W; Limbers, Christine A

2008-02-01

The PedsQL (Pediatric Quality of Life Inventory) is a modular instrument designed to measure health-related quality of life (HRQOL) and disease-specific symptoms in children and adolescents ages 2-18. The PedsQL Multidimensional Fatigue Scale was designed as a generic symptom-specific instrument to measure fatigue in pediatric patients ages 2-18. Since a sizeable number of pediatric patients prefer to remain with their pediatric providers after age 18, the objective of the present study was to determine the feasibility, reliability, and validity of the PedsQL Multidimensional Fatigue Scale in young adults. The 18-item PedsQL Multidimensional Fatigue Scale (General Fatigue, Sleep/Rest Fatigue, and Cognitive Fatigue domains), the PedsQL 4.0 Generic Core Scales Young Adult Version, and the SF-8 Health Survey were completed by 423 university students ages 18-25. The PedsQL Multidimensional Fatigue Scale evidenced minimal missing responses, achieved excellent reliability for the Total Scale Score (alpha = 0.90), distinguished between healthy young adults and young adults with chronic health conditions, was significantly correlated with the relevant PedsQL 4.0 Generic Core Scales and the SF-8 standardized scores, and demonstrated a factor-derived structure largely consistent with the a priori conceptual model. The results demonstrate the measurement properties of the PedsQL Multidimensional Fatigue Scale in a convenience sample of young adult university students. The findings suggest that the PedsQL Multidimensional Fatigue Scale may be utilized in the evaluation of fatigue for a broad age range.
Multidimensional daily diary of fatigue-fibromyalgia-17 items (MDF-fibro-17): part 2 psychometric evaluation in fibromyalgia patients.

PubMed

Li, Y; Morris, S; Cole, J; Dube', S; Smith, J A M; Burbridge, C; Symonds, T; Hudgens, S; Wang, W

2017-05-18

The Multidimensional Daily Diary of Fatigue-Fibromyalgia-17 instrument (MDF-Fibro-17) has been developed for use in fibromyalgia (FM) clinical studies and includes 5 domains: Global Fatigue Experience, Cognitive Fatigue, Physical Fatigue, Motivation, and Impact on Function. Psychometric properties of the MDF-Fibro-17 needed to demonstrate the appropriateness of using this instrument in clinical studies are presented. Psychometric analyses were conducted to evaluate the factor structure, reliability, validity, and responsiveness of the MDF-Fibro-17 using data from a Phase 2 clinical study of FM patients (N = 381). Confirmatory factor analyses (CFA) were performed to ensure understanding of the multidimensional domain structure, and a secondary factor analysis of the domains examined the appropriateness of calculating a total score in addition to domain scores. Longitudinal psychometric analyses (test-retest reliability and responder analysis) were also conducted on the data from Baseline to Week 6. The CFA supported the 17-item, 5 domain structure of this instrument as the best fit of the data: comparative fit index (CFI) and non-normed fit index (NNFI) were 0.997 and 0.992 respectively, standardized root mean square residual (SRMR) was 0.010 and the root mean square error of approximation (RMSEA) was 0.06. In addition, total score (CFI and NNFI both 0.95) met required standards. For the total and 5 domain scores, reliability and validity data were acceptable: test-retest and internal consistency were above 0.9; correlations were as expected with the Global Fatigue Index (GFI) (0.62-0.75), Fibromyalgia Impact Questionnaire (FIQ) Total (0.59-0.71), and 36-Item Short Form Health Survey (SF-36) vitality (VT) (0.43-0.53); and discrimination was shown using quintile scores for the GFI, FIQ Total, and Pain Numeric Rating Scale (NRS) quartiles. In addition, sensitivity to change was demonstrated with an overall mean responder score of -2.59 using anchor-based methods. The MDF-Fibro-17 reliably measures 5 domains of FM-related fatigue and psychometric evaluation confirms that this measure meets or exceeds each of the predefined acceptable thresholds for evidence of reliability, validity, and responsiveness to changes in clinical status. This suggests that the MDF-Fibro-17 is an appropriate and responsive measure of FM-related fatigue in clinical studies.
Development of the Competitive Work Environment Scale: A Multidimensional Climate Construct

ERIC Educational Resources Information Center

Fletcher, Thomas D.; Nusbaum, David N.

2010-01-01

Recent research suggests that competitive work environments may influence individual's attitudes, behaviors, stress, and performance. Unfortunately, adequate measures of competitive environments are lacking. This article traces the development of a new multidimensional competitive work environment scale. An initial 59-item pool covering five…
Dimensionality and DIF in a Licensure Examination.

ERIC Educational Resources Information Center

Sykes, Robert C.; And Others

The sources of multidimensionality found in several different forms of a licensure examination were studied. The relationship between one source of multidimensionality, differential item functioning (DIF) (or factors producing DIF), and content characteristics was explored in an attempt to isolate aspects of training or curriculum that could…
Similarity from multi-dimensional scaling: solving the accuracy and diversity dilemma in information filtering.

PubMed

Zeng, Wei; Zeng, An; Liu, Hao; Shang, Ming-Sheng; Zhang, Yi-Cheng

2014-01-01

Recommender systems are designed to assist individual users to navigate through the rapidly growing amount of information. One of the most successful recommendation techniques is the collaborative filtering, which has been extensively investigated and has already found wide applications in e-commerce. One of challenges in this algorithm is how to accurately quantify the similarities of user pairs and item pairs. In this paper, we employ the multidimensional scaling (MDS) method to measure the similarities between nodes in user-item bipartite networks. The MDS method can extract the essential similarity information from the networks by smoothing out noise, which provides a graphical display of the structure of the networks. With the similarity measured from MDS, we find that the item-based collaborative filtering algorithm can outperform the diffusion-based recommendation algorithms. Moreover, we show that this method tends to recommend unpopular items and increase the global diversification of the networks in long term.
Using Multidimensional Scaling To Assess the Dimensionality of Dichotomous Item Data.

ERIC Educational Resources Information Center

Meara, Kevin; Robin, Frederic; Sireci, Stephen G.

2000-01-01

Investigated the usefulness of multidimensional scaling (MDS) for assessing the dimensionality of dichotomous test data. Focused on two MDS proximity measures, one based on the PC statistic (T. Chen and M. Davidson, 1996) and other, on interitem Euclidean distances. Simulation results show that both MDS procedures correctly identify…
Evaluating the Dimensionality of Pornography.

PubMed

Busby, Dean M; Chiu, Hsin-Yao; Olsen, Joseph A; Willoughby, Brian J

2017-08-01

Pornography may be a construct with a single trait or one with many traits. Research in the past was inconsistent in this regard with most researchers assuming that pornography was unidimensional (with one single trait of pornography). However, the considerable amounts of residual variation found in these studies beyond that explained by the single trait hints at what might be a multidimensional construct (with multiple traits such as sensitization and differentiation). Consequently, in this study, we intended to address the question of whether pornography consisted of a single trait or if it was multidimensional. Using MTurk, 2173 participants from the United States and the Commonwealth of Nations (in which pornography is not strictly illegal) were recruited and asked to rate how pornographic they thought a list of different depictions were. The data were analyzed utilizing the cross-validation procedure in which two subsamples were created from the main sample and one was used to establish the model building and the other to validate the model. Various models, including first-order and higher-order exploratory and confirmatory factor models, were tested. Results indicated that a bi-factor (multidimensional) model generated the best model fit, and that it was most appropriate to consider pornography multidimensional. The final model contained two dimensions ("Sensitization" and "Differentiation"). While sensitization revealed the participants' general tendency to rate all items to be more or less pornographic, differentiation revealed the participants' tendency to differentiate highly pornographic items from less pornographic items. Based on the findings of this study, we suggest that future research on the usage and effects of pornography be conducted while taking into consideration the multidimensional nature of pornography.

Item Response Theory Modeling and Categorical Regression Analyses of the Five-Factor Model Rating Form: A Study on Italian Community-Dwelling Adolescent Participants and Adult Participants.

PubMed

Fossati, Andrea; Widiger, Thomas A; Borroni, Serena; Maffei, Cesare; Somma, Antonella

2017-06-01

To extend the evidence on the reliability and construct validity of the Five-Factor Model Rating Form (FFMRF) in its self-report version, two independent samples of Italian participants, which were composed of 510 adolescent high school students and 457 community-dwelling adults, respectively, were administered the FFMRF in its Italian translation. Adolescent participants were also administered the Italian translation of the Borderline Personality Features Scale for Children-11 (BPFSC-11), whereas adult participants were administered the Italian translation of the Triarchic Psychopathy Measure (TriPM). Cronbach α values were consistent with previous findings; in both samples, average interitem r values indicated acceptable internal consistency for all FFMRF scales. A multidimensional graded item response theory model indicated that the majority of FFMRF items had adequate discrimination parameters; information indices supported the reliability of the FFMRF scales. Both categorical (i.e., item-level) and scale-level regression analyses suggested that the FFMRF scores may predict a nonnegligible amount of variance in the BPFSC-11 total score in adolescent participants, and in the TriPM scale scores in adult participants.
The Academic Resilience Scale (ARS-30): A New Multidimensional Construct Measure.

PubMed

Cassidy, Simon

2016-01-01

Resilience is a psychological construct observed in some individuals that accounts for success despite adversity. Resilience reflects the ability to bounce back, to beat the odds and is considered an asset in human characteristic terms. Academic resilience contextualizes the resilience construct and reflects an increased likelihood of educational success despite adversity. The paper provides an account of the development of a new multidimensional construct measure of academic resilience. The 30 item Academic Resilience Scale (ARS-30) explores process-as opposed to outcome-aspects of resilience, providing a measure of academic resilience based on students' specific adaptive cognitive-affective and behavioral responses to academic adversity. Findings from the study involving a sample of undergraduate students ( N = 532) demonstrate that the ARS-30 has good internal reliability and construct validity. It is suggested that a measure such as the ARS-30, which is based on adaptive responses, aligns more closely with the conceptualisation of resilience and provides a valid construct measure of academic resilience relevant for research and practice in university student populations.
Development of the Assessment of Belief Conflict in Relationship-14 (ABCR-14).

PubMed

Kyougoku, Makoto; Teraoka, Mutsumi; Masuda, Noriko; Ooura, Mariko; Abe, Yasushi

2015-01-01

Nurses and other healthcare workers frequently experience belief conflict, one of the most important, new stress-related problems in both academic and clinical fields. In this study, using a sample of 1,683 nursing practitioners, we developed The Assessment of Belief Conflict in Relationship-14 (ABCR-14), a new scale that assesses belief conflict in the healthcare field. Standard psychometric procedures were used to develop and test the scale, including a qualitative framework concept and item-pool development, item reduction, and scale development. We analyzed the psychometric properties of ABCR-14 according to entropy, polyserial correlation coefficient, exploratory factor analysis, confirmatory factor analysis, average variance extracted, Cronbach's alpha, Pearson product-moment correlation coefficient, and multidimensional item response theory (MIRT). The results of the analysis supported a three-factor model consisting of 14 items. The validity and reliability of ABCR-14 was suggested by evidence from high construct validity, structural validity, hypothesis testing, internal consistency reliability, and concurrent validity. The result of the MIRT offered strong support for good item response of item slope parameters and difficulty parameters. However, the ABCR-14 Likert scale might need to be explored from the MIRT point of view. Yet, as mentioned above, there is sufficient evidence to support that ABCR-14 has high validity and reliability. The ABCR-14 demonstrates good psychometric properties for nursing belief conflict. Further studies are recommended to confirm its application in clinical practice.
Modeling the World Health Organization Disability Assessment Schedule II using non-parametric item response models.

PubMed

Galindo-Garre, Francisca; Hidalgo, María Dolores; Guilera, Georgina; Pino, Oscar; Rojo, J Emilio; Gómez-Benito, Juana

2015-03-01

The World Health Organization Disability Assessment Schedule II (WHO-DAS II) is a multidimensional instrument developed for measuring disability. It comprises six domains (getting around, self-care, getting along with others, life activities and participation in society). The main purpose of this paper is the evaluation of the psychometric properties for each domain of the WHO-DAS II with parametric and non-parametric Item Response Theory (IRT) models. A secondary objective is to assess whether the WHO-DAS II items within each domain form a hierarchy of invariantly ordered severity indicators of disability. A sample of 352 patients with a schizophrenia spectrum disorder is used in this study. The 36 items WHO-DAS II was administered during the consultation. Partial Credit and Mokken scale models are used to study the psychometric properties of the questionnaire. The psychometric properties of the WHO-DAS II scale are satisfactory for all the domains. However, we identify a few items that do not discriminate satisfactorily between different levels of disability and cannot be invariantly ordered in the scale. In conclusion the WHO-DAS II can be used to assess overall disability in patients with schizophrenia, but some domains are too general to assess functionality in these patients because they contain items that are not applicable to this pathology. Copyright © 2014 John Wiley & Sons, Ltd.
The Australian Racism, Acceptance, and Cultural-Ethnocentrism Scale (RACES): item response theory findings.

PubMed

Grigg, Kaine; Manderson, Lenore

2016-03-17

Racism and associated discrimination are pervasive and persistent challenges with multiple cumulative deleterious effects contributing to inequities in various health outcomes. Globally, research over the past decade has shown consistent associations between racism and negative health concerns. Such research confirms that race endures as one of the strongest predictors of poor health. Due to the lack of validated Australian measures of racist attitudes, RACES (Racism, Acceptance, and Cultural-Ethnocentrism Scale) was developed. Here, we examine RACES' psychometric properties, including the latent structure, utilising Item Response Theory (IRT). Unidimensional and Multidimensional Rating Scale Model (RSM) Rasch analyses were utilised with 296 Victorian primary school students and 182 adolescents and 220 adults from the Australian community. RACES was demonstrated to be a robust 24-item three-dimensional scale of Accepting Attitudes (12 items), Racist Attitudes (8 items), and Ethnocentric Attitudes (4 items). RSM Rasch analyses provide strong support for the instrument as a robust measure of racist attitudes in the Australian context, and for the overall factorial and construct validity of RACES across primary school children, adolescents, and adults. RACES provides a reliable and valid measure that can be utilised across the lifespan to evaluate attitudes towards all racial, ethnic, cultural, and religious groups. A core function of RACES is to assess the effectiveness of interventions to reduce community levels of racism and in turn inequities in health outcomes within Australia.
Development and validation of the Multidimensional Home Environment Scale (MHES) for adolescents and their mothers.

PubMed

Tabbakh, Tamara; Freeland-Graves, Jeanne

2016-08-01

The home environment is an important setting for the development of weight status in adolescence. At present a limited number of valid and reliable tools are available to evaluate the weight-related comprehensive home environment of this population. The goal of this research was to develop the Multidimensional Home Environment Scale which measures multiple components of the home. It includes psychological, social, and environmental domains from the perspective of an adolescent and the mother. Items were generated based on a literature review and then assessed for content validity by an expert panel and focus group in the target population. Internal consistency reliability was determined using Cronbach's α. Principal components analysis with varimax rotation was employed for assessment of construct validity. Temporal stability was evaluated using paired sample t-tests and bivariate correlations between responses at two different times, 1-2weeks apart. Associations between adolescent and mother responses were utilized for convergent validity. The final versions contained 32-items for adolescents and 36-items for mothers; these were administered to 218 adolescents and mothers. The subscales on the questionnaires exhibited high construct validity, internal consistency reliability (adolescent: α=0.82, mother: α=0.83) and test-retest reliability (adolescent: r=0.90, p<0.01; mother: r=0.91, p<0.01). Total home environment scores were computed, with greater scores reflecting a better health environment. These results verify the utility of the MHES as a valid and reliable instrument. This promising tool can be utilized to capture the comprehensive home environment of young adolescents (11-14years old). Copyright © 2016 Elsevier Ltd. All rights reserved.
Personality in general and clinical samples: Measurement invariance of the Multidimensional Personality Questionnaire.

PubMed

Eigenhuis, Annemarie; Kamphuis, Jan H; Noordhof, Arjen

2017-09-01

A growing body of research suggests that the same general dimensions can describe normal and pathological personality, but most of the supporting evidence is exploratory. We aim to determine in a confirmatory framework the extent to which responses on the Multidimensional Personality Questionnaire (MPQ) are identical across general and clinical samples. We tested the Dutch brief form of the MPQ (MPQ-BF-NL) for measurement invariance across a general population subsample (N = 365) and a clinical sample (N = 365), using Multiple Group Confirmatory Factor Analysis (MGCFA) and Multiple Group Exploratory Structural Equation Modeling (MGESEM). As an omnibus personality test, the MPQ-BF-NL revealed strict invariance, indicating absence of bias. Unidimensional per scale tests for measurement invariance revealed that 10% of items appeared to contain bias across samples. Item bias only affected the scale interpretation of Achievement, with individuals from the clinical sample more readily admitting to put high demands on themselves than individuals from the general sample, regardless of trait level. This formal test of equivalence provides strong evidence for the common structure of normal and pathological personality and lends further support to the clinical utility of the MPQ. (PsycINFO Database Record (c) 2017 APA, all rights reserved).
Conceptualizing Interprofessional Teams as Multi-Team Systems-Implications for Assessment and Training.

PubMed

West, Courtney; Landry, Karen; Graham, Anna; Graham, Lori; Cianciolo, Anna T; Kalet, Adina; Rosen, Michael; Sherman, Deborah Witt

2015-01-01

SGEA 2015 CONFERENCE ABSTRACT (EDITED). Evaluating Interprofessional Teamwork During a Large-Scale Simulation. Courtney West, Karen Landry, Anna Graham, and Lori Graham. CONSTRUCT: This study investigated the multidimensional measurement of interprofessional (IPE) teamwork as part of large-scale simulation training. Healthcare team function has a direct impact on patient safety and quality of care. However, IPE team training has not been the norm. Recognizing the importance of developing team-based collaborative care, our College of Nursing implemented an IPE simulation activity called Disaster Day and invited other professions to participate. The exercise consists of two sessions: one in the morning and another in the afternoon. The disaster scenario is announced just prior to each session, which consists of team building, a 90-minute simulation, and debriefing. Approximately 300 Nursing, Medicine, Pharmacy, Emergency Medical Technicians, and Radiology students and over 500 standardized and volunteer patients participated in the Disaster Day event. To improve student learning outcomes, we created 3 competency-based instruments to evaluate collaborative practice in multidimensional fashion during this exercise. A 20-item IPE Team Observation Instrument designed to assess interprofessional team's attainment of Interprofessional Education Collaborative (IPEC) competencies was completed by 20 faculty and staff observing the Disaster Day simulation. One hundred sixty-six standardized patients completed a 10-item Standardized Patient IPE Team Evaluation Instrument developed from the IPEC competencies and adapted items from the 2014 Henry et al. PIVOT Questionnaire. This instrument assessed the standardized or volunteer patient's perception of the team's collaborative performance. A 29-item IPE Team's Perception of Collaborative Care Questionnaire, also created from the IPEC competencies and divided into 5 categories of Values/Ethics, Roles and Responsibilities, Communication, Teamwork, and Self-Evaluation, was completed by 188 students including 99 from Nursing, 43 from Medicine, 6 from Pharmacy, and 40 participants who belonged to more than one component, were students at another institution, or did not indicate their institution. The team instrument was designed to assess each team member's perception of how well the team and him- or herself met the competencies. Five of the items on the team perceptions questionnaire mirrored items on the standardized patient evaluation: demonstrated leadership practices that led to effective teamwork, discussed care and decisions about that care with patient, described roles and responsibilities clearly, worked well together to coordinate care, and good/effective communication. Internal consistency reliability of the IPE Team Observation Instrument was 0.80. In 18 of the 20 items, more than 50% of observers indicated the item was demonstrated. Of those, 6 of the items were observed by 50% to 75% of the observers, and the remaining 12 were observed by more than 80% of the observers. Internal consistency reliability of the IPE Team's Perception of Collaborative Care Instrument was 0.95. The mean response score-1 (strongly disagree) to 4 (strongly agree)-was calculated for each section of the instrument. The overall mean score was 3.57 (SD = .11). Internal consistency reliability of the Standardized Patient IPE Team Evaluation Instrument was 0.87. The overall mean score was 3.28 (SD = .17). The ratings for the 5 items shared by the standardized patient and team perception instruments were compared using independent sample t tests. Statistically significant differences (p < .05) were present in each case, with the students rating themselves higher on average than the standardized patients did (mean differences between 0.2 and 0.6 on a scale of 1-4). Multidimensional, competency-based instruments appear to provide a robust view of IPE teamwork; however, challenges remain. Due to the large scale of the simulation exercise, observation-based assessment did not function as well as self- and standardized patient-based assessment. To promote greater variation in observer assessments during future Disaster Day simulations, we plan to adjust the rating scale from "not observed," "observed," and "not applicable" to a 4-point scale and reexamine interrater reliability.
The Subjective Sexual Arousal Scale for Men (SSASM): preliminary development and psychometric validation of a multidimensional measure of subjective male sexual arousal.

PubMed

Althof, Stanley E; Perelman, Michael A; Rosen, Raymond C

2011-08-01

Sexual arousal is a multifaceted process that involves both mental and physical components. No instrument has been developed and validated to assess subjective aspects of male sexual arousal. To develop and psychometrically validate a self-administered scale for assessing subjective male sexual arousal. Using recommendations of the Food and Drug Administration (FDA) guidance on patient-reported outcome instruments, important aspects of male sexual arousal were identified via qualitative research (focus groups and interviews) of U.S. men with erectile dysfunction (ED) and healthy controls. After a preliminary questionnaire was developed by a panel of experts, a quantitative study of men with ED and controls was conducted to psychometrically validate the Subjective Sexual Arousal Scale for Men (SSASM). To develop a male sexual arousal scale and determine its factor structure, reliability, and construct validity. Five aspects of male sexual arousal were identified from the qualitative focus groups and cognitive interviews. Men's preferred language for describing sexual arousal and preferred response formats were incorporated into the questions. Factor analysis of data from the quantitative study of 304 men aged 21 to 70 years identified five domains with eigenvalues >1: sexual performance (six items), mental satisfaction (five items), sexual assertiveness (three items), partner communication (three items), and partner relationship (three items). The five domains had a high degree of internal consistency (Cronbach's alpha values 0.88-0.94). Test-retest reliability over a 2- to 4-week period was high-moderately high (r values 0.75-0.88) for the five domain scores. Correlations between SSASM domain scores and standardized scale scores for social desirability, general health, life satisfaction, and sexual function demonstrated the construct validity of the scale. Preliminary validation data suggest that the 20-item SSASM scale may be useful as a multidimensional, reliable, self-administered instrument for assessing subjective sexual arousal in men of different ages. © 2011 International Society for Sexual Medicine.
Extracting Undimensional Chains from Multidimensional Datasets: A Graph Theory Approach.

ERIC Educational Resources Information Center

Yamomoto, Yoneo; Wise, Steven L.

An order-analysis procedure, which uses graph theory to extract efficiently nonredundant, unidimensional chains of items from multidimensional data sets and chain consistency as a criterion for chain membership is outlined in this paper. The procedure is intended as an alternative to the Reynolds (1976) procedure which is described as being…
Finite Mixture Multilevel Multidimensional Ordinal IRT Models for Large Scale Cross-Cultural Research

ERIC Educational Resources Information Center

de Jong, Martijn G.; Steenkamp, Jan-Benedict E. M.

2010-01-01

We present a class of finite mixture multilevel multidimensional ordinal IRT models for large scale cross-cultural research. Our model is proposed for confirmatory research settings. Our prior for item parameters is a mixture distribution to accommodate situations where different groups of countries have different measurement operations, while…
Caregiver Appraisals of Functional Dependence in Individuals With Dementia and Associated Caregiver Upset: Psychometric Properties of a New Scale and Response Patterns by Caregiver and Care Recipient Characteristics

PubMed Central

GITLIN, LAURA N.; ROTH, DAVID L.; BURGIO, LOUIS D.; LOEWENSTEIN, DAVID A.; WINTER, LARAINE; NICHOLS, LINDA; ARGÜELLES, SOLEDAD; CORCORAN, MARY; BURNS, ROBERT; MARTINDALE, JENNIFER

2008-01-01

Objective To evaluate psychometric properties and response patterns of the Caregiver Assessment of Function and Upset (CAFU), a 15-item multidimensional measure of dependence in dementia patients and caregiver reaction. Method 640 families were administered the CAFU (53% White, 43% African American, and 4% mixed race and ethnicity). We created a random split of the sample and conducted exploratory factor analyses on Sample 1 and confirmatory factor analyses on Sample 2. Convergent and discriminant validity were evaluated using Spearman rank correlation coefficients. Results A two-factor structure for functional items was derived, and excellent factorial validity was obtained. Convergent and discriminant validity were obtained for function and upset measures. Differential response patterns for dependence and caregiver upset were found for caregiver race, relationship, and care recipient gender but not for caregiver gender. Discussion The CAFU is easily administered, reliable, and valid for evaluating appraisals of dependencies and upsetting care areas. PMID:15750049
Construct Validation of a Multidimensional Computerized Adaptive Test for Fatigue in Rheumatoid Arthritis

PubMed Central

Nikolaus, Stephanie; Bode, Christina; Taal, Erik; Vonkeman, Harald E.; Glas, Cees A. W.; van de Laar, Mart A. F. J.

2015-01-01

Objective Multidimensional computerized adaptive testing enables precise measurements of patient-reported outcomes at an individual level across different dimensions. This study examined the construct validity of a multidimensional computerized adaptive test (CAT) for fatigue in rheumatoid arthritis (RA). Methods The ‘CAT Fatigue RA’ was constructed based on a previously calibrated item bank. It contains 196 items and three dimensions: ‘severity’, ‘impact’ and ‘variability’ of fatigue. The CAT was administered to 166 patients with RA. They also completed a traditional, multidimensional fatigue questionnaire (BRAF-MDQ) and the SF-36 in order to examine the CAT’s construct validity. A priori criterion for construct validity was that 75% of the correlations between the CAT dimensions and the subscales of the other questionnaires were as expected. Furthermore, comprehensive use of the item bank, measurement precision and score distribution were investigated. Results The a priori criterion for construct validity was supported for two of the three CAT dimensions (severity and impact but not for variability). For severity and impact, 87% of the correlations with the subscales of the well-established questionnaires were as expected but for variability, 53% of the hypothesised relations were found. Eighty-nine percent of the items were selected between one and 137 times for CAT administrations. Measurement precision was excellent for the severity and impact dimensions, with more than 90% of the CAT administrations reaching a standard error below 0.32. The variability dimension showed good measurement precision with 90% of the CAT administrations reaching a standard error below 0.44. No floor- or ceiling-effects were found for the three dimensions. Conclusion The CAT Fatigue RA showed good construct validity and excellent measurement precision on the dimensions severity and impact. The dimension variability had less ideal measurement characteristics, pointing to the need to recalibrate the CAT item bank with a two-dimensional model, solely consisting of severity and impact. PMID:26710104
Development of multi-dimensional body image scale for malaysian female adolescents

PubMed Central

Taib, Mohd Nasir Mohd; Shariff, Zalilah Mohd; Khor, Geok Lin

2008-01-01

The present study was conducted to develop a Multi-dimensional Body Image Scale for Malaysian female adolescents. Data were collected among 328 female adolescents from a secondary school in Kuantan district, state of Pahang, Malaysia by using a self-administered questionnaire and anthropometric measurements. The self-administered questionnaire comprised multiple measures of body image, Eating Attitude Test (EAT-26; Garner & Garfinkel, 1979) and Rosenberg Self-esteem Inventory (Rosenberg, 1965). The 152 items from selected multiple measures of body image were examined through factor analysis and for internal consistency. Correlations between Multi-dimensional Body Image Scale and body mass index (BMI), risk of eating disorders and self-esteem were assessed for construct validity. A seven factor model of a 62-item Multi-dimensional Body Image Scale for Malaysian female adolescents with construct validity and good internal consistency was developed. The scale encompasses 1) preoccupation with thinness and dieting behavior, 2) appearance and body satisfaction, 3) body importance, 4) muscle increasing behavior, 5) extreme dieting behavior, 6) appearance importance, and 7) perception of size and shape dimensions. Besides, a multidimensional body image composite score was proposed to screen negative body image risk in female adolescents. The result found body image was correlated with BMI, risk of eating disorders and self-esteem in female adolescents. In short, the present study supports a multi-dimensional concept for body image and provides a new insight into its multi-dimensionality in Malaysian female adolescents with preliminary validity and reliability of the scale. The Multi-dimensional Body Image Scale can be used to identify female adolescents who are potentially at risk of developing body image disturbance through future intervention programs. PMID:20126371
Development of multi-dimensional body image scale for malaysian female adolescents.

PubMed

Chin, Yit Siew; Taib, Mohd Nasir Mohd; Shariff, Zalilah Mohd; Khor, Geok Lin

2008-01-01

The present study was conducted to develop a Multi-dimensional Body Image Scale for Malaysian female adolescents. Data were collected among 328 female adolescents from a secondary school in Kuantan district, state of Pahang, Malaysia by using a self-administered questionnaire and anthropometric measurements. The self-administered questionnaire comprised multiple measures of body image, Eating Attitude Test (EAT-26; Garner & Garfinkel, 1979) and Rosenberg Self-esteem Inventory (Rosenberg, 1965). The 152 items from selected multiple measures of body image were examined through factor analysis and for internal consistency. Correlations between Multi-dimensional Body Image Scale and body mass index (BMI), risk of eating disorders and self-esteem were assessed for construct validity. A seven factor model of a 62-item Multi-dimensional Body Image Scale for Malaysian female adolescents with construct validity and good internal consistency was developed. The scale encompasses 1) preoccupation with thinness and dieting behavior, 2) appearance and body satisfaction, 3) body importance, 4) muscle increasing behavior, 5) extreme dieting behavior, 6) appearance importance, and 7) perception of size and shape dimensions. Besides, a multidimensional body image composite score was proposed to screen negative body image risk in female adolescents. The result found body image was correlated with BMI, risk of eating disorders and self-esteem in female adolescents. In short, the present study supports a multi-dimensional concept for body image and provides a new insight into its multi-dimensionality in Malaysian female adolescents with preliminary validity and reliability of the scale. The Multi-dimensional Body Image Scale can be used to identify female adolescents who are potentially at risk of developing body image disturbance through future intervention programs.
Development and initial validation of a brief self-report measure of cognitive dysfunction in fibromyalgia.

PubMed

Kratz, Anna L; Schilling, Stephen G; Goesling, Jenna; Williams, David A

2015-06-01

Pain is often the focus of research and clinical care in fibromyalgia (FM); however, cognitive dysfunction is also a common, distressing, and disabling symptom in FM. Current efforts to address this problem are limited by the lack of a comprehensive, valid measure of subjective cognitive dysfunction in FM that is easily interpretable, accessible, and brief. The purpose of this study was to leverage cognitive functioning item banks that were developed as part of the Patient Reported Outcomes Measurement Information System (PROMIS) to devise a 10-item short form measure of cognitive functioning for use in FM. In study 1, a nationwide (U.S.) sample of 1,035 adults with FM (age range = 18-82, 95.2% female) completed 2 cognitive item pools. Factor analyses and item response theory analyses were used to identify dimensionality and optimally performing items. A recommended 10-item measure, called the Multidimensional Inventory of Subjective Cognitive Impairment (MISCI) was created. In study 2, 232 adults with FM completed the MISCI and a legacy measure of cognitive functioning that is used in FM clinical trials, the Multiple Ability Self-Report Questionnaire (MASQ). The MISCI showed excellent internal reliability, low ceiling/floor effects, and good convergent validity with the MASQ (r = -.82). This paper presents the MISCI, a 10-item measure of cognitive dysfunction in FM, developed through classical test theory and item response theory. This brief but comprehensive measure shows evidence of excellent construct validity through large correlations with a lengthy legacy measure of cognitive functioning. Copyright © 2015 American Pain Society. Published by Elsevier Inc. All rights reserved.
A natural language screening measure for motivation to change.

PubMed

Miller, William R; Johnson, Wendy R

2008-09-01

Client motivation for change, a topic of high interest to addiction clinicians, is multidimensional and complex, and many different approaches to measurement have been tried. The current effort drew on psycholinguistic research on natural language that is used by clients to describe their own motivation. Seven addiction treatment sites participated in the development of a simple scale to measure client motivation. Twelve items were drafted to represent six potential dimensions of motivation for change that occur in natural discourse. The maximum self-rating of motivation (10 on a 0-10 scale) was the median score on all items, and 43% of respondents rated 10 on all 12 items - a substantial ceiling effect. From 1035 responses, three factors emerged representing importance, ability, and commitment - constructs that are also reflected in several theoretical models of motivation. A 3-item version of the scale, with one marker item for each of these constructs, accounted for 81% of variance in the full scale. The three items are: 1. It is important for me to . . . 2. I could . . . and 3. I am trying to . . . This offers a quick (1-minute) assessment of clients' self-reported motivation for change.
Design and validation of a questionnaire to assess organizational culture in French hospital wards.

PubMed

Saillour-Glénisson, F; Domecq, S; Kret, M; Sibe, M; Dumond, J P; Michel, P

2016-09-17

Although many organizational culture questionnaires have been developed, there is a lack of any validated multidimensional questionnaire assessing organizational culture at hospital ward level and adapted to health care context. Facing the lack of an appropriate tool, a multidisciplinary team designed and validated a dimensional organizational culture questionnaire for healthcare settings to be administered at ward level. A database of organizational culture items and themes was created after extensive literature review. Items were regrouped into dimensions and subdimensions (classification validated by experts). Pre-test and face validation was conducted with 15 health care professionals. In a stratified cluster random sample of hospitals, the psychometric validation was conducted in three phases on a sample of 859 healthcare professionals from 36 multidisciplinary medicine services: 1) the exploratory phase included a description of responses' saturation levels, factor and correlations analyses and an internal consistency analysis (Cronbach's alpha coefficient); 2) confirmatory phase used the Structural Equation Modeling (SEM); 3) reproducibility was studied by a test-retest. The overall response rate was 80 %; the completion average was 97 %. The metrological results were: a global Cronbach's alpha coefficient of 0.93, higher than 0.70 for 12 sub-dimensions; all Dillon-Goldstein's rho coefficients higher than 0.70; an excellent quality of external model with a Goodness of Fitness (GoF) criterion of 0.99. Seventy percent of the items had a reproducibility ranging from moderate (Intra-Class Coefficient between 50 and 70 % for 25 items) to good (ICC higher than 70 % for 33 items). COMEt (Contexte Organisationnel et Managérial en Etablissement de Santé) questionnaire is a validated multidimensional organizational culture questionnaire made of 6 dimensions, 21 sub-dimensions and 83 items. It is the first dimensional organizational culture questionnaire, specific to healthcare context, for a unit level assessment showing robust psychometric properties (validity and reliability). This tool is suited for research purposes, especially for assessing organizational context in research analysing the effectiveness of hospital quality improvement strategies. Our tool is also suited for an overall assessment of ward culture and could be a powerful trigger to improve management and clinical performance. Its psychometric properties in other health systems need to be tested.
The Development of a Multiple-Item Annoyance Scale (MIAS) for Transportation Noise Annoyance

PubMed Central

Belke, Christin; Spilski, Jan

2018-01-01

In 2001, Team#6 of the International Commission on Biological Effects of Noise (ICBEN) recommended the use of two single international standardised questions and response scales. This recommendation has been widely accepted in the scientific community. Nevertheless, annoyance can be regarded as a multidimensional construct comprising the three elements: (1) experience of an often repeated noise-related disturbance and the behavioural response to cope with it, (2) an emotional/attitudinal response to the sound and its disturbing impact, and (3) the perceived control or coping capacity with regard to the noise situation. The psychometric properties of items reflecting these three elements have been explored for aircraft noise annoyance. Analyses were conducted using data of the NORAH-Study (Noise-Related Annoyance, Cognition, and Health), and a multi-item noise annoyance scale (MIAS) has been developed and tested post hoc by using a stepwise process (exploratory and confirmatory factor analyses). Preliminary results were presented to the 12th ICBEN Congress in 2017. In this study, the validation of MIAS is done for aircraft noise and extended to railway and road traffic noise. The results largely confirm the concept of MIAS as a second-order construct of annoyance for all of the investigated transportation noise sources; however, improvements can be made, in particular with regard to items addressing the perceived coping capacity. PMID:29757228
The development and validation of a multidimensional sum-scaling questionnaire to measure patient-reported outcomes in acute respiratory tract infections in primary care: the acute respiratory tract infection questionnaire.

PubMed

Aabenhus, Rune; Thorsen, Hanne; Siersma, Volkert; Brodersen, John

2013-01-01

Patient-reported outcomes are seldom validated measures in clinical trials of acute respiratory tract infections (ARTIs) in primary care. We developed and validated a patient-reported outcome sum-scaling measure to assess the severity and functional impacts of ARTIs. Qualitative interviews and field testing among adults with an ARTI were conducted to ascertain a high degree of face and content validity of the questionnaire. Subsequently, a draft version of the Acute Respiratory Tract Infection Questionnaire (ARTIQ) was statistically validated by using the partial credit Rasch model to test dimensionality, objectivity, and reliability of items. Test of known groups' validity was conducted by comparing participants with and without an ARTI. The final version of the ARTIQ consisted of 38 items covering five dimensions (Physical-upper, Physical-lower, Psychological, Sleep, and Medicine) and five single items. All final dimensions were confirmed to fit the Rasch model, thus enabling sum-scaling of responses. The ARTIQ scores in participants with an ARTI were significantly higher than in those without ARTI (known groups' validity). A self-administered, multidimensional, sum-scaling questionnaire with high face and content validity and adequate psychometric properties for assessing severity and functional impacts from ARTIs in adults is available to clinical trials and audits in primary care. Copyright © 2013, International Society for Pharmacoeconomics and Outcomes Research (ISPOR). Published by Elsevier Inc.

Using Multidimensional Rasch Analysis to Validate the Chinese Version of the Motivated Strategies for Learning Questionnaire (MSLQ-CV)

ERIC Educational Resources Information Center

Lee, John Chi-Kin; Zhang, Zhonghua; Yin, Hongbiao

2010-01-01

This article used the multidimensional random coefficients multinomial logit model to examine the construct validity and detect the substantial differential item functioning (DIF) of the Chinese version of motivated strategies for learning questionnaire (MSLQ-CV). A total of 1,354 Hong Kong junior high school students were administered the…
Career Locus of Control and Career Success among Chinese Employees: A Multidimensional Approach

ERIC Educational Resources Information Center

Guan, Yanjun; Wang, Zhen; Dong, Zhilin; Liu, Yukun; Yue, Yumeng; Liu, Haiyang; Zhang, Yuqing; Zhou, Wenxia; Liu, Haihua

2013-01-01

The current research aimed to develop a multidimensional measure of career locus of control (LOC) and examine its predictive validity on objective and subjective career success among Chinese employees. Items of career LOC were generated based on literature review of the significant predictors of career success, as well as the open-ended responses…
Comparison of Unidimensional and Multidimensional Approaches to IRT Parameter Estimation. Research Report. ETS RR-04-44

ERIC Educational Resources Information Center

Zhang, Jinming

2004-01-01

It is common to assume during statistical analysis of a multiscale assessment that the assessment has simple structure or that it is composed of several unidimensional subtests. Under this assumption, both the unidimensional and multidimensional approaches can be used to estimate item parameters. This paper theoretically demonstrates that these…
Similarity from Multi-Dimensional Scaling: Solving the Accuracy and Diversity Dilemma in Information Filtering

PubMed Central

Zeng, Wei; Zeng, An; Liu, Hao; Shang, Ming-Sheng; Zhang, Yi-Cheng

2014-01-01

Recommender systems are designed to assist individual users to navigate through the rapidly growing amount of information. One of the most successful recommendation techniques is the collaborative filtering, which has been extensively investigated and has already found wide applications in e-commerce. One of challenges in this algorithm is how to accurately quantify the similarities of user pairs and item pairs. In this paper, we employ the multidimensional scaling (MDS) method to measure the similarities between nodes in user-item bipartite networks. The MDS method can extract the essential similarity information from the networks by smoothing out noise, which provides a graphical display of the structure of the networks. With the similarity measured from MDS, we find that the item-based collaborative filtering algorithm can outperform the diffusion-based recommendation algorithms. Moreover, we show that this method tends to recommend unpopular items and increase the global diversification of the networks in long term. PMID:25343243
Development of a questionnaire for assessing the childbirth experience (QACE).

PubMed

Carquillat, Pierre; Vendittelli, Françoise; Perneger, Thomas; Guittier, Marie-Julia

2017-08-30

Due to its potential impact on women's psychological health, assessing perceptions of their childbirth experience is important. The aim of this study was to develop a multidimensional self-reporting questionnaire to evaluate the childbirth experience. Factors influencing the childbirth experience were identified from a literature review and the results of a previous qualitative study. A total of 25 items were combined from existing instruments or were created de novo. A draft version was pilot tested for face validity with 30 women and submitted for evaluation of its construct validity to 477 primiparous women at one-month post-partum. The recruitment took place in two obstetric clinics from Swiss and French university hospitals. To evaluate the content validity, we compared item responses to general childbirth experience assessments on a numeric, 0 to 10 rating scale. We dichotomized two group assessment scores: "0 to 7" and "8 to 10". We performed an exploratory factor analysis to identify underlying dimensions. In total, 291 women completed the questionnaire (response rate = 61%). The responses to 22 items were statistically significant between the 0 to 7 and 8 to 10 groups for the general childbirth experience assessments. An exploratory factor analysis yielded four sub-scales, which were labelled "relationship with staff" (4 items), "emotional status" (3 items), "first moments with the new born," (3 items) and "feelings at one month postpartum" (3 items). All 4 scales had satisfactory internal consistency levels (alpha coefficients from 0.70 to 0.85). The full 25-item version can be used to analyse each item by itself, and the short 4-dimension version can be scored to summarize the general assessment of the childbirth experience. The Questionnaire for Assessing the Childbirth Experience (QACE) could be useful as a screening instrument to identify women with negative childbirth experiences. It can be used as both a research instrument in its short version and a questionnaire for use in clinical practice in its full version.
Development of the Assessment of Belief Conflict in Relationship-14 (ABCR-14)

PubMed Central

Kyougoku, Makoto; Teraoka, Mutsumi; Masuda, Noriko; Ooura, Mariko; Abe, Yasushi

2015-01-01

Purpose Nurses and other healthcare workers frequently experience belief conflict, one of the most important, new stress-related problems in both academic and clinical fields. Methods In this study, using a sample of 1,683 nursing practitioners, we developed The Assessment of Belief Conflict in Relationship-14 (ABCR-14), a new scale that assesses belief conflict in the healthcare field. Standard psychometric procedures were used to develop and test the scale, including a qualitative framework concept and item-pool development, item reduction, and scale development. We analyzed the psychometric properties of ABCR-14 according to entropy, polyserial correlation coefficient, exploratory factor analysis, confirmatory factor analysis, average variance extracted, Cronbach’s alpha, Pearson product-moment correlation coefficient, and multidimensional item response theory (MIRT). Results The results of the analysis supported a three-factor model consisting of 14 items. The validity and reliability of ABCR-14 was suggested by evidence from high construct validity, structural validity, hypothesis testing, internal consistency reliability, and concurrent validity. The result of the MIRT offered strong support for good item response of item slope parameters and difficulty parameters. However, the ABCR-14 Likert scale might need to be explored from the MIRT point of view. Yet, as mentioned above, there is sufficient evidence to support that ABCR-14 has high validity and reliability. Conclusion The ABCR-14 demonstrates good psychometric properties for nursing belief conflict. Further studies are recommended to confirm its application in clinical practice. PMID:26247356
Measurement and control of bias in patient reported outcomes using multidimensional item response theory.

PubMed

Dowling, N Maritza; Bolt, Daniel M; Deng, Sien; Li, Chenxi

2016-05-26

Patient-reported outcome (PRO) measures play a key role in the advancement of patient-centered care research. The accuracy of inferences, relevance of predictions, and the true nature of the associations made with PRO data depend on the validity of these measures. Errors inherent to self-report measures can seriously bias the estimation of constructs assessed by the scale. A well-documented disadvantage of self-report measures is their sensitivity to response style (RS) effects such as the respondent's tendency to select the extremes of a rating scale. Although the biasing effect of extreme responding on constructs measured by self-reported tools has been widely acknowledged and studied across disciplines, little attention has been given to the development and systematic application of methodologies to assess and control for this effect in PRO measures. We review the methodological approaches that have been proposed to study extreme RS effects (ERS). We applied a multidimensional item response theory model to simultaneously estimate and correct for the impact of ERS on trait estimation in a PRO instrument. Model estimates were used to study the biasing effects of ERS on sum scores for individuals with the same amount of the targeted trait but different levels of ERS. We evaluated the effect of joint estimation of multiple scales and ERS on trait estimates and demonstrated the biasing effects of ERS on these trait estimates when used as explanatory variables. A four-dimensional model accounting for ERS bias provided a better fit to the response data. Increasing levels of ERS showed bias in total scores as a function of trait estimates. The effect of ERS was greater when the pattern of extreme responding was the same across multiple scales modeled jointly. The estimated item category intercepts provided evidence of content independent category selection. Uncorrected trait estimates used as explanatory variables in prediction models showed downward bias. A comprehensive evaluation of the psychometric quality and soundness of PRO assessment measures should incorporate the study of ERS as a potential nuisance dimension affecting the accuracy and validity of scores and the impact of PRO data in clinical research and decision making.
Performance of the likelihood ratio difference (G2 Diff) test for detecting unidimensionality in applications of the multidimensional Rasch model.

PubMed

Harrell-Williams, Leigh; Wolfe, Edward W

2014-01-01

Previous research has investigated the influence of sample size, model misspecification, test length, ability distribution offset, and generating model on the likelihood ratio difference test in applications of item response models. This study extended that research to the evaluation of dimensionality using the multidimensional random coefficients multinomial logit model (MRCMLM). Logistic regression analysis of simulated data reveal that sample size and test length have a large effect on the capacity of the LR difference test to correctly identify unidimensionality, with shorter tests and smaller sample sizes leading to smaller Type I error rates. Higher levels of simulated misfit resulted in fewer incorrect decisions than data with no or little misfit. However, Type I error rates indicate that the likelihood ratio difference test is not suitable under any of the simulated conditions for evaluating dimensionality in applications of the MRCMLM.
Coping as Part of Motivational Resilience in School: A Multidimensional Measure of Families, Allocations, and Profiles of Academic Coping

ERIC Educational Resources Information Center

Skinner, Ellen; Pitzer, Jennifer; Steele, Joel

2013-01-01

A study was designed to examine a multidimensional measure of children's coping in the academic domain as part of a larger model of motivational resilience. Using items tapping multiple ways of dealing with academic problems, including five adaptive ways (strategizing, help-seeking, comfort-seeking, self-encouragement, and commitment) and six…
A hybrid heuristic for the multiple choice multidimensional knapsack problem

NASA Astrophysics Data System (ADS)

Mansi, Raïd; Alves, Cláudio; Valério de Carvalho, J. M.; Hanafi, Saïd

2013-08-01

In this article, a new solution approach for the multiple choice multidimensional knapsack problem is described. The problem is a variant of the multidimensional knapsack problem where items are divided into classes, and exactly one item per class has to be chosen. Both problems are NP-hard. However, the multiple choice multidimensional knapsack problem appears to be more difficult to solve in part because of its choice constraints. Many real applications lead to very large scale multiple choice multidimensional knapsack problems that can hardly be addressed using exact algorithms. A new hybrid heuristic is proposed that embeds several new procedures for this problem. The approach is based on the resolution of linear programming relaxations of the problem and reduced problems that are obtained by fixing some variables of the problem. The solutions of these problems are used to update the global lower and upper bounds for the optimal solution value. A new strategy for defining the reduced problems is explored, together with a new family of cuts and a reformulation procedure that is used at each iteration to improve the performance of the heuristic. An extensive set of computational experiments is reported for benchmark instances from the literature and for a large set of hard instances generated randomly. The results show that the approach outperforms other state-of-the-art methods described so far, providing the best known solution for a significant number of benchmark instances.
Precision of working memory for speech sounds.

PubMed

Joseph, Sabine; Iverson, Paul; Manohar, Sanjay; Fox, Zoe; Scott, Sophie K; Husain, Masud

2015-01-01

Memory for speech sounds is a key component of models of verbal working memory (WM). But how good is verbal WM? Most investigations assess this using binary report measures to derive a fixed number of items that can be stored. However, recent findings in visual WM have challenged such "quantized" views by employing measures of recall precision with an analogue response scale. WM for speech sounds might rely on both continuous and categorical storage mechanisms. Using a novel speech matching paradigm, we measured WM recall precision for phonemes. Vowel qualities were sampled from a formant space continuum. A probe vowel had to be adjusted to match the vowel quality of a target on a continuous, analogue response scale. Crucially, this provided an index of the variability of a memory representation around its true value and thus allowed us to estimate how memories were distorted from the original sounds. Memory load affected the quality of speech sound recall in two ways. First, there was a gradual decline in recall precision with increasing number of items, consistent with the view that WM representations of speech sounds become noisier with an increase in the number of items held in memory, just as for vision. Based on multidimensional scaling (MDS), the level of noise appeared to be reflected in distortions of the formant space. Second, as memory load increased, there was evidence of greater clustering of participants' responses around particular vowels. A mixture model captured both continuous and categorical responses, demonstrating a shift from continuous to categorical memory with increasing WM load. This suggests that direct acoustic storage can be used for single items, but when more items must be stored, categorical representations must be used.
The Academic Resilience Scale (ARS-30): A New Multidimensional Construct Measure

PubMed Central

Cassidy, Simon

2016-01-01

Resilience is a psychological construct observed in some individuals that accounts for success despite adversity. Resilience reflects the ability to bounce back, to beat the odds and is considered an asset in human characteristic terms. Academic resilience contextualizes the resilience construct and reflects an increased likelihood of educational success despite adversity. The paper provides an account of the development of a new multidimensional construct measure of academic resilience. The 30 item Academic Resilience Scale (ARS-30) explores process—as opposed to outcome—aspects of resilience, providing a measure of academic resilience based on students’ specific adaptive cognitive-affective and behavioral responses to academic adversity. Findings from the study involving a sample of undergraduate students (N = 532) demonstrate that the ARS-30 has good internal reliability and construct validity. It is suggested that a measure such as the ARS-30, which is based on adaptive responses, aligns more closely with the conceptualisation of resilience and provides a valid construct measure of academic resilience relevant for research and practice in university student populations. PMID:27917137
Dyspnoea-12: a translation and linguistic validation study in a Swedish setting

PubMed Central

Ekström, Magnus

2017-01-01

Background Dyspnoea consists of multiple dimensions including the intensity, unpleasantness, sensory qualities and emotional responses which may differ between patient groups, settings and in relation to treatment. The Dyspnoea-12 is a validated and convenient instrument for multidimensional measurement in English. We aimed to take forward a Swedish version of the Dyspnoea-12. Methods The linguistic validation of the Dyspnoea-12 was performed (Mapi Language Services, Lyon, France). The standardised procedure involved forward and backward translations by three independent certified translators and revisions after feedback from an in-country linguistic consultant, the developerand three native physicians. The understanding and convenience of the translated version was evaluated using qualitative in-depth interviews with five patients with dyspnoea. Results A Swedish version of the Dyspnoea-12 was elaborated and evaluated carefully according to international guidelines. The Swedish version, ‘Dyspné−12’, has the same layout as the original version, including 12 items distributed on seven physical and five affective items. The Dyspnoea-12 is copyrighted by the developer but can be used free of charge after permission for not industry-funded research. Conclusion A Swedish version of the Dyspnoea-12 is now available for clinical validation and multidimensional measurement across diseases and settings with the aim of improved evaluation and management of dyspnoea. PMID:28592574
Factor Structure, Factorial Invariance, and Validity of the Multidimensional Shame-Related Response Inventory-21 (MSRI-21)

PubMed Central

Garcia, Antonio F.; Acosta, Melina; Pirani, Saifa; Edwards, Daniel; Osman, Augustine

2017-01-01

We describe 2 studies designed to evaluate scores on the Multidimensional Shame-related Response Inventory-21 (MSRI-21), a recently developed instrument that measures affective and behavioral responses to shame. The inventory assesses shame-related responses in 3 categories: negative self-evaluation, fear of social consequences, and maladaptive behavior tendency. For Study 1, (N = 743) undergraduates completed the MSRI-21. Confirmatory factor analysis supported the validity of the MSRI-21 3-factor structure. Latent variable modeling of coefficient-α provided strong evidence for the internal consistency of scores on each scale. In Study 2, (N = 540) undergraduates completed the instrument along with 5 concurrent measures chosen for clinical significance. Achievement of factorial invariance supported the use of MSRI-21 scale scores to make valid mean comparisons across gender. In addition, MSRI-21 scale scores were associated as expected with scores on measures of self-harm, suicide, and other risk factors. Taken together, results of 2 studies support the internal consistency reliability, factorial validity, factorial invariance, and convergent validity of scores on the MSRI-21. Further work is needed to assess the temporal stability of the MSRI-21 scale scores, invariance across clinical status and other groupings, item-level measurement properties, and viability in highly symptomatic samples. PMID:28182490
Development of multi-dimensional action checklist for promoting new approaches in participatory occupational safety and health in small and medium-sized enterprises.

PubMed

Nishikido, Noriko; Yuasa, Akiko; Motoki, Chiharu; Tanaka, Mika; Arai, Sumiko; Matsuda, Kazumi; Ikeda, Tomoko; Iijima, Miyoko; Hirata, Mamoru; Hojoh, Minoru; Tsutaki, Miho; Ito, Akiyoshi; Maeda, Kazutoshi; Miyoshi, Yukari; Mitsuhashi, Hiroyuki; Fukuda, Eiko; Kawakami, Yuko

2006-01-01

To meet diversified health needs in workplaces, especially in developed countries, occupational safety and health (OSH) activities should be extended. The objective of this study is to develop a new multi-dimensional action checklist that can support employers and workers in understanding a wide range of OSH activities and to promote participation in OSH in small and medium-sized enterprises (SMEs). The general structure of and specific items in the new action checklist were discussed in a focus group meeting with OSH specialists based upon the results of a literature review and our previous interviews with company employers and workers. To assure practicality and validity, several sessions were held to elicit the opinions of company members and, as a result, modifications were made. The new multi-dimensional action checklist was finally formulated consisting of 6 core areas, 9 technical areas, and 61 essential items. Each item was linked to a suitable section in the information guidebook that we developed concomitantly with the action checklist. Combined usage of the action checklist with the information guidebook would provide easily comprehended information and practical support. Intervention studies using this newly developed action checklist will clarify the effectiveness of the new approach to OSH in SMEs.
MM-MDS: a multidimensional scaling database with similarity ratings for 240 object categories from the Massive Memory picture database.

PubMed

Hout, Michael C; Goldinger, Stephen D; Brady, Kyle J

2014-01-01

Cognitive theories in visual attention and perception, categorization, and memory often critically rely on concepts of similarity among objects, and empirically require measures of "sameness" among their stimuli. For instance, a researcher may require similarity estimates among multiple exemplars of a target category in visual search, or targets and lures in recognition memory. Quantifying similarity, however, is challenging when everyday items are the desired stimulus set, particularly when researchers require several different pictures from the same category. In this article, we document a new multidimensional scaling database with similarity ratings for 240 categories, each containing color photographs of 16-17 exemplar objects. We collected similarity ratings using the spatial arrangement method. Reports include: the multidimensional scaling solutions for each category, up to five dimensions, stress and fit measures, coordinate locations for each stimulus, and two new classifications. For each picture, we categorized the item's prototypicality, indexed by its proximity to other items in the space. We also classified pairs of images along a continuum of similarity, by assessing the overall arrangement of each MDS space. These similarity ratings will be useful to any researcher that wishes to control the similarity of experimental stimuli according to an objective quantification of "sameness."
Measuring Global Physical Health in Children with Cerebral Palsy: Illustration of a Multidimensional Bi-factor Model and Computerized Adaptive Testing

PubMed Central

Haley, Stephen M.; Ni, Pengsheng; Dumas, Helene M.; Fragala-Pinkham, Maria A.; Hambleton, Ronald K.; Montpetit, Kathleen; Bilodeau, Nathalie; Gorton, George E.; Watson, Kyle; Tucker, Carole A

2009-01-01

Purpose The purpose of this study was to apply a bi-factor model for the determination of test dimensionality and a multidimensional CAT using computer simulations of real data for the assessment of a new global physical health measure for children with cerebral palsy (CP). Methods Parent respondents of 306 children with cerebral palsy were recruited from four pediatric rehabilitation hospitals and outpatient clinics. We compared confirmatory factor analysis results across four models: (1) one-factor unidimensional; (2) two-factor multidimensional (MIRT); (3) bi-factor MIRT with fixed slopes; and (4) bi-factor MIRT with varied slopes. We tested whether the general and content (fatigue and pain) person score estimates could discriminate across severity and types of CP, and whether score estimates from a simulated CAT were similar to estimates based on the total item bank, and whether they correlated as expected with external measures. Results Confirmatory factor analysis suggested separate pain and fatigue sub-factors; all 37 items were retained in the analyses. From the bi-factor MIRT model with fixed slopes, the full item bank scores discriminated across levels of severity and types of CP, and compared favorably to external instruments. CAT scores based on 10- and 15-item versions accurately captured the global physical health scores. Conclusions The bi-factor MIRT CAT application, especially the 10- and 15-item version, yielded accurate global physical health scores that discriminated across known severity groups and types of CP, and correlated as expected with concurrent measures. The CATs have potential for collecting complex data on the physical health of children with CP in an efficient manner. PMID:19221892
The Swiss Health Literacy Survey: development and psychometric properties of a multidimensional instrument to assess competencies for health

PubMed Central

Wang, Jen; Thombs, Brett D.; Schmid, Margareta R.

2012-01-01

Abstract Background Growing recognition of the role of citizens and patients in health and health care has placed a spotlight on health literacy and patient education. Objective To identify specific competencies for health in definitions of health literacy and patient‐centred concepts and empirically test their dimensionality in the general population. Methods A thorough review of the literature on health literacy, self‐management, patient empowerment, patient education and shared decision making revealed considerable conceptual overlap as competencies for health and identified a corpus of 30 generic competencies for health. A questionnaire containing 127 items covering the 30 competencies was fielded as a telephone interview in German, French and Italian among 1255 respondents randomly selected from the resident population in Switzerland. Findings Analyses with the software MPlus to model items with mixed response categories showed that the items do not load onto a single factor. Multifactorial models with good fit could be erected for each of five dimensions defined a priori and their corresponding competencies: information and knowledge (four competencies, 17 items), general cognitive skills (four competencies, 17 items), social roles (two competencies, seven items), medical management (four competencies, 27 items) and healthy lifestyle (two competencies, six items). Multiple indicators and multiple causes models identified problematic differential item functioning for only six items belonging to two competencies. Conclusions The psychometric analyses of this instrument support broader conceptualization of health literacy not as a single competence but rather as a package of competencies for health. PMID:22390287
Measuring quality of life in patients with stress urinary incontinence: is the ICIQ-UI-SF adequate?

PubMed

Kurzawa, Zuzanna; Sutherland, Jason M; Crump, Trafford; Liu, Guiping

2018-05-08

The International Consultation on Incontinence Questionnaire Short Form (ICIQ-UI-SF) is a widely used four-item patient-reported outcome (PRO) measure. Evaluations of this instrument are limited, restraining user's confidence in the instrument. This study conducts a comprehensive evaluation of the ICIQ-UI-SF on a sample of urological surgery patients in Canada. One hundred and seventy-seven surgical patients with stress urinary incontinence completed the ICIQ-UI-SF pre-operatively. Methods drawing from confirmatory factor analysis (CFA), measures of reliability, item response theory (IRT), and differential item functioning were applied. Ceiling effects were examined. Ceiling effects were identified. In the CFA, the factor loadings of items one and two differed significantly (p < 0.001) from item three indicating possible multidimensionality. The first two items reflect symptom severity not quality of life. Reliability was moderate as measured by Cronbach's alpha (0.63) and McDonald's coefficient (0.65). The IRT found the instrument does not discriminate between individuals with low incontinence-related quality of life. Due to low/moderate reliability, the ICIQ-UI-SF can be used as a complement to other data or used to report aggregated surgical outcomes among surgical patients. If the primary objective is to measure quality of life, other PROs should be considered.
Rasch analysis of the Chedoke-McMaster Attitudes towards Children with Handicaps scale.

PubMed

Armstrong, Megan; Morris, Christopher; Tarrant, Mark; Abraham, Charles; Horton, Mike C

2017-02-01

Aim To assess whether the Chedoke-McMaster Attitudes towards Children with Handicaps (CATCH) 36-item total scale and subscales fit the unidimensional Rasch model. Method The CATCH was administered to 1881 children, aged 7-16 years in a cross-sectional survey. Data were used from a random sample of 416 for the initial Rasch analysis. The analysis was performed on the 36-item scale and then separately for each subscale. The analysis explored fit to the Rasch model in terms of overall scale fit, individual item fit, item response categories, and unidimensionality. Item bias for gender and school level was also assessed. Revised scales were then tested on an independent second random sample of 415 children. Results Analyses indicated that the 36-item overall scale was not unidimensional and did not fit the Rasch model. Two scales of affective attitudes and behavioural intention were retained after four items were removed from each due to misfit to the Rasch model. Additionally, the scaling was improved when the two most negative response categories were aggregated. There was no item bias by gender or school level on the revised scales. Items assessing cognitive attitudes did not fit the Rasch model and had low internal consistency as a scale. Conclusion Affective attitudes and behavioural intention CATCH sub-scales should be treated separately. Caution should be exercised when using the cognitive subscale. Implications for Rehabilitation The 36-item Chedoke-McMaster Attitudes towards Children with Handicaps (CATCH) scale as a whole did not fit the Rasch model; thus indicating a multi-dimensional scale. Researchers should use two revised eight-item subscales of affective attitudes and behavioural intentions when exploring interventions aiming to improve children's attitudes towards disabled people or factors associated with those attitudes. Researchers should use the cognitive subscale with caution, as it did not create a unidimensional and internally consistent scale. Therefore, conclusions drawn from this scale may not accurately reflect children's attitudes.

Development of a fragile X syndrome (FXS) knowledge scale: towards a modified multidimensional measure of informed choice for FXS population carrier screening.

PubMed

Ames, Alice G; Jaques, Alice; Ukoumunne, Obioha C; Archibald, Alison D; Duncan, Rony E; Emery, Jon; Metcalfe, Sylvia A

2015-02-01

Genetic carrier screening is increasingly possible for many conditions, but it is important to ensure decisions are informed. The multidimensional measure of informed choice (MMIC) is a quantitative instrument developed to evaluate informed choice in prenatal screening for Down syndrome, measuring knowledge, attitudes and uptake. To apply the MMIC in other screening settings, the knowledge scale must be modified. To develop and validate a modified MMIC knowledge scale for use with women undergoing carrier screening for fragile X syndrome (FXS). Responses to MMIC items were collected through questionnaires as part of a FXS carrier screening pilot study in a preconception setting in Melbourne, Australia. Ten knowledge scale items were developed using a modified Delphi technique. Cronbach's alpha and factor analysis were used to validate the new FXS knowledge scale. We summarized the knowledge, attitudes and informed choice status based on the modified MMIC. Two hundred and eighty-five women were recruited, 241 eligible questionnaires were complete for analysis. The FXS knowledge scale items measured one salient construct and were internally consistent (alpha = 0.70). 71% (172/241) of participants were classified as having good knowledge, 70% (169/241) had positive attitudes and 27% (65/241) made an informed choice to accept or decline screening. We present the development of a knowledge scale as part of a MMIC to evaluate informed choice in population carrier screening for FXS. This can be used as a template by other researchers to develop knowledge scales for other conditions for use in the MMIC. © 2012 John Wiley & Sons Ltd.
Going to the source: creating a citizenship outcome measure by community-based participatory research methods.

PubMed

Rowe, Michael; Clayton, Ashley; Benedict, Patricia; Bellamy, Chyrell; Antunes, Kimberly; Miller, Rebecca; Pelletier, Jean-Francois; Stern, Erica; O'Connell, Maria J

2012-01-01

This study used participatory methods and concept-mapping techniques to develop a greater understanding of the construct of citizenship and an instrument to assess the degree to which individuals, particularly those with psychiatric disorders, perceive themselves to be citizens in a multifaceted sense (that is, not in a simply legal sense). Participants were persons with recent experience of receiving public mental health services, having criminal justice charges, having a serious general medical illness, or having more than one of these "life disruptions," along with persons who had not experienced any of these disruptions. Community-based participatory methods, including a co-researcher team of persons with experiences of mental illness and other life disruptions, were employed. Procedures included conducting focus groups with each life disruption (or no disruption) group to generate statements about the meaning of citizenship (N = 75 participants); reducing the generated statements to 100 items and holding concept-mapping sessions with participants from the five stakeholder groups (N = 66 participants) to categorize and rate each item in terms of importance and access; analyzing concept-mapping data to produce citizenship domains; and developing a pilot instrument of citizenship. Multidimensional scaling and hierarchical cluster analysis revealed seven primary domains of citizenship: personal responsibilities, government and infrastructure, caring for self and others, civil rights, legal rights, choices, and world stewardship. Forty-six items were identified for inclusion in the citizenship measure. Citizenship is a multidimensional construct encompassing the degree to which individuals with different life experiences perceive inclusion or involvement across a variety of activities and concepts.
Development of the insight scale for affective disorders (ISAD): modification from the scale to assess unawareness of mental disorder.

PubMed

Olaya, Beatriz; Marsà, Ferran; Ochoa, Susana; Balanzá-Martínez, Vicent; Barbeito, Sara; García-Portilla, Mari Paz; González-Pinto, Ana; Lobo, Antonio; López-Antón, Raúl; Usall, Judith; Arranz, Belén; Haro, Josep Maria

2012-12-15

Research on insight in patients with mood disorders has grown in recent years. Several instruments to assess insight have been used, but most of them have been specifically designed for psychosis and may not appear relevant to mood disorders. The aim of the present study is to develop a short, multidimensional, reliable and valid scale to measure insight in patients with mood disorders, based on the Amador's Scale to Assess Unawareness of Mental Disorders (SUMD). A Delphi method was used to facilitate expert participation and ensure face and content validity. The SUMD structure and items were used as a reference in the scale development. A new scale with 17 items was obtained. Internal consistency, test-retest and inter-rater reliability and validity were studied in a sample of 76 outpatients with a DSM-IV diagnosis of major depression or bipolar disorder (type I or II). Internal consistency of the general items was moderate, and high for the symptoms awareness subscale. Scores on ISAD correlated with other measures of insight and with some clinical measures, thus supporting its validity. The majority of the sample came from community services. Future studies should use inpatients or patients with severe symptoms to broaden the range of responses. Moreover, the rating of insight and other measures by the same clinician might introduce a methodological bias. The ISAD, with a multidimensional approach, appears as a short, reliable and valid measure of insight in mood disorders. Expert consensus ensures its face and content validity. Copyright © 2012 Elsevier B.V. All rights reserved.
Application of a Multidimensional Nested Logit Model to Multiple-Choice Test Items

ERIC Educational Resources Information Center

Bolt, Daniel M.; Wollack, James A.; Suh, Youngsuk

2012-01-01

Nested logit models have been presented as an alternative to multinomial logistic models for multiple-choice test items (Suh and Bolt in "Psychometrika" 75:454-473, 2010) and possess a mathematical structure that naturally lends itself to evaluating the incremental information provided by attending to distractor selection in scoring. One potential…
Development and validation of a patient-reported outcome measure for stroke patients.

PubMed

Luo, Yanhong; Yang, Jie; Zhang, Yanbo

2015-05-08

Family support and patient satisfaction with treatment are crucial for aiding in the recovery from stroke. However, current validated stroke-specific questionnaires may not adequately capture the impact of these two variables on patients undergoing clinical trials of new drugs. Therefore, the aim of this study was to develop and evaluate a new stroke patient-reported outcome measure (Stroke-PROM) instrument for capturing more comprehensive effects of stroke on patients participating in clinical trials of new drugs. A conceptual framework and a pool of items for the preliminary Stroke-PROM were generated by consulting the relevant literature and other questionnaires created in China and other countries, and interviewing 20 patients and 4 experts to ensure that all germane parameters were included. During the first item-selection phase, classical test theory and item response theory were applied to an initial scale completed by 133 patients with stroke. During the item-revaluation phase, classical test theory and item response theory were used again, this time with 475 patients with stroke and 104 healthy participants. During the scale assessment phase, confirmatory factor analysis was applied to the final scale of the Stroke-PROM using the same study population as in the second item-selection phase. Reliability, validity, responsiveness and feasibility of the final scale were tested. The final scale of Stroke-PROM contained 46 items describing four domains (physiology, psychology, society and treatment). These four domains were subdivided into 10 subdomains. Cronbach's α coefficients for the four domains ranged from 0.861 to 0.908. Confirmatory factor analysis supported the validity of the final scale, and the model fit index satisfied the criterion. Differences in the Stroke-PROM mean scores were significant between patients with stroke and healthy participants in nine subdomains (P < 0.001), indicating that the scale showed good responsiveness. The Stroke-PROM is a patient-reported outcome multidimensional questionnaire developed especially for clinical trials of new drugs and is focused on issues of family support and patient satisfaction with treatment. Extensive data analyses supported the validity, reliability and responsiveness of the Stroke-PROM.
A Multidimensional Tool Based on the eHealth Literacy Framework: Development and Initial Validity Testing of the eHealth Literacy Questionnaire (eHLQ)

PubMed Central

Karnoe, Astrid; Furstrand, Dorthe; Batterham, Roy; Christensen, Karl Bang; Elsworth, Gerald; Osborne, Richard H

2018-01-01

Background For people to be able to access, understand, and benefit from the increasing digitalization of health services, it is critical that services are provided in a way that meets the user’s needs, resources, and competence. Objective The objective of the study was to develop a questionnaire that captures the 7-dimensional eHealth Literacy Framework (eHLF). Methods Draft items were created in parallel in English and Danish. The items were generated from 450 statements collected during the conceptual development of eHLF. In all, 57 items (7 to 9 items per scale) were generated and adjusted after cognitive testing. Items were tested in 475 people recruited from settings in which the scale was intended to be used (community and health care settings) and including people with a range of chronic conditions. Measurement properties were assessed using approaches from item response theory (IRT) and classical test theory (CTT) such as confirmatory factor analysis (CFA) and reliability using composite scale reliability (CSR); potential bias due to age and sex was evaluated using differential item functioning (DIF). Results CFA confirmed the presence of the 7 a priori dimensions of eHLF. Following item analysis, a 35-item 7-scale questionnaire was constructed, covering (1) using technology to process health information (5 items, CSR=.84), (2) understanding of health concepts and language (5 items, CSR=.75), (3) ability to actively engage with digital services (5 items, CSR=.86), (4) feel safe and in control (5 items, CSR=.87), (5) motivated to engage with digital services (5 items, CSR=.84), (6) access to digital services that work (6 items, CSR=.77), and (7) digital services that suit individual needs (4 items, CSR=.85). A 7-factor CFA model, using small-variance priors for cross-loadings and residual correlations, had a satisfactory fit (posterior productive P value: .27, 95% CI for the difference between the observed and replicated chi-square values: −63.7 to 133.8). The CFA showed that all items loaded strongly on their respective factors. The IRT analysis showed that no items were found to have disordered thresholds. For most scales, discriminant validity was acceptable; however, 2 pairs of dimensions were highly correlated; dimensions 1 and 5 (r=.95), and dimensions 6 and 7 (r=.96). All dimensions were retained because of strong content differentiation and potential causal relationships between these dimensions. There is no evidence of DIF. Conclusions The eHealth Literacy Questionnaire (eHLQ) is a multidimensional tool based on a well-defined a priori eHLF framework with robust properties. It has satisfactory evidence of construct validity and reliable measurement across a broad range of concepts (using both CTT and IRT traditions) in various groups. It is designed to be used to understand and evaluate people’s interaction with digital health services. PMID:29434011
The Test Anxiety Inventory for Children and Adolescents (TAICA): Examination of the Psychometric Properties of a New Multidimensional Measure of Test Anxiety among Elementary and Secondary School Students

ERIC Educational Resources Information Center

Lowe, Patricia A.; Lee, Steven W.; Witteborg, Kristin M.; Prichard, Keri W.; Luhr, Megan E.; Cullinan, Christopher M.; Mildren, Bethany A.; Raad, Jennifer M.; Cornelius, Rebecca A.; Janik, Melissa

2008-01-01

The Test Anxiety Inventory for Children and Adolescents (TAICA) is a new multidimensional measure used to assess test anxiety in elementary and secondary school students. The TAICA is a 45-item self-report measure consisting of a Total Test Anxiety scale, four debilitating test anxiety subscales (Cognitive Obstruction/Inattention, Physiological…
Testing the multidimensionality of the inventory of school motivation in a Dutch student sample.

PubMed

Korpershoek, Hanke; Xu, Kun; Mok, Magdalena Mo Ching; McInerney, Dennis M; van der Werf, Greetje

2015-01-01

A factor analytic and a Rasch measurement approach were applied to evaluate the multidimensional nature of the school motivation construct among more than 7,000 Dutch secondary school students. The Inventory of School Motivation (McInerney and Ali, 2006) was used, which intends to measure four motivation dimensions (mastery, performance, social, and extrinsic motivation), each comprising of two first-order factors. One unidimensional model and three multidimensional models (4-factor, 8-factor, higher order) were fit to the data. Results of both approaches showed that the multidimensional models validly represented the school motivation among Dutch secondary school pupils, whereas model fit of the unidimensional model was poor. The differences in model fit between the three multidimensional models were small, although a different model was favoured by the two approaches. The need for improvement of some of the items and the need to increase measurement precision of several first-order factors are discussed.
Personality Measurement with Mentally Retarded and Other Sub-Cultural Adults. Final Report.

ERIC Educational Resources Information Center

Eber, Herbert W.

Two 160-item experimental forms of multidimensional personality test to assess vocational potential of clients of limited literacy (third grade reading level) were developed and administered to clients at rehabilitation centers and at centers for the retarded. Using the 16 Personality Factors Test as a model, items were constructed to do the…
Differences in autonomic physiological responses between good and poor inductive reasoners.

PubMed

Melis, C; van Boxtel, A

2001-11-01

We investigated individual- and task-related differences in autonomic physiological responses induced by time limited figural and verbal inductive reasoning tasks. In a group of 52 participants, the percentage of correctly responded task items was evaluated together with nine different autonomic physiological response measures and respiration rate (RR). Weighted multidimensional scaling analyses of the physiological responses revealed three underlying dimensions, primarily characterized by RR, parasympathetic, and sympathetic activity. RR and sympathetic activity appeared to be relatively more important response dimensions for poor reasoners, whereas parasympathetic responsivity was relatively more important for good reasoners. These results suggest that poor reasoners showed higher levels of cognitive processing intensity than good reasoners. Furthermore, for the good reasoners, the dimension of sympathetic activity was relatively more important during the figural than during the verbal reasoning task, which was explained in terms of hemispheric lateralization in autonomic function.
Psychometric properties of the polish version of the Job-related Affective Well-being Scale.

PubMed

Basińska, Beata A; Gruszczyńska, Ewa; Schaufeli, Wilmar B

2014-12-01

The aim of this study was to verify psychometric properties of the Polish version of the Job-related Affective Well-being Scale (JAWS). Specifically, theoretical 4-factor structure (based on the dimensions of pleasure and arousal) and reliability of the original - 20-item JAWS (van Katwyk et al., 2000) and the shortened - 12-item (Schaufeli and Van Rhenen, 2006) versions were tested. Two independent samples were analyzed (police officers, N = 395, and police recruits, N = 202). The Polish version of the original, 20-item, JAWS was used to measure job-related affective states across the past month (van Katwyk et al., 2000). This version of JAWS includes 2 dimensions: valence and arousal, which allow to assess 4 categories of emotions: low-arousal positive emotions, high-arousal positive emotions, low-arousal negative emotions and high-arousal negative emotions. The results of multidimensional scaling analysis showed that the theoretical circumplex model of emotions underlining JAWS was satisfactorily reproduced. Also the hypothesized 4-factor structure of the Polish version of JAWS was confirmed. The 12-item version had better fit with the data than the original, 20-item, version, but the best fit was obtained for the even shorter, 8-item version. This version emerged from a multidimensional scaling of the 12-item version. Reliabilities of the 20- and 12-item versions were good, with lower values for the 8-item JAWS version. The findings confirmed satisfactory psychometric properties of both Polish versions of the Job-related Affective Well-being Scale. Thus, when both psychometric properties and relevance for cross-cultural comparisons are considered, the 12-item JAWS is recommended as a version of choice.
Subjective health literacy: Development of a brief instrument for school-aged children.

PubMed

Paakkari, Olli; Torppa, Minna; Kannas, Lasse; Paakkari, Leena

2016-12-01

The present paper focuses on the measurement of health literacy (HL), which is an important determinant of health and health behaviours. HL starts to develop in childhood and adolescence; hence, there is a need for instruments to monitor HL among younger age groups. These instruments are still rare. The aim of the project reported here was, therefore, to develop a brief, multidimensional, theory-based instrument to measure subjective HL among school-aged children. The development of the instrument covered four phases: item generation based on a conceptual framework; a pilot study ( n = 405); test-retest ( n = 117); and construction of the instrument ( n = 3853). All the samples were taken from Finnish 7th and 9th graders. Initially, 65 items were generated, of which 32 items were selected for the pilot study. After item reduction, the instrument contained 16 items. The test-retest phase produced estimates of stability. In the final phase a 10-item instrument was constructed, referred to as Health Literacy for School-Aged Children (HLSAC). The instrument exhibited a high Cronbach alpha (0.93), and included two items from each of the five predetermined theoretical components (theoretical knowledge, practical knowledge, critical thinking, self-awareness, citizenship). The iterative and validity-driven development process made it possible to construct a brief multidimensional HLSAC instrument. Such instruments are suitable for large-scale studies, and for use with children and adolescents. Validation will require further testing for use in other countries.
A cross-national study on the multidimensional characteristics of the five-item psychological demands scale of the Job Content Questionnaire.

PubMed

Choi, BongKyoo; Kawakami, Norito; Chang, SeiJin; Koh, SangBaek; Bjorner, Jakob; Punnett, Laura; Karasek, Robert

2008-01-01

The five-item psychological demands scale of the Job Content Questionnaire (JCQ) has been assumed to be one-dimensional in practice. To examine whether the scale has sufficient internal consistency and external validity to be treated as a single scale, using the cross-national JCQ datasets from the United States, Korea, and Japan. Exploratory factor analyses with 22 JCQ items, confirmatory factor analyses with the five psychological demands items, and correlations analyses with mental health indexes. Generally, exploratory factor analyses displayed the predicted demand/control/support structure with three and four factors extracted. However, at more detailed levels of exploratory and confirmatory factor analyses, the demands scale showed clear evidence of multi-factor structure. The correlations of items and subscales of the demands scale with mental health indexes were similar to those of the full scale in the Korean and Japanese datasets, but not in the U.S. data. In 4 out of 16 sub-samples of the U.S. data, several significant correlations of the components of the demands scale with job dissatisfaction and life dissatisfaction were obscured by the full scale. The multidimensionality of the psychological demands scale should be considered in psychometric analysis and interpretation, occupational epidemiologic studies, and future scale extension.
Testlet-Based Multidimensional Adaptive Testing.

PubMed

Frey, Andreas; Seitz, Nicki-Nils; Brandt, Steffen

2016-01-01

Multidimensional adaptive testing (MAT) is a highly efficient method for the simultaneous measurement of several latent traits. Currently, no psychometrically sound approach is available for the use of MAT in testlet-based tests. Testlets are sets of items sharing a common stimulus such as a graph or a text. They are frequently used in large operational testing programs like TOEFL, PISA, PIRLS, or NAEP. To make MAT accessible for such testing programs, we present a novel combination of MAT with a multidimensional generalization of the random effects testlet model (MAT-MTIRT). MAT-MTIRT compared to non-adaptive testing is examined for several combinations of testlet effect variances (0.0, 0.5, 1.0, and 1.5) and testlet sizes (3, 6, and 9 items) with a simulation study considering three ability dimensions with simple loading structure. MAT-MTIRT outperformed non-adaptive testing regarding the measurement precision of the ability estimates. Further, the measurement precision decreased when testlet effect variances and testlet sizes increased. The suggested combination of the MTIRT model therefore provides a solution to the substantial problems of testlet-based tests while keeping the length of the test within an acceptable range.
The Four Faces of Competition: The Development of the Multidimensional Competitive Orientation Inventory

PubMed Central

Orosz, Gábor; Tóth-Király, István; Büki, Noémi; Ivaskevics, Krisztián; Bőthe, Beáta; Fülöp, Márta

2018-01-01

To date, no short scale exists with established factor structure that can assess individual differences in competition. The aim of the present study was to uncover and operationalize the facets of competitive orientations with theoretical underpinning and strong psychometric properties. A total of 2676 respondents were recruited for four studies. The items were constructed based on qualitative research in different cultural contexts. A combined method of exploratory structural equation modeling (ESEM) and confirmatory factor analysis (CFA) was employed. ESEM resulted in a four-factor structure of the competitive orientations and this structure was supported by a series of CFAs on different comprehensive samples. The Multidimensional Competitive Orientation Inventory (MCOI) included 12 items and four factors: hypercompetitive orientation, self-developmental competitive orientation, anxiety-driven competition avoidance, and lack of interest toward competition. Strong gender invariance was established. The four facets of competition have differentiated relationship patterns with adaptive and maladaptive personality and motivational constructs. The MCOI can assess the adaptive and maladaptive facets of competitive orientations with a short, reliable, valid and theoretically underlined multidimensional measure. PMID:29872415
PubMed

Steagall, Paulo V M; Monteiro, Beatriz P; Lavoie, Anne-Marie; Frank, Diane; Troncy, Eric; Luna, Stelio P L; Brondani, Juliana T

2017-01-01

Validation of the French version of the UNESP-Botucatu multidimensional composite pain scale for assessing postoperative pain in cats. The aim of this study was to validate the French version of the UNESP-Botucatu multidimensional composite pain scale (MCPS-Fr) to assess postoperative pain in cats. Two veterinarians and one DVM student identified three domains of behavior based on video analyses: "psychomotor change", "protection of the painful area" and "physiological variables". Internal consistency was excellent (Cronbach's alpha coefficient of 0.94, 0.90 and 0.61, respectively). Criterion validity was good to very good when evaluations from the three observers were compared with a "gold standard". Inter- and intra-rater reliability for each scale item were good to very good. The optimal cut-off point identified with a ROC curve was > 7 (scale range 0-30 points), with a sensitivity of 97.8% and specificity of 99.1%. The MCPS-Fr is a valid, reliable and responsive instrument for assessing acute pain in cats undergoing ovariohysterectomy.(Translated by Dr. Beatriz Monteiro).
An examination of the psychometric structure of the Multidimensional Pain Inventory in temporomandibular disorder patients: a confirmatory factor analysis

PubMed Central

Andreu, Yolanda; Galdon, Maria J; Durá, Estrella; Ferrando, Maite; Pascual, Juan; Turk, Dennis C; Jiménez, Yolanda; Poveda, Rafael

2006-01-01

Background This paper seeks to analyse the psychometric and structural properties of the Multidimensional Pain Inventory (MPI) in a sample of temporomandibular disorder patients. Methods The internal consistency of the scales was obtained. Confirmatory Factor Analysis was carried out to test the MPI structure section by section in a sample of 114 temporomandibular disorder patients. Results Nearly all scales obtained good reliability indexes. The original structure could not be totally confirmed. However, with a few adjustments we obtained a satisfactory structural model of the MPI which was slightly different from the original: certain items and the Self control scale were eliminated; in two cases, two original scales were grouped in one factor, Solicitous and Distracting responses on the one hand, and Social activities and Away from home activities, on the other. Conclusion The MPI has been demonstrated to be a reliable tool for the assessment of pain in temporomandibular disorder patients. Some divergences to be taken into account have been clarified. PMID:17169143
Factors affecting construction performance: exploratory factor analysis

NASA Astrophysics Data System (ADS)

Soewin, E.; Chinda, T.

2018-04-01

The present work attempts to develop a multidimensional performance evaluation framework for a construction company by considering all relevant measures of performance. Based on the previous studies, this study hypothesizes nine key factors, with a total of 57 associated items. The hypothesized factors, with their associated items, are then used to develop questionnaire survey to gather data. The exploratory factor analysis (EFA) was applied to the collected data which gave rise 10 factors with 57 items affecting construction performance. The findings further reveal that the items constituting ten key performance factors (KPIs) namely; 1) Time, 2) Cost, 3) Quality, 4) Safety & Health, 5) Internal Stakeholder, 6) External Stakeholder, 7) Client Satisfaction, 8) Financial Performance, 9) Environment, and 10) Information, Technology & Innovation. The analysis helps to develop multi-dimensional performance evaluation framework for an effective measurement of the construction performance. The 10 key performance factors can be broadly categorized into economic aspect, social aspect, environmental aspect, and technology aspects. It is important to understand a multi-dimension performance evaluation framework by including all key factors affecting the construction performance of a company, so that the management level can effectively plan to implement an effective performance development plan to match with the mission and vision of the company.
Emotional vitality in caregivers: application of Rasch Measurement Theory with secondary data to development and test a new measure.

PubMed

Barbic, Skye P; Bartlett, Susan J; Mayo, Nancy E

2015-07-01

To describe the practical steps in identifying items and evaluating scoring strategies for a new measure of emotional vitality in informal caregivers of individuals who have experienced a significant health event. The psychometric properties of responses to selected items from validated health-related quality of life and other psychosocial questionnaires administered four times over a one-year period were evaluated using Rasch Measurement Theory. Community. A total of 409 individuals providing informal care at home to older adults who had experienced a recent stroke. Rasch Measurement Theory was used to test the ordering of response option thresholds, fit, spread of the item locations, residual correlations, person separation index, and stability across time. Based on a theoretical framework developed in earlier work, we identified 22 candidate items from a pool of relevant psychosocial measures available. Of these, additional evaluation resulted in 19 items that could be used to assess the five core domains. The overall model fit was reasonable (χ(2) = 202.26, DF = 117, p = 0.06), stable across time, with borderline evidence of multidimensionality (10%). Items and people covered a continuum ranging from -3.7 to +2.7 logits, reflecting coverage of the measurement continuum, with a person separation index of 0.85. Mean fit of caregivers was lower than expected (-1.31 ±1.10 logits). Established methods from the Rasch Measurement Theory were applied to develop a prototype measure of emotional vitality that is acceptable, reliable, and can be used to obtain an interval level score for use in future research and clinical settings. © The Author(s) 2014.
A Multidimensional Tool Based on the eHealth Literacy Framework: Development and Initial Validity Testing of the eHealth Literacy Questionnaire (eHLQ).

PubMed

Kayser, Lars; Karnoe, Astrid; Furstrand, Dorthe; Batterham, Roy; Christensen, Karl Bang; Elsworth, Gerald; Osborne, Richard H

2018-02-12

For people to be able to access, understand, and benefit from the increasing digitalization of health services, it is critical that services are provided in a way that meets the user's needs, resources, and competence. The objective of the study was to develop a questionnaire that captures the 7-dimensional eHealth Literacy Framework (eHLF). Draft items were created in parallel in English and Danish. The items were generated from 450 statements collected during the conceptual development of eHLF. In all, 57 items (7 to 9 items per scale) were generated and adjusted after cognitive testing. Items were tested in 475 people recruited from settings in which the scale was intended to be used (community and health care settings) and including people with a range of chronic conditions. Measurement properties were assessed using approaches from item response theory (IRT) and classical test theory (CTT) such as confirmatory factor analysis (CFA) and reliability using composite scale reliability (CSR); potential bias due to age and sex was evaluated using differential item functioning (DIF). CFA confirmed the presence of the 7 a priori dimensions of eHLF. Following item analysis, a 35-item 7-scale questionnaire was constructed, covering (1) using technology to process health information (5 items, CSR=.84), (2) understanding of health concepts and language (5 items, CSR=.75), (3) ability to actively engage with digital services (5 items, CSR=.86), (4) feel safe and in control (5 items, CSR=.87), (5) motivated to engage with digital services (5 items, CSR=.84), (6) access to digital services that work (6 items, CSR=.77), and (7) digital services that suit individual needs (4 items, CSR=.85). A 7-factor CFA model, using small-variance priors for cross-loadings and residual correlations, had a satisfactory fit (posterior productive P value: .27, 95% CI for the difference between the observed and replicated chi-square values: -63.7 to 133.8). The CFA showed that all items loaded strongly on their respective factors. The IRT analysis showed that no items were found to have disordered thresholds. For most scales, discriminant validity was acceptable; however, 2 pairs of dimensions were highly correlated; dimensions 1 and 5 (r=.95), and dimensions 6 and 7 (r=.96). All dimensions were retained because of strong content differentiation and potential causal relationships between these dimensions. There is no evidence of DIF. The eHealth Literacy Questionnaire (eHLQ) is a multidimensional tool based on a well-defined a priori eHLF framework with robust properties. It has satisfactory evidence of construct validity and reliable measurement across a broad range of concepts (using both CTT and IRT traditions) in various groups. It is designed to be used to understand and evaluate people's interaction with digital health services. ©Lars Kayser, Astrid Karnoe, Dorthe Furstrand, Roy Batterham, Karl Bang Christensen, Gerald Elsworth, Richard H Osborne. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 12.02.2018.

Evaluating the Dimensionality of Self-Determination Theory's Relative Autonomy Continuum.

PubMed

Sheldon, Kennon M; Osin, Evgeny N; Gordeeva, Tamara O; Suchkov, Dmitry D; Sychev, Oleg A

2017-09-01

We conducted a theoretical and psychometric evaluation of self-determination theory's "relative autonomy continuum" (RAC), an important aspect of the theory whose validity has recently been questioned. We first derived a Comprehensive Relative Autonomy Index (C-RAI) containing six subscales and 24 items, by conducting a paired paraphrase content analysis of existing RAI measures. We administered the C-RAI to multiple U.S. and Russian samples, assessing motivation to attend class, study a major, and take responsibility. Item-level and scale-level multidimensional scaling analyses, confirmatory factor analyses, and simplex/circumplex modeling analyses reaffirmed the validity of the RAC, across multiple samples, stems, and studies. Validation analyses predicting subjective well-being and trait autonomy from the six separate subscales, in combination with various higher order composites (weighted and unweighted), showed that an aggregate unweighted RAI score provides the most unbiased and efficient indicator of the overall quality of motivation within the behavioral domain being assessed.
"Even 'Daily' is Not Enough": How Well Do We Measure Domestic Violence and Abuse?-A Think-Aloud Study of a Commonly Used Self-Report Scale.

PubMed

Evans, Maggie; Gregory, Alison; Feder, Gene; Howarth, Emma; Hegarty, Kelsey

2016-01-01

This article explores the challenges of providing a quantitative measure of domestic violence and abuse (DVA), illustrated by the Composite Abuse Scale, a validated multidimensional measure of frequency and severity of abuse, used worldwide for prevalence studies and intervention trials. Cognitive "think-aloud" and qualitative interviewing with a sample of women who had experienced DVA revealed a tendency toward underreporting their experience of abuse, particularly of coercive control, threatening behavior, restrictions to freedom, and sexual abuse. Underreporting was linked to inconsistency and uncertainty in item interpretation and response, fear of answering truthfully, and unwillingness to identify with certain forms of abuse. Suggestions are made for rewording or reconceptualizing items and the inclusion of a distress scale to measure the individual impact of abuse. The importance of including qualitative methods in questionnaire design and in the interpretation of quantitative findings is highlighted.
The development of scientific thinking in elementary school: a comprehensive inventory.

PubMed

Koerber, Susanne; Mayer, Daniela; Osterhaus, Christopher; Schwippert, Knut; Sodian, Beate

2015-01-01

The development of scientific thinking was assessed in 1,581 second, third, and fourth graders (8-, 9-, 10-year-olds) based on a conceptual model that posits developmental progression from naïve to more advanced conceptions. Using a 66-item scale, five components of scientific thinking were addressed, including experimental design, data interpretation, and understanding the nature of science. Unidimensional and multidimensional item response theory analyses supported the instrument's reliability and validity and suggested that the multiple components of scientific thinking form a unitary construct, independent of verbal or reasoning skills. A partial credit model gave evidence for a hierarchical developmental progression. Across each grade transition, advanced conceptions increased while naïve conceptions decreased. Independent effects of intelligence, schooling, and parental education on scientific thinking are discussed. © 2014 The Authors. Child Development © 2014 Society for Research in Child Development, Inc.
An R package for analyzing and modeling ranking data

PubMed Central

2013-01-01

Background In medical informatics, psychology, market research and many other fields, researchers often need to analyze and model ranking data. However, there is no statistical software that provides tools for the comprehensive analysis of ranking data. Here, we present pmr, an R package for analyzing and modeling ranking data with a bundle of tools. The pmr package enables descriptive statistics (mean rank, pairwise frequencies, and marginal matrix), Analytic Hierarchy Process models (with Saaty’s and Koczkodaj’s inconsistencies), probability models (Luce model, distance-based model, and rank-ordered logit model), and the visualization of ranking data with multidimensional preference analysis. Results Examples of the use of package pmr are given using a real ranking dataset from medical informatics, in which 566 Hong Kong physicians ranked the top five incentives (1: competitive pressures; 2: increased savings; 3: government regulation; 4: improved efficiency; 5: improved quality care; 6: patient demand; 7: financial incentives) to the computerization of clinical practice. The mean rank showed that item 4 is the most preferred item and item 3 is the least preferred item, and significance difference was found between physicians’ preferences with respect to their monthly income. A multidimensional preference analysis identified two dimensions that explain 42% of the total variance. The first can be interpreted as the overall preference of the seven items (labeled as “internal/external”), and the second dimension can be interpreted as their overall variance of (labeled as “push/pull factors”). Various statistical models were fitted, and the best were found to be weighted distance-based models with Spearman’s footrule distance. Conclusions In this paper, we presented the R package pmr, the first package for analyzing and modeling ranking data. The package provides insight to users through descriptive statistics of ranking data. Users can also visualize ranking data by applying a thought multidimensional preference analysis. Various probability models for ranking data are also included, allowing users to choose that which is most suitable to their specific situations. PMID:23672645
An R package for analyzing and modeling ranking data.

PubMed

Lee, Paul H; Yu, Philip L H

2013-05-14

In medical informatics, psychology, market research and many other fields, researchers often need to analyze and model ranking data. However, there is no statistical software that provides tools for the comprehensive analysis of ranking data. Here, we present pmr, an R package for analyzing and modeling ranking data with a bundle of tools. The pmr package enables descriptive statistics (mean rank, pairwise frequencies, and marginal matrix), Analytic Hierarchy Process models (with Saaty's and Koczkodaj's inconsistencies), probability models (Luce model, distance-based model, and rank-ordered logit model), and the visualization of ranking data with multidimensional preference analysis. Examples of the use of package pmr are given using a real ranking dataset from medical informatics, in which 566 Hong Kong physicians ranked the top five incentives (1: competitive pressures; 2: increased savings; 3: government regulation; 4: improved efficiency; 5: improved quality care; 6: patient demand; 7: financial incentives) to the computerization of clinical practice. The mean rank showed that item 4 is the most preferred item and item 3 is the least preferred item, and significance difference was found between physicians' preferences with respect to their monthly income. A multidimensional preference analysis identified two dimensions that explain 42% of the total variance. The first can be interpreted as the overall preference of the seven items (labeled as "internal/external"), and the second dimension can be interpreted as their overall variance of (labeled as "push/pull factors"). Various statistical models were fitted, and the best were found to be weighted distance-based models with Spearman's footrule distance. In this paper, we presented the R package pmr, the first package for analyzing and modeling ranking data. The package provides insight to users through descriptive statistics of ranking data. Users can also visualize ranking data by applying a thought multidimensional preference analysis. Various probability models for ranking data are also included, allowing users to choose that which is most suitable to their specific situations.
Development of the life impact burn recovery evaluation (LIBRE) profile: assessing burn survivors' social participation.

PubMed

Kazis, Lewis E; Marino, Molly; Ni, Pengsheng; Soley Bori, Marina; Amaya, Flor; Dore, Emily; Ryan, Colleen M; Schneider, Jeff C; Shie, Vivian; Acton, Amy; Jette, Alan M

2017-10-01

Measuring the impact burn injuries have on social participation is integral to understanding and improving survivors' quality of life, yet there are no existing instruments that comprehensively measure the social participation of burn survivors. This project aimed to develop the Life Impact Burn Recovery Evaluation Profile (LIBRE), a patient-reported multidimensional assessment for understanding the social participation after burn injuries. 192 questions representing multiple social participation areas were administered to a convenience sample of 601 burn survivors. Exploratory factor analysis and confirmatory factor analysis (CFA) were used to identify the underlying structure of the data. Using item response theory methods, a Graded Response Model was applied for each identified sub-domain. The resultant multidimensional LIBRE Profile can be administered via Computerized Adaptive Testing (CAT) or fixed short forms. The study sample included 54.7% women with a mean age of 44.6 (SD 15.9) years. The average time since burn injury was 15.4 years (0-74 years) and the average total body surface area burned was 40% (1-97%). The CFA indicated acceptable fit statistics (CFI range 0.913-0.977, TLI range 0.904-0.974, RMSEA range 0.06-0.096). The six unidimensional scales were named: relationships with family and friends, social interactions, social activities, work and employment, romantic relationships, and sexual relationships. The marginal reliability of the full item bank and CATs ranged from 0.84 to 0.93, with ceiling effects less than 15% for all scales. The LIBRE Profile is a promising new measure of social participation following a burn injury that enables burn survivors and their care providers to measure social participation.
A poverty-related quality of life questionnaire can help to detect health inequalities in emergency departments.

PubMed

Boyer, Laurent; Baumstarck, Karine; Iordanova, Teodora; Fernandez, Jessica; Jean, Philippe; Auquier, Pascal

2014-03-01

This study aimed to develop a self-administered, multidimensional, poverty-related quality of life (PQoL) questionnaire for individuals seeking care in emergency departments (EDs): the PQoL-17. The development of the PQoL was undertaken in three steps: item generation, item reduction, and validation. The content of the PQoL was derived from 80 interviews with patients seeking care in EDs. Using item response and classical test theories, item reduction was performed in 3 EDs on 300 patients and validation was completed in 10 EDs on 619 patients. The PQoL contains 17 items describing seven dimensions (self-esteem/vitality, psychological well-being, relationships with family, relationships with friends, autonomy, physical well-being/access to care, and future perception). The seven-factor structure accounted for 75.1% of the total variance. This model showed a good fit (indices from the LISREL model: root mean square error of approximation, 0.055; comparative fit index, 0.97; general fit index, 0.96; standardized root mean square residual, 0.058). Each item achieved the 0.40 standard for item internal consistency, and Cronbach α coefficients were >0.70. Significant associations with socioeconomic and clinical indicators showed good discriminant and external validity. Infit statistics ranged from 0.82 to 1.16. The PQoL-17 presents satisfactory psychometric properties and can be completed quickly, thereby fulfilling the goal of brevity sought in EDs. Copyright © 2014 Elsevier Inc. All rights reserved.
Preliminary development and psychometric evaluation of an unmet needs measure for adolescents and young adults with cancer: the Cancer Needs Questionnaire - Young People (CNQ-YP).

PubMed

Clinton-McHarg, Tara; Carey, Mariko; Sanson-Fisher, Rob; D'Este, Catherine; Shakeshaft, Anthony

2012-01-30

Adolescents and young adult (AYA) cancer survivors may have unique physical, psychological and social needs due to their cancer occurring at a critical phase of development. The aim of this study was to develop a psychometrically rigorous measure of unmet need to capture the specific needs of this group. Items were developed following a comprehensive literature review, focus groups with AYAs, and feedback from health care providers, researchers and other professionals. The measure was pilot tested with 32 AYA cancer survivors recruited through a state-based cancer registry to establish face and content validity. A main sample of 139 AYA cancer patients and survivors were recruited through seven treatment centres and invited to complete the questionnaire. To establish test-retest reliability, a sub-sample of 34 participants completed the measure a second time. Exploratory factor analysis was performed and the measure was assessed for internal consistency, discriminative validity, potential responsiveness and acceptability. The Cancer Needs Questionnaire - Young People (CNQ-YP) has established face and content validity, and acceptability. The final measure has 70 items and six factors: Treatment Environment and Care (33 items); Feelings and Relationships (14 items); Daily Life (12 items); Information and Activities (5 items); Education (3 items); and Work (3 items). All domains achieved Cronbach's alpha values greater than 0.80. Item-to-item test-retest reliability was also high, with all but four items reaching weighted kappa values above 0.60. The CNQ-YP is the first multi-dimensional measure of unmet need which has been developed specifically for AYA cancer patients and survivors. The measure displays a strong factor structure, and excellent internal consistency and test-retest reliability. However, the small sample size has implications for the reliability of the statistical analyses undertaken, particularly the exploratory factor analysis. Future studies with a larger sample are recommended to confirm the factor structure of the measure. Longitudinal studies to establish responsiveness and predictive validity should also be undertaken.
Preliminary development and psychometric evaluation of an unmet needs measure for adolescents and young adults with cancer: the Cancer Needs Questionnaire - Young People (CNQ-YP)

PubMed Central

2012-01-01

Background Adolescents and young adult (AYA) cancer survivors may have unique physical, psychological and social needs due to their cancer occurring at a critical phase of development. The aim of this study was to develop a psychometrically rigorous measure of unmet need to capture the specific needs of this group. Methods Items were developed following a comprehensive literature review, focus groups with AYAs, and feedback from health care providers, researchers and other professionals. The measure was pilot tested with 32 AYA cancer survivors recruited through a state-based cancer registry to establish face and content validity. A main sample of 139 AYA cancer patients and survivors were recruited through seven treatment centres and invited to complete the questionnaire. To establish test-retest reliability, a sub-sample of 34 participants completed the measure a second time. Exploratory factor analysis was performed and the measure was assessed for internal consistency, discriminative validity, potential responsiveness and acceptability. Results The Cancer Needs Questionnaire - Young People (CNQ-YP) has established face and content validity, and acceptability. The final measure has 70 items and six factors: Treatment Environment and Care (33 items); Feelings and Relationships (14 items); Daily Life (12 items); Information and Activities (5 items); Education (3 items); and Work (3 items). All domains achieved Cronbach's alpha values greater than 0.80. Item-to-item test-retest reliability was also high, with all but four items reaching weighted kappa values above 0.60. Conclusions The CNQ-YP is the first multi-dimensional measure of unmet need which has been developed specifically for AYA cancer patients and survivors. The measure displays a strong factor structure, and excellent internal consistency and test-retest reliability. However, the small sample size has implications for the reliability of the statistical analyses undertaken, particularly the exploratory factor analysis. Future studies with a larger sample are recommended to confirm the factor structure of the measure. Longitudinal studies to establish responsiveness and predictive validity should also be undertaken. PMID:22284545
Students' Beliefs about Mobile Devices vs. Desktop Computers in South Korea and the United States

ERIC Educational Resources Information Center

Sung, Eunmo; Mayer, Richard E.

2012-01-01

College students in the United States and in South Korea completed a 28-item multidimensional scaling (MDS) questionnaire in which they rated the similarity of 28 pairs of multimedia learning materials on a 10-point scale (e.g., narrated animation on a mobile device Vs. movie clip on a desktop computer) and a 56-item semantic differential…
Development and validation of a vision-specific quality-of-life questionnaire for Timor-Leste.

PubMed

du Toit, Rènée; Palagyi, Anna; Ramke, Jacqueline; Brian, Garry; Lamoureux, Ecosse L

2008-10-01

To develop and determine the reliability and validity of a vision-specific quality-of-life instrument (TL-VSQOL) designed to assess the impact of distance and near vision impairment in adults living in Timor-Leste. A vision-specific quality-of-life questionnaire was developed, piloted, and administered to 704 Timorese aged >or=40 years during a population-based eye health rapid assessment. Rasch analysis was performed on the data of 457 participants with presenting near vision worse than N8 (78.5%) and/or distance vision worse than 6/18 (69.8%). Unidimensionality, item fit to the model, response category performance, differential item functioning, and targeting of items to participants were assessed. Initially, the questionnaire lacked fit to the Rasch model. Removal of two items concerning emotional well-being resulted in a fit of the data (overall item-trait interaction: chi(2) (df) = 81 (51); mean (SD) person and item fit residual values: -0.30 (1.02) and -0.32 (1.46), and good targeting of person ability and item difficulty was evident. Poorer distance and near visual acuities were significantly associated with worse quality-of-life scores (P < 0.001). Person separation reliability was substantial (0.93), indicating that the instrument can discriminate between groups with normal and impaired vision. All 17 items were free of differential item functioning, and there was no evidence of multidimensionality. This 17-item TL-VSQOL has high reliability, construct, and criterion validity and effective targeting. It can effectively assess the impact on quality of life of adult Timorese with distance and near vision impairment. The TL-VSQOL could be adapted for use in other low-resource settings.
Emotional competencies in geriatric nursing: empirical evidence from a computer based large scale assessment calibration study.

PubMed

Kaspar, Roman; Hartig, Johannes

2016-03-01

The care of older people was described as involving substantial emotion-related affordances. Scholars in vocational training and nursing disagree whether emotion-related skills could be conceptualized and assessed as a professional competence. Studies on emotion work and empathy regularly neglect the multidimensionality of these phenomena and their relation to the care process, and are rarely conclusive with respect to nursing behavior in practice. To test the status of emotion-related skills as a facet of client-directed geriatric nursing competence, 402 final-year nursing students from 24 German schools responded to a 62-item computer-based test. 14 items were developed to represent emotion-related affordances. Multi-dimensional IRT modeling was employed to assess a potential subdomain structure. Emotion-related test items did not form a separate subdomain, and were found to be discriminating across the whole competence continuum. Tasks concerning emotion work and empathy are reliable indicators for various levels of client-directed nursing competence. Claims for a distinct emotion-related competence in geriatric nursing, however, appear excessive with a process-oriented perspective.
Testlet-Based Multidimensional Adaptive Testing

PubMed Central

Frey, Andreas; Seitz, Nicki-Nils; Brandt, Steffen

2016-01-01

Multidimensional adaptive testing (MAT) is a highly efficient method for the simultaneous measurement of several latent traits. Currently, no psychometrically sound approach is available for the use of MAT in testlet-based tests. Testlets are sets of items sharing a common stimulus such as a graph or a text. They are frequently used in large operational testing programs like TOEFL, PISA, PIRLS, or NAEP. To make MAT accessible for such testing programs, we present a novel combination of MAT with a multidimensional generalization of the random effects testlet model (MAT-MTIRT). MAT-MTIRT compared to non-adaptive testing is examined for several combinations of testlet effect variances (0.0, 0.5, 1.0, and 1.5) and testlet sizes (3, 6, and 9 items) with a simulation study considering three ability dimensions with simple loading structure. MAT-MTIRT outperformed non-adaptive testing regarding the measurement precision of the ability estimates. Further, the measurement precision decreased when testlet effect variances and testlet sizes increased. The suggested combination of the MTIRT model therefore provides a solution to the substantial problems of testlet-based tests while keeping the length of the test within an acceptable range. PMID:27917132
Development and Validation of a Computerized-Adaptive Test for PTSD (P-CAT).

PubMed

Eisen, Susan V; Schultz, Mark R; Ni, Pengsheng; Haley, Stephen M; Smith, Eric G; Spiro, Avron; Osei-Bonsu, Princess E; Nordberg, Sam; Jette, Alan M

2016-10-01

The primary purpose was to develop, field test, and validate a computerized-adaptive test (CAT) for posttraumatic stress disorder (PTSD) to enhance PTSD assessment and decrease the burden of symptom monitoring. Data sources included self-report and interviewer-administered diagnostic interviews. The sample included 1,288 veterans. In phase 1, 89 items from a previously developed PTSD item pool were administered to a national sample of 1,085 veterans. A multidimensional graded-response item response theory model was used to calibrate items for incorporation into a CAT for PTSD (P-CAT). In phase 2, in a separate sample of 203 veterans, the P-CAT was validated against three other self-report measures (PTSD Checklist, Civilian Version; Mississippi Scale for Combat-Related PTSD; and Primary Care PTSD Screen) and the PTSD module of the Structured Clinical Interview for DSM-IV. A bifactor model with one general PTSD factor and four subfactors consistent with DSM-5 (reexperiencing, avoidance, negative mood-cognitions, and arousal), yielded good fit. The P-CAT discriminated veterans with PTSD from those with other mental health conditions and those with no mental health conditions (Cohen's d effect sizes >.90). The P-CAT also discriminated those with and without a PTSD diagnosis and those who screened positive versus negative for PTSD. Concurrent validity was supported by high correlations (r=.85-.89) with the validation measures. The P-CAT appears to be a promising tool for efficient and accurate assessment of PTSD symptomatology. Further testing is needed to evaluate its responsiveness to change. With increasing availability of computers and other technologies, CAT may be a viable and efficient assessment method.
Dyspnoea-12: a translation and linguistic validation study in a Swedish setting.

PubMed

Sundh, Josefin; Ekström, Magnus

2017-06-06

Dyspnoea consists of multiple dimensions including the intensity, unpleasantness, sensory qualities and emotional responses which may differ between patient groups, settings and in relation to treatment. The Dyspnoea-12 is a validated and convenient instrument for multidimensional measurement in English. We aimed to take forward a Swedish version of the Dyspnoea-12. The linguistic validation of the Dyspnoea-12 was performed (Mapi Language Services, Lyon, France). The standardised procedure involved forward and backward translations by three independent certified translators and revisions after feedback from an in-country linguistic consultant, the developerand three native physicians. The understanding and convenience of the translated version was evaluated using qualitative in-depth interviews with five patients with dyspnoea. A Swedish version of the Dyspnoea-12 was elaborated and evaluated carefully according to international guidelines. The Swedish version, 'Dyspné-12', has the same layout as the original version, including 12 items distributed on seven physical and five affective items. The Dyspnoea-12 is copyrighted by the developer but can be used free of charge after permission for not industry-funded research. A Swedish version of the Dyspnoea-12 is now available for clinical validation and multidimensional measurement across diseases and settings with the aim of improved evaluation and management of dyspnoea. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2017. All rights reserved. No commercial use is permitted unless otherwise expressly granted.
Holland in Iceland revisited: an emic approach to evaluating U.S. vocational interest models.

PubMed

Einarsdóttir, Sif; Rounds, James; Su, Rong

2010-07-01

An emic approach was used to test the structural validity and applicability of Holland's (1997) RIASEC (Realistic, Investigative, Artistic, Social, Enterprising, Conventional) model in Iceland. Archival data from the development of the Icelandic Interest Inventory (Einarsdóttir & Rounds, 2007) were used in the present investigation. The data included an indigenous pool of occupations and work-task items representing Iceland's world of work that had been administered to a sample of 597 upper secondary school students. Multidimensional scaling analysis and property vector fitting using Prediger's (1981) work-task dimensions were applied to the item responses to test if the RIASEC model could be identified. The results indicated that a 4-dimensional solution better explains the interest space in Iceland than Holland's 2-dimensional RIASEC representation. The work-task dimension of People-Things and the Sex-Type and Prestige dimensions were located in the 1st and 2nd dimensions of the multidimensional scaling solution, but Data-Ideas, a dimension critical to the RIASEC model, was not. The 3rd and 4th dimensions did not correspond to any dimensions previously detected in structural studies in the United States and seem to be related to specific ecological, cultural, and political forces in Iceland. These results demonstrate the importance of selecting representative indigenous occupations and work tasks when evaluating the RIASEC model. The present study is an example of the next step in a comprehensive cross-cultural research program on vocational interests, an emic investigation. (c) 2010 APA, all rights reserved.
Punishment insensitivity in early childhood: A developmental, dimensional approach

PubMed Central

Nichols, Sara R.; Briggs-Gowan, Margaret; Estabrook, Ryne; Burns, James; Kestler, Jacqueline; Berman, Grace; Henry, David; Wakschlag, Lauren

2014-01-01

Impairment in learning from punishment ("punishment insensitivity") is an established feature of severe antisocial behavior in adults and youth but it has not been well studied as a developmental phenomenon. In early childhood, differentiating a normal:abnormal spectrum of punishment insensitivity is key for distinguishing normative misbehavior from atypical manifestations. This study employed a novel measure, the Multidimensional Assessment Profile of Disruptive Behavior (MAPDB), to examine the distribution, dimensionality, and external validity of punishment insensitivity in a large, demographically diverse community sample of preschoolers (three-five years) recruited from pediatric clinics (N=1,855). Caregivers completed surveys from which a seven-item Punishment Insensitivity scale was derived. Findings indicated that Punishment Insensitivity behaviors are relatively common in young children, with at least 50% of preschoolers exhibiting them sometimes. Item response theory analyses revealed a Punishment Insensitivity spectrum. Items varied along a severity continuum: most items needed to occur "Often" in order to be severe and behaviors that were qualitatively atypical or intense were more severe. Although there were item-level differences across sociodemographic groups, these were small. Construct, convergent, and divergent validity were demonstrated via association to low concern for others and noncompliance, motivational regulation, and a disruptive family context. Incremental clinical utility was demonstrated in relation to impairment. Early childhood punishment insensitivity varies along a severity continuum and is atypical when it predominates. Implications for understanding the phenomenology of emergent disruptive behavior are discussed. PMID:25425187
Development and psychometric evaluation of a health-related quality of life instrument for individuals with adult-onset hearing loss.

PubMed

Stika, Carren J; Hays, Ron D

2015-07-01

Self-reports of 'hearing handicap' are available, but a comprehensive measure of health-related quality of life (HRQOL) for individuals with adult-onset hearing loss (AOHL) does not exist. Our objective was to develop and evaluate a multidimensional HRQOL instrument for individuals with AOHL. The Impact of Hearing Loss Inventory Tool (IHEAR-IT) was developed using results of focus groups, a literature review, advisory expert panel input, and cognitive interviews. The 73-item field-test instrument was completed by 409 adults (22-91 years old) with varying degrees of AOHL and from different areas of the USA. Multitrait scaling analysis supported four multi-item scales and five individual items. Internal consistency reliabilities ranged from 0.93 to 0.96 for the scales. Construct validity was supported by correlations between the IHEAR-IT scales and scores on the 36-item Short Form Health Survey, version 2.0 (SF-36v2) mental composite summary (r = 0.32-0.64) and the Hearing Handicap Inventory for the Elderly/Adults (HHIE/HHIA) (r ≥ -0.70). The field test provides initial support for the reliability and construct validity of the IHEAR-IT for evaluating HRQOL of individuals with AOHL. Further research is needed to evaluate the responsiveness to change of the IHEAR-IT scales and identify items for a short-form.
Development of a clinician-administered National Institutes of Health-Brief Fatigue Inventory: A measure of fatigue in the context of depressive disorders.

PubMed

Saligan, Leorey N; Luckenbaugh, David A; Slonena, Elizabeth E; Machado-Vieira, Rodrigo; Zarate, Carlos A

2015-09-01

Fatigue is a complex, multidimensional condition. Although it is often associated with depression, it is not known whether it has a distinct network from depression or whether it can be clinically evaluated, separately. This study describes preliminary findings in the development of a brief, clinician-administered instrument to measure fatigue in the context of depressive disorders using items from existing clinician-administered depression and mania scales. Based on items from prior fatigue measurements, items were selected from the Hamilton Depression Rating Scale (HDRS), Montgomery-Asberg Depression Rating Scale (MADRS), Young Mania Rating Scale, and Structured Interview Guide for HDRS with Atypical Depression. The final items composed the NIH-Brief Fatigue Inventory (NIH-BFI). Responses from 89 depressed adults collected pre- and post-antidepressant therapy (ADT) determined the reliability and consistency of the NIH-BFI using Cronbach's alpha and principal components analysis (PCA). Correlations of the NIH-BFI and fatigue items from other scales before and after ADT explored validity. The 7-item NIH-BFI had Cronbach alphas ranging from 0.81 to 0.88 and PCA indicating a single dimension. The NIH-BFI score was strongly correlated (r = 0.73, p < 0.001) with fatigue items from Beck Depression Index, with MADRS without fatigue items (r = 0.77, p < 0.001), and HDRS without fatigue items (pre: r = 0.69, p < 0.001). Preliminary findings show support for internal consistency reliability and validity of the NIH-BFI, a clinician-administered measure of fatigue. Further testing in other clinical populations is recommended to obtain additional information on reliability and validity. The NIH-BFI provides a method for clinician-rated fatigue that may be a separate from depression. Published by Elsevier Ltd.
Validity and reliability of the multidimensional assessment of fatigue scale in Iranian patients with relapsing-remitting subtype of multiple sclerosis.

PubMed

Behrangrad, Shabnam; Kordi Yoosefinejad, Amin

2018-03-01

The purpose of this study is to investigate the validity and reliability of the Persian version of the Multidimensional Assessment of Fatigue Scale (MAFS) in an Iranian population with multiple sclerosis. A self-reported survey on fatigue including the MAFS, Fatigue Impact Scale and demographic measures was completed by 130 patients with multiple sclerosis and 60 healthy persons sampled with a convenience method. Test-retest reliability and validity were evaluated 3 days apart. Construct validity of the MAFS was assessed with the Fatigue Impact Scale. The MAFS had high internal consistency (Cronbach's alpha >0.9) and 3-d test-retest reliability (intraclass correlation coefficient = 0.99). Correlation between the Fatigue Impact Scale and MAFS was high (r = 0.99). Correlation between MAFS scores and the Expanded Disability Status Scale was also strong (r = 0.85). Questionnaire items showed acceptable item-scale correlation (0.968-0.993). The Persian version of the MAFS appears to be a valid and reliable questionnaire. It is an appropriate short multidimensional instrument to assess fatigue in patients with multiple sclerosis in clinical practice and research. Implications for Rehabilitation The Persian version of Multidimensional Assessment of Fatigue is a valid and reliable instrument for the assessment and monitoring the fatigue in Persian-language patients with multiple sclerosis. It is very easy to administer and a time efficient scale in comparison to other instruments evaluating fatigue in patients with multiple sclerosis.

Dimensions of the South Oaks Gambling Screen in Finland: A cross-sectional population study.

PubMed

Salonen, Anne H; Rosenström, Tom; Edgren, Robert; Volberg, Rachel; Alho, Hannu; Castrén, Sari

2017-06-01

The underlying structure of problematic gambling behaviors, such as those assessed by the South Oaks Gambling Screen (SOGS), remain unknown: Can problem gambling be assessed unidimensionally or should multiple qualitatively different dimensions be taken into account, and if so, what do these qualitative dimensions indicate? How significant are the deviations from unidimensionality in practice? A cross-sectional random sample of Finns aged 15-74 (n = 4,484) was drawn from the Population Information Registry and surveyed in 2011-2012. Analyses were conducted using descriptive statistics, Confirmatory factor analysis (CFA) and multidimensional item response theory (MIRT) models. Altogether, 14.9% of the population endorsed at least one of the 20 SOGS items, but nine items had low endorsement rates (≤ 0.2%). CFA and MIRT techniques suggested that individuals differed from each other in two positively correlated (r = 0.70) underlying dimensions: "impact on self primarily" and "impact on others also". This two-factor correlated-factors model can be reinterpreted as a bifactor model with one general gambling-problem factor and two specific factors with similar interpretation as in the correlated-factors model but with non-overlapping items. The two specific factors may provide clinically useful information without extra costs of assessment. © 2017 Scandinavian Psychological Associations and John Wiley & Sons Ltd.
Psychometric analyses and internal consistency of the PHEEM questionnaire to measure the clinical learning environment in the clerkship of a Medical School in Chile.

PubMed

Riquelme, Arnoldo; Herrera, Cristian; Aranis, Carolina; Oporto, Jorge; Padilla, Oslando

2009-06-01

The Spanish version of the Postgraduate Hospital Educational Environment Measure (PHEEM) was evaluated in this study to determine its psychometric properties, validity and internal consistency to measure the clinical learning environment in the hospital setting of Pontificia Universidad Católica de Chile Medical School's Internship. The 40-item PHEEM questionnaire was translated from English to Spanish and retranslated to English. Content validity was tested by a focus group and minor differences in meaning were adjusted. The PHEEM was administered to clerks in years 6 and 7. Construct validity was carried out using exploratory factor analysis followed by a Varimax rotation. Internal consistency was measured using Cronbach's alpha. A total of 125 out of 220 students responded to the PHEEM. The overall response rate was 56.8% and compliances with each item ranged from 99.2% to 100%. Analyses indicate that five factors instrument accounting for 58% of the variance and internal consistency of the 40-item questionnaire is 0.955 (Cronbach's alpha). The 40-item questionnaire had a mean score of 98.21 +/- 21.2 (maximum score of 160). The Spanish version of PHEEM is a multidimensional, valid and highly reliable instrument measuring the educational environment among undergraduate medical students working in hospital-based clerkships.
Measuring genetic knowledge: a brief survey instrument for adolescents and adults.

PubMed

Fitzgerald-Butt, S M; Bodine, A; Fry, K M; Ash, J; Zaidi, A N; Garg, V; Gerhardt, C A; McBride, K L

2016-02-01

Basic knowledge of genetics is essential for understanding genetic testing and counseling. The lack of a written, English language, validated, published measure has limited our ability to evaluate genetic knowledge of patients and families. Here, we begin the psychometric analysis of a true/false genetic knowledge measure. The 18-item measure was completed by parents of children with congenital heart defects (CHD) (n = 465) and adolescents and young adults with CHD (age: 15-25, n = 196) with a mean total correct score of 12.6 [standard deviation (SD) = 3.5, range: 0-18]. Utilizing exploratory factor analysis, we determined that one to three correlated factors, or abilities, were captured by our measure. Through confirmatory factor analysis, we determined that the two factor model was the best fit. Although it was necessary to remove two items, the remaining items exhibited adequate psychometric properties in a multidimensional item response theory analysis. Scores for each factor were computed, and a sum-score conversion table was derived. We conclude that this genetic knowledge measure discriminates best at low knowledge levels and is therefore well suited to determine a minimum adequate amount of genetic knowledge. However, further reliability testing and validation in diverse research and clinical settings is needed. © 2015 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Construct validity of the Heart Failure Screening Tool (Heart-FaST) to identify heart failure patients at risk of poor self-care: Rasch analysis.

PubMed

Reynolds, Nicholas A; Ski, Chantal F; McEvedy, Samantha M; Thompson, David R; Cameron, Jan

2018-02-14

The aim of this study was to psychometrically evaluate the Heart Failure Screening Tool (Heart-FaST) via: (1) examination of internal construct validity; (2) testing of scale function in accordance with design; and (3) recommendation for change/s, if items are not well adjusted, to improve psychometric credential. Self-care is vital to the management of heart failure. The Heart-FaST may provide a prospective assessment of risk, regarding the likelihood that patients with heart failure will engage in self-care. Psychometric validation of the Heart-FaST using Rasch analysis. The Heart-FaST was administered to 135 patients (median age = 68, IQR = 59-78 years; 105 males) enrolled in a multidisciplinary heart failure management program. The Heart-FaST is a nurse-administered tool for screening patients with HF at risk of poor self-care. A Rasch analysis of responses was conducted which tested data against Rasch model expectations, including whether items serve as unbiased, non-redundant indicators of risk and measure a single construct and that rating scales operate as intended. The results showed that data met Rasch model expectations after rescoring or deleting items due to poor discrimination, disordered thresholds, differential item functioning, or response dependence. There was no evidence of multidimensionality which supports the use of total scores from Heart-FaST as indicators of risk. Aggregate scores from this modified screening tool rank heart failure patients according to their "risk of poor self-care" demonstrating that the Heart-FaST items constitute a meaningful scale to identify heart failure patients at risk of poor engagement in heart failure self-care. © 2018 John Wiley & Sons Ltd.
Development and testing of the Multidimensional Trust in Health Care Systems Scale.

PubMed

Egede, Leonard E; Ellis, Charles

2008-06-01

To describe the development and psychometric testing of the Multidimensional Trust in Health Care Systems Scale (MTHCSS). Scale development occurred in 2 phases. In phase 1, a pilot instrument with 70 items was generated from the review of the trust literature, focus groups, and expert opinion. The 70 items were pilot tested in a sample of 256 students. Exploratory factor analysis was used to derive an orthogonal set of correlated factors. In phase 2, the final scale was administered to 301 primary care patients to assess reliability and validity. Phase 2 participants also completed validated measures of patient-centered care, health locus of control, medication nonadherence, social support, and patient satisfaction. In phase 1, a 17-item scale (MTHCSS) was developed with 10 items measuring trust in health care providers, 4 items measuring trust in health care payers, and 3 items measuring trust in health care institutions. In phase 2, the 17-item MTHCSS had a mean score of 63.0 (SD 8.8); the provider subscale had a mean of 40.0 (SD 6.2); the payers subscale had a mean of 12.8 (SD 3.0); and the institutions subscale had a mean of 10.3 (SD 2.1). Cronbach's alpha for the MTHCSS was 0.89 and 0.92, 0.74, and 0.64 for the 3 subscales. The MTHCSS was significantly correlated with patient-centered care (r = .22 to .62), locus of control-chance (r = .42), medication nonadherence (r = -.22), social support (r = .25), and patient satisfaction (r = .67). The MTHCSS is a valid and reliable instrument for measuring the 3 objects of trust in health care and is correlated with patient-level health outcomes.
Head and neck cancer-specific quality of life: instrument validation.

PubMed

Terrell, J E; Nanavati, K A; Esclamado, R M; Bishop, J K; Bradford, C R; Wolf, G T

1997-10-01

The disfigurement and dysfunction associated with head and neck cancer affect emotional well-being and some of the most basic functions of life. Most cancer-specific quality-of-life assessments give a single composite score for head and neck cancer-related quality of life. To develop and evaluate an improved multidimensional instrument to assess head and neck cancer-related functional status and well-being. The item selection process included literature review, interviews with health care workers, and patient surveys. A survey with 37 disease-specific questions and the SF-12 survey were administered to 253 patients in 3 large medical centers. Factor analysis was performed to identify disease-specific domains. Domain scores were calculated as the standardized score of the component items. These domains were assessed for construct validity based on clinical hypotheses and test-retest reliability. Four relevant domains were identified: Eating (6 items), Communication (4 items), Pain (4 items), and Emotion (6 items). Each had an internal consistency (Cronbach alpha value) of greater than 0.80. Construct validity was demonstrated by moderate correlations with the SF-12 Physical and Mental component scores (r=0.43-0.60). Test-retest reliability for each domain demonstrated strong reliability between the 2 time points. Correlations were strong for each individual question, ranging from 0.53 to 0.93. Construct validity testing demonstrated that the direction of differences for each domain were as hypothesized. The Head and Neck Quality of Life questionnaire is a promising multidimensional tool with which to assess head and neck cancer-specific quality of life.
Development and Validation of Triarchic Psychopathy Scales from the Multidimensional Personality Questionnaire

PubMed Central

Brislin, Sarah J.; Drislane, Laura E.; Smith, Shannon Toney; Edens, John F.; Patrick, Christopher J.

2015-01-01

Psychopathy is conceptualized by the triarchic model as encompassing three distinct phenotypic constructs: boldness, meanness, and disinhibition. In the current study, the Multidimensional Personality Questionnaire (MPQ), a normal-range personality measure, was evaluated for representation of these three constructs. Consensus ratings were used to identify MPQ items most related to each triarchic (Tri) construct. Scale measures were developed from items indicative of each construct, and scores for these scales were evaluated for convergent and discriminant validity in community (N = 176) and incarcerated samples (N = 240). A cross the two samples, MPQ-Tri scale scores demonstrated good internal consistencies and relationships with criterion measures of various types consistent with predictions based on the triarchic model. Findings are discussed in terms of their implications for further investigation of the triarchic model constructs in preexisting datasets that include the MPQ, in particular longitudinal and genetically informative datasets. PMID:25642934
An evaluation of the quick inventory of depressive symptomatology and the hamilton rating scale for depression: a sequenced treatment alternatives to relieve depression trial report.

PubMed

Rush, A John; Bernstein, Ira H; Trivedi, Madhukar H; Carmody, Thomas J; Wisniewski, Stephen; Mundt, James C; Shores-Wilson, Kathy; Biggs, Melanie M; Woo, Ada; Nierenberg, Andrew A; Fava, Maurizio

2006-03-15

Nine DSM-IV-TR criterion symptom domains are evaluated to diagnose major depressive disorder (MDD). The Quick Inventory of Depressive Symptomatology (QIDS) provides an efficient assessment of these domains and is available as a clinician rating (QIDS-C16), a self-report (QIDS-SR16), and in an automated, interactive voice response (IVR) (QIDS-IVR16) telephone system. This report compares the performance of these three versions of the QIDS and the 17-item Hamilton Rating Scale for Depression (HRSD17). Data were acquired at baseline and exit from the first treatment step (citalopram) in the Sequenced Treatment Alternatives to Relieve Depression (STAR*D) trial. Outpatients with nonpsychotic MDD who completed all four ratings within +/-2 days were identified from the first 1500 STAR*D subjects. Both item response theory and classical test theory analyses were conducted. The three methods for obtaining QIDS data produced consistent findings regarding relationships between the nine symptom domains and overall depression, demonstrating interchangeability among the three methods. The HRSD17, while generally satisfactory, rarely utilized the full range of item scores, and evidence suggested multidimensional measurement properties. In nonpsychotic MDD outpatients without overt cognitive impairment, clinician assessment of depression severity using either the QIDS-C16 or HRSD17 may be successfully replaced by either the self-report or IVR version of the QIDS.
Further Validation of the Multidimensional Fatigue Symptom Inventory-Short Form

PubMed Central

Stein, Kevin D.; Jacobsen, Paul B.; Blanchard, Chris M.; Thors, Christina

2008-01-01

A growing body of evidence is documenting the multidimensional nature of cancer-related fatigue. Although several multidimensional measures of fatigue have been developed, further validation of these scales is needed. To this end, the current study sought to evaluate the factorial and construct validity of the 30-item Multidimensional Fatigue Symptom Inventory-Short Form (MFSI-SF). A heterogeneous sample of 304 cancer patients (mean age 55 years) completed the MFSI-SF, along with several other measures of psychosocial functioning including the MOS-SF-36 and Fatigue Symptom Inventory, following the fourth cycle of chemotherapy treatment. The results of a confirmatory factor analysis indicated the 5-factor model provided a good fit to the data as evidenced by commonly used goodness of fit indices (CFI 0.90 and IFI 0.90). Additional evidence for the validity of the MFSI-SF was provided via correlations with other relevant instruments (range −0.21 to 0.82). In sum, the current study provides support for the MFSI-SF as a valuable tool for the multidimensional assessment of cancer-related fatigue. PMID:14711465
Development, validity and reliability of the short multidimensional positive mental health instrument.

PubMed

Vaingankar, Janhavi Ajit; Subramaniam, Mythily; Abdin, Edimansyah; Picco, Louisa; Chua, Boon Yiang; Eng, Goi Khia; Sambasivam, Rajeswari; Shafie, Saleha; Zhang, Yunjue; Chong, Siow Ann

2014-06-01

The 47-item positive mental health (PMH) instrument measures the level of PMH in multiethnic adult Asian populations. This study aimed to (1) develop a short PMH instrument and (2) establish its validity and reliability among the adult Singapore population. Two separate studies were conducted among adult community-dwelling Singapore residents of Chinese, Malay or Indian ethnicity where participants completed self-administered questionnaires. In the first study, secondary data analysis was conducted using confirmatory factor analysis (CFA) to shorten the PMH instrument. In the second study, the newly developed short PMH instrument and other scales were administered to 201 residents to establish its factor structure, validity and reliability. A 20-item short PMH instrument fulfilling a higher-order six-factor structure was developed following secondary analysis. The mean age of the participants in the second study was 41 years and about 53% were women. One item with poor factor loading was further removed to generate a 19-item version of the PMH instrument. CFA demonstrated a first-order six-factor model of the short PMH instrument. The PMH-19 instrument and its subscales fulfilled criterion validity hypotheses. Internal consistency and test-retest reliability of the PMH-19 instrument were high (Cronbach's α coefficient = 0.87; intraclass correlation coefficient = 0.93, respectively). The 19-item PMH instrument is multidimensional, valid and reliable, and most importantly, with its reduced administration time, the short PMH instrument can be used to measure and evaluate PMH in Asian communities.
The patient satisfaction questionnaire of EUprimecare project: measurement properties.

PubMed

Cimas, Marta; Ayala, Alba; García-Pérez, Sonia; Sarria-Santamera, Antonio; Forjaz, Maria João

2016-06-01

The measurement of patient satisfaction is considered an essential outcome indicator to evaluate health care quality. Patient satisfaction is considered a multi-dimensional construct, which would include a variety of domains. Although a large number of studies have proposed scales to measure patient satisfaction, there is a lack of psychometric information on them. This study aims to describe the psychometric properties of the Primary Care Satisfaction Scale (PCSS) of the EUprimecare project. A cross-sectional survey of patient satisfaction with primary care was carried out by telephone interview. Primary care services of Estonia, Finland, Germany, Hungary, Lithuania, Italy and Spain. A total of 3020 adult patients aged 18-65 years old attending primary care services. Classic psychometric properties were analysed and Rasch analysis was used to assess the following measurement properties: fit to the Rasch model; uni-dimensionality; reliability; differential item functioning (DIF) by gender, age, civil status, area of residency and country; local independency; adequacy of response scale; and scale targeting. To achieve good fit to the Rasch model, the original response scales of three items (1, 2 and 6) were rescored and Item 3 (waiting time in the room) was removed. The scale was uni-dimensional and Person Separation Index was 0.79, indicating a good reliability. All items were free from bias. PCSS linear measure displayed satisfactory convergent validity with overall satisfaction with primary care. PCSS, as a reliable and valid scale, could be used to measure patient satisfaction in primary care in Europe. © The Author 2016. Published by Oxford University Press in association with the International Society for Quality in Health Care; all rights reserved.
Dimensions of sensation assessed in urinary urgency: a systematic review.

PubMed

Das, Rebekah; Buckley, Jonathan; Williams, Marie

2013-10-01

Urinary urgency is an adverse sensory experience. Confirmation of the multidimensional nature of other adverse sensory experiences such as pain and dyspnea has improved the understanding of neurophysiological and perceptual mechanisms leading to innovations in assessment and treatment. It has been suggested that the sensation of urgency may include multiple dimensions such as intensity, suddenness and unpleasantness. In this systematic review we determine which dimensions of sensation have been assessed by instruments used to measure urinary urgency. A systematic search was undertaken of MEDLINE, Embase, AMED, CINAHL, Ageline, Web of Science, InformIT Health and Scopus databases to identify studies that included assessments of urinary urge or urgency. Articles were included in the analysis if they were primary studies that described the method used to measure urge/urgency in adults and published in English in peer reviewed publications since January 1, 2000. Articles were excluded from study if urgency was measured only in conjunction with other symptoms (eg frequency or incontinence) or if there was no English version of the instrument. Secondary analyses and systematic reviews were retained to hand search references for additional primary studies. Data were extracted for the instruments used to measure urge/urgency. For each instrument the items specific to urinary urgency were reviewed using a prospectively developed categorization process for the sensory dimension and the measurement metric. Items used to assess urinary urgency were collated in a matrix (sensory dimensions vs assessment metric). The most frequently used dimensions, metrics and combinations were descriptively analyzed. After removal of duplicate articles 1,048 full text articles were screened and 411 were excluded, leaving 637 eligible articles from which data were extracted. A total of 216 instruments were identified which were 1 of 6 types, namely 1) wider symptom questionnaires, 2) urgency specific questionnaires, 3) ordinal scales, 4) visual analog scales, 5) event records or 6) body maps. These 216 instruments contained a total of 309 urgency specific items. Of the instruments 51% did not define a dimension of sensation and 26% did not define the metric used. From the remaining instruments 8 dimensions of sensation and 5 types of metrics were identified. From most common to least common, the sensory dimensions assessed were behavioral response, intensity, suddenness, bother, affective response, unpleasantness, quality (descriptors) and problems associated with sensation. Metrics were magnitude, frequency, presence, time frame or location. The most common sensory dimension/metric combinations were frequency of a behavioral response (14% of items) and magnitude of bother caused by the sensation (8% of items). The hypothesis that urinary urgency is multidimensional is supported by the range of dimensions assessed with available instruments. To clarify the nature of urinary urgency compared with the normal desire to void, prospective studies are required to determine whether sensory dimensions are distinct, and which may delineate between normal and pathological sensation. Copyright © 2013 American Urological Association Education and Research, Inc. Published by Elsevier Inc. All rights reserved.
The development of an instrument to assess chemistry perceptions

NASA Astrophysics Data System (ADS)

Wells, Raymond R.

The instrument, developed in this study, attempted to correct the deficiencies of previous instruments. Statements of belief and opinion can be validly included under the construct of chemistry perceptions. Further, statements that might be better characterized as science attitudes, math attitudes, or attitudes toward a specific course or program were not included. Eliminating statements of math anxiety and test anxiety insured that responses to statements of anxiety were perceptions of anxiety solely related to chemistry. The results of the expert judges' responses to the Validation of Proposed Perception Statements forms were detailed to establish construct and content validity. The nature of Likert scale construction and calculation of internal consistency also supported the validity of the instrument. A pilot Chemistry Perception Questionnaire (CPQ) was then constructed based on agreement of the appropriate subscale and mean importance of the perception statements. The pilot CPQ results were subjected to an item analysis based on three sets of statistics: the frequency of each response and the percentage of respondents making each response for each perception statement, the mean and standard deviations for each item, and the item discrimination index which correlated the item scores with the subscale scores. With no zero or negative correlations to the subscale scores, it was not necessary to replace any of the perception statements contained in the pilot instrument. Therefore, the piloted Chemistry Perception Questionnaire became the final instrument. Factor analysis confirmed the multidimensionality of the instrument. The instrument was administered twice with a separation interval of approximately one month in order to perform a test-retest reliability analysis. One hundred and forty-one pairs were matched and results detailed. The correlation between forms, for the total instrument, was 0.9342. The mean coefficient alpha, for the total instrument, was 0.9495. With test-retest correlations and alphas exceeding 0.70 for all seven subscales and the total instrument, it was determined that the Chemistry Perception Questionnaire instrument achieved reasonably high reliability estimations.
Assessing Children's Homework Performance: Development of Multi-Dimensional, Multi-Informant Rating Scales.

PubMed

Power, Thomas J; Dombrowski, Stefan C; Watkins, Marley W; Mautone, Jennifer A; Eagle, John W

2007-06-01

Efforts to develop interventions to improve homework performance have been impeded by limitations in the measurement of homework performance. This study was conducted to develop rating scales for assessing homework performance among students in elementary and middle school. Items on the scales were intended to assess student strengths as well as deficits in homework performance. The sample included 163 students attending two school districts in the Northeast. Parents completed the 36-item Homework Performance Questionnaire - Parent Scale (HPQ-PS). Teachers completed the 22-item teacher scale (HPQ-TS) for each student for whom the HPQ-PS had been completed. A common factor analysis with principal axis extraction and promax rotation was used to analyze the findings. The results of the factor analysis of the HPQ-PS revealed three salient and meaningful factors: student task orientation/efficiency, student competence, and teacher support. The factor analysis of the HPQ-TS uncovered two salient and substantive factors: student responsibility and student competence. The findings of this study suggest that the HPQ is a promising set of measures for assessing student homework functioning and contextual factors that may influence performance. Directions for future research are presented.
Assessing Children’s Homework Performance: Development of Multi-Dimensional, Multi-Informant Rating Scales

PubMed Central

Power, Thomas J.; Dombrowski, Stefan C.; Watkins, Marley W.; Mautone, Jennifer A.; Eagle, John W.

2007-01-01

Efforts to develop interventions to improve homework performance have been impeded by limitations in the measurement of homework performance. This study was conducted to develop rating scales for assessing homework performance among students in elementary and middle school. Items on the scales were intended to assess student strengths as well as deficits in homework performance. The sample included 163 students attending two school districts in the Northeast. Parents completed the 36-item Homework Performance Questionnaire – Parent Scale (HPQ-PS). Teachers completed the 22-item teacher scale (HPQ-TS) for each student for whom the HPQ-PS had been completed. A common factor analysis with principal axis extraction and promax rotation was used to analyze the findings. The results of the factor analysis of the HPQ-PS revealed three salient and meaningful factors: student task orientation/efficiency, student competence, and teacher support. The factor analysis of the HPQ-TS uncovered two salient and substantive factors: student responsibility and student competence. The findings of this study suggest that the HPQ is a promising set of measures for assessing student homework functioning and contextual factors that may influence performance. Directions for future research are presented. PMID:18516211
Developing Tools for Identifying Employer and Employee Satisfaction of Nursing New Graduates in China

PubMed Central

Fan, Yuying; Li, Qiujie; Yang, Shufen; Guo, Ying; Yang, Libin; Zhao, Shibin

2014-01-01

Purpose. Researchers developed evaluation tools measuring employment relevant satisfaction for nursing new graduates. The evaluation tools were designed to be relevant to nursing managers who make employment decisions and nursing new graduates who were just employed. Methods. In-depth interviews and an expert panel were established to review the activities that evaluate the employee and employer satisfaction of nursing new graduates. Based on individual interviews and literature review, evaluation items were selected. A two-round Delphi study was then conducted from September 2008 to May 2009 with a panel of experts from a range of nursing colleges in China. Results. The response rate was 100% and Kendall's W was 0.73 in the second round of Delphi study. After two rounds of Delphi surveys, a list of 5 employee satisfaction items and 4 employer satisfaction items was identified for nursing new graduates. Conclusions. The findings of this study identified a different but multidimensional set of factors for employment relevant satisfaction, which confirmed the importance of certain fundamental aspects of practice. We developed the evaluation tools to assess the employer and employee satisfaction of nursing new graduates, which provided a database for further study. PMID:25097876
Developing tools for identifying employer and employee satisfaction of nursing new graduates in China.

PubMed

Fan, Yuying; Li, Qiujie; Yang, Shufen; Guo, Ying; Yang, Libin; Zhao, Shibin

2014-01-01

Researchers developed evaluation tools measuring employment relevant satisfaction for nursing new graduates. The evaluation tools were designed to be relevant to nursing managers who make employment decisions and nursing new graduates who were just employed. In-depth interviews and an expert panel were established to review the activities that evaluate the employee and employer satisfaction of nursing new graduates. Based on individual interviews and literature review, evaluation items were selected. A two-round Delphi study was then conducted from September 2008 to May 2009 with a panel of experts from a range of nursing colleges in China. The response rate was 100% and Kendall's W was 0.73 in the second round of Delphi study. After two rounds of Delphi surveys, a list of 5 employee satisfaction items and 4 employer satisfaction items was identified for nursing new graduates. The findings of this study identified a different but multidimensional set of factors for employment relevant satisfaction, which confirmed the importance of certain fundamental aspects of practice. We developed the evaluation tools to assess the employer and employee satisfaction of nursing new graduates, which provided a database for further study.
The horizontal and vertical attributes of individualism and collectivism in a Spanish population.

PubMed

Gouveia, Valdiney V; Clemente, Miguel; Espinosa, Pablo

2003-02-01

The authors examined the dimensionality and factorial structure of individualism and collectivism in Spanish participants (N = 526). A series of confirmatory factor analyses were performed on responses to the 32-item individualism-collectivism measure reported by T. M. Singelis, H. C. Triandis, D. S. Bhawuk, and M. Gelfand (1995). Consistent with earlier data, the best fitting model was multidimensional: a vertical versus a horizontal attribute crossed with individualism and collectivism dimensions. Whereas the overall fit of the data to a LISREL model was moderate, additional self-report data on respondents' interpersonal experiences supported the construct validity of the 4 factors. The authors suggest that the additional complexity is useful in explaining Spanish social behavior.
Ethical behaviour in clinical practice: a multidimensional Rasch analysis from a survey of primary health care professionals of Barcelona (Catalonia, Spain).

PubMed

González-de Paz, Luis; Kostov, Belchin; López-Pina, Jose A; Zabalegui-Yárnoz, Adelaida; Navarro-Rubio, M Dolores; Sisó-Almirall, Antoni

2014-12-01

Normative ethics includes ethical behaviour health care professionals should uphold in daily practice. This study assessed the degree to which primary health care (PHC) professionals endorse a set of ethical standards from these norms. Health care professionals from an urban area participated in a cross-sectional study. Data were collected using an anonymous, self-administered questionnaire. We examined the level of ethical endorsement of the items and the ethical performance of health care professionals using a Rasch multidimensional model. We analysed differences in ethical performance between groups according to sex, profession and knowledge of ethical norms. A total of 452 Professionals from 56 PHC centres participated. The level of ethical performance was lower in items related to patient autonomy and respecting patient choices. The item estimate across all dimensions showed that professionals found it most difficult to endorse avoiding interruptions when seeing patients. We found significant differences in two groups: nurses had greater ethical performance than family physicians (p < 0.05), and professionals who reported having effective knowledge of ethical norms had a higher level of ethical performance (p < 0.01). Paternalistic behaviour persists in PHC. Lesser endorsement of items suggests that patient-centred care and patient autonomy are not fully considered by professionals. Ethical sensitivity could improve if patients are cared for by multidisciplinary teams.
The Swedish version of the multidimensional scale of perceived social support (MSPSS)--a psychometric evaluation study in women with hirsutism and nursing students.

PubMed

Ekbäck, Maria; Benzein, Eva; Lindberg, Magnus; Arestedt, Kristofer

2013-10-10

The Multidimensional Scale of Perceived Social Support (MSPSS) is a short instrument, developed to assess perceived social support. The original English version has been widely used. The original scale has demonstrated satisfactory psychometric properties in different settings, but no validated Swedish version has been available. The aim was therefore to translate, adapt and psychometrically evaluate the Multidimensional Scale of Perceived Social Support for use in a Swedish context. In total 281 participants accepted to join the study, a main sample of 127 women with hirsutism and a reference sample of 154 nursing students. The MSPSS was translated and culturally adapted according to the rigorous official process approved by WHO. The psychometric evaluation included item analysis, evaluation of factor structure, known-group validity, internal consistency and reproducibility. The original three-factor structure was reproduced in the main sample of women with hirsutism. An equivalent factor structure was demonstrated in a cross-validation, based on the reference sample of nursing students. Known-group validity was supported and internal consistency was good for all scales (α = 0.91-0.95). The test-retest showed acceptable to very good reproducibility for the items (κw = 0.58-0.85) and the scales (ICC = 0.89-0.92; CCC = 0.89-0.92). The Swedish version of the MSPSS is a multidimensional scale with sound psychometric properties in the present study sample. The simple and short format makes it a useful tool for measuring perceived social support.

Development, Content Validity, and User Review of a Web-based Multidimensional Pain Diary for Adolescent and Young Adults With Sickle Cell Disease.

PubMed

Bakshi, Nitya; Stinson, Jennifer N; Ross, Diana; Lukombo, Ines; Mittal, Nonita; Joshi, Saumya V; Belfer, Inna; Krishnamurti, Lakshmanan

2015-06-01

Vaso-occlusive pain, the hallmark of sickle cell disease (SCD), is a major contributor to morbidity, poor health-related quality of life, and health care utilization associated with this disease. There is wide variation in the burden, frequency, and severity of pain experienced by patients with SCD. As compared with health care utilization for pain, a daily pain diary captures the breadth of the pain experience and is a superior measure of pain burden and its impact on patients. Electronic pain diaries based on real-time data capture methods overcome methodological barriers and limitations of paper pain diaries, but their psychometric properties have not been formally established in patients with SCD. To develop and establish the content validity of a web-based multidimensional pain diary for adolescents and young adults with SCD and conduct an end-user review to refine the prototype. Following identification of items, a conceptual model was developed. Interviews with adolescents and young adults with SCD were conducted. Subsequently, end-user review with use of the electronic pain diary prototype was conducted. Two iterative cycles of in-depth cognitive interviews in adolescents and young adults with SCD informed the design and guided the addition, removal, and modification of items in the multidimensional pain diary. Potential end-users provided positive feedback on the design and prototype of the electronic diary. A multidimensional web-based electronic pain diary for adolescents and young adults with SCD has been developed and content validity and initial end-user reviews have been completed.
Factor and Rasch analysis of the Fonseca anamnestic index for the diagnosis of myogenous temporomandibular disorder.

PubMed

Rodrigues-Bigaton, Delaine; de Castro, Ester M; Pires, Paulo F

Rasch analysis has been used in recent studies to test the psychometric properties of a questionnaire. The conditions for use of the Rasch model are one-dimensionality (assessed via prior factor analysis) and local independence (the probability of getting a particular item right or wrong should not be conditioned upon success or failure in another). To evaluate the dimensionality and the psychometric properties of the Fonseca anamnestic index (FAI), such as the fit of the data to the model, the degree of difficulty of the items, and the ability to respond in patients with myogenous temporomandibular disorder (TMD). The sample consisted of 94 women with myogenous TMD, diagnosed by the Research Diagnostic Criteria for Temporomandibular Disorders (RDC/TMD), who answered the FAI. For the factor analysis, we applied the Kaiser-Meyer-Olkin test, Bartlett's sphericity, Spearman's correlation, and the determinant of the correlation matrix. For extraction of the factors/dimensions, an eigenvalue >1.0 was used, followed by oblique oblimin rotation. The Rasch analysis was conducted on the dimension that showed the highest proportion of variance explained. Adequate sample "n" and FAI multidimensionality were observed. Dimension 1 (primary) consisted of items 1, 2, 3, 6, and 7. All items of dimension 1 showed adequate fit to the model, being observed according to the degree of difficulty (from most difficult to easiest), respectively, items 2, 1, 3, 6, and 7. The FAI presented multidimensionality with its main dimension consisting of five reliable items with adequate fit to the composition of its structure. Copyright © 2017 Associação Brasileira de Pesquisa e Pós-Graduação em Fisioterapia. Publicado por Elsevier Editora Ltda. All rights reserved.
Development and Psychometric Evaluation of a Health-Related Quality of Life Instrument for Individuals with Adult-Onset Hearing Loss

PubMed Central

Stika, Carren J.; Hays, Ron D.

2016-01-01

Objective Self-reports of “hearing handicap” are available, but a comprehensive measure of health-related quality of life (HRQOL) for individuals with adult-onset hearing loss (AOHL) does not exist. Our objective was to develop and evaluate a multidimensional HRQOL instrument for individuals with AOHL. Design The Impact of Hearing Loss Inventory Tool (IHEAR-IT) was developed using results of focus groups, a literature review, Advisory Expert Panel input, and cognitive interviews. Study Sample The 73-item field-test instrument was completed by 409 adults (22-91 years old) with varying degrees of AOHL and from different areas of the US. Results Multitrait scaling analysis supported four multi-item scales and five individual items. Internal consistency reliabilities ranged from 0.93 to 0.96 for the scales. Construct validity was supported by correlations between the IHEAR-IT scales and scores on the 36-Item Short Form Health Survey, Version 2.0 (SF-36v2) Mental Composite Summary (r’s = 0.32 – 0.64) and the Hearing Handicap Inventory for the Elderly/Adults (HHIE/HHIA) (r’s > −0.70). Conclusions The field test provide initial support for the reliability and construct validity of the IHEAR-IT for evaluating HRQOL of individuals with AOHL. Further research is needed to evaluate the responsiveness to change of the IHEAR-IT scales and identify items for a short-form. PMID:27104754
Punishment Insensitivity in Early Childhood: A Developmental, Dimensional Approach.

PubMed

Nichols, Sara R; Briggs-Gowan, Margaret J; Estabrook, Ryne; Burns, James L; Kestler, Jacqueline; Berman, Grace; Henry, David B; Wakschlag, Lauren S

2015-08-01

Impairment in learning from punishment ("punishment insensitivity") is an established feature of severe antisocial behavior in adults and youth but it has not been well studied as a developmental phenomenon. In early childhood, differentiating a normal: abnormal spectrum of punishment insensitivity is key for distinguishing normative misbehavior from atypical manifestations. This study employed a novel measure, the Multidimensional Assessment Profile of Disruptive Behavior (MAP-DB), to examine the distribution, dimensionality, and external validity of punishment insensitivity in a large, demographically diverse community sample of preschoolers (3-5 years) recruited from pediatric clinics (N = 1,855). Caregivers completed surveys from which a seven-item Punishment Insensitivity scale was derived. Findings indicated that Punishment Insensitivity behaviors are relatively common in young children, with at least 50 % of preschoolers exhibiting them sometimes. Item response theory analyses revealed a Punishment Insensitivity spectrum. Items varied along a severity continuum: most items needed to occur "Often" in order to be severe and behaviors that were qualitatively atypical or intense were more severe. Although there were item-level differences across sociodemographic groups, these were small. Construct, convergent, and divergent validity were demonstrated via association to low concern for others and noncompliance, motivational regulation, and a disruptive family context. Incremental clinical utility was demonstrated in relation to impairment. Early childhood punishment insensitivity varies along a severity continuum and is atypical when it predominates. Implications for understanding the phenomenology of emergent disruptive behavior are discussed.
Validation of the Multidimensional Acculturative Stress Inventory on adolescents of Mexican origin.

PubMed

Rodriguez, Norma; Flores, Thomas; Flores, Ramon T; Myers, Hector F; Vriesema, Christine Calderon

2015-12-01

The Multidimensional Acculturative Stress Inventory (MASI), a 36-item measure that assesses acculturative stress among people of Mexican origin living in the United States, was tested on 331 adolescent (14-20 years of age) high school students (204 female, 127 male) of Mexican origin. Exploratory factor analyses yielded 4 factors: bicultural practices conflict (9 items), Spanish competency pressures (8 items), English competency pressures (8 items), and bicultural self-consciousness (2 items). These factors accounted for 59.5% of the variance and correlated in the expected directions with criterion measures of acculturation and the Psychological General Well-Being Schedule. Bicultural practices conflict and bicultural self-consciousness emerged as the first and fourth factors for adolescents, which differed from the last 2 factors observed in a previous study of adults by Rodriguez, Myers, Mira, Flores, and Garcia-Hernandez (2002)--pressure to acculturate and pressure against acculturation. Comparisons of the MASI factor structures between adolescents and adults also revealed that English competency pressures and Spanish competency pressures played a prominent role for both adolescents in this study and adults in the study by Rodriguez et al. (2002). The congruence and difference in factor structure of the MASI between adolescents and adults indicates that both groups experience acculturative stress because of English- and Spanish-language competency pressures, but adolescents differentially experience difficulties in negotiating between American and Latino practices and identities. The results highlight the importance of assessing acculturative stress from both Latino and American culture and recognizing the varying levels of these sources of acculturative stress by generation. (c) 2015 APA, all rights reserved).
Validity and reliability of the Multidimensional Body Image Scale in Malaysian university students.

PubMed

Gan, W Y; Mohd, Nasir M T; Siti, Aishah H; Zalilah, M S

2012-12-01

This study aimed to evaluate the validity and reliability of the Multidimensional Body Image Scale (MBIS), a seven-factor, 62-item scale developed for Malaysian female adolescents. This scale was evaluated among male and female Malaysian university students. A total of 671 university students (52.2% women and 47.8% men) completed a self-administered questionnaire on MBIS, Eating Attitude Test-26, and Rosenberg Self-Esteem Scale. Their height and weight were measured. Results in confirmatory factor analysis showed that the 62-item MBIS reported poor fit to the data, xhi2/df = 4.126, p < 0.001, CFI = 0.808, SRMR = 0.070, RMSEA = 0.068 (90% CI = 0.067, 0.070). After re-specification of the model, the model fit was improved with 46 items remaining, chi2/df = 3.346, p < 0.001, CFI = 0.903, SRMR = 0.053, RMSEA = 0.059 (90% CI = 0.057, 0.061), and the model showed good fit to the data for men and women separately. This 46-item MBIS had good internal consistency in both men (Cronbach's alpha = 0.88) and women (Cronbach's alpha = 0.92). In terms of construct validity, it showed positive correlations with disordered eating and body weight status, but negative correlation with self-esteem. Also, this scale discriminated well between participants with and without disordered eating. The MBIS-46 demonstrated good reliability and validity for the evaluation of body image among university students. Further studies need to be conducted to confirm the validation results of the 46-item MBIS.
Reliability of perceived neighbourhood conditions and the effects of measurement error on self-rated health across urban and rural neighbourhoods.

PubMed

Pruitt, Sandi L; Jeffe, Donna B; Yan, Yan; Schootman, Mario

2012-04-01

Limited psychometric research has examined the reliability of self-reported measures of neighbourhood conditions, the effect of measurement error on associations between neighbourhood conditions and health, and potential differences in the reliabilities between neighbourhood strata (urban vs rural and low vs high poverty). We assessed overall and stratified reliability of self-reported perceived neighbourhood conditions using five scales (social and physical disorder, social control, social cohesion, fear) and four single items (multidimensional neighbouring). We also assessed measurement error-corrected associations of these conditions with self-rated health. Using random-digit dialling, 367 women without breast cancer (matched controls from a larger study) were interviewed twice, 2-3 weeks apart. Test-retest (intraclass correlation coefficients (ICC)/weighted κ) and internal consistency reliability (Cronbach's α) were assessed. Differences in reliability across neighbourhood strata were tested using bootstrap methods. Regression calibration corrected estimates for measurement error. All measures demonstrated satisfactory internal consistency (α ≥ 0.70) and either moderate (ICC/κ=0.41-0.60) or substantial (ICC/κ=0.61-0.80) test-retest reliability in the full sample. Internal consistency did not differ by neighbourhood strata. Test-retest reliability was significantly lower among rural (vs urban) residents for two scales (social control, physical disorder) and two multidimensional neighbouring items; test-retest reliability was higher for physical disorder and lower for one multidimensional neighbouring item among the high (vs low) poverty strata. After measurement error correction, the magnitude of associations between neighbourhood conditions and self-rated health were larger, particularly in the rural population. Research is needed to develop and test reliable measures of perceived neighbourhood conditions relevant to the health of rural populations.
An Evaluation of the Quick Inventory of Depressive Symptomatology and the Hamilton Rating Scale for Depression: A Sequenced Treatment Alternatives to Relieve Depression Trial Report

PubMed Central

Rush, A. John; Bernstein, Ira H.; Trivedi, Madhukar H.; Carmody, Thomas J.; Wisniewski, Stephen; Mundt, James C.; Shores-Wilson, Kathy; Biggs, Melanie M.; Woo, Ada; Nierenberg, Andrew A.; Fava, Maurizio

2010-01-01

Background Nine DSM-IV-TR criterion symptom domains are evaluated to diagnose major depressive disorder (MDD). The Quick Inventory of Depressive Symptomatology (QIDS) provides an efficient assessment of these domains and is available as a clinician rating (QIDS-C16), a self-report (QIDS-SR16), and in an automated, interactive voice response (IVR) (QIDS-IVR16) telephone system. This report compares the performance of these three versions of the QIDS and the 17-item Hamilton Rating Scale for Depression (HRSD17). Methods Data were acquired at baseline and exit from the first treatment step (citalopram) in the Sequenced Treatment Alternatives to Relieve Depression (STAR*D) trial. Outpatients with nonpsychotic MDD who completed all four ratings within ±2 days were identified from the first 1500 STAR*D subjects. Both item response theory and classical test theory analyses were conducted. Results The three methods for obtaining QIDS data produced consistent findings regarding relationships between the nine symptom domains and overall depression, demonstrating interchangeability among the three methods. The HRSD17, while generally satisfactory, rarely utilized the full range of item scores, and evidence suggested multidimensional measurement properties. Conclusions In nonpsychotic MDD outpatients without overt cognitive impairment, clinician assessment of depression severity using either the QIDS-C16 or HRSD17 may be successfully replaced by either the self-report or IVR version of the QIDS. PMID:16199008
Questionnaire to assess patient satisfaction with pharmaceutical care in Spanish language.

PubMed

Traverso, María Luz; Salamano, Mercedes; Botta, Carina; Colautti, Marisel; Palchik, Valeria; Pérez, Beatriz

2007-08-01

To develop and validate a questionnaire, in Spanish, for assessing patient satisfaction with pharmaceutical care received in community pharmacies. Selection and translation of questionnaire's items; definition of response scale and demographic questions. Evaluation of face and content validity, feasibility, factor structure, reliability and construct validity. Forty-one community pharmacies of the province of Santa Fe. Argentina. Questionnaire administered to patients receiving pharmaceutical care or traditional pharmacy services. Pilot test to assess feasibility. Factor analysis used principal components and varimax rotation. Reliability established using internal consistency with Cronbach's alpha. Construct validity determined with extreme group method. A self-administered questionnaire with 27 items, 5-point Likert response scale and demographic questions was designed considering multidimensional structure of patient satisfaction. Questionnaire evaluates cumulative experience of patients with comprehensive pharmaceutical care practice in community pharmacies. Two hundred and seventy-four complete questionnaires were obtained. Factor analysis resulted in three factors: Managing therapy, Interpersonal relationship and General satisfaction, with a cumulative variance of 62.51%. Cronbach's alpha for the whole questionnaire was 0.96, and 0.95, 0.88 and 0.76 for the three factors, respectively. Mann-Whitney test for construct validity did not showed significant differences between pharmacies that provide pharmaceutical care and those that do not, however, 23 items showed significant differences between the two groups of pharmacies. The questionnaire developed can be a reliable and valid instrument to assess patient satisfaction with pharmaceutical care in community pharmacies in Spanish. Further research is needed to deepen the validation process.
Applying item response theory and computer adaptive testing: the challenges for health outcomes assessment.

PubMed

Fayers, Peter M

2007-01-01

We review the papers presented at the NCI/DIA conference, to identify areas of controversy and uncertainty, and to highlight those aspects of item response theory (IRT) and computer adaptive testing (CAT) that require theoretical or empirical research in order to justify their application to patient reported outcomes (PROs). IRT and CAT offer exciting potential for the development of a new generation of PRO instruments. However, most of the research into these techniques has been in non-healthcare settings, notably in education. Educational tests are very different from PRO instruments, and consequently problematic issues arise when adapting IRT and CAT to healthcare research. Clinical scales differ appreciably from educational tests, and symptoms have characteristics distinctly different from examination questions. This affects the transferring of IRT technology. Particular areas of concern when applying IRT to PROs include inadequate software, difficulties in selecting models and communicating results, insufficient testing of local independence and other assumptions, and a need of guidelines for estimating sample size requirements. Similar concerns apply to differential item functioning (DIF), which is an important application of IRT. Multidimensional IRT is likely to be advantageous only for closely related PRO dimensions. Although IRT and CAT provide appreciable potential benefits, there is a need for circumspection. Not all PRO scales are necessarily appropriate targets for this methodology. Traditional psychometric methods, and especially qualitative methods, continue to have an important role alongside IRT. Research should be funded to address the specific concerns that have been identified.
Depressive Mood and Social Maladjustment: Differential Effects on Academic Achievement

ERIC Educational Resources Information Center

Aluja, Anton; Blanch, Angel

2004-01-01

The Children Depression Inventory (CDI) is a multidimensional instrument that includes items of social withdrawal, anhedonia, asthenia, low self-esteem (internalized) and behavioral problems (externalized). Child depression has been related with low academic achievement, neurotic and introverted personality traits and social maladjustment defined…
Measuring Developmental Students' Mathematics Anxiety

ERIC Educational Resources Information Center

Ding, Yanqing

2016-01-01

This study conducted an item-level analysis of mathematics anxiety and examined the dimensionality of mathematics anxiety in a sample of developmental mathematics students (N = 162) by Multi-dimensional Random Coefficients Multinominal Logit Model (MRCMLM). The results indicate a moderately correlated factor structure of mathematics anxiety (r =…
Development of a multidimensional measure for recurrent abdominal pain in children: population-based studies in three settings.

PubMed

Malaty, Hoda M; Abudayyeh, Suhaib; O'Malley, Kimberly J; Wilsey, Michael J; Fraley, Ken; Gilger, Mark A; Hollier, David; Graham, David Y; Rabeneck, Linda

2005-02-01

Recurrent abdominal pain (RAP) is a common problem in children and adolescents. Evaluation and treatment of children with RAP continue to challenge physicians because of the lack of a psychometrically sound measure for RAP. A major obstacle to progress in research on RAP has been the lack of a biological marker for RAP and the lack of a reliable and valid clinical measure for RAP. The objectives of this study were (1) to develop and test a multidimensional measure for RAP (MM-RAP) in children to serve as a primary outcome measure for clinical trials, (2) to evaluate the reliability of the measure and compare its responses across different populations, and (3) to examine the reliabilities of the measure scales in relation to the demographic variables of the studied population. We conducted 3 cross-sectional studies. Two studies were clinic-based studies that enrolled children with RAP from 1 pediatric gastroenterology clinic and 6 primary care clinics. The third study was a community-based study in which children from 1 elementary and 2 middle schools were screened for frequent episodes of abdominal pain. The 3 studies were conducted in Houston, Texas. Inclusion criteria for the clinic-based studies were (1) age of 4 to 18 years; (2) abdominal pain that had persisted for 3 or more months; (3) abdominal pain that was moderate to severe and interfered with some or all regular activities; (4) abdominal pain that may or may not be accompanied by upper-gastrointestinal symptoms; and (5) children were accompanied by a parent or guardian who was capable of giving informed consent, and children over the age of 10 years were capable of giving informed assent. The community-based study used standardized questionnaires that were offered to 1080 children/parents from the 3 participating schools; 700 completed and returned the questionnaires (65% response rate). The questionnaire was designed to elicit data concerning the history of abdominal pain or discomfort. A total of 160 children met Apley's criteria and were classified as having RAP. Inclusion criteria were identical to those criteria for the clinic-based studies. Participating children in the 3 studies received a standardized questionnaire that asked about socioeconomic variables, abdominal pain (intensity; frequency; duration; nature of abdominal pain, if present, and possible relationships with school activities; and other upper gastrointestinal symptoms). We used 4 scales for the MM-RAP: pain intensity scale (3 items), nonpain symptoms scale (12 items), disability scale (3 items), and satisfaction scale (2 items). Age 7 was used as a cutoff point for the analysis as the 7-year-olds have been shown to exhibit more sophisticated knowledge of illness than younger children. A total of 295 children who were aged 4 to 18 years participated in the study: 155 children from the pediatric gastroenterology clinics, 82 from the primary care clinics, and 58 from the schools. The interitem consistency (Cronbach's coefficient alpha) for the pain intensity items, nonpain symptoms items, disability items, and satisfaction items were 0.75, 0.81, 0.80, and 0.78, respectively, demonstrating good reliability of the measure. The internal consistencies of the 4 scales did not significantly differ between younger (< or =7 years) and older (>7 years) children. There was also no significant variation in the coefficient alpha of each of the 4 scales in relation to gender or the level of the parent's education. Reliability was identical for the pain-intensity items (0.74) among children who sought medical attention from primary care or pediatric gastroenterology clinics. The intercorrelations of factor scores among the 4 scales showed a strong relationship among the factors but not high enough that correlations would be expected to be measuring the same items. The results of the factor analysis identified 5 components instead of 4 components representing the 4 scales. The 12 items of the nonpain symptoms scale were classified into 2 components; 1 component included heartburn, burping, passing gas, bloating, problem with ingestion of milk, bad breath, and sour taste (nonpain symptoms I), and the other included nausea/vomiting, diarrhea, and constipation (nonpain symptoms II). The program ordered the 5 components on the basis of the percentage of the total variance explained by each component and consequently by the strength of each components in the following order: nonpain symptoms I, pain intensity, pain disability, satisfaction, and nonpain symptoms II. Of the 20 items that composed the MM-RAP, 17 met the inclusion criteria of having a correlation of > or =0.40 on the primary factor analyses. The 3 items that assessed pain intensity met the inclusion criteria as well as the 2 items that assessed satisfaction. Two of the 3 items that assessed disability met the inclusion criteria; however, the missed school item did not. The sleep problem and the loss of appetite items in the nonpain items also did not meet the inclusion criteria in both components of the nonpain symptoms scale. However, the loss of appetite item met the inclusion criteria in the disability scale with a correlation of 0.6. The 2 items that did not meet the inclusion criteria (missed school days and sour taste) will be eliminated in the revised measure for RAP. The MM-RAP demonstrated good reliability evidence in population samples. Children who have RAP and are seen at pediatric gastroenterology or primary care pediatric clinics have similar responses, showing that the measure performed well across several populations. Age did not affect the reliability of responses. The MM-RAP included 4 dimensions, each with several items that may identify disease-specific dimensions. In addition, dividing the nonpain symptoms scale into 2 components instead of 1 component could assist in creating a disease-specific measure. The present study focused exclusively on developing the multidimensional measure for RAP in children that could assist physicians in evaluating the efficacy of RAP treatment independent of psychological evaluations. In addition, the measure was designed for use in clinical trials that evaluate the efficacy of RAP treatment and to allow comparison between intervention studies. In conclusion, we were able to identify 4 dimensions of RAP in children (pain intensity, nonpain symptoms, pain disability, and satisfaction with health). We demonstrated that these dimensions can be measured in a reliable manner that is applicable to children who experience RAP in various settings.
Psychometric properties of the multidimensional perfectionism scale of Hewitt in a dutch-speaking sample: associations with the big five personality traits.

PubMed

De Cuyper, Kathleen; Claes, Laurence; Hermans, Dirk; Pieters, Guido; Smits, Dirk

2015-01-01

We administered the Dutch Multidimensional Perfectionism Scale of Hewitt and Flett (1991, 2004) in a large student sample (N = 959) and performed a confirmatory factor analysis to test the factorial structure proposed by the original authors. The existence of a method factor referring to the negatively keyed items in the questionnaire was investigated by including it in the tested models. Next, we investigated how the 3 perfectionism dimensions are associated with the Five-factor model (FFM) of personality. The 3-factor structure originally observed by the authors was confirmed, at least when a method factor that refers to the negatively keyed items was included in the model. Self-oriented and socially prescribed perfectionism were both distinguished by low extraversion and low emotional stability. Self-oriented perfectionism's positive relationship with both conscientiousness and openness to experience differentiated the 2 perfectionism dimensions from each other. Other-oriented perfectionism was not well-characterized by the Big Five personality traits.
A psychometric study of the multidimensional fatigue inventory to assess fatigue in patients with schizophrenia spectrum disorders.

PubMed

Hedlund, Lena; Gyllensten, Amanda Lundvik; Hansson, Lars

2015-04-01

Fatigue is frequently reported by patients with mental illness. The multidimensional fatigue inventory (MFI-20) is a self-assessment instrument with 20 items including five dimensions of fatigue. The purpose of this study was to examine the test-retest reliability, internal consistency, convergent construct validity and feasibility of using MFI-20 in patients with schizophrenia spectrum disorders. Patients completed two self-assessment instruments, MFI-20 (n = 93) and Visual Analogue Scale (n = 79), twice within 1 week ± 2 days. Fifty-three patients also rated the feasibility of responding to the MFI-20 with a Likert scale. The test-retest reliability and validity were analysed by using Spearman's correlations and internal consistency by calculating Cronbach's α. The test-retest showed a correlation between .66 and .91 for all subscales of MFI. The internal consistency was .92. The analysis of convergent construct validity showed a correlation of .68 (time 1) and .77 (time 2). No item was systematically identified as being difficult to answer.
Factoring handedness data: II. Geschwind's multidimensional hypothesis.

PubMed

Messinger, H B; Messinger, M I

1996-06-01

The challenge in this journal by Peters and Murphy to the validity of two published factor analyses of handedness data because of bimodality was dealt with in Part I by identifying measures to normalize the handedness item distributions. A new survey using Oldfield's questionnaire format had 38 bell-shaped (unimodal) handedness-item distributions and 11 that were only marginally bimodal out of the 55 items used in Geschwind's 1986 study. Yet they were still non-normal and the factor analysis was unsatisfactory; bimodality is not the only problem. By choosing a transformation for each item that was optimal as assessed by D'Agostino's K2 statistic, all but two items could be normalized. Seven factors were derived that showed high congruence between maximum likelihood and principal components extractions before and after varimax rotation. Geschwind's assertion that handedness is not unidimensional is therefore supported.
Detecting When “Quality of Life” Has Been “Enhanced”: Estimating Change in Quality of Life Ratings

PubMed Central

Tractenberg, Rochelle E.; Yumoto, Futoshi; Aisen, Paul S.

2015-01-01

Objective To demonstrate challenges in the estimation of change in quality of life (QOL). Methods Data were taken from a completed clinical trial with negative results. Responses to 13 QOL items were obtained 12 months apart from 258 persons with Alzheimer’s disease (AD) participating in a randomized, placebo-controlled clinical trial with two treatment arms. Two analyses to estimate whether “change” in QOL occurred over 12 months are described. A simple difference (later - earlier) was calculated from total scores (standard approach). A Qualified Change algorithm (novel approach) was applied to each item: differences in ratings were classified as either: improved, worsened, stayed poor, or stayed “positive” (fair, good, excellent). The strengths of evidence supporting a claim that “QOL changed”, derived from the two analyses, were compared by considering plausible alternative explanations for, and interpretations of, results obtained under each approach. Results Total score approach: QOL total scores decreased, on average, in the two treatment (both −1.0, p < 0.05), but not the placebo (=−0.59, p > 0.3) groups. Qualified change approach: Roughly 60% of all change in QOL items was worsening in every arm; 17% - 42% of all subjects experienced change in each item. Conclusions Totalling the subjective QOL item ratings collapses over items, and suggests a potentially misleading “overall” level of change (or no change, as in the placebo arm). Leaving the items as individual components of “quality” of life they were intended to capture, and qualifying the direction and amount of change in each, suggests that at least 17% of any group experienced change on every item, with 60% of all observed change being worsening. Discussion Summarizing QOL item ratings as a total “score” collapses over the face-valid, multi-dimensional components of the construct “quality of life”. Qualified Change provides robust evidence of changes to QOL or “enhancements of” life quality. PMID:26213645
Markov blanket-based approach for learning multi-dimensional Bayesian network classifiers: an application to predict the European Quality of Life-5 Dimensions (EQ-5D) from the 39-item Parkinson's Disease Questionnaire (PDQ-39).

PubMed

Borchani, Hanen; Bielza, Concha; Martı Nez-Martı N, Pablo; Larrañaga, Pedro

2012-12-01

Multi-dimensional Bayesian network classifiers (MBCs) are probabilistic graphical models recently proposed to deal with multi-dimensional classification problems, where each instance in the data set has to be assigned to more than one class variable. In this paper, we propose a Markov blanket-based approach for learning MBCs from data. Basically, it consists of determining the Markov blanket around each class variable using the HITON algorithm, then specifying the directionality over the MBC subgraphs. Our approach is applied to the prediction problem of the European Quality of Life-5 Dimensions (EQ-5D) from the 39-item Parkinson's Disease Questionnaire (PDQ-39) in order to estimate the health-related quality of life of Parkinson's patients. Fivefold cross-validation experiments were carried out on randomly generated synthetic data sets, Yeast data set, as well as on a real-world Parkinson's disease data set containing 488 patients. The experimental study, including comparison with additional Bayesian network-based approaches, back propagation for multi-label learning, multi-label k-nearest neighbor, multinomial logistic regression, ordinary least squares, and censored least absolute deviations, shows encouraging results in terms of predictive accuracy as well as the identification of dependence relationships among class and feature variables. Copyright © 2012 Elsevier Inc. All rights reserved.
Invariance test of the Multidimensional Body Self-Relations Questionnaire: do women with breast cancer interpret this measure differently?

PubMed

Sabiston, Catherine M; Rusticus, Shayna; Brunet, Jennifer; McDonough, Meghan H; Hadd, Valerie; Hubley, Anita M; Crocker, Peter R E

2010-10-01

To examine whether the meaning and interpretation of body image are similar for breast cancer survivors and women without breast cancer. Women completed the Multidimensional Body Self-Relations Questionnaire--Appearance Scales as part of two studies. There were 469 women with breast cancer and 385 women without breast cancer. Invariance testing was conducted to examine whether the items assessing the body image dimensions were similar, whether the dimensions were interpreted similarly, whether the items were equally salient and meaningful, and whether there were mean differences on the body image dimensions across the two groups. The meaning and interpretation of body image dimensions related to appearance evaluation and appearance orientation were similar across the groups, yet some group differences were found for overweight preoccupation and body areas satisfaction (and not testable for self-classified weight). Breast cancer survivors reported a small yet significantly higher mean on appearance evaluation and lower mean on appearance orientation compared to the women without breast cancer. Meaningful comparisons in body image across cancer and non-cancer women can be made using two of the Multidimensional Body Self-Relations Questionnaire--Appearance Scales. The overweight preoccupation subscale could be used to assess body image but should not be used if group mean differences are desirable. Assessing satisfaction with body areas across these groups is not recommended and may introduce systematic bias.
Development and initial validation of a caffeine craving questionnaire.

PubMed

West, Oliver; Roderique-Davies, Gareth

2008-01-01

Craving for caffeine has received little empirical attention, despite considerable research into the potential for caffeine dependence. The main aim of this study was to develop, and initially validate, a multi-item, multidimensional instrument to measure cravings for caffeine. Participants were 189 caffeine consumers who completed the Questionnaire of Caffeine Cravings, which was based on the Questionnaire of Smoking Urges (QSU), in one of five naturally occurring periods of abstinence; 1-15 min; 16-120 mins; 3-7 h; 12-48 h and +48 h. Exploratory factor analysis suggested a three-factor solution best described the data; Factor 1 reflected strong desires, intentions and positive reinforcement; Factor 2 reflected mild/general positive and negative reinforcement and Factor 3 reflected functional/mood-based negative reinforcement. Significantly higher Factor 1 and Factor 2 scores were recorded for high frequency users; significantly higher Factor 1 and Factor 3 scores were recorded as a function of increased levels of dependence. Duration of abstinence did not significantly effect cravings across all three factors. Regression analyses suggested level of dependence best predicted both current cravings and frequency of daily use. These findings suggest caffeine cravings may be conceptualized multidimensionally and further validates the use of multidimensional, multi-item instruments. Cravings for caffeine may manifest and be detected across varying levels of dependence and, frequency of use and independently of duration of abstinence.

The blood donor identity survey: a multidimensional measure of blood donor motivations.

PubMed

France, Christopher R; Kowalsky, Jennifer M; France, Janis L; Himawan, Lina K; Kessler, Debra A; Shaz, Beth H

2014-08-01

Evidence indicates that donor identity is an important predictor of donation behavior; however, prior studies have relied on diverse, unidimensional measures with limited psychometric support. The goals of this study were to examine the application of self-determination theory to blood donor motivations and to develop and validate a related multidimensional measure of donor identity. Items were developed and administered electronically to a sample of New York Blood Center (NYBC) donors (n=582) and then to a sample of Ohio University students (n=1005). Following initial confirmatory factor analysis (CFA) on the NYBC sample to identify key items related to self-determination theory's six motivational factors, a revised survey was administered to the university sample to reexamine model fit and to assess survey reliability and validity. Consistent with self-determination theory, for both samples CFAs indicated that the best fit to the data was provided by a six-motivational-factor model, including amotivation, external regulation, introjected regulation, identified regulation, integrated regulation, and intrinsic regulation. The Blood Donor Identity Survey provides a psychometrically sound, multidimensional measure of donor motivations (ranging from unmotivated to donate to increasing levels of autonomous motivation to donate) that is suitable for nondonors as well as donors with varying levels of experience. Future research is needed to examine longitudinal changes in donor identity and its relationship to actual donation behavior. © 2014 AABB.
Creating a test blueprint for a progress testing program: A paired-comparisons approach.

PubMed

von Bergmann, HsingChi; Childs, Ruth A

2018-03-01

Creating a new testing program requires the development of a test blueprint that will determine how the items on each test form are distributed across possible content areas and practice domains. To achieve validity, categories of a blueprint are typically based on the judgments of content experts. How experts judgments are elicited and combined is important to the quality of resulting test blueprints. Content experts in dentistry participated in a day-long faculty-wide workshop to discuss, refine, and confirm the categories and their relative weights. After reaching agreement on categories and their definitions, experts judged the relative importance between category pairs, registering their judgments anonymously using iClicker, an audience response system. Judgments were combined in two ways: a simple calculation that could be performed during the workshop and a multidimensional scaling of the judgments performed later. Content experts were able to produce a set of relative weights using this approach. The multidimensional scaling yielded a three-dimensional model with the potential to provide deeper insights into the basis of the experts' judgments. The approach developed and demonstrated in this study can be applied across academic disciplines to elicit and combine content experts judgments for the development of test blueprints.
Validity and reliability of the Malay version multidimensional scale of perceived social support (MSPSS-M) among teachers.

PubMed

Lee, Soo Cheng; Moy, Foong Ming; Hairi, Noran Naqiah

2017-01-01

The multidimensional scale of perceived social support (MSPSS) was developed to measure perceived social support. It has been translated and culturally adapted among natives literate in the Malay language. However, its psychometric properties for teachers who are majority females and married have not been assessed. This was a cross-sectional study conducted among the public secondary school teachers in the central region of Peninsular Malaysia from May to July 2013. A total of 150 and 203 teachers were recruited to perform exploratory factor analysis and confirmatory factor analysis (CFA), respectively. Reliability testing was evaluated on 141 teachers via internal consistency and two-week interval test-retest. The 12-item three-factor structure of MSPSS-M was revised to 8-item two-factor structure. The revised MSPSS-M demonstrated excellent fit in CFA with adequate divergent and convergent validity and good factor loadings (0.80-0.90). The revised MSPSS-M also displayed good internal consistency with Cronbach's alpha of 0.91, 0.93 and 0.92 and good test-retest reliability with intraclass correlation of 0.89, 0.88 and 0.88 in the total scale, family and friends factors, respectively. The revised 8-item MSPSS-M is a reliable and valid tool for assessment of perceived social support among teachers.
Measuring ambivalence to science

NASA Astrophysics Data System (ADS)

Gardner, P. L.

Ambivalence is a psychological state in which a person holds mixed feelings (positive and negative) towards some psychological object. Standard methods of attitude measurement, such as Likert and semantic differential scales, ignore the possibility of ambivalence; ambivalent responses cannot be distinguished from neutral ones. This neglect arises out of an assumption that positive and negative affects towards a particular psychological object are bipolar, i.e., unidimensional in opposite directions. This assumption is frequently untenable. Conventional item statistics and measures of test internal consistency are ineffective as checks on this assumption; it is possible for a scale to be multidimensional and still display apparent internal consistency. Factor analysis is a more effective procedure. Methods of measuring ambivalence are suggested, and implications for research are discussed.
Neural modulation of directed forgetting by valence and arousal: An event-related potential study.

PubMed

Gallant, Sara N; Dyson, Benjamin J

2016-10-01

Intentional forgetting benefits memory by removing no longer needed information and promoting processing of more relevant materials. This study sought to understand how the behavioural and neurophysiological representation of intentional forgetting would be impacted by emotion. We took a novel approach by examining the unique contribution of both valence and arousal on emotional directed forgetting. Participants completed an item directed forgetting task for positive, negative, and neutral words at high and lower levels of arousal while brain activity was recorded using electroencephalography (EEG). Behaviourally, recognition of to-be-remembered (TBR) and to-be-forgotten (TBF) items varied as a function of valence and arousal with reduced directed forgetting for high arousing negative and neutral words. In the brain, patterns of frontal and posterior activation in response to TBF and TBR cues respectively replicated prior EEG evidence to support involvement of inhibitory and selective rehearsal mechanisms in item directed forgetting. Interestingly, emotion only impacted cue-related posterior activity, which varied depending on specific interactions between valence and arousal. Together, results suggest that the brain handles valence and arousal differently and highlights the importance of considering in a collective manner the multidimensional nature of emotion in experimentation. Copyright © 2016 Elsevier B.V. All rights reserved.
Revision and psychometric testing of the City of Hope Quality of Life-Ostomy Questionnaire.

PubMed

Grant, Marcia; Ferrell, Betty; Dean, Grace; Uman, Gwen; Chu, David; Krouse, Robert

2004-10-01

Ostomies may be performed for bowel or urinary diversion, and occur in both cancer and non-cancer patients. Impact on physical, psychological, social and spiritual well-being is not unexpected, but has been minimally described in the literature. The City of Hope Quality of Life (COH-QOL)-Ostomy Questionnaire is an adult patient self-report instrument designed to assess quality of life. This report focuses on the revision and psychometric testing of this questionnaire. The revised COH-QOL-Ostomy Questionnaire involved in-depth patient interviews and expert panel review. The format consisted of a 13-item disease and demographic section, a 34-item forced-choice section, and a 41-item linear analogue scaled section. A mailed survey to California members of the United Ostomy Association resulted in a 62% response rate (n = 1513). Factor analysis was conducted to refine the instrument. Construct validity involved testing a number of hypotheses identifying contrasting groups. Factor analysis confirmed the conceptual framework. Reliability of subscales ranged from 0.77 to 0.90. The questionnaire discriminated between subpopulations with specific concerns. Overall, the analyses provide evidence for the validity and reliability of the COH-QOL-Ostomy Questionnaire as a comprehensive, multidimensional self-report questionnaire for measuring quality of life in patients with intestinal ostomies.
The Multidimensional Loss Scale: validating a cross-cultural instrument for measuring loss.

PubMed

Vromans, Lyn; Schweitzer, Robert D; Brough, Mark

2012-04-01

The Multidimensional Loss Scale (MLS) represents the first instrument designed specifically to index Experience of Loss Events and Loss Distress across multiple domains (cultural, social, material, and intrapersonal) relevant to refugee settlement. Recently settled Burmese adult refugees (N = 70) completed a questionnaire battery, including MLS items. Analyses explored MLS internal consistency, convergent and divergent validity, and factor structure. Cronbach alphas indicated satisfactory internal consistency for Experience of Loss Events (0.85) and Loss Distress (0.92), reflecting a unitary construct of multidimensional loss. Loss Distress did not correlate with depression or anxiety symptoms and correlated moderately with interpersonal grief and trauma symptoms, supporting divergent and convergent validity. Factor analysis provided preliminary support for a five-factor model: Loss of Symbolic Self, Loss of Interdependence, Loss of Home, Interpersonal Loss, and Loss of Intrapersonal Integrity. Received well by participants, the new scale shows promise for application in future research and practice.
A multidimensional approach to measuring well-being in students: Application of the PERMA framework

PubMed Central

Kern, Margaret L.; Waters, Lea E.; Adler, Alejandro; White, Mathew A.

2015-01-01

Seligman recently introduced the PERMA model with five core elements of psychological well-being: positive emotions, engagement, relationships, meaning, and accomplishment. We empirically tested this multidimensional theory with 516 Australian male students (age 13–18). From an extensive well-being assessment, we selected a subset of items theoretically relevant to PERMA. Factor analyses recovered four of the five PERMA elements, and two ill-being factors (depression and anxiety). We then explored the nomological net surrounding each factor by examining cross-sectional associations with life satisfaction, hope, gratitude, school engagement, growth mindset, spirituality, physical vitality, physical activity, somatic symptoms, and stressful life events. Factors differentially related to these correlates, offering support for the multidimensional approach to measuring well-being. Directly assessing subjective well-being across multiple domains offers the potential for schools to more systematically understand and promote well-being. PMID:25745508
Development and Psychometric Properties of a Tuberculosis-Specific Multidimensional Health-Related Quality-of-Life Measure for Patients with Pulmonary Tuberculosis.

PubMed

Abdulelah, Juman; Sulaiman, Syed Azhar Syed; Hassali, Mohamed A; Blebil, Ali Q; Awaisu, Ahmed; Bredle, Jason M

2015-05-01

Various generic instruments exist to assess health-related quality of life (HRQOL) in patients with tuberculosis (TB), but a psychometrically sound disease-specific instrument is lacking. The present study aimed to develop and psychometrically validate a multidimensional TB-specific HRQOL instrument relevant to the value of patients with pulmonary TB in Iraq with an eye toward cross-cultural application. The core general HRQOL questionnaire is composed of the Functional Assessment of Cancer Therapy-General items. A modular approach was followed for the development of the Functional Assessment of Chronic Illness Therapy-Tuberculosis (FACIT-TB) questionnaire in which a set of items assessing quality-of-life (QOL) issues not sufficiently covered by the core Functional Assessment of Cancer Therapy-General items, but considered to be relevant to the target population, was added. Moreover, principal-component analysis was used to determine the new subscale structure of the questionnaire. In addition to the 27 items of the core questionnaire, a set of 20 items referring to disease symptoms related to the site of infection, adverse effects, and additional QOL dimensions such as fatigue, social stigma, and economic burden of the illness was included. Factor analysis demonstrated that the FACIT-TB construct comprised five domains. A rigorous method was applied in the development of the FACIT-TB measure to fully understand the impact of TB on patients' QOL. The instrument is psychometrically sound and portrays multiple important dimensions of HRQOL. FACIT-TB is relatively brief, is easy to administer and score, and is appropriate for use in clinical trials and practice. Copyright © 2015 International Society for Pharmacoeconomics and Outcomes Research (ISPOR). Published by Elsevier Inc. All rights reserved.
Distinguishing Different Strategies of Across-Dimension Attentional Selection

ERIC Educational Resources Information Center

Huang, Liqiang; Pashler, Harold

2012-01-01

Selective attention in multidimensional displays has usually been examined using search tasks requiring the detection of a single target. We examined the ability to perceive a spatial structure in multi-item subsets of a display that were defined either conjunctively or disjunctively. Observers saw two adjacent displays and indicated whether the…
Modeling Age-Related Differences in Immediate Memory Using SIMPLE

ERIC Educational Resources Information Center

Surprenant, Aimee M.; Neath, Ian; Brown, Gordon D. A.

2006-01-01

In the SIMPLE model (Scale Invariant Memory and Perceptual Learning), performance on memory tasks is determined by the locations of items in multidimensional space, and better performance is associated with having fewer close neighbors. Unlike most previous simulations with SIMPLE, the ones reported here used measured, rather than assumed,…
Preliminary Development and Validation of the Mindful Student Questionnaire

ERIC Educational Resources Information Center

Renshaw, Tyler L.

2017-01-01

Research validating mindfulness-based interventions with youths and in schools is growing, yet research validating measures of youths' mindfulness in schools has received far less empirical attention. The present study makes the case for and reports on the preliminary development and validation of a new, 15-item, multidimensional, self-report…
Visualizing the Structure of Medical Informatics Using Term Co-Occurrence Analysis.

ERIC Educational Resources Information Center

Morris, Theodore Allan

2000-01-01

Examines the structure of medical informatics and the relationship between biomedicine and information science and information technology. Uses co-occurrence analysis of subject headings assigned to items indexed for MEDLINE as well as multidimensional scaling to show seven to eight broad multidisciplinary subject clusters. (Contains 28…
Group Trust, Communication Media, and Interactivity: Toward an Integrated Model of Online Collaborative Learning

ERIC Educational Resources Information Center

Du, Jianxia; Wang, Chuang; Zhou, Mingming; Xu, Jianzhong; Fan, Xitao; Lei, Saosan

2018-01-01

The present investigation examines the multidimensional relationships among several critical components in online collaborative learning, including group trust, communication media, and interactivity. Four hundred eleven university students from 103 groups in the United States responded survey items on online collaboration, interactivity,…
Validation of the Adolescent Concerns Measure (ACM): evidence from exploratory and confirmatory factor analysis.

PubMed

Ang, Rebecca P; Chong, Wan Har; Huan, Vivien S; Yeo, Lay See

2007-01-01

This article reports the development and initial validation of scores obtained from the Adolescent Concerns Measure (ACM), a scale which assesses concerns of Asian adolescent students. In Study 1, findings from exploratory factor analysis using 619 adolescents suggested a 24-item scale with four correlated factors--Family Concerns (9 items), Peer Concerns (5 items), Personal Concerns (6 items), and School Concerns (4 items). Initial estimates of convergent validity for ACM scores were also reported. The four-factor structure of ACM scores derived from Study 1 was confirmed via confirmatory factor analysis in Study 2 using a two-fold cross-validation procedure with a separate sample of 811 adolescents. Support was found for both the multidimensional and hierarchical models of adolescent concerns using the ACM. Internal consistency and test-retest reliability estimates were adequate for research purposes. ACM scores show promise as a reliable and potentially valid measure of Asian adolescents' concerns.
Informed choice: understanding knowledge in the context of screening uptake.

PubMed

Michie, Susan; Dormandy, Elizabeth; Marteau, Theresa M

2003-07-01

This study evaluates a scale measuring knowledge about a screening test and investigates the association between knowledge, uptake and attitudes towards screening. One thousand four hundred ninety-nine pregnant women completed the knowledge scale of the multidimensional measure of informed choice (MMIC). Three hundred forty-five of these women and 152 professionals providing antenatal care also rated the importance of the knowledge items. Item characteristic curves show that, with one exception, the knowledge items reflect a spread of difficulty and are able to discriminate between people. All items were seen as essential or helpful by both women and health professionals, with two items seen as particularly important and one as unimportant. There were some differences between health professionals, women with low risk results and women with high risk results. Knowledge was not associated with uptake, attitude, or the extent to which uptake was consistent with women's attitudes towards undergoing the test.
PedsQL™ Multidimensional Fatigue Scale in sickle cell disease: feasibility, reliability, and validity.

PubMed

Panepinto, Julie A; Torres, Sylvia; Bendo, Cristiane B; McCavit, Timothy L; Dinu, Bogdan; Sherman-Bien, Sandra; Bemrich-Stolz, Christy; Varni, James W

2014-01-01

Sickle cell disease (SCD) is an inherited blood disorder characterized by a chronic hemolytic anemia that can contribute to fatigue and global cognitive impairment in patients. The study objective was to report on the feasibility, reliability, and validity of the PedsQL™ Multidimensional Fatigue Scale in SCD for pediatric patient self-report ages 5-18 years and parent proxy-report for ages 2-18 years. This was a cross-sectional multi-site study whereby 240 pediatric patients with SCD and 303 parents completed the 18-item PedsQL™ Multidimensional Fatigue Scale. Participants also completed the PedsQL™ 4.0 Generic Core Scales. The PedsQL™ Multidimensional Fatigue Scale evidenced excellent feasibility, excellent reliability for the Total Scale Scores (patient self-report α = 0.90; parent proxy-report α = 0.95), and acceptable reliability for the three individual scales (patient self-report α = 0.77-0.84; parent proxy-report α = 0.90-0.97). Intercorrelations of the PedsQL™ Multidimensional Fatigue Scale with the PedsQL™ Generic Core Scales were predominantly in the large (≥0.50) range, supporting construct validity. PedsQL™ Multidimensional Fatigue Scale Scores were significantly worse with large effects sizes (≥0.80) for patients with SCD than for a comparison sample of healthy children, supporting known-groups discriminant validity. Confirmatory factor analysis demonstrated an acceptable to excellent model fit in SCD. The PedsQL™ Multidimensional Fatigue Scale demonstrated acceptable to excellent measurement properties in SCD. The results demonstrate the relative severity of fatigue symptoms in pediatric patients with SCD, indicating the potential clinical utility of multidimensional assessment of fatigue in patients with SCD in clinical research and practice. © 2013 Wiley Periodicals, Inc.
PedsQL™ Multidimensional Fatigue Scale in Sickle Cell Disease: Feasibility, Reliability and Validity

PubMed Central

Panepinto, Julie A.; Torres, Sylvia; Bendo, Cristiane B.; McCavit, Timothy L.; Dinu, Bogdan; Sherman-Bien, Sandra; Bemrich-Stolz, Christy; Varni, James W.

2013-01-01

Background Sickle cell disease (SCD) is an inherited blood disorder characterized by a chronic hemolytic anemia that can contribute to fatigue and global cognitive impairment in patients. The study objective was to report on the feasibility, reliability, and validity of the PedsQL™ Multidimensional Fatigue Scale in SCD for pediatric patient self-report ages 5–18 years and parent proxy-report for ages 2–18 years. Procedure This was a cross-sectional multi-site study whereby 240 pediatric patients with SCD and 303 parents completed the 18-item PedsQL™ Multidimensional Fatigue Scale. Participants also completed the PedsQL™ 4.0 Generic Core Scales. Results The PedsQL™ Multidimensional Fatigue Scale evidenced excellent feasibility, excellent reliability for the Total Scale Scores (patient self-report α = 0.90; parent proxy-report α = 0.95), and acceptable reliability for the three individual scales (patient self-report α = 0.77–0.84; parent proxy-report α = 0.90–0.97). Intercorrelations of the PedsQL™ Multidimensional Fatigue Scale with the PedsQL™ Generic Core Scales were predominantly in the large (≥ 0.50) range, supporting construct validity. PedsQL™ Multidimensional Fatigue Scale Scores were significantly worse with large effects sizes (≥0.80) for patients with SCD than for a comparison sample of healthy children, supporting known-groups discriminant validity. Confirmatory factor analysis demonstrated an acceptable to excellent model fit in SCD. Conclusions The PedsQL™ Multidimensional Fatigue Scale demonstrated acceptable to excellent measurement properties in SCD. The results demonstrate the relative severity of fatigue symptoms in pediatric patients with SCD, indicating the potential clinical utility of multidimensional assessment of fatigue in patients with SCD in clinical research and practice. PMID:24038960
Reliability and validity of the brief multidimensional measure of religiousness/spirituality among adolescents.

PubMed

Harris, Sion Kim; Sherritt, Lon R; Holder, David W; Kulig, John; Shrier, Lydia A; Knight, John R

2008-12-01

Developed for use in health research, the Brief Multidimensional Measure of Religiousness/Spirituality (BMMRS) consists of brief measures of a broad range of religiousness and spirituality (R/S) dimensions. It has established psychometric properties among adults, but little is known about its appropriateness for use with adolescents. We assessed the psychometric properties of the BMMRS among adolescents. We recruited a racially diverse (85% non-White) sample of 305 adolescents aged 12-18 years (median 16 yrs, IQR 14-17) from 3 urban medical clinics; 93 completed a retest 1 week later. We assessed internal consistency and test-retest reliability. We assessed construct validity by examining how well the measures discriminated groups expected to differ based on self-reported religious preference, and how they related to a hypothesized correlate, depressive symptoms. Religious preference was categorized into "No religion/Atheist" (11%), "Don't know/Confused" (9%), or "Named a religion" (80%). Responses to multi-item measures were generally internally consistent (alpha > or = 0.70 for 12/16 measures) and stable over 1 week (intraclass correlation coefficients > or = 0.70 for 14/16). Forgiveness, Negative R/S Coping, and Commitment items showed lower internal cohesiveness. Scores on most measures were higher (p < 0.05) among those who "Named a religion" compared to the "No religion/Atheist" group. Forgiveness, Commitment, and Anticipated Support from members of one's congregation were inversely correlated with depressive symptoms, while BMMRS measures assessing negative R/S experiences (Negative R/S Coping, Negative Interactions with others in congregation, Loss in Faith) were positively correlated with depressive symptoms. These findings suggest that most BMMRS measures are reliable and valid for use among adolescents.
Fatigue in children: reliability and validity of the Dutch PedsQL™ Multidimensional Fatigue Scale.

PubMed

Gordijn, M Suzanne; Suzanne Gordijn, M; Cremers, Eline M P; Kaspers, Gertjan J L; Gemke, Reinoud J B J

2011-09-01

The aim of the study is to report on the feasibility, reliability, validity, and the norm-references of the Dutch version of the PedsQL™ Multidimensional Fatigue Scale. The study participants are four hundred and ninety-seven parents of children aged 2-18 years and 366 children aged 5-18 years from various day care facilities, elementary schools, and a high school who completed the Dutch version of the PedsQL™ Multidimensional Fatigue Scale. The number of missing items was minimal. All scales showed satisfactory internal consistency reliability, with Cronbach's coefficient alpha exceeding 0.70. Test-retest reliability was good to excellent (ICCs 0.68-0.84) and inter-observer reliability varied from moderate to excellent (ICCs 0.56-0.93) for total scores. Parent/child concordance for total scores was poor to good (ICCs 0.25-0.68). The PedsQL™ Multidimensional Fatigue Scale was able to distinguish between healthy children and children with an impaired health condition. The Dutch version of the PedsQL™ Multidimensional Fatigue Scale demonstrates an adequate feasibility, reliability, and validity in another sociocultural context. With the obtained norm-references, it can be utilized as a tool in the evaluation of fatigue in healthy and chronically ill children aged 2-18 years.

Authenticating concealed private data while maintaining concealment

DOEpatents

Thomas, Edward V [Albuquerque, NM; Draelos, Timothy J [Albuquerque, NM

2007-06-26

A method of and system for authenticating concealed and statistically varying multi-dimensional data comprising: acquiring an initial measurement of an item, wherein the initial measurement is subject to measurement error; applying a transformation to the initial measurement to generate reference template data; acquiring a subsequent measurement of an item, wherein the subsequent measurement is subject to measurement error; applying the transformation to the subsequent measurement; and calculating a Euclidean distance metric between the transformed measurements; wherein the calculated Euclidean distance metric is identical to a Euclidean distance metric between the measurement prior to transformation.
Cross-cultural validation of the National Eye Institute Visual Function Questionnaire.

PubMed

Mollazadegan, Kaziwe; Huang, Jinhai; Khadka, Jyoti; Wang, Qinmei; Yang, Feng; Gao, Rongrong; Pesudovs, Konrad

2014-05-01

To assess the native and the previously Rasch-modified National Eye Institute Visual Function Questionnaire (NEI VFQ) scales in a Chinese population. Eye Hospital of Wenzhou Medical University, Wenzhou, China. Questionnaire development. Patients on the waiting list for cataract surgery completed the 39-item NEI VFQ (NEI VFQ-39). Rasch analysis was performed in 3 steps as follows: (1) Assess the psychometric properties of the original NEI VFQ. (2) Reassess the previously proposed Rasch-modified NEI VFQ scales by Pesudovs et al. (2010) in Chinese populations. (3) Compare the scores of previously recommended scales of the NEI VFQ with new Rasch-modified scales of the same questionnaire using Bland-Altman plots. Four hundred thirty-five patients (median age 70 years; range 35 to 90 years) completed the NEI VFQ-39. Response categories for 4 question types were dysfunctional and therefore repaired. The original NEI VFQ-39 and NEI VFQ-25 showed good measurement precision. However, both versions showed multidimensionality, misfitting items, suboptimum targeting, and nonfunctioning subscales. Using the previously proposed Rasch-modified scales of the NEI VFQ yielded valid measurement of each construct in the 39-item and 25-item questionnaire. Comparison between the earlier proposed NEI VFQ scales and the new versions developed in this population showed good agreement. The original NEI VFQ was once again found to be flawed. The previously proposed Rasch-analyzed versions of the NEI VFQ and the new Chinese versions showed good agreement. Copyright © 2014 ASCRS and ESCRS. Published by Elsevier Inc. All rights reserved.
Refinement and partial validation of the UNESP-Botucatu multidimensional composite pain scale for assessing postoperative pain in horses.

PubMed

Taffarel, Marilda Onghero; Luna, Stelio Pacca Loureiro; de Oliveira, Flavia Augusta; Cardoso, Guilherme Schiess; Alonso, Juliana de Moura; Pantoja, Jose Carlos; Brondani, Juliana Tabarelli; Love, Emma; Taylor, Polly; White, Kate; Murrell, Joanna C

2015-04-01

Quantification of pain plays a vital role in the diagnosis and management of pain in animals. In order to refine and validate an acute pain scale for horses a prospective, randomized, blinded study was conducted. Twenty-four client owned adult horses were recruited and allocated to one of four following groups: anaesthesia only (GA); pre-emptive analgesia and anaesthesia (GAA,); anaesthesia, castration and postoperative analgesia (GC); or pre-emptive analgesia, anaesthesia and castration (GCA). One investigator, unaware of the treatment group, assessed all horses at time-points before and after intervention and completed the pain scale. Videos were also obtained at these time-points and were evaluated by a further four blinded evaluators who also completed the scale. The data were used to investigate the relevance, specificity, criterion validity and inter- and intra-observer reliability of each item on the pain scale, and to evaluate construct validity and responsiveness of the scale. Construct validity was demonstrated by the observed differences in scores between the groups, four hours after anaesthetic recovery and before administration of systemic analgesia in the GC group. Inter- and intra-observer reliability for the items was only satisfactory. Subsequently the pain scale was refined, based on results for relevance, specificity and total item correlation. Scale refinement and exclusion of items that did not meet predefined requirements generated a selection of relevant pain behaviours in horses. After further validation for reliability, these may be used to evaluate pain under clinical and experimental conditions.
Multidimensional fatigue inventory and post-polio syndrome - a Rasch analysis.

PubMed

Dencker, Anna; Sunnerhagen, Katharina S; Taft, Charles; Lundgren-Nilsson, Åsa

2015-02-12

Fatigue is a common symptom in post-polio syndrome (PPS) and can have a substantial impact on patients. There is a need for validated questionnaires to assess fatigue in PPS for use in clinical practice and research. The aim with this study was to assess the validity and reliability of the Swedish version of Multidimensional Fatigue Inventory (MFI-20) in patients with PPS using the Rasch model. A total of 231 patients diagnosed with PPS completed the Swedish MFI-20 questionnaire at post-polio out-patient clinics in Sweden. The mean age of participants was 62 years and 61% were females. Data were tested against assumptions of the Rasch measurement model (i.e. unidimensionality of the scale, good item fit, independency of items and absence of differential item functioning). Reliability was tested with the person separation index (PSI). A transformation of the ordinal total scale scores into an interval scale for use in parametric analysis was performed. Dummy cases with minimum and maximum scoring were used for the transformation table to achieve interval scores between 20 and 100, which are comprehensive limits for the MFI-20 scale. An initial Rasch analysis of the full scale with 20 items showed misfit to the Rasch model (p < 0.001). Seven items showed slightly disordered thresholds and person estimates were not significantly improved by rescoring items. Analysis of MFI-20 scale with the 5 MFI-20 subscales as testlets showed good fit with a non-significant x (2) value (p = 0.089). PSI for the testlet solution was 0.86. Local dependency was present in all subscales and fit to the Rasch model was solved with testlets within each subscale. PSI ranged from 0.52 to 0.82 in the subscales. This study shows that the Swedish MFI-20 total scale and subscale scores yield valid and reliable measures of fatigue in persons with post-polio syndrome. The Rasch transformed total scores can be used for parametric statistical analyses in future clinical studies.
Screening for Moral Injury: The Moral Injury Symptom Scale - Military Version Short Form.

PubMed

Koenig, Harold G; Ames, Donna; Youssef, Nagy A; Oliver, John P; Volk, Fred; Teng, Ellen J; Haynes, Kerry; Erickson, Zachary D; Arnold, Irina; O'Garo, Keisha; Pearce, Michelle

2018-03-26

To develop a short form (SF) of the 45-item multidimensional Moral Injury Symptom Scale - Military Version (MISS-M) to use when screening for moral injury and monitoring treatment response in veterans and active duty military with PTSD. A total of 427 veterans and active duty military with PTSD symptoms were recruited from VA Medical Centers in Augusta, GA; Los Angeles, CA; Durham, NC; Houston, TX; and San Antonio, TX; and from Liberty University, Lynchburg, Virginia. The sample was randomly split in two. In the first half (n = 214), exploratory factor analysis identified the highest loading item on each of the 10 MISS scales (guilt, shame, moral concerns, loss of meaning, difficulty forgiving, loss of trust, self-condemnation, religious struggle, and loss of religious faith) to form the 10-item MISS-M-SF; confirmatory factor analysis was then performed to replicate results in the second half of the sample (n = 213). Internal reliability, test-retest reliability, and convergent, discriminant, and concurrent validity were examined in the overall sample. The study was approved by the institutional review boards and the Research & Development (R&D) Committees at Veterans Administration medical centers in Durham, Los Angeles, Augusta, Houston, and San Antonio, and the Liberty University and Duke University Medical Center institutional review boards. The 10-item MISS-M-SF had a median of 50 and a range of 12-91 (possible range 10-100). Over 70% scored a 9 or 10 (highest possible) on at least one item. Cronbach's alpha was 0.73 (95% CI 0.69-0.76), and test-retest reliability was 0.87 (95% CI 0.79-0.92). Convergent validity with the 45-item MISS-M was r = 0.92. Discriminant validity was demonstrated by relatively weak correlations with social, religious, and physical health constructs (r = 0.21-0.35), and concurrent validity was indicated by strong correlations with PTSD, depression, and anxiety symptoms (r = 0.54-0.58). The MISS-M-SF is a reliable and valid measure of MI symptoms that can be used to screen for MI and monitor response to treatment in veterans and active duty military with PTSD.
A multidimensional assessment of the validity and utility of alcohol use disorder severity as determined by item response theory models.

PubMed

Dawson, Deborah A; Saha, Tulshi D; Grant, Bridget F

2010-02-01

The relative severity of the 11 DSM-IV alcohol use disorder (AUD) criteria are represented by their severity threshold scores, an item response theory (IRT) model parameter inversely proportional to their prevalence. These scores can be used to create a continuous severity measure comprising the total number of criteria endorsed, each weighted by its relative severity. This paper assesses the validity of the severity ranking of the 11 criteria and the overall severity score with respect to known AUD correlates, including alcohol consumption, psychological functioning, family history, antisociality, and early initiation of drinking, in a representative population sample of U.S. past-year drinkers (n=26,946). The unadjusted mean values for all validating measures increased steadily with the severity threshold score, except that legal problems, the criterion with the highest score, was associated with lower values than expected. After adjusting for the total number of criteria endorsed, this direct relationship was no longer evident. The overall severity score was no more highly correlated with the validating measures than a simple count of criteria endorsed, nor did the two measures yield different risk curves. This reflects both within-criterion variation in severity and the fact that the number of criteria endorsed and their severity are so highly correlated that severity is essentially redundant. Attempts to formulate a scalar measure of AUD will do as well by relying on simple counts of criteria or symptom items as by using scales weighted by IRT measures of severity. Published by Elsevier Ireland Ltd.
Development of the Ghent Multidimensional Somatic Complaints Scale

ERIC Educational Resources Information Center

Beirens, Koen; Fontaine, Johnny R. J.

2010-01-01

The present study aimed at developing a new scale that operationalizes a hierarchical model of somatic complaints. First, 63 items representing a wide range of symptoms and sensations were compiled from somatic complaints scales and emotion literature. These complaints were rated by Belgian students (n = 307) and Belgian adults (n = 603).…
The Impact of Conditional Scores on the Performance of DETECT.

ERIC Educational Resources Information Center

Zhang, Yanwei Oliver; Yu, Feng; Nandakumar, Ratna

DETECT is a nonparametric, conditional covariance-based procedure to identify dimensional structure and the degree of multidimensionality of test data. The ability composite or conditional score used to estimate conditional covariance plays a significant role in the performance of DETECT. The number correct score of all items in the test (T) and…
Examining Sources of Gender DIF in Mathematics Assessments Using a Confirmatory Multidimensional Model Approach

ERIC Educational Resources Information Center

Mendes-Barnett, Sharon; Ercikan, Kadriye

2006-01-01

This study contributes to understanding sources of gender differential item functioning (DIF) on mathematics tests. This study focused on identifying sources of DIF and differential bundle functioning for boys and girls on the British Columbia Principles of Mathematics Exam (Grade 12) using a confirmatory SIBTEST approach based on a…
The Support Appraisal for Work Stressors Inventory: Construction and Initial Validation

ERIC Educational Resources Information Center

Lawrence, Sandra A.; Gardner, John; Callan, Victor J.

2007-01-01

In order to better understand the role of perceived available support in buffering the negative effects of workplace stressors, a new multidimensional measure of perceived available support, the SAWS, was developed. Initial item development and content validation were conducted, followed by scale evaluation and validation. Two samples of 190 and…
An Investigation of Sample Size Splitting on ATFIND and DIMTEST

ERIC Educational Resources Information Center

Socha, Alan; DeMars, Christine E.

2013-01-01

Modeling multidimensional test data with a unidimensional model can result in serious statistical errors, such as bias in item parameter estimates. Many methods exist for assessing the dimensionality of a test. The current study focused on DIMTEST. Using simulated data, the effects of sample size splitting for use with the ATFIND procedure for…
Robustness of Ability Estimation to Multidimensionality in CAST with Implications to Test Assembly

ERIC Educational Resources Information Center

Zhang, Yanwei; Nandakumar, Ratna

2006-01-01

Computer Adaptive Sequential Testing (CAST) is a test delivery model that combines features of the traditional conventional paper-and-pencil testing and item-based computerized adaptive testing (CAT). The basic structure of CAST is a panel composed of multiple testlets adaptively administered to examinees at different stages. Current applications…
Relationships between Organizations and Publics: Development of a Multi-Dimensional Organization-Public Relationship Scale.

ERIC Educational Resources Information Center

Bruning, Stephen D.; Ledingham, John A.

1999-01-01

Attempts to design a multiple-item, multiple-dimension organization/public relationship scale. Finds that organizations and key publics have three types of relationships: professional, personal, and community. Provides an instrument that can be used to measure the influence that perceptions of the organization/public relationship have on consumer…
Using Logistic Approximations of Marginal Trace Lines to Develop Short Assessments

ERIC Educational Resources Information Center

Stucky, Brian D.; Thissen, David; Edelen, Maria Orlando

2013-01-01

Test developers often need to create unidimensional scales from multidimensional data. For item analysis, "marginal trace lines" capture the relation with the general dimension while accounting for nuisance dimensions and may prove to be a useful technique for creating short-form tests. This article describes the computations needed to obtain…
Assessment of the dimensionality of the Wijma delivery expectancy/experience questionnaire using factor analysis and Rasch analysis.

PubMed

Pallant, J F; Haines, H M; Green, P; Toohill, J; Gamble, J; Creedy, D K; Fenwick, J

2016-11-21

Fear of childbirth has negative consequences for a woman's physical and emotional wellbeing. The most commonly used measurement tool for childbirth fear is the Wijma Delivery Expectancy Questionnaire (WDEQ-A). Although originally conceptualized as unidimensional, subsequent investigations have suggested it is multidimensional. This study aimed to undertake a detailed psychometric assessment of the WDEQ-A; exploring the dimensionality and identifying possible subscales that may have clinical and research utility. WDEQ-A was administered to a sample of 1410 Australian women in mid-pregnancy. The dimensionality of WDEQ-A was explored using exploratory (EFA) and confirmatory factor analysis (CFA), and Rasch analysis. EFA identified a four factor solution. CFA failed to support the unidimensional structure of the original WDEQ-A, but confirmed the four factor solution identified by EFA. Rasch analysis was used to refine the four subscales (Negative emotions: five items; Lack of positive emotions: five items; Social isolation: four items; Moment of birth: three items). Each WDEQ-A Revised subscale showed good fit to the Rasch model and adequate internal consistency reliability. The correlation between Negative emotions and Lack of positive emotions was strong, however Moment of birth and Social isolation showed much lower intercorrelations, suggesting they should not be added to create a total score. This study supports the findings of other investigations that suggest the WDEQ-A is multidimensional and should not be used in its original form. The WDEQ-A Revised may provide researchers with a more refined, psychometrically sound tool to explore the differential impact of aspects of childbirth fear.
Validation of the Modified Fatigue Impact Scale in Parkinson's disease.

PubMed

Schiehser, Dawn M; Ayers, Catherine R; Liu, Lin; Lessig, Stephanie; Song, David S; Filoteo, J Vincent

2013-03-01

Fatigue is a common symptom in Parkinson's disease (PD); however, a multidimensional scale that measures the impact of fatigue on functioning has yet to be validated in this population. The aim of this study was to examine the validity of the Modified Fatigue Impact Scale (MFIS), a self-report measure that assesses the effects of fatigue on physical, cognitive, and psychosocial functioning, in a sample of nondemented PD patients. PD patients (N = 100) completed the MFIS, the Positive and Negative Affect Schedule (PANAS-X), and several additional measures of psychosocial, cognitive, and motor functioning. A Principal Component Analysis (PCA) and item analysis using Cronbach's alpha were conducted to determine structural validity and internal consistency of the MFIS. Correlational analyses were performed between the MFIS and the PANAS-X fatigue subscale to evaluate convergent validity and between the MFIS and measures of depression, anxiety, apathy, and disease-related symptoms to determine divergent validity. The PCA identified two viable MFIS subscales: a cognitive subscale and a combination of the original scale's physical and psychosocial subscales as one factor. Item analysis revealed high internal consistency of all 21 items and the items within the two subscales. The MFIS had strong convergent validity with the PANAS-X fatigue subscale and adequate divergent validity with measures of disease stage, motor function, and cognition. Overall, this study demonstrates that the MFIS is a valid multidimensional measure that can be used to evaluate the impact of fatigue on cognitive and physical/social functioning in PD patients without dementia. Published by Elsevier Ltd.
Multidimensional Profiling of Task Stress States for Human Factors: A Brief Review.

PubMed

Matthews, Gerald

2016-09-01

This article advocates multidimensional assessment of task stress in human factors and reviews the use of the Dundee Stress State Questionnaire (DSSQ) for evaluation of systems and operators. Contemporary stress research has progressed from an exclusive focus on environmental stressors to transactional perspectives on the stress process. Performance impacts of stress reflect the operator's dynamic attempts to understand and cope with task demands. Multidimensional stress assessments are necessary to gauge the different forms of system-operator interaction. This review discusses the theoretical and practical use of the DSSQ in evaluating multidimensional patterns of stress response. It presents psychometric evidence for the multidimensional perspective and illustrative profiles of subjective state response to task stressors and environments. Evidence is also presented on stress state correlations with related variables, including personality, stress process measures, psychophysiological response, and objective task performance. Evidence supports the validity of the DSSQ as a task stress measure. Studies of various simulated environments show that different tasks elicit different profiles of stress state response. Operator characteristics such as resilience predict individual differences in state response to stressors. Structural equation modeling may be used to understand performance impacts of stress states. Multidimensional assessment affords insight into the stress process in a variety of human factors contexts. Integrating subjective and psychophysiological assessment is a priority for future research. Stress state measurement contributes to evaluating system design, countermeasures to stress and fatigue, and performance vulnerabilities. It may also support personnel selection and diagnostic monitoring of operators. © 2016, Human Factors and Ergonomics Society.
Multidimensionality of the Zarit Burden Interview across the severity spectrum of cognitive impairment: an Asian perspective.

PubMed

Cheah, Wee Kooi; Han, Huey Charn; Chong, Mei Sian; Anthony, Philomena Vasantha; Lim, Wee Shiong

2012-11-01

We aimed to examine the multidimensionality of the Zarit Burden Interview (ZBI) beyond the conventional dual-factor structure among caregivers of persons with cognitive impairment in a predominantly Chinese multiethnic Asian population, and ascertain how these dimensions vary across the spectrum of disease severity. We studied 130 consecutive dyads of primary caregivers and patients attending a memory clinic over a six-month period. Caregiver burden was measured by the 22-item ZBI, and disease severity was staged via the Clinical Dementia Rating (CDR) scale. We performed principal component analysis (PCA) with varimax rotation to determine the factor structure of the ZBI. The magnitude of burden in each factor was expressed as the item to total ratio (ITR) and plotted against the stages of cognitive impairment. Descriptive and inferential statistics were applied to study the relationships between dimensions with disease and caregiver characteristics. We identified four factors: demands of care and social impact, control over the situation, psychological impact, and worry about caregiving performance. ITRs of the first three factors increased with severity of disease and were related to recipients' functional status and disease characteristics. ITR in the dimension of worry about performance was endorsed highest across the spectrum of disease severity, starting as early as the stage of mild cognitive impairment and peaking at CDR 1. Multidimensionality of ZBI was confirmed in our local setting. Each dimension of burden was unique and expressed differentially across disease severity. The dimension of worry about performance merits further study.
Rasch Model Analysis Gives New Insights Into the Structural Validity of the QuickDASH in Patients With Musculoskeletal Shoulder Pain.

PubMed

Jerosch-Herold, Christina; Chester, Rachel; Shepstone, Lee

2017-09-01

Study Design Cross-sectional secondary analysis of a prospective cohort study. Background The shortened version of the Disabilities of the Arm, Shoulder and Hand questionnaire (QuickDASH) is a widely used outcome measure that has been extensively evaluated using classical test theory. Rasch model analysis can identify strengths and weaknesses of rating scales and goes beyond classical test theory approaches. It uses a mathematical model to test the fit between the observed data and expected responses and converts ordinal-level scores into interval-level measurement. Objective To test the structural validity of the QuickDASH using Rasch analysis. Methods A prospective cohort study of 1030 patients with shoulder pain provided baseline data. Rasch analysis was conducted to (1) assess how the QuickDASH fits the Rasch model, (2) identify sources of misfit, and (3) explore potential solutions to these. Results There was evidence of multidimensionality and significant misfit to the Rasch model (χ 2 = 331.09, P<.001). Two items had disordered threshold responses with strong floor effects. Response bias was detected in most items for age and sex. Rescoring resulted in ordered thresholds; however, the 11-item scale still did not meet the expectations of the Rasch model. Conclusion Rasch model analysis on the QuickDASH has identified a number of problems that cannot be easily detected using traditional analyses. While revisions to the QuickDASH resulted in better fit, a "shoulder-specific" version is not advocated at present. Caution needs to be exercised when interpreting results of the QuickDASH outcome measure, as it does not meet the criteria for interval-level measurement and shows significant response bias by age and sex. J Orthop Sports Phys Ther 2017;47(9):664-672. Epub 13 Jul 2017. doi:10.2519/jospt.2017.7288.
Development of a stress scale for pregnant women in the South Asian context: the A-Z Stress Scale.

PubMed

Kazi, A; Fatmi, Z; Hatcher, J; Niaz, U; Aziz, A

2009-01-01

Stress in pregnancy can lead to low-birth-weight and preterm babies and to psychological consequences such as anxiety and depression during pregnancy and the puerperium. Previous scales to measure stress contain items that overlap with the symptoms of pregnancy. A stress scale was developed based on in-depth interviews with pregnant women in Pakistan. Construct validity, test-retest reliability and inter-rater reliability were carried out. Cronbach alpha was 0.82 for the 30 short-listed items, with item-total correlations of 0.2-0.8. Multidimensional scaling determined 2 dimensions: socioenvironmental hassles and chronic illnesses. This was the first scale developed for pregnant women based on stressors in a developing country in South Asia.

Stress at work: development of the Stress Perception Questionnaire of Rome (SPQR), an ad hoc questionnaire for multidimensional assessment of work related stress.

PubMed

Cinti, M E; Cannavò, M; Fioravanti, M

2017-01-01

Stress is an emotional condition, mostly experienced as negative, initially identified and defined by Selye in the mid-thirties of the last Century. Since the first definition, stress concerns the adaptation pro- cess mostly related to environmental changes. An application of stress focuses on the evaluation of its interference on work conditions, and the scientific evidence on work related stress is very ample and rich. We are proposing a new ad hoc questionnaire for the multidimensional assessment of work related stress, called Stress Perception Question- naire of Rome (SPQR) composed of 50 items. The development of this questionnaire is based on a multi-step process: a) Identification of all the relevant topics to work related stress and areas in the scientific evidence and their transformation on specific contents of 60 tentative items; b) Exploratory factor analysis aimed to identify the best items (50) which could guarantee the maximum convergence on single scales (8), and the minimum redundancy between scales; c) Validation of the 8 scales' structure by a confirmatory factor analysis (fully achieved); d) Factor analysis for a second level factor resulting in a single factor identified as the questionnaire total score (Stress Score); d) Reliability analysis of the questionnaire total score and the single scale scores (at optimum level); e) Validation by external criteria of work related stress identified in the presence of personal violence episodes experienced by a group of health workers with different professional profiles and from two different hospitals in Rome. Our results show that the SPQR is a useful and sensitive tool for assessing the presence of emotional stress related problems identifiable in a work environment. The advantage of this questionnaire is that it allows for a multidimensional description of the different components of this problematic area besides its ability to quantify the overall stress level of those who have been administered the SPQR.
Multidimensional structure of a questionnaire to assess barriers to and motivators of physical activity in recipients of solid organ transplantation.

PubMed

van Adrichem, Edwin J; Krijnen, Wim P; Dekker, Rienk; Ranchor, Adelita V; Dijkstra, Pieter U; van der Schans, Cees P

2017-11-01

To explore the underlying dimensions of the Barriers and Motivators Questionnaire that is used to assess barriers to and motivators of physical activity experienced by recipients of solid organ transplantation and thereby improve the application in research and clinical settings. A cross-sectional study was performed in recipients of solid organ transplantation (n = 591; median (IQR) age = 59 (49; 66); 56% male). The multidimensional structure of the questionnaire was analyzed by exploratory principal component analysis. Cronbach's α was calculated to determine internal consistency of the entire questionnaire and individual components. The barriers scale had a Cronbach's α of 0.86 and was subdivided into four components; α of the corresponding subscales varied between 0.80 and 0.66. The motivator scale had an α of 0.91 and was subdivided into four components with an α between 0.88 to 0.70. Nine of the original barrier items and two motivator items were not included in the component structure. A four-dimensional structure for both the barriers and motivators scale of the questionnaire is supported. The use of the indicated subscales increases the usability in research and clinical settings compared to the overall scores and provide opportunities to identify modifiable constructs to be targeted in interventions. Implications for rehabilitation Organ transplant recipients are less active than the general population despite established health benefits of physical activity. A multidimensional structure is shown in the Barriers and Motivators Questionnaire, the use of the identified subscales increases applicability in research and clinical settings. The use of the questionnaire with its component structure in the clinical practice of a rehabilitation physician could result in a faster assessment of problem areas in daily practice and result in a higher degree of clarity as opposed to the use of the individual items of the questionnaire.
Characterising the latent structure and organisation of self-reported thoughts, feelings and behaviours in adolescents and young adults

PubMed Central

Neufeld, Sharon; Jones, Peter B.; Fonagy, Peter; Bullmore, Edward T.; Dolan, Raymond J.; Moutoussis, Michael; Toseeb, Umar; Goodyer, Ian M.

2017-01-01

Little is known about the underlying relationships between self-reported mental health items measuring both positive and negative emotional and behavioural symptoms at the population level in young people. Improved measurement of the full range of mental well-being and mental illness may aid in understanding the aetiological substrates underlying the development of both mental wellness as well as specific psychiatric diagnoses. A general population sample aged 14 to 24 years completed self-report questionnaires on anxiety, depression, psychotic-like symptoms, obsessionality and well-being. Exploratory and confirmatory factor models for categorical data and latent profile analyses were used to evaluate the structure of both mental wellness and illness items. First order, second order and bifactor structures were evaluated on 118 self-reported items obtained from 2228 participants. A bifactor solution was the best fitting latent variable model with one general latent factor termed ‘distress’ and five ‘distress independent’ specific factors defined as self-confidence, antisocial behaviour, worry, aberrant thinking, and mood. Next, six distinct subgroups were derived from a person-centred latent profile analysis of the factor scores. Finally, concurrent validity was assessed using information on hazardous behaviours (alcohol use, substance misuse, self-harm) and treatment for mental ill health: both discriminated between the latent traits and latent profile subgroups. The findings suggest a complex, multidimensional mental health structure in the youth population rather than the previously assumed first or second order factor structure. Additionally, the analysis revealed a low hazardous behaviour/low mental illness risk subgroup not previously described. Population sub-groups show greater validity over single variable factors in revealing mental illness risks. In conclusion, our findings indicate that the structure of self reported mental health is multidimensional in nature and uniquely finds improved prediction to mental illness risk within person-centred subgroups derived from the multidimensional latent traits. PMID:28403164
Characterising the latent structure and organisation of self-reported thoughts, feelings and behaviours in adolescents and young adults.

PubMed

St Clair, Michelle C; Neufeld, Sharon; Jones, Peter B; Fonagy, Peter; Bullmore, Edward T; Dolan, Raymond J; Moutoussis, Michael; Toseeb, Umar; Goodyer, Ian M

2017-01-01

Little is known about the underlying relationships between self-reported mental health items measuring both positive and negative emotional and behavioural symptoms at the population level in young people. Improved measurement of the full range of mental well-being and mental illness may aid in understanding the aetiological substrates underlying the development of both mental wellness as well as specific psychiatric diagnoses. A general population sample aged 14 to 24 years completed self-report questionnaires on anxiety, depression, psychotic-like symptoms, obsessionality and well-being. Exploratory and confirmatory factor models for categorical data and latent profile analyses were used to evaluate the structure of both mental wellness and illness items. First order, second order and bifactor structures were evaluated on 118 self-reported items obtained from 2228 participants. A bifactor solution was the best fitting latent variable model with one general latent factor termed 'distress' and five 'distress independent' specific factors defined as self-confidence, antisocial behaviour, worry, aberrant thinking, and mood. Next, six distinct subgroups were derived from a person-centred latent profile analysis of the factor scores. Finally, concurrent validity was assessed using information on hazardous behaviours (alcohol use, substance misuse, self-harm) and treatment for mental ill health: both discriminated between the latent traits and latent profile subgroups. The findings suggest a complex, multidimensional mental health structure in the youth population rather than the previously assumed first or second order factor structure. Additionally, the analysis revealed a low hazardous behaviour/low mental illness risk subgroup not previously described. Population sub-groups show greater validity over single variable factors in revealing mental illness risks. In conclusion, our findings indicate that the structure of self reported mental health is multidimensional in nature and uniquely finds improved prediction to mental illness risk within person-centred subgroups derived from the multidimensional latent traits.
Reconsidering the Roland-Morris Disability Questionnaire: time for a multidimensional framework?

PubMed

Magnussen, Liv Heide; Lygren, Hildegunn; Strand, Liv Inger; Hagen, Eli Molde; Breivik, Kyrre

2015-02-15

Cross-sectional design. To explore (1) the factor structure of the Roland-Morris Disability Questionnaire (RMDQ), (2) whether there is a dominant factor, and (3) whether the potential factors are unique predictors of other aspects related to back pain. The RMDQ is one of the most recommended back-specific questionnaires assessing disability. The RMDQ is scored as a unidimensional scale summarizing answers to all 24 questions (Yes/No) regarding daily life functioning. However, there are indications that the scale is multidimensional. Patients (n = 457; age, 18-60 yr) with 8 to 12 weeks of back pain filled in questionnaires assessing subjective health complaints, emotional distress, instrumental and emotion-focused coping, and fear voidance behavior at baseline. A total of 371 patients (81.7%) filled in the RMDQ. Exploratory factor analysis was used to examine the factor structure of RMDQ items. Multiple regression analyses were used to assess whether the derived factors predicted relevant problems in back pain differently. Exploratory factor analysis showed indices of model fit for a 3-factor solution after removing 2 items because of low prevalence (19 and 24). Two items were removed because of cross-loadings and low loadings (2 and 22). No support for a dominant factor was found as the 3 factors were only moderately correlated (r = 0.34-0.40), and the ratio between the first and second eigenvalue was 2.6, not supporting essential unidimensionality. "Symptoms" were the factor that most strongly predicted subjective health complaints, whereas "avoidance of activity and participation" predicted fear avoidance behavior, instrumental and emotional coping. "Limitation in daily activities" did not predict any of these variables. The main findings of our study are that the RMDQ consists of 3 independent factors, and not 1 dominant factor as suggested previously. We think the time is now ripe to start treating and scoring the RMDQ as a multidimensional scale. N/A.
Development of a new occupational balance-questionnaire: incorporating the perspectives of patients and healthy people in the design of a self-reported occupational balance outcome instrument.

PubMed

Dür, Mona; Steiner, Günter; Fialka-Moser, Veronika; Kautzky-Willer, Alexandra; Dejaco, Clemens; Prodinger, Birgit; Stoffer, Michaela Alexandra; Binder, Alexa; Smolen, Josef; Stamm, Tanja Alexandra

2014-04-05

Self-reported outcome instruments in health research have become increasingly important over the last decades. Occupational therapy interventions often focus on occupational balance. However, instruments to measure occupational balance are scarce. The aim of the study was therefore to develop a generic self-reported outcome instrument to assess occupational balance based on the experiences of patients and healthy people including an examination of its psychometric properties. We conducted a qualitative analysis of the life stories of 90 people with and without chronic autoimmune diseases to identify components of occupational balance. Based on these components, the Occupational Balance-Questionnaire (OB-Quest) was developed. Construct validity and internal consistency of the OB-Quest were examined in quantitative data. We used Rasch analyses to determine overall fit of the items to the Rasch model, person separation index and potential differential item functioning. Dimensionality testing was conducted by the use of t-tests and Cronbach's alpha. The following components emerged from the qualitative analyses: challenging and relaxing activities, activities with acknowledgement by the individual and by the sociocultural context, impact of health condition on activities, involvement in stressful activities and fewer stressing activities, rest and sleep, variety of activities, adaptation of activities according to changed living conditions and activities intended to care for oneself and for others. Based on these, the seven items of the questionnaire (OB-Quest) were developed. 251 people (132 with rheumatoid arthritis, 43 with systematic lupus erythematous and 76 healthy) filled in the OB-Quest. Dimensionality testing indicated multidimensionality of the questionnaire (t = 0.58, and 1.66 after item reduction, non-significant). The item on the component rest and sleep showed differential item functioning (health condition and age). Person separation index was 0.51. Cronbach's alpha changed from 0.38 to 0.57 after deleting two items. This questionnaire includes new items addressing components of occupational balance meaningful to patients and healthy people which have not been measured so far. The reduction of two items of the OB-Quest showed improved internal consistency. The multidimensionality of the questionnaire indicates the need for a summary of several components into subscales.
Development of a new occupational balance-questionnaire: incorporating the perspectives of patients and healthy people in the design of a self-reported occupational balance outcome instrument

PubMed Central

2014-01-01

Background Self-reported outcome instruments in health research have become increasingly important over the last decades. Occupational therapy interventions often focus on occupational balance. However, instruments to measure occupational balance are scarce. The aim of the study was therefore to develop a generic self-reported outcome instrument to assess occupational balance based on the experiences of patients and healthy people including an examination of its psychometric properties. Methods We conducted a qualitative analysis of the life stories of 90 people with and without chronic autoimmune diseases to identify components of occupational balance. Based on these components, the Occupational Balance-Questionnaire (OB-Quest) was developed. Construct validity and internal consistency of the OB-Quest were examined in quantitative data. We used Rasch analyses to determine overall fit of the items to the Rasch model, person separation index and potential differential item functioning. Dimensionality testing was conducted by the use of t-tests and Cronbach’s alpha. Results The following components emerged from the qualitative analyses: challenging and relaxing activities, activities with acknowledgement by the individual and by the sociocultural context, impact of health condition on activities, involvement in stressful activities and fewer stressing activities, rest and sleep, variety of activities, adaptation of activities according to changed living conditions and activities intended to care for oneself and for others. Based on these, the seven items of the questionnaire (OB-Quest) were developed. 251 people (132 with rheumatoid arthritis, 43 with systematic lupus erythematous and 76 healthy) filled in the OB-Quest. Dimensionality testing indicated multidimensionality of the questionnaire (t = 0.58, and 1.66 after item reduction, non-significant). The item on the component rest and sleep showed differential item functioning (health condition and age). Person separation index was 0.51. Cronbach’s alpha changed from 0.38 to 0.57 after deleting two items. Conclusions This questionnaire includes new items addressing components of occupational balance meaningful to patients and healthy people which have not been measured so far. The reduction of two items of the OB-Quest showed improved internal consistency. The multidimensionality of the questionnaire indicates the need for a summary of several components into subscales. PMID:24708642
The Multidimensional Aggression Scale for the Structured Doll Play Interview

ERIC Educational Resources Information Center

Abramson, Paul R.; And Others

1974-01-01

A multidimensional aggression scoring system for preschool children's responses to the structured doll play interview is described. The system, which incorporates previous investigator's findings, scales doll play responses along three dimensions of aggression: intensity, agent, and directionality. (Author)
Development of the Multidimensional Peer Victimization Scale-Revised (MPVS-R) and the Multidimensional Peer Bullying Scale (MPVS-RB).

PubMed

Betts, Lucy R; Houston, James E; Steer, Oonagh L

2015-01-01

Peer victimization is a frequent occurrence for many adolescents; however, some of the psychometric properties of self-report scales assessing these experiences remain unclear. Furthermore, with an increase in access to technology, electronic aggression should also be considered. The authors examined the psychometric properties of the Multidimensional Peer Victimization Scale (MPVS; Mynard & Joseph, 2000), and developed versions to include the assessment of electronic aggression according to whether the adolescent was the target or perpetrator of peer victimization. A total of 371 (191 girls and 180 boys; Mage = 13 years 4 months, SDage = 1 year 2 months) adolescents in the United Kingdom completed the MPVS including five newly developed items assessing electronic aggression, a version of the MPVS designed to assess victimization perpetration, and a measure of self-esteem. Confirmatory factor analyses yielded a five-factor structure comprising: Physical, social manipulation, verbal, attacks on property, and electronic for both scales. Convergent validity was established through negative associations between the victimization scales and self-esteem. Sex differences also emerged. One revised scale and one new scale are subsequently proposed: The MPVS-Revised and the Multidimensional Peer Bullying Scale.
Representational constraints on children's suggestibility.

PubMed

Ceci, Stephen J; Papierno, Paul B; Kulkofsky, Sarah

2007-06-01

In a multistage experiment, twelve 4- and 9-year-old children participated in a triad rating task. Their ratings were mapped with multidimensional scaling, from which euclidean distances were computed to operationalize semantic distance between items in target pairs. These children and age-mates then participated in an experiment that employed these target pairs in a story, which was followed by a misinformation manipulation. Analyses linked individual and developmental differences in suggestibility to children's representations of the target items. Semantic proximity was a strong predictor of differences in suggestibility: The closer a suggested distractor was to the original item's representation, the greater was the distractor's suggestive influence. The triad participants' semantic proximity subsequently served as the basis for correctly predicting memory performance in the larger group. Semantic proximity enabled a priori counterintuitive predictions of reverse age-related trends to be confirmed whenever the distance between representations of items in a target pair was greater for younger than for older children.
The Brief Impairment Scale (Bis): A Multidimensional Scale of Functional Impairment for Children and Adolescents.

ERIC Educational Resources Information Center

Bird, Hector R.; Canino, Glorisa J.; Davies, Mark; Ramirez, Rafael; Chavez, Ligia; Duarte, Cristiane; Shen, Sa

2005-01-01

Objective: This article provides the results of the psychometric testing of the Brief Impairment Scale (BIS). The BIS is a 23-item instrument that evaluates three domains of functioning: interpersonal relations, school/work functioning, and self-care/self-fulfilment. It capitalizes on the strengths of existing global measures while addressing some…
Construct Validity of the Multidimensional Structure of Bullying and Victimization: An Application of Exploratory Structural Equation Modeling

ERIC Educational Resources Information Center

Marsh, Herbert W.; Nagengast, Benjamin; Morin, Alexandre J. S.; Parada, Roberto H.; Craven, Rhonda G.; Hamilton, Linda R.

2011-01-01

Existing research posits multiple dimensions of bullying and victimization but has not identified well-differentiated facets of these constructs that meet standards of good measurement: goodness of fit, measurement invariance, lack of differential item functioning, and well-differentiated factors that are not so highly correlated as to detract…
Differences in Student Evaluations of Limited-Term Lecturers and Full-Time Faculty

ERIC Educational Resources Information Center

Cho, Jeong-Il; Otani, Koichiro; Kim, B. Joon

2014-01-01

This study compared student evaluations of teaching (SET) for limited-term lecturers (LTLs) and full-time faculty (FTF) using a Likert-scaled survey administered to students (N = 1,410) at the end of university courses. Data were analyzed using a general linear regression model to investigate the influence of multi-dimensional evaluation items on…
Development and Validation of a Teaching Practice Scale (TISS) for Instructors of Introductory Statistics at the College Level

ERIC Educational Resources Information Center

Hassad, Rossi A.

2009-01-01

This study examined the teaching practices of 227 college instructors of introductory statistics (from the health and behavioral sciences). Using primarily multidimensional scaling (MDS) techniques, a two-dimensional, 10-item teaching practice scale, TISS (Teaching of Introductory Statistics Scale), was developed and validated. The two dimensions…
Predictive Validity of Career Decision-Making Profiles over Time among Chinese College Students

ERIC Educational Resources Information Center

Tian, Lin; Guan, Yanjun; Chen, Sylvia Xiaohua; Levin, Nimrod; Cai, Zijun; Chen, Pei; Zhu, Chengfeng; Fu, Ruchunyi; Wang, Yang; Zhang, Shu

2014-01-01

Two studies were conducted to validate the Chinese version of the Career Decision-Making Profiles (CDMP) questionnaire, a multidimensional measure of the way individuals make career decisions. Results of Study 1 showed that after dropping 1 item from the original CDMP scale, the 11-factor structure was supported among Chinese college students (N =…
Correlates of Perceived Social Support in Chinese Adult Child Caregivers of Parent Stroke Survivors.

PubMed

Pan, Yuqin; Jones, Patricia S

2017-10-01

Prevalence of stroke and traditional filial responsibility involve adult children in caregiving to their parent stroke survivors in China. Support resources are insufficient because of the shrinking size of family and the underdeveloped support system. The aim of this study was to identify the correlates of perceived social support among adult child caregivers of parent stroke survivors in China. A cross-sectional correlational design was used in this study. A nonproportional quota sample of 126 adult child caregivers was recruited from Zhejiang Province, China. Data were collected at either the hospital stroke units or the respondents' homes using structured questionnaires of caregiving dyadic demographics and caregiving characteristics, 14-item Activities of Daily Living, 15-item Mutuality Scale, and 12-item Multidimensional Scale of Perceived Social Support. SPSS 17.0 was used for analysis. Caregivers' mutuality, education, full employment or being retired, monthly income, having a co-carer, and having a father as the care receiver were significantly positively associated with caregivers' perceived social support. However, mutuality was not significantly associated with caregivers' perceived social support after the other factors were adjusted. Adult child caregivers with higher levels of mutuality, education, or monthly income; who are fully employed or are retired; who have a co-carer; or who are caring for a father perceived more social support. Nursing strategies and social policies need to be directed to enhance caregiver mutuality and support caregiving efforts.
Lord-Wingersky Algorithm Version 2.0 for Hierarchical Item Factor Models with Applications in Test Scoring, Scale Alignment, and Model Fit Testing.

PubMed

Cai, Li

2015-06-01

Lord and Wingersky's (Appl Psychol Meas 8:453-461, 1984) recursive algorithm for creating summed score based likelihoods and posteriors has a proven track record in unidimensional item response theory (IRT) applications. Extending the recursive algorithm to handle multidimensionality is relatively simple, especially with fixed quadrature because the recursions can be defined on a grid formed by direct products of quadrature points. However, the increase in computational burden remains exponential in the number of dimensions, making the implementation of the recursive algorithm cumbersome for truly high-dimensional models. In this paper, a dimension reduction method that is specific to the Lord-Wingersky recursions is developed. This method can take advantage of the restrictions implied by hierarchical item factor models, e.g., the bifactor model, the testlet model, or the two-tier model, such that a version of the Lord-Wingersky recursive algorithm can operate on a dramatically reduced set of quadrature points. For instance, in a bifactor model, the dimension of integration is always equal to 2, regardless of the number of factors. The new algorithm not only provides an effective mechanism to produce summed score to IRT scaled score translation tables properly adjusted for residual dependence, but leads to new applications in test scoring, linking, and model fit checking as well. Simulated and empirical examples are used to illustrate the new applications.
Psychometric properties and a latent class analysis of the 12-item World Health Organization Disability Assessment Schedule 2.0 (WHODAS 2.0) in a pooled dataset of community samples.

PubMed

MacLeod, Melissa A; Tremblay, Paul F; Graham, Kathryn; Bernards, Sharon; Rehm, Jürgen; Wells, Samantha

2016-12-01

The 12-item World Health Organization Disability Assessment Schedule 2.0 (WHODAS 2.0) is a brief measurement tool used cross-culturally to capture the multi-dimensional nature of disablement through six domains, including: understanding and interacting with the world; moving and getting around; self-care; getting on with people; life activities; and participation in society. Previous psychometric research supports that the WHODAS 2.0 functions as a general factor of disablement. In a pooled dataset from community samples of adults (N = 447) we used confirmatory factor analysis to confirm a one-factor structure. Latent class analysis was used to identify subgroups of individuals based on their patterns of responses. We identified four distinct classes, or patterns of disablement: (1) pervasive disability; (2) physical disability; (3) emotional, cognitive, or interpersonal disability; (4) no/low disability. Convergent validity of the latent class subgroups was found with respect to socio-demographic characteristics, number of days affected by disabilities, stress, mental health, and substance use. These classes offer a simple and meaningful way to classify people with disabilities based on the 12-item WHODAS 2.0. Focusing on individuals with a high probability of being in the first three classes may help guide interventions. Copyright © 2016 John Wiley & Sons, Ltd.
Trajectories of Multidimensional Caregiver Burden in Chinese Informal Caregivers for Dementia: Evidence from Exploratory and Confirmatory Factor Analysis of the Zarit Burden Interview.

PubMed

Li, Dan; Hu, Nan; Yu, Yueyi; Zhou, Aihong; Li, Fangyu; Jia, Jianping

2017-01-01

Despite its popularity, the latent structure of 22-item Zarit Burden Interview (ZBI) remains unclear. There has been no study exploring how caregiver multidimensional burden changed. The aim of the work was to validate the latent structure of ZBI and to investigate how multidimensional burden evolves with increasing global burden. We studied 1,132 dyads of dementia patients and their informal caregivers. The caregivers completed the ZBI and a questionnaire regarding caregiving. The total sample was randomly split into two equal subsamples. Exploratory factor analysis (EFA) was performed in the first subsample. In the second subsample, confirmatory factor analysis (CFA) was conducted to validate models generated from EFA. The mean of weighted factor score was calculated to assess the change of dimension burden against the increasing ZBI total score. The result of EFA and CFA supported that a five-factor structure, including role strain, personal strain, incompetency, dependency, and guilt, had the best goodness-of-fit. The trajectories of multidimensional burden suggested that three different dimensions (guilt, role strain and personal strain) became the main subtype of burden in sequence as the ZBI total score increased from mild to moderate. Factor dependency contributed prominently to the total burden in severe stage. The five-factor ZBI is a psychometrically robust measure for assessing multidimensional burden in Chinese caregivers. The changes of multidimensional burden have deepened our understanding of the psychological characteristics of caregiving beyond a single total score and may be useful for developing interventions to reduce caregiver burden.
Validation of the Dutch version of the Swallowing Quality-of-Life Questionnaire (DSWAL-QoL) and the adjusted DSWAL-QoL (aDSWAL-QoL) using item analysis with the Rasch model: a pilot study.

PubMed

Simpelaere, Ingeborg S; Van Nuffelen, Gwen; De Bodt, Marc; Vanderwegen, Jan; Hansen, Tina

2017-04-07

The Swallowing Quality-of-Life Questionnaire (SWAL-QoL) is considered the gold standard for assessing health-related QoL in oropharyngeal dysphagia. The Dutch translation (DSWAL-QoL) and its adjusted version (aDSWAL-QoL) have been validated using classical test theory (CTT). However, these scales have not been tested against the Rasch measurement model, which is required to establish the structural validity and objectivity of the total scale and subscale scores. Thus, the purpose of this study was to examine the psychometric properties of these scales using item analysis according to the Rasch model. Item analysis with the Rasch model was performed using RUMM2030 software with previously collected data from a validation study of 108 patients. The assessment included evaluations of overall model fit, reliability, unidimensionality, threshold ordering, individual item and person fits, differential item functioning (DIF), local item dependency (LID) and targeting. The analysis could not establish the psychometric properties of either of the scales or their subscales because they did not fit the Rasch model, and multidimensionality, disordered thresholds, DIF, and/or LID were found. The reliability and power of fit were high for the total scales (PSI = 0.93) but low for most of the subscales (PSI < 0.70). The targeting of persons and items was suboptimal. The main source of misfit was disordered thresholds for both the total scales and subscales. Based on the results of the analysis, adjustments to improve the scales were implemented as follows: disordered thresholds were rescaled, misfit items were removed and items were split for DIF. However, the multidimensionality and LID could not be resolved. The reliability and power of fit remained low for most of the subscales. This study represents the first analyses of the DSWAL-QoL and aDSWAL-QoL with the Rasch model. Relying on the DSWAL-QoL and aDSWAL-QoL total and subscale scores to make conclusions regarding dysphagia-related HRQoL should be treated with caution before the structural validity and objectivity of both scales have been established. A larger and well-targeted sample is recommended to derive definitive conclusions about the items and scales. Solutions for the psychometric weaknesses suggested by the model and practical implications are discussed.

The Interaction with Disabled Persons scale: revisiting its internal consistency and factor structure, and examining item-level properties.

PubMed

Iacono, Teresa; Tracy, Jane; Keating, Jenny; Brown, Ted

2009-01-01

The Interaction with Disabled Persons scale (IDP) has been used in research into baseline attitudes and to evaluate whether a shift in attitudes towards people with developmental disabilities has occurred following some form of intervention. This research has been conducted on the assumption that the IDP measures attitudes as a multidimensional construct and has good internal consistency. Such assumptions about the IDP appear flawed, particularly in light of failures to replicate its underlying factor structure. The aim of this study was to evaluate the construct validity and dimensionality of the IDP. This study used a prospective survey approach. Participants were recruited from first and second year undergraduate university students enrolled in health sciences, occupational therapy, physiotherapy, community and emergency health, nursing, and combined degrees of nursing and midwifery, and health sciences and social work at a large Australian university (n=373). Students completed the IDP, a 20-item self-report scale of attitudes towards people with disabilities. The IDP data were analysed using a combination of factor analysis (Classical Test Theory approach) and Rasch analysis (Item Response Theory approach). The results indicated that the original IDP 6-factor solution was not supported. Instead, one factor consisting of five IDP items (9, 11, 12, 17, and 18) labelled Discomfort met the four criteria for empirical validation of test quality: interval level scaling (scalability), unidimensionality, lacked of DIF across the two participant groups and data collection occasions, and hierarchical ordering. Researchers should consider using the Discomfort subscale of the IDP in future attitude research since it exhibits sound measurement properties.
Development and validation of the patient evaluation scale (PES) for primary health care in Nigeria.

PubMed

Ogaji, Daprim S; Giles, Sally; Daker-White, Gavin; Bower, Peter

2017-03-01

Questionnaires developed for patient evaluation of the quality of primary care are often focussed on primary care systems in developed countries. Aim To report the development and validation of the patient evaluation scale (PES) designed for use in the Nigerian primary health care context. An iterative process was used to develop and validate the questionnaire using patients attending 28 primary health centres across eight states in Nigeria. The development involved literature review, patient interviews, expert reviews, cognitive testing with patients and waves of quantitative cross-sectional surveys. The questionnaire's content validity, internal structures, acceptability, reliability and construct validity are reported. Findings The full and shortened version of PES with 27 and 18 items, respectively, were developed through these process. The low item non-response from the serial cross-sectional surveys depicts questionnaire's acceptability among the local population. PES-short form (SF) has Cronbach's α of 0.87 and three domains (codenamed 'facility', 'organisation' and 'health care') with Cronbach's αs of 0.78, 0.79 and 0.81, respectively. Items in the multi-dimensional questionnaire demonstrated adequate convergent and discriminant properties. PES-SF scores show significant positive correlation with scores of the full PES and also discriminated population groups in support of a priori hypotheses. The PES and PES-SF contain items that are relevant to the needs of patients in Nigeria. The good measurement properties of the questionnaire demonstrates its potential usefulness for patient-focussed quality improvement activities in Nigeria. There is still need to translate these questionnaires into major languages in Nigeria and assess their validity against external quality criteria.
Development and validation of the Bullying and Cyberbullying Scale for Adolescents: A multi-dimensional measurement model.

PubMed

Thomas, Hannah J; Scott, James G; Coates, Jason M; Connor, Jason P

2018-05-03

Intervention on adolescent bullying is reliant on valid and reliable measurement of victimization and perpetration experiences across different behavioural expressions. This study developed and validated a survey tool that integrates measurement of both traditional and cyber bullying to test a theoretically driven multi-dimensional model. Adolescents from 10 mainstream secondary schools completed a baseline and follow-up survey (N = 1,217; M age = 14 years; 66.2% male). The Bullying and cyberbullying Scale for Adolescents (BCS-A) developed for this study comprised parallel victimization and perpetration subscales, each with 20 items. Additional measures of bullying (Olweus Global Bullying and the Forms of Bullying Scale [FBS]), as well as measures of internalizing and externalizing problems, school connectedness, social support, and personality, were used to further assess validity. Factor structure was determined, and then, the suitability of items was assessed according to the following criteria: (1) factor interpretability, (2) item correlations, (3) model parsimony, and (4) measurement equivalence across victimization and perpetration experiences. The final models comprised four factors: physical, verbal, relational, and cyber. The final scale was revised to two 13-item subscales. The BCS-A demonstrated acceptable concurrent and convergent validity (internalizing and externalizing problems, school connectedness, social support, and personality), as well as predictive validity over 6 months. The BCS-A has sound psychometric properties. This tool establishes measurement equivalence across types of involvement and behavioural forms common among adolescents. An improved measurement method could add greater rigour to the evaluation of intervention programmes and also enable interventions to be tailored to subscale profiles. © 2018 The British Psychological Society.
Nurses' Attitudes Regarding the Safe Handling of Patients Who Are Morbidly Obese: Instrument Development and Psychometric Analysis.

PubMed

Bejciy-Spring, Susan; Vermillion, Brenda; Morgan, Sally; Newton, Cheryl; Chucta, Sheila; Gatens, Cindy; Zadvinskis, Inga; Holloman, Christopher; Chipps, Esther

2016-12-01

Nurses' attitudes play an important role in the consistent practice of safe patient handling behaviors. The purposes of this study were to develop and assess the psychometric properties of a newly developed instrument measuring attitudes of nurses related to the care and safe handling of patients who are obese. Phases of instrument development included (a) item generation, (b) content validity assessment, (c) reliability assessment, (d) cognitive interviewing, and (e) construct validity assessment through factor analysis. The final data from the exploratory factor analysis produced a 26-item multidimensional instrument that contains 9 subscales. Based on the factor analysis, a 26-item instrument can be used to examine nurses' attitudes regarding patients who are morbidly obese and related safe handling practices.
Validation of the partner version of the multidimensional vaginal penetration disorder questionnaire: A tool for clinical assessment of lifelong vaginismus in a sample of Iranian population.

PubMed

Molaeinezhad, Mitra; Khoei, Effat Merghati; Salehi, Mehrdad; Yousfy, Alireza; Roudsari, Robab Latifnejad

2014-01-01

The role of spousal response in woman's experience of pain during the vaginal penetration attempts believed to be an important factor; however, studies are rather limited in this area. The aim of this study was to develop and investigate the psychometric indexes of the partner version of a multidimensional vaginal penetration disorder questionnaire (PV-MVPDQ); hence, the clinical assessment of spousal psychosexual reactions to vaginismus by specialists will be easier. A mixed-methods sequential exploratory design was used, through that, the findings from a thematic qualitative research with 20 unconsummated couples, which followed by an extensive literature review used for development of PV-MVPDQ. A consecutive sample of 214 men who their wives' suffered from lifelong vaginismus (LLV) based on Diagnostic and Statistical Manual of Mental Disorders 4(th) version (DSM)-IVTR criteria during a cross-sectional design, completed the questionnaire and additional questions regarding their demographic and sexual history. Validation measures and reliability were conducted by exploratory factor analysis (EFA) and Cronbach's alpha coefficient through SPSS version 16 manufactured by SPSS Inc. (IBM corporation, Armonk, USA). After conducting EFA PV-MVPDQ emerged as having 40 items and 7 dimensions: Helplessness, sexual information, vicious cycle of penetration, hypervigilance and solicitous, catastrophic cognitions, sexual and marital adjustment and optimism. Subscales of PV-MVPDQ showed a significant reliability (0.71-0.85) and results of test-retest were satisfactory. The present study shows PV-MVPDQ is a multi-dimensional valid and reliable self-report questionnaire for assessment of cognitions, sexual and marital relations related to vaginal penetrations in spouses of women with LLV. It may assist specialists to base on which clinical judgment and appropriate planning for clinical management.
Validation of the partner version of the multidimensional vaginal penetration disorder questionnaire: A tool for clinical assessment of lifelong vaginismus in a sample of Iranian population

PubMed Central

Molaeinezhad, Mitra; Khoei, Effat Merghati; Salehi, Mehrdad; Yousfy, Alireza; Roudsari, Robab Latifnejad

2014-01-01

Background: The role of spousal response in woman's experience of pain during the vaginal penetration attempts believed to be an important factor; however, studies are rather limited in this area. The aim of this study was to develop and investigate the psychometric indexes of the partner version of a multidimensional vaginal penetration disorder questionnaire (PV-MVPDQ); hence, the clinical assessment of spousal psychosexual reactions to vaginismus by specialists will be easier. Materials and Methods: A mixed-methods sequential exploratory design was used, through that, the findings from a thematic qualitative research with 20 unconsummated couples, which followed by an extensive literature review used for development of PV-MVPDQ. A consecutive sample of 214 men who their wives’ suffered from lifelong vaginismus (LLV) based on Diagnostic and Statistical Manual of Mental Disorders 4th version (DSM)-IVTR criteria during a cross-sectional design, completed the questionnaire and additional questions regarding their demographic and sexual history. Validation measures and reliability were conducted by exploratory factor analysis (EFA) and Cronbach's alpha coefficient through SPSS version 16 manufactured by SPSS Inc. (IBM corporation, Armonk, USA). Results: After conducting EFA PV-MVPDQ emerged as having 40 items and 7 dimensions: Helplessness, sexual information, vicious cycle of penetration, hypervigilance and solicitous, catastrophic cognitions, sexual and marital adjustment and optimism. Subscales of PV-MVPDQ showed a significant reliability (0.71-0.85) and results of test-retest were satisfactory. Conclusion: The present study shows PV-MVPDQ is a multi-dimensional valid and reliable self-report questionnaire for assessment of cognitions, sexual and marital relations related to vaginal penetrations in spouses of women with LLV. It may assist specialists to base on which clinical judgment and appropriate planning for clinical management. PMID:25540787
Multidimensional Scoring of Abilities: The Ordered Polytomous Response Case

ERIC Educational Resources Information Center

de la Torre, Jimmy

2008-01-01

Recent work has shown that multidimensionally scoring responses from different tests can provide better ability estimates. For educational assessment data, applications of this approach have been limited to binary scores. Of the different variants, the de la Torre and Patz model is considered more general because implementing the scoring procedure…
[Adaptation and validation of the CCAENA(©) scale for the measurement of continuity of care between healthcare levels in Colombia and Brazil].

PubMed

Garcia-Subirats, Irene; Aller, Marta Beatriz; Vargas Lorenzo, Ingrid; Vázquez Navarrete, María Luisa

2015-01-01

To adapt and to validate the scale of the questionnaire Continuity of Care between Care Levels (CCAENA(©)) in the context of the Colombian and Brazilian health systems. The study consisted of two phases: 1) adaptation of the CCAENA(©) scale to the context of each country, which was tested by two pretests and a pilot test, and 2) validation by means of application of the scale in a population survey in Colombia and Brazil. The following psychometric properties were analyzed: construct validity (exploratory factor analysis), internal consistency (Cronbach's alpha and item-rest correlations), the multidimensionality of the scales (Spearman correlation coefficients), and known group validity (chi-square test). Of the 21 items of the original scale, 14 were selected and reformulated based on a statement with response options of agreement to a question with frequency response options. Factor analysis showed that items could be grouped into three factors: continuity across healthcare levels, the patient-primary care provider relationship, and the patient-secondary care provider relationship. Cronbach's alpha indicated good internal consistency (>0.80 in all the scales). The correlation coefficients suggest that the three factors could be interpreted as separated scales (<0.70) and had adequate ability to differentiate between groups. The adapted version of the CCAENA(©) shows adequate validity and reliability in both countries, maintaining a high equivalence with the original version. It is a useful and feasible tool to assess the continuity of care between healthcare levels from the users' perspective in both contexts. Copyright © 2014 SESPAS. Published by Elsevier Espana. All rights reserved.
Towards Tailored Patient's Management Approach: Integrating the Modified 2010 ACR Criteria for Fibromyalgia in Multidimensional Patient Reported Outcome Measures Questionnaire

PubMed Central

El Miedany, Yasser; El Gaafary, Maha; Youssef, Sally; Ahmed, Ihab

2016-01-01

Objectives. To assess the validity, reliability, and responsiveness to change of a patient self-reported questionnaire combining the Widespread Pain Index and the Symptom Severity Score as well as construct outcome measures and comorbidities assessment in fibromyalgia patients. Methods. The PROMs-FM was conceptualized based on frameworks used by the WHO Quality of Life tool and the PROMIS. Initially, cognitive interviews were conducted to identify item pool of questions. Item selection and reduction were achieved based on patients as well as an interdisciplinary group of specialists. Rasch and internal consistency reliability analyses were implemented. The questionnaire included the modified ACR criteria main items (Symptom Severity Score and Widespread Pain Index), in addition to assessment of functional disability, quality of life (QoL), review of the systems, and comorbidities. Every patient completed HAQ and EQ-5D questionnaires. Results. A total of 146 fibromyalgia patients completed the questionnaire. The PROMs-FM questionnaire was reliable as demonstrated by a high standardized alpha (0.886–0.982). Content construct assessment of the functional disability and QoL revealed significant correlation (p < 0.01) with both HAQ and EQ-5D. Changes in functional disability and QoL showed significant (p < 0.01) variation with diseases activity status in response to therapy. There was higher prevalence of autonomic symptoms, CVS risk, sexual dysfunction, and falling. Conclusions. The developed PROMs-FM questionnaire is a reliable and valid instrument for assessment of fibromyalgia patients. A phased treatment regimen depending on the severity of FMS as well as preferences and comorbidities of the patient is the best approach to tailored patient management. PMID:27190648
Identifying content for the glaucoma-specific item bank to measure quality-of-life parameters.

PubMed

Khadka, Jyoti; McAlinden, Colm; Craig, Jamie E; Fenwick, Eva K; Lamoureux, Ecosse L; Pesudovs, Konrad

2015-01-01

Patient-reported outcomes (PROs) have become essential clinical trial end points. However, a comprehensive, multidimensional, patient-relevant, and precise glaucoma-specific PRO instrument is not available. Therefore, the purpose of this study was to identify content for a new, glaucoma-specific, quality-of-life (QOL) item bank. Content identification was undertaken in 5 phases: (1) identification of extant items in glaucoma-specific instruments and the qualitative literature; (2) focus groups and interviews with glaucoma patients; (3) item classification and selection; (4) expert review and revision of items; and (5) cognitive interviews with patients. A total of 737 unique items (extant items from PRO instruments, 247; qualitative articles, 14 items; focus groups and semistructured interviews, 476 items) were identified. These items were classified into 10 QOL domains. Four criteria (item redundancy, item inconsistent with domain definition, item content too narrow to have wider applicability, and item clarity) were used to remove and refine the items. After the cognitive interviews, the final minimally representative item set had a total of 342 unique items belonging to 10 domains: activity limitation (88), mobility (20), visual symptoms (19), ocular surface symptoms (22), general symptoms (15), convenience (39), health concerns (45), emotional well-being (49), social issues (23), and economic issues (22). The systematic content identification process identified 10 QOL domains, which were important to patients with glaucoma. The majority of the items were identified from the patient-specific focus groups and semistructured interviews suggesting that the existing PRO instruments do not adequately address QOL issues relevant to individuals with glaucoma.
A Study of Two Instructional Sequences Informed by Alternative Learning Progressions in Genetics

NASA Astrophysics Data System (ADS)

Duncan, Ravit Golan; Choi, Jinnie; Castro-Faix, Moraima; Cavera, Veronica L.

2017-12-01

Learning progressions (LPs) are hypothetical models of how learning in a domain develops over time with appropriate instruction. In the domain of genetics, there are two independently developed alternative LPs. The main difference between the two progressions hinges on their assumptions regarding the accessibility of classical (Mendelian) versus molecular genetics and the order in which they should be taught. In order to determine the relative difficulty of the different genetic ideas included in the two progressions, and to test which one is a better fit with students' actual learning, we developed two modules in classical and molecular genetics and alternated their sequence in an implementation study with 11th grade students studying biology. We developed a set of 56 ordered multiple-choice items that collectively assessed both molecular and classical genetic ideas. We found significant gains in students' learning in both molecular and classical genetics, with the largest gain relating to understanding the informational content of genes and the smallest gain in understanding modes of inheritance. Using multidimensional item response modeling, we found no statistically significant differences between the two instructional sequences. However, there was a trend of slightly higher gains for the molecular-first sequence for all genetic ideas.
Rasch analysis of the Patient Rated Elbow Evaluation questionnaire.

PubMed

Vincent, Joshua I; MacDermid, Joy C; King, Graham J W; Grewal, Ruby

2015-06-20

The Patient Rated Elbow Evaluation (PREE) was developed as an elbow joint specific measure of pain and disability and validated with classical psychometric methods. More recently, Rasch analysis has contributed new methods for analyzing the clinical measurement properties of self-report outcome measures. The objective of the study was to determine aspects of validity of the PREE using the Rasch model to assess the overall fit of the PREE data, the response scaling, individual item fit, differential item functioning (DIF), local dependency, unidimensionality and person separation index (PSI). A convenience sample of 236 patients (Age range 21-79 years; M: F- 97:139) with elbow disorders were recruited from the Roth│McFarlane Hand and Upper Limb Centre, London, Ontario, Canada. The baseline scores of the PREE were used. Rasch analysis was conducted using RUMM 2030 software on the 3 sub scales of the PREE separately. The 3 sub scales showed misfit initially with disordered thresholds on17 out of 20 items), uniform DIF was observed for two items ("Carrying a 10lbs object" from specific activities subscale for age group; and "household work" from the usual activities subscale for gender); multidimensionality and local dependency. The Pain subscale satisfied Rasch expectations when item 2 "Pain - At rest" was split for age group, while the usual activities subscale readily stood up to Rasch requirements when the item 2 "household work" was split for gender. The specific activities subscale demonstrated fit to the Rasch model when sub test analysis accounted for local dependency. All three subscales of the PREE were well targeted and had high reliability (PSI >0.80). The three subscales of the PREE appear to be robust when tested against the Rasch model when subject to a few alterations. The value of changing the 0-10 format is questionable given its widespread use; further Rasch-based analysis of whether these findings are stable in other samples is warranted.
Translating patient reported outcome measures: methodological issues explored using cognitive interviewing with three rheumatoid arthritis measures in six European languages.

PubMed

Hewlett, Sarah; Nicklin, Joanna; Bode, Chistina; Carmona, Loreto; Dures, Emma; Engelbrecht, Matthias; Hagel, Sofia; Kirwan, John; Molto, Anna; Redondo, Marta; Gossec, Laure

2016-06-01

Cross-cultural translation of patient-reported outcome measures (PROMs) is a lengthy process, often performed professionally. Cognitive interviewing assesses patient comprehension of PROMs. The objective was to evaluate the usefulness of cognitive interviewing to assess translations and compare professional (full) with non-professional (simplified) translation processes. A full protocol used for the Bristol RA Fatigue Multi-dimensional Questionnaire and Numerical Rating Scale (BRAF-MDQ, BRAF-NRS) was compared with a simplified protocol used for the RA Impact of Disease scale (RAID). RA patients in the UK, France, the Netherlands, Germany, Spain and Sweden completed the PROMs during cognitive interviewing (BRAFs in the UK were omitted as these were performed during development). Transcripts were deductively analysed for understanding, information retrieval, judgement and response options. Usefulness of cognitive interviewing was assessed by the nature of problems identified, and translation processes by percentage of consistently problematic items (⩾40% patients per country with similar concerns). Sixty patients participated (72% women). For the BRAFs (full protocol) one problematic item was identified (of 23 items × 5 languages, 1/115 = 0.9%). For the RAID (simplified protocol) two problematic items were identified (of 7 items × 6 languages, 2/42 = 4.8%), of which one was revised (Dutch). Coping questions were problematic in both PROMs. Conceptual and cultural challenges though rare were important, as identified by formal evaluation, demonstrating that cognitive interviewing is crucial in PROM translations. Proportionately fewer problematic items were found for the full than for the simplified translation procedure, suggesting that while both are acceptable, professional PROM translation might be preferable. Coping may be a particularly challenging notion cross-culturally. © The Author 2016. Published by Oxford University Press on behalf of the British Society for Rheumatology. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Methodological Measurement Fruitfulness of Exploratory Structural Equation Modeling (ESEM): New Approaches to Key Substantive Issues in Motivation and Engagement

ERIC Educational Resources Information Center

Marsh, Herbert W.; Liem, Gregory Arief D.; Martin, Andrew J.; Morin, Alexandre J. S.; Nagengast, Benjamin

2011-01-01

The most popular measures of multidimensional constructs typically fail to meet standards of good measurement: goodness of fit, measurement invariance, lack of differential item functioning, and well-differentiated factors that are not so highly correlated as to detract from their discriminant validity. Part of the problem, the authors argue, is…
On the Factor Structure of the Beck Depression Inventory-II: G Is the Key

ERIC Educational Resources Information Center

Brouwer, Danny; Meijer, Rob R.; Zevalkink, Jolien

2013-01-01

The Beck Depression Inventory-II (BDI-II; Beck, Steer, & Brown, 1996) is intended to measure severity of depression, and because items represent a broad range of depressive symptoms, some multidimensionality exists. In recent factor-analytic studies, there has been a debate about whether the BDI-II can be considered as one scale or whether…
The development of a multi-dimensional gambling accessibility scale.

PubMed

Hing, Nerilee; Haw, John

2009-12-01

The aim of the current study was to develop a scale of gambling accessibility that would have theoretical significance to exposure theory and also serve to highlight the accessibility risk factors for problem gambling. Scale items were generated from the Productivity Commission's (Australia's Gambling Industries: Report No. 10. AusInfo, Canberra, 1999) recommendations and tested on a group with high exposure to the gambling environment. In total, 533 gaming venue employees (aged 18-70 years; 67% women) completed a questionnaire that included six 13-item scales measuring accessibility across a range of gambling forms (gaming machines, keno, casino table games, lotteries, horse and dog racing, sports betting). Also included in the questionnaire was the Problem Gambling Severity Index (PGSI) along with measures of gambling frequency and expenditure. Principal components analysis indicated that a common three factor structure existed across all forms of gambling and these were labelled social accessibility, physical accessibility and cognitive accessibility. However, convergent validity was not demonstrated with inconsistent correlations between each subscale and measures of gambling behaviour. These results are discussed in light of exposure theory and the further development of a multi-dimensional measure of gambling accessibility.
Measuring Sexual Orientation: A Review and Critique of U.S. Data Collection Efforts and Implications for Health Policy.

PubMed

Wolff, Margaret; Wells, Brooke; Ventura-DiPersia, Christina; Renson, Audrey; Grov, Christian

The U.S. Department of Health and Human Services' (HHS) Healthy People 2020 goals sought to improve health outcomes among sexual minorities; HHS acknowledged that a dearth of sexual orientation items in federal and state health surveys obscured a broad understanding of sexual minority-related health disparities. The HHS 2011 data progression plan aimed to advance sexual orientation data collection efforts at the national level. Sexual orientation is a complex, multidimensional construct often composed of sexual identity, sexual attraction, and sexual behavior, thus posing challenges to its quantitative and practical measurement and analysis. In this review, we (a) present existing sexual orientation constructs; (b) evaluate current HHS sexual orientation data collection efforts; (c) review post-2011 data progression plan research on sexual minority health disparities, drawing on HHS survey data; (d) highlight the importance of and (e) identify obstacles to multidimensional sexual orientation measurement and analysis; and (f) discuss methods for multidimensional sexual orientation analysis and propose a matrix for addressing discordance/branchedness within these analyses. Multidimensional sexual orientation data collection and analysis would elucidate sexual minority-related health disparities, guide related health policies, and enhance population-based estimates of sexual minority individuals to steer health care practices.
Recent status scores for version 6 of the Addiction Severity Index (ASI-6).

PubMed

Cacciola, John S; Alterman, Arthur I; Habing, Brian; McLellan, A Thomas

2011-09-01

To describe the derivation of recent status scores (RSSs) for version 6 of the Addiction Severity Index (ASI-6). 118 ASI-6 recent status items were subjected to nonparametric item response theory (NIRT) analyses followed by confirmatory factor analysis (CFA). Generalizability and concurrent validity of the derived scores were determined. A total of 607 recent admissions to variety of substance abuse treatment programs constituted the derivation sample; a subset (n = 252) comprised the validity sample. The ASI-6 interview and a validity battery of primarily self-report questionnaires that included at least one measure corresponding to each of the seven ASI domains were administered. Nine summary scales describing recent status that achieved or approached both high scalability and reliability were derived; one scale for each of six areas (medical, employment/finances, alcohol, drug, legal, psychiatric) and three scales for the family/social area. Intercorrelations among the RSSs also supported the multi-dimensionality of the ASI-6. Concurrent validity analyses yielded strong evidence supporting the validity of six of the RSSs (medical, alcohol, drug, employment, family/social problems, psychiatric). Evidence was weaker for the legal, family/social support and child problems RSSs. Generalizability analyses of the scales to males versus females and whites versus blacks supported the comparability of the findings, with slight exceptions. The psychometric analyses to derive Addiction Severity Index version 6 recent status scores support the multi-dimensionality of the Addiction Severity Index version 6 (i.e. the relative independence of different life functioning areas), consistent with research on earlier editions of the instrument. In general, the Addiction Severity Index version 6 scales demonstrate acceptable scalability, reliability and concurrent validity. While questions remain about the generalizability of some scales to population subgroups, the overall findings coupled with updated and more extensive content in the Addiction Severity Index version 6 support its use in clinical practice and research. © 2011 The Authors, Addiction © 2011 Society for the Study of Addiction.
Recent Status Scores for Version 6 of the Addiction Severity Index (ASI-6)

PubMed Central

Cacciola, John S.; Alterman, Arthur I; Habing, Brian; McLellan, A. Thomas

2012-01-01

Aims To describe the derivation of Recent Status Scores (RSSs) for Version 6 of the Addiction Severity Index (ASI-6). Design 118 ASI-6 recent status items were subjected to nonparametric item response theory (NIRT) analyses followed by confirmatory factor analysis (CFA). Generalizability and concurrent validity of the derived scores were determined. Setting and Participants 607 recent admissions to variety of substance abuse treatment programs constituted the derivation sample; a subset (N = 254) comprised the validity sample. Measurements The ASI-6 interview and a validity battery of primarily self-report questionnaires that included at least one measure corresponding to each of the seven ASI domains were administered. Findings Nine summary scales describing recent status that achieved or approached both high scalability and reliability were derived; one scale for each of six areas (medical, employment/finances, alcohol, drug, legal, psychiatric), and three scales for the family/social area. Intercorrelations among the RSSs also supported the multidimensionality of the ASI-6. Concurrent validity analyses yielded strong evidence supporting the validity of the six of the RSSs (Medical, Alcohol, Drug, Employment, Family/Social Problems, Psychiatric). Evidence was weaker for the Legal, Family/Social Support and Child Problems RSSs. Generalizability analyses of the scales to males versus females and whites versus blacks supported the comparability of the findings with slight exceptions. Conclusions The psychometric analyses to derive Addiction Severity Index-6 Recent Status Scores (RSSs) support the multidimensionality of the ASI-6 (i.e., the relative independence of different life functioning areas), consistent with research on earlier editions of the instrument. In general, the ASI-6 scales demonstrate acceptable scalability, reliability and concurrent validity. While questions remain about the generalizability of some scales to population subgroups, the overall findings coupled with updated and more extensive content in the ASI-6 support its use in clinical practice and research. PMID:21545666
Classification of natural and supernatural causes of mental distress. Development of a Mental Distress Explanatory Model Questionnaire.

PubMed

Eisenbruch, M

1990-11-01

This paper describes the background and development of a Mental Distress Explanatory Model Questionnaire designed to explore how people from different cultures explain mental distress. A 45-item questionnaire was developed with items derived from the Murdock et al. categories, with additional items covering western notions of physiological causation and stress. The questionnaire was administered to 261 people, mostly college students. Multi-dimensional scaling analysis shows four clusters of mental distress: a) stress; b) western physiological; c) nonwestern physiological; and d) supernatural. These clusters form two dimensions: western physiological vs. supernatural and impersonal vs. personalistic explanations. Natural and stress items are separated from supernatural and nonwestern physiological items along the first dimension. Brain damage, physical illness, and genetic defects have the greatest separation along the first dimension. Being hot, the body being out of balance, and wind currents passing through the body most strongly represent the non-western physiological category. The questionnaire has the potential to be used for community health screening and for monitoring patient care, as well as with students in the health sciences and with health practitioners.

Assessing First- and Second-Order Equity for the Common-Item Nonequivalent Groups Design Using Multidimensional IRT

ERIC Educational Resources Information Center

Andrews, Benjamin James

2011-01-01

The equity properties can be used to assess the quality of an equating. The degree to which expected scores conditional on ability are similar between test forms is referred to as first-order equity. Second-order equity is the degree to which conditional standard errors of measurement are similar between test forms after equating. The purpose of…
High-Performance Psychometrics: The Parallel-E Parallel-M Algorithm for Generalized Latent Variable Models. Research Report. ETS RR-16-34

ERIC Educational Resources Information Center

von Davier, Matthias

2016-01-01

This report presents results on a parallel implementation of the expectation-maximization (EM) algorithm for multidimensional latent variable models. The developments presented here are based on code that parallelizes both the E step and the M step of the parallel-E parallel-M algorithm. Examples presented in this report include item response…
Using an analytical hierarchy process (AHP) for weighting items of a measurement scale: a pilot study.

PubMed

Benaïm, C; Perennou, D-A; Pelissier, J-Y; Daures, J-P

2010-02-01

Many clinical scales contain items that are scored separately prior to being compiled into a single score. However, if the items have different degrees of importance, they should be weighted differently before being compiled. The principal aims of this study were to show how the "analytic hierarchy process" (AHP), which has never been used for this purpose, can be applied to weighting the six items of the "London handicap scale", and to compare the AHP to the "conjoint analysis" (CA), which was previously implemented by Harwood et al. (1994) [1]. In order to assess the relative importance of the six items, we submitted AHP and CA to a group of 10 physiatrists. We compared the methods in terms of item ranking according to importance, assessment of fictitious patients based on weights determined by each method, and perceived difficulty by the physiatrist. For both techniques, "Physical independence" (PHY) was the best-weighted item, but other ranks varied depending on the technique. AHP was better than CA in terms of accuracy (global assessment of the clinical status) and perceived difficulty. AHP may be used to reveal the importance that experts assign to the items of a multidimensional scale, and to calculate the appropriate weights for specific items. For this purpose, AHP seems to be more accurate than CA.
The relationships of coping, negative thinking, life satisfaction, social support, and selected demographics with anxiety of young adult college students.

PubMed

Mahmoud, Jihan S R; Staten, Ruth Topsy; Lennie, Terry A; Hall, Lynne A

2015-05-01

Understanding young adults' anxiety requires applying a multidimensional approach to assess the psychosocial, behavioral, and cognitive aspects of this phenomenon. A hypothesized model of the relationships among coping style, thinking style, life satisfaction, social support, and selected demographics and anxiety among college students was tested using path analysis. A total of 257 undergraduate students aged 18-24 years completed an online survey. The independent variables were measured using the Multidimensional Scale of Perceived Social Support, the Brief Students' Multidimensional Life Satisfaction Scale, the Brief COPE Inventory, the Positive Automatic Thoughts Questionnaire, and the Cognition Checklist-Anxiety. The outcome, anxiety, was measured using the Anxiety subscale of the 21-item Depression Anxiety and Stress Scale. Only negative thinking and maladaptive coping had a direct relationship with anxiety. Negative thinking was the strongest predictor of both maladaptive coping and anxiety. These findings suggest that helping undergraduates manage their anxiety by reducing their negative thinking is critical. Designing and testing interventions to decrease negative thinking in college students is recommended for future research. © 2015 Wiley Periodicals, Inc.
[The patient and family satisfaction with the department of mental health in Rome].

PubMed

Cozza, M; Amara, M; Butera, N; Infantino, G; Monti, A M; Provénzano, R

1997-01-01

Satisfaction's measurement with Mental Health Services in patients and their relatives. Satisfaction scale administration to the patients who were treated in community-based psychiatric service from 1.1.1996 to 31.3.1996 and the relatives who were primarily involved in caring for the patient. The ASL Rome "C" community-based psychiatric service. Verona Service Satisfaction Scale-54, a multidimensional instrument which measure satisfaction with community-based psychiatric service. Main results (301 scales for patients, 163 scales for relatives), pointed out for patients a higher satisfaction for the technical and interpersonal skills of psychiatrists and psychologists (score of specific items > 4). Lowest scores of satisfaction were towards the appearance, comfort level and physical layout of the facility (score 2.95) and towards the response of the service to emergencies during the night, weekend and Bank Holidays (score 2.87). Relatives were not particularly keen for the item regarding help to find open employment (score 2.76). Furthermore patients and their relatives gave a negative evaluation of the publicity and information offered by Mental Health Services. Dimension's analysis reaches the same conclusions deduced items's average score. The result of this study emphasizes the patients higher degree of satisfaction than the relatives. The above results point out three aspects to be improved by the Mental Health Service in order to satisfy the demands of the patients and relatives: 1. appearance, comfort level and physical layout of the facility, 2. publicity and information, 3. social activities and social skills.
Multidimensional Patient Impression of Change Following Interdisciplinary Pain Management.

PubMed

Gagnon, Christine M; Scholten, Paul; Atchison, James

2018-04-20

To assess patient impression of change following interdisciplinary pain management utilizing a newly developed Multidimensional Patient Impression of Change (MPIC) questionnaire. A heterogeneous group of chronic pain patients (N = 601) participated in an interdisciplinary treatment program. Programs included individual and group therapies (pain psychology, physical therapy, occupational therapy, relaxation training/biofeedback, aerobic conditioning, patient education and medical management). Patients completed measures of pain, mood, coping, physical functioning and pain acceptance both prior to and at completion of their treatment programs. The newly developed MPIC is an expansion to the Patient Global Impression of Change (PGIC) including seven additional domains (Pain, Mood, Sleep, Physical Functioning, Cope with Pain, Manage Pain Flare-ups, and Medication Effectiveness). The MPIC was administered to the patients post-treatment. There were statistically significant pre- to post-treatment improvements found on all outcome measures. The majority of these improvements were significantly correlated with all domains of the MPIC. The original PGIC item was significantly associated with all of the new MPIC domains and the domains were significantly associated with each other; but there were variations in the distribution of responses highlighting variation of perceived improvements among the domains. Our results support the use of the MPIC as a quick and easy post-treatment assessment screening tool. Future research is needed to examine relevant correlates to Medication Effectiveness. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.
Feasibility, Validity, and Reliability of the Italian Pediatric Quality of Life Inventory Multidimensional Fatigue Scale for Adults in Inpatients with Severe Obesity

PubMed Central

Manzoni, Gian Mauro; Rossi, Alessandro; Marazzi, Nicoletta; Agosti, Fiorenza; De Col, Alessandra; Pietrabissa, Giada; Castelnuovo, Gianluca; Molinari, Enrico; Sartorio, Allessandro

2018-01-01

Objective This study was aimed to examine the feasibility, validity, and reliability of the Italian Pediatric Quality of Life Inventory Multidimensional Fatigue Scale (PedsQL™ MFS) for adult inpatients with severe obesity. Methods 200 inpatients (81% females) with severe obesity (BMI ≥ 35 kg/m2) completed the PedsQL MFS (General Fatigue, Sleep/Rest Fatigue and Cognitive Fatigue domains), the Fatigue Severity Scale, and the Center for Epidemiologic Studies Depression Scale immediately after admission to a 3-week residential body weight reduction program. A randomized subsample of 48 patients re-completed the PedsQL MFS after 3 days. Results Confirmatory factor analysis showed that a modified hierarchical model with two items moved from the Sleep/Rest Fatigue domain to the General Fatigue domain and a second-order latent factor best fitted the data. Internal consistency and test-retest reliabilities were acceptable to high in all scales, and small to high statistically significant correlations were found with all convergent measures, with the exception of BMI. Significant floor effects were found in two scales (Cognitive Fatigue and Sleep/Rest Fatigue). Conclusion The Italian modified PedsQL MFS for adults showed to be a valid and reliable tool for the assessment of fatigue in inpatients with severe obesity. Future studies should assess its discriminant validity as well as its responsiveness to weight reduction. PMID:29402854
Systematic review of the multidimensional fatigue symptom inventory-short form.

PubMed

Donovan, Kristine A; Stein, Kevin D; Lee, Morgan; Leach, Corinne R; Ilozumba, Onaedo; Jacobsen, Paul B

2015-01-01

Fatigue is a subjective complaint that is believed to be multifactorial in its etiology and multidimensional in its expression. Fatigue may be experienced by individuals in different dimensions as physical, mental, and emotional tiredness. The purposes of this study were to review and characterize the use of the 30-item Multidimensional Fatigue Symptom Inventory-Short Form (MFSI-SF) in published studies and to evaluate the available evidence for its psychometric properties. A systematic review was conducted to identify published articles reporting results for the MFSI-SF. Data were analyzed to characterize internal consistency reliability of multi-item MFSI-SF scales and test-retest reliability. Correlation coefficients were summarized to characterize concurrent, convergent, and divergent validity. Standardized effect sizes were calculated to characterize the discriminative validity of the MFSI-SF and its sensitivity to change. Seventy articles were identified. Sample sizes reported ranged from 10 to 529 and nearly half consisted exclusively of females. More than half the samples were composed of cancer patients; of those, 59% were breast cancer patients. Mean alpha coefficients for MFSI-SF fatigue subscales ranged from 0.84 for physical fatigue to 0.93 for general fatigue. The MFSI-SF demonstrated moderate test-retest reliability in a small number of studies. Correlations with other fatigue and vitality measures were moderate to large in size and in the expected direction. The MFSI-SF fatigue subscales were positively correlated with measures of distress, depressive, and anxious symptoms. Effect sizes for discriminative validity ranged from medium to large, while effect sizes for sensitivity to change ranged from small to large. Findings demonstrate the positive psychometric properties of the MFSI-SF, provide evidence for its usefulness in medically ill and nonmedically ill individuals, and support its use in future studies.
A quick aphasia battery for efficient, reliable, and multidimensional assessment of language function.

PubMed

Wilson, Stephen M; Eriksson, Dana K; Schneck, Sarah M; Lucanie, Jillian M

2018-01-01

This paper describes a quick aphasia battery (QAB) that aims to provide a reliable and multidimensional assessment of language function in about a quarter of an hour, bridging the gap between comprehensive batteries that are time-consuming to administer, and rapid screening instruments that provide limited detail regarding individual profiles of deficits. The QAB is made up of eight subtests, each comprising sets of items that probe different language domains, vary in difficulty, and are scored with a graded system to maximize the informativeness of each item. From the eight subtests, eight summary measures are derived, which constitute a multidimensional profile of language function, quantifying strengths and weaknesses across core language domains. The QAB was administered to 28 individuals with acute stroke and aphasia, 25 individuals with acute stroke but no aphasia, 16 individuals with chronic post-stroke aphasia, and 14 healthy controls. The patients with chronic post-stroke aphasia were tested 3 times each and scored independently by 2 raters to establish test-retest and inter-rater reliability. The Western Aphasia Battery (WAB) was also administered to these patients to assess concurrent validity. We found that all QAB summary measures were sensitive to aphasic deficits in the two groups with aphasia. All measures showed good or excellent test-retest reliability (overall summary measure: intraclass correlation coefficient (ICC) = 0.98), and excellent inter-rater reliability (overall summary measure: ICC = 0.99). Sensitivity and specificity for diagnosis of aphasia (relative to clinical impression) were 0.91 and 0.95 respectively. All QAB measures were highly correlated with corresponding WAB measures where available. Individual patients showed distinct profiles of spared and impaired function across different language domains. In sum, the QAB efficiently and reliably characterized individual profiles of language deficits.
A quick aphasia battery for efficient, reliable, and multidimensional assessment of language function

PubMed Central

Eriksson, Dana K.; Schneck, Sarah M.; Lucanie, Jillian M.

2018-01-01

This paper describes a quick aphasia battery (QAB) that aims to provide a reliable and multidimensional assessment of language function in about a quarter of an hour, bridging the gap between comprehensive batteries that are time-consuming to administer, and rapid screening instruments that provide limited detail regarding individual profiles of deficits. The QAB is made up of eight subtests, each comprising sets of items that probe different language domains, vary in difficulty, and are scored with a graded system to maximize the informativeness of each item. From the eight subtests, eight summary measures are derived, which constitute a multidimensional profile of language function, quantifying strengths and weaknesses across core language domains. The QAB was administered to 28 individuals with acute stroke and aphasia, 25 individuals with acute stroke but no aphasia, 16 individuals with chronic post-stroke aphasia, and 14 healthy controls. The patients with chronic post-stroke aphasia were tested 3 times each and scored independently by 2 raters to establish test-retest and inter-rater reliability. The Western Aphasia Battery (WAB) was also administered to these patients to assess concurrent validity. We found that all QAB summary measures were sensitive to aphasic deficits in the two groups with aphasia. All measures showed good or excellent test-retest reliability (overall summary measure: intraclass correlation coefficient (ICC) = 0.98), and excellent inter-rater reliability (overall summary measure: ICC = 0.99). Sensitivity and specificity for diagnosis of aphasia (relative to clinical impression) were 0.91 and 0.95 respectively. All QAB measures were highly correlated with corresponding WAB measures where available. Individual patients showed distinct profiles of spared and impaired function across different language domains. In sum, the QAB efficiently and reliably characterized individual profiles of language deficits. PMID:29425241
Adaptation and psychometric properties of the ISPCAN Child Abuse Screening Tool for use in trials (ICAST-Trial) among South African adolescents and their primary caregivers.

PubMed

Meinck, Franziska; Boyes, Mark E; Cluver, Lucie; Ward, Catherine L; Schmidt, Peter; DeStone, Sachin; Dunne, Michael P

2018-05-31

Child abuse prevention research has been hampered by a lack of validated multi-dimensional non-proprietary instruments, sensitive enough to measure change in abuse victimization or behavior. This study aimed to adapt the ICAST child abuse self-report measure (parent and child) for use in intervention studies and to investigate the psychometric properties of this substantially modified tool in a South African sample. First, cross-cultural and sensitivity adaptation of the original ICAST tools resulted in two preliminary measures (ICAST-Trial adolescents: 27 items, ICAST-Trial caregivers: 19 items). Second, ICAST-Trial data from a cluster randomized trial of a parenting intervention for families with adolescents (N = 1104, 552 caregiver-adolescent dyads) was analyzed. Confirmatory factor analysis established the hypothesized 6-factor (adolescents) and 4-factor (caregivers) structure. Removal of two items for adolescents and five for caregivers resulted in adequate model fit. Concurrent criterion validity analysis confirmed hypothesized relationships between child abuse and adolescent and caregiver mental health, adolescent behavior, discipline techniques and caregiver childhood abuse history. The resulting ICAST-Trial measures have 25 (adolescent) and 14 (caregiver) items respectively and measure physical, emotional and contact sexual abuse, neglect (both versions), and witnessing intimate partner violence and sexual harassment (adolescent version). The study established that both tools are sensitive to measuring change over time in response to a parenting intervention. The ICAST-Trial should have utility for evaluating the effectiveness of child abuse prevention efforts in similar socioeconomic contexts. Further research is needed to replicate these findings and examine cultural appropriateness, barriers for disclosure, and willingness to engage in child abuse research. Copyright © 2018 The Authors. Published by Elsevier Ltd.. All rights reserved.
Comparison of SF-36 vitality scale and Fatigue Symptom Inventory in assessing cancer-related fatigue.

PubMed

Brown, Linda F; Kroenke, Kurt; Theobald, Dale E; Wu, Jingwei

2011-08-01

Cancer-related fatigue (CRF) is an important symptom in clinical practice and research. The best way to measure it, however, remains unsettled. The SF-36 vitality scale, a general measure of energy/fatigue, is a frequently cited measure. With only four items, however, its ability to adequately represent multiple CRF facets has been questioned. The 13-item Fatigue Symptom Inventory (FSI) was developed to assess multidimensional aspects of CRF. Our objectives were to assess the convergent validity and to compare the sensitivity to change of the two scales. We administered both scales at 1 month (n = 68) and 6 months (n = 96) to a subset of heterogeneous patients receiving treatment in 16 cancer centers who were enrolled in a clinical trial of pain and depression. Distributions of standardized response means (SRMs) were compared to assess sensitivity to change. Results of both scales were compared to scores on a single fatigue item from the Patient Health Questionnaire (PHQ). Mean scores for both the FSI and the vitality scale demonstrated clinically significant fatigue in the sample. The vitality scale was strongly correlated with all three FSI scales (r = -0.68 to -0.77). The vitality and FSI scales also correlated strongly with the PHQ fatigue item. Moreover, distributions of SRMs for both scales were approximately normal. Both the FSI and the vitality scale are supported as valid measures of CRF. Both demonstrated sensitivity to change across a range of effect sizes. The vitality scale may be an excellent choice when brevity is paramount; the FSI may be more appropriate when tapping specific dimensions is warranted.
Multicultural Mastery Scale for youth: multidimensional assessment of culturally mediated coping strategies.

PubMed

Fok, Carlotta Ching Ting; Allen, James; Henry, David; Mohatt, Gerald V

2012-06-01

Self-mastery refers to problem-focused coping facilitated through personal agency. Communal mastery describes problem solving through an interwoven social network. This study investigates an adaptation of self- and communal mastery measures for youth. Given the important distinction between family and peers in the lives of youth, these adaptation efforts produced Mastery-Family and Mastery-Friends subscales, along with a Mastery-Self subscale. We tested these measures for psychometric properties and internal structure with 284 predominately Yup'ik Eskimo Alaska Native adolescents (12- to 18-year-olds) from rural, remote communities-a non-Western culturally distinct group hypothesized to display higher levels of collectivism and communal mastery. Results demonstrate a subset of items adapted for youth function satisfactorily, a 3-response alternative format provided meaningful information, and the subscale's underlying structure is best described through 3 distinct first-order factors organized under 1 higher order mastery factor. (c) 2012 APA, all rights reserved
Multicultural Mastery Scale for Youth: Multidimensional Assessment of Culturally Mediated Coping Strategies

PubMed Central

Fok, Carlotta Ching Ting; Allen, James; Henry, David; Mohatt, Gerald V.

2012-01-01

Self-mastery refers to problem-focused coping facilitated through personal agency. Communal mastery describes problem solving through an interwoven social network. This study investigates an adaptation of self- and communal mastery measures for youth. Given the important distinction between family and peers in the lives of youth, these adaptation efforts produced Mastery-Family and Mastery-Friends subscales, along with a Mastery-Self subscale. We tested these measures for psychometric properties and internal structure with 284 12 to 18-year-old predominately Yup’ik Eskimo Alaska Native adolescents from rural, remote communities — a non-Western culturally distinct group hypothesized to display higher levels of collectivism and communal mastery. Results demonstrate a subset of items adapted for youth function satisfactorily, a three-response alternative format provided meaningful information, and the subscale’s underlying structure is best described through three distinct first-order factors organized under one higher order mastery factor. PMID:21928912
Refinement and initial validation of a multidimensional composite scale for use in assessing acute postoperative pain in cats.

PubMed

Brondani, Juliana Tabarelli; Luna, Stelio Pacca Loureiro; Padovani, Carlos Roberto

2011-02-01

To refine and test construct validity and reliability of a composite pain scale for use in assessing acute postoperative pain in cats undergoing ovariohysterectomy. 40 cats that underwent ovariohysterectomy in a previous study. In a previous randomized, double-blind, placebo-controlled study, a composite pain scale was developed to assess postoperative pain in cats that received a placebo or an analgesic (tramadol, vedaprofen, or tramadol-vedaprofen combination). In the present study, the scale was refined via item analysis (distribution frequency and occurrence), a nonparametric ANOVA, and item-to-total score correlation. Construct validity was assessed via factor analysis and known-groups discrimination, and reliability was measured by assessing internal consistency. Respiratory rate and respiratory pattern were rejected after item analysis. Factor analysis resulted in 5 dimensions (F1 [psychomotor change], posture, comfort, activity, mental status, and miscellaneous behaviors; F2 [protection of wound area], reaction to palpation of the surgical wound and palpation of the abdomen and flank; F3 [physiologic variables], systolic arterial blood pressure and appetite; F4 [vocal expression of pain], vocalization; and F5 [heart rate]). Internal consistency was excellent for the overall scale and for F1, F2, and F3; very good for F4; and unacceptable for F5. Except for heart rate, the identified factors and scale total score could be used to detect differences between the analgesic and placebo groups and differences among the analgesic treatments. Results provided initial evidence of construct validity and reliability of a multidimensional composite tool for use in assessing acute postoperative pain in cats undergoing ovariohysterectomy.
Dimensions of insight in schizophrenia: Exploratory factor analysis of items from multiple self- and interviewer-rated measures of insight.

PubMed

Konsztowicz, Susanna; Schmitz, Norbert; Lepage, Martin

2018-03-10

Insight in schizophrenia is regarded as a multidimensional construct that comprises aspects such as awareness of the disorder and recognition of the need for treatment. The proposed number of underlying dimensions of insight is variable in the literature. In an effort to identify a range of existing dimensions of insight, we conducted a factor analysis on combined items from multiple measures of insight. We recruited 165 participants with enduring schizophrenia (treated for >3years). Exploratory factor analysis was conducted on itemized scores from two interviewer-rated measures of insight: the Schedule for the Assessment of Insight-Expanded and the abbreviated Scale to assess Unawareness of Mental Disorder; and two self-report measures: the Birchwood Insight Scale and the Beck Cognitive Insight Scale. A five-factor solution was selected as the best-fitting model, with the following dimensions of insight: 1) awareness of illness and the need for treatment; 2) awareness and attribution of symptoms and consequences; 3) self-certainty; 4) self-reflectiveness for objectivity and fallibility; and 5) self-reflectiveness for errors in reasoning and openness to feedback. Insight in schizophrenia is a multidimensional construct comprised of distinct clinical and cognitive domains of awareness. Multiple measures of insight, both clinician- and self-rated, are needed to capture all of the existing dimensions of insight. Future exploration of associations between the various dimensions and their potential determinants will facilitate the development of clinically useful models of insight and effective interventions to improve outcome. Copyright © 2018 Elsevier B.V. All rights reserved.
Development of Elderly Quality of Life Index – Eqoli: Item Reduction and Distribution into Dimensions

PubMed Central

Paschoal, Sérgio Márcio Pacheco; Filho, Wilson Jacob; Litvoc, Júlio

2008-01-01

OBJECTIVE To describe item reduction and its distribution into dimensions in the construction process of a quality of life evaluation instrument for the elderly. METHODS The sampling method was chosen by convenience through quotas, with selection of elderly subjects from four programs to achieve heterogeneity in the “health status”, “functional capacity”, “gender”, and “age” variables. The Clinical Impact Method was used, consisting of the spontaneous and elicited selection by the respondents of relevant items to the construct Quality of Life in Old Age from a previously elaborated item pool. The respondents rated each item’s importance using a 5-point Likert scale. The product of the proportion of elderly selecting the item as relevant (frequency) and the mean importance score they attributed to it (importance) represented the overall impact of that item in their quality of life (impact). The items were ordered according to their impact scores and the top 46 scoring items were grouped in dimensions by three experts. A review of the negative items was performed. RESULTS One hundred and ninety three people (122 women and 71 men) were interviewed. Experts distributed the 46 items into eight dimensions. Closely related items were grouped and dimensions not reaching the minimum expected number of items received additional items resulting in eight dimensions and 43 items. DISCUSSION The sample was heterogeneous and similar to what was expected. The dimensions and items demonstrated the multidimensionality of the construct. The Clinical Impact Method was appropriate to construct the instrument, which was named Elderly Quality of Life Index - EQoLI. An accuracy process will be examined in the future. PMID:18438571
[A methodological approach to assessing the quality of medical health information on its way from science to the mass media].

PubMed

Serong, Julia; Anhäuser, Marcus; Wormer, Holger

2015-01-01

A current research project deals with the question of how the quality of medical health information changes on its way from the academic journal via press releases to the news media. In an exploratory study a sample of 30 news items has been selected stage-by-stage from an adjusted total sample of 1,695 journalistic news items on medical research in 2013. Using a multidimensional set of criteria the news items as well as the corresponding academic articles, abstracts and press releases are examined by science journalists and medical experts. Together with a content analysis of the expert assessments, it will be verified to what extent established quality standards for medical journalism can be applied to medical health communication and public relations or even to studies and abstracts as well. Copyright © 2015. Published by Elsevier GmbH.
The Stahl Multidimensional Inventory of Values and Attitudes (SMIVA): A Report on the Development of an Instrument to Measure the Effects of One Approach to Values Education.

ERIC Educational Resources Information Center

Stahl, Robert J.

1986-01-01

Reports the steps taken to develop a satisfactory group measure of the Casteel-Stahl model of cognitive-affect-process education. The resulting 60-item Likert format instrument measures a wide array of instructional outcomes, from empathy, communications, decision making, problem solving and personal consistency to acceptance of self and…
An Application of a Multidimensional Extension of the Two-Parameter Logistic Latent Trait Model.

DTIC Science & Technology

1983-08-01

theory, models, technical issues, and applications. Review of Educational Research, 1978, 48, 467-510. Marco, G. L. Item characteristic curve...solutions to three intractable testing problems. Journal of Educational Measurement, 1977, 14, 139-160. McKinley, R. L. and Reckase, M. D. A successful...application of latent trait theory to tailored achievement testing (Research Report 80-1). Columbia: University of Missouri, Department of Educational

Psychometric properties of the multidimensional fatigue inventory in Brazilian Hodgkin's lymphoma survivors.

PubMed

Baptista, Renata Lyrio R; Biasoli, Irene; Scheliga, Adriana; Soares, Andrea; Brabo, Eloa; Morais, José Carlos; Werneck, Guilherme Loureiro; Spector, Nelson

2012-12-01

Fatigue is the most common symptom among Hodgkin's lymphoma survivors. To evaluate the psychometric properties of the Brazilian version of the Multidimensional Fatigue Inventory (MFI). The MFI was translated into Brazilian Portuguese using established forward-backward translation procedures, and the psychometric properties were evaluated in a sample of 200 Hodgkin's lymphoma survivors. The psychometric properties evaluated included internal consistency and construct validity. The MFI was administered along with the informed consent form. The overall Cronbach's alpha coefficient for the 20 items was 0.84, ranging from 0.59 to 0.81 for each of the five scales. Correlations between items and scales ranged from 0.32 to 0.72. The factor analysis yielded a five-factor solution that explained 65% of the variance. The first factor merged the original "general fatigue" and "physical fatigue" scales, as has been previously reported. The second factor identified the original "mental fatigue" scale and the fifth factor identified the original "reduced activity" scale. Questions from the original "reduced motivation" scale were represented in both factors three and four. The Brazilian version of the MFI showed satisfactory psychometric properties and can be considered a valid research tool for assessing cancer-related fatigue. Copyright © 2012 U.S. Cancer Pain Relief Committee. Published by Elsevier Inc. All rights reserved.
Development, content validity, and piloting of an instrument designed to measure managers' attitude toward workplace breastfeeding support.

PubMed

Chow, Tan; Wolfe, Edward W; Olson, Beth H

2012-07-01

Manager attitude is influential in female employees' perceptions of workplace breastfeeding support. Currently, no instrument is available to assess manager attitude toward supporting women who wish to combine breastfeeding with work. We developed and piloted an instrument to measure manager attitudes toward workplace breastfeeding support entitled the "Managers' Attitude Toward Breastfeeding Support Questionnaire," an instrument that measures four constructs using 60 items that are rated agree/disagree on a 4-point Likert rating scale. We established the content validity of the Managers' Attitude Toward Breastfeeding Support Questionnaire measures through expert content review (n=22), expert assessment of item fit (n=11), and cognitive interviews (n=8). Data were collected from a purposive sample of 185 front-line managers who had experience supervising female employees, and responses were scaled using the Multidimensional Random Coefficients Multinomial Logit Model. Dimensionality analyses supported the proposed four-construct model. Reliability ranged from 0.75 to 0.86, and correlations between the constructs were moderately strong (0.47 to 0.71). Four items in two constructs exhibited model-to-data misfit and/or a low score-measure correlation. One item was revised and the other three items were retained in the Managers' Attitude Toward Breastfeeding Support Questionnaire. Findings of this study suggest that the Managers' Attitude Toward Breastfeeding Support Questionnaire measures are reliable and valid indicators of manager attitude toward workplace breastfeeding support, and future research should be conducted to establish external validity. The Managers' Attitude Toward Breastfeeding Support Questionnaire could be used to collect data in a standardized manner within and across companies to measure and compare manager attitudes toward supporting breastfeeding. Organizations can subsequently develop targeted strategies to improve support for breastfeeding employees through efforts influencing managerial attitude. Copyright © 2012 Academy of Nutrition and Dietetics. Published by Elsevier Inc. All rights reserved.
Development and Validation of the Consumer Health Activation Index.

PubMed

Wolf, Michael S; Smith, Samuel G; Pandit, Anjali U; Condon, David M; Curtis, Laura M; Griffith, James; O'Conor, Rachel; Rush, Steven; Bailey, Stacy C; Kaplan, Gordon; Haufle, Vincent; Martin, David

2018-04-01

Although there has been increasing interest in patient engagement, few measures are publicly available and suitable for patients with limited health literacy. We sought to develop a Consumer Health Activation Index (CHAI) for use among diverse patients. Expert opinion, a systematic literature review, focus groups, and cognitive interviews with patients were used to create and revise a potential set of items. Psychometric testing guided by item response theory was then conducted among 301 English-speaking, community-dwelling adults. This included differential item functioning analyses to evaluate item performance across participant health literacy levels. To determine construct validity, CHAI scores were compared to scales measuring similar personality constructs. Associations between the CHAI and physical and mental health established predictive validity. A second study among 9,478 adults was used to confirm CHAI associations with health outcomes. Exploratory factor analyses revealed a single-factor solution with a 10-item scale. The CHAI showed good internal consistency (alpha = 0.81) and moderate test-retest reliability (ICC = 0.53). Reading grade level was found to be at the 6 th grade. Moderate to strong correlations were found with similar constructs (Multidimensional Health Locus of Control, r = 0.38, P < 0.001; Conscientiousness, r = 0.41, P < 0.001). Predictive validity was demonstrated through associations with functional health status measures (depression, r = -0.28, P < 0.001; anxiety, r = -0.22, P < 0.001; and physical functioning, r = 0.22, P < 0.001). In the validation sample, the CHAI was significantly associated with self-reported physical and mental health ( r = 0.31 and 0.32 respectively; both P < 0.001). The CHAI appears to be a valid, reliable, and easily administered tool that can be used to assess health activation among adults, including those with limited health literacy. Future studies should test the tool in actual use and explore further applications.
A Mixed Effects Randomized Item Response Model

ERIC Educational Resources Information Center

Fox, J.-P.; Wyrick, Cheryl

2008-01-01

The randomized response technique ensures that individual item responses, denoted as true item responses, are randomized before observing them and so-called randomized item responses are observed. A relationship is specified between randomized item response data and true item response data. True item response data are modeled with a (non)linear…
Disparity between General Symptom Relief and Remission Criteria in the Positive and Negative Syndrome Scale (PANSS): A Post-treatment Bifactor Item Response Theory Model.

PubMed

Anderson, Ariana E; Reise, Steven P; Marder, Stephen R; Mansolf, Maxwell; Han, Carol; Bilder, Robert M

2017-12-01

Objective: Total scale scores derived by summing ratings from the 30-item PANSS are commonly used in clinical trial research to measure overall symptom severity, and percentage reductions in the total scores are sometimes used to document the efficacy of treatment. Acknowledging that some patients may have substantial changes in PANSS total scores but still be sufficiently symptomatic to warrant diagnosis, ratings on a subset of 8 items, referred to here as the "Remission set," are sometimes used to determine if patients' symptoms no longer satisfy diagnostic criteria. An unanswered question remains: is the goal of treatment better conceptualized as reduction in overall symptom severity, or reduction in symptoms below the threshold for diagnosis? We evaluated the psychometric properties of PANSS total scores, to assess whether having low symptom severity post-treatment is equivalent to attaining Remission. Design: We applied a bifactor item response theory (IRT) model to post-treatment PANSS ratings of 3,647 subjects diagnosed with schizophrenia assessed at the termination of 11 clinical trials. The bifactor model specified one general dimension to reflect overall symptom severity, and five domain-specific dimensions. We assessed how PANSS item discrimination and information parameters varied across the range of overall symptom severity (θ), with a special focus on low levels of symptoms (i.e., θ<-1), which we refer to as "Relief" from symptoms. A score of θ=-1 corresponds to an expected PANSS item score of 1.83, a rating between "Absent" and "Minimal" for a PANSS symptom. Results: The application of the bifactor IRT model revealed: (1) 88% of total score variation was attributable to variation in general symptom severity, and only 8% reflected secondary domain factors. This implies that a general factor may provide a good indicator of symptom severity, and that interpretation is not overly complicated by multidimensionality; (2) Post-treatment, 534 individuals (about 15% of the whole sample) scored in the "Relief" range of general symptom severity, but more than twice that number (n = 1351) satisfied Remission criteria (37%). 2 in 3 Remitted patients had scores that were not in a low symptom range (corresponding to Absent or Minimal item scores); (3) PANSS items vary greatly in their ability to measure the general symptom severity dimension; while many items are highly discriminating and relatively "pure" indicators of general symptom severity (delusions, conceptual disorganization), others are better indicators of specific dimensions (blunted affect, depression). The utility of a given PANSS item for assessing a patient depended on the illness level of the patient. Conclusion: Satisfying conventional Remission criteria was not strongly associated with low levels of symptoms. The items providing the most information for patients in the symptom Relief range were Delusions, Preoccupation, Suspiciousness Persecution, Unusual Thought Content, Conceptual Disorganization, Stereotyped Thinking, Active Social Avoidance, and Lack of Judgment and Insight. Lower scores on these items (item scores ≤2) were strongly associated with having a low latent trait θ or experiencing overall symptom relief. The inter-rater agreement between Remission and Relief subjects suggested that these criteria identified different subsets of patients. Alternative subsets of items may offer better indicators of general symptom severity and provide better discrimination (and lower standard errors) for scaling individuals and judging symptom relief, where the "best" subset of items ultimately depends on the illness range and treatment phase being evaluated.
Development and Application of a Test for Food-Induced Emotions.

PubMed

Geier, Uwe; Büssing, Arndt; Kruse, Pamela; Greiner, Ramona; Buchecker, Kirsten

2016-01-01

This study aimed to develop a test to measure food-induced emotions suitable for stable food and beverages. All of the experiments were conducted under the conditions of a consumer sensory evaluation according to German standard DIN 10974. Test development included descriptors' derivation and factor analysis as well as a comparison between the new test (empathic food test, EFT) and a hedonic sensory test and an unspecific psychological test, known as a multidimensional mood questionnaire (MDMQ). Nineteen sensory experts derived twelve items using free-choice profiling. After an exploratory factor analyses, ten of the intended twelve items were integrated into two scales. To compare the new questionnaire (EFT) to the MDMQ and a hedonic test, panels of 59 (EFT), 64 (MDMQ) and 63 (hedonic sensory test) untrained individuals described their perceptions after consuming sensorially similar pairs of milk, water, bread and sugar. The benchmark of comparison was the power to discriminate between the food pairs. Test-retest replicability was demonstrated. All three tests presented slight differences in sample preference and effect size depending on the offered products. These findings underscore the need to test new methods with a wide range of products. Further research is needed to investigate the relationship between sensorial perception and emotional response.
Development and Application of a Test for Food-Induced Emotions

PubMed Central

Geier, Uwe; Büssing, Arndt; Kruse, Pamela; Greiner, Ramona; Buchecker, Kirsten

2016-01-01

This study aimed to develop a test to measure food-induced emotions suitable for stable food and beverages. All of the experiments were conducted under the conditions of a consumer sensory evaluation according to German standard DIN 10974. Test development included descriptors’ derivation and factor analysis as well as a comparison between the new test (empathic food test, EFT) and a hedonic sensory test and an unspecific psychological test, known as a multidimensional mood questionnaire (MDMQ). Nineteen sensory experts derived twelve items using free-choice profiling. After an exploratory factor analyses, ten of the intended twelve items were integrated into two scales. To compare the new questionnaire (EFT) to the MDMQ and a hedonic test, panels of 59 (EFT), 64 (MDMQ) and 63 (hedonic sensory test) untrained individuals described their perceptions after consuming sensorially similar pairs of milk, water, bread and sugar. The benchmark of comparison was the power to discriminate between the food pairs. Test-retest replicability was demonstrated. All three tests presented slight differences in sample preference and effect size depending on the offered products. These findings underscore the need to test new methods with a wide range of products. Further research is needed to investigate the relationship between sensorial perception and emotional response. PMID:27861503
Transgender-inclusive measures of sex/gender for population surveys: Mixed-methods evaluation and recommendations.

PubMed

Bauer, Greta R; Braimoh, Jessica; Scheim, Ayden I; Dharma, Christoffer

2017-01-01

Given that an estimated 0.6% of the U.S. population is transgender (trans) and that large health disparities for this population have been documented, government and research organizations are increasingly expanding measures of sex/gender to be trans inclusive. Options suggested for trans community surveys, such as expansive check-all-that-apply gender identity lists and write-in options that offer maximum flexibility, are generally not appropriate for broad population surveys. These require limited questions and a small number of categories for analysis. Limited evaluation has been undertaken of trans-inclusive population survey measures for sex/gender, including those currently in use. Using an internet survey and follow-up of 311 participants, and cognitive interviews from a maximum-diversity sub-sample (n = 79), we conducted a mixed-methods evaluation of two existing measures: a two-step question developed in the United States and a multidimensional measure developed in Canada. We found very low levels of item missingness, and no indicators of confusion on the part of cisgender (non-trans) participants for both measures. However, a majority of interview participants indicated problems with each question item set. Agreement between the two measures in assessment of gender identity was very high (K = 0.9081), but gender identity was a poor proxy for other dimensions of sex or gender among trans participants. Issues to inform measure development or adaptation that emerged from analysis included dimensions of sex/gender measured, whether non-binary identities were trans, Indigenous and cultural identities, proxy reporting, temporality concerns, and the inability of a single item to provide a valid measure of sex/gender. Based on this evaluation, we recommend that population surveys meant for multi-purpose analysis consider a new Multidimensional Sex/Gender Measure for testing that includes three simple items (one asked only of a small sub-group) to assess gender identity and lived gender, with optional additions. We provide considerations for adaptation of this measure to different contexts.
Transgender-inclusive measures of sex/gender for population surveys: Mixed-methods evaluation and recommendations

PubMed Central

Bauer, Greta R.; Braimoh, Jessica; Scheim, Ayden I.; Dharma, Christoffer

2017-01-01

Given that an estimated 0.6% of the U.S. population is transgender (trans) and that large health disparities for this population have been documented, government and research organizations are increasingly expanding measures of sex/gender to be trans inclusive. Options suggested for trans community surveys, such as expansive check-all-that-apply gender identity lists and write-in options that offer maximum flexibility, are generally not appropriate for broad population surveys. These require limited questions and a small number of categories for analysis. Limited evaluation has been undertaken of trans-inclusive population survey measures for sex/gender, including those currently in use. Using an internet survey and follow-up of 311 participants, and cognitive interviews from a maximum-diversity sub-sample (n = 79), we conducted a mixed-methods evaluation of two existing measures: a two-step question developed in the United States and a multidimensional measure developed in Canada. We found very low levels of item missingness, and no indicators of confusion on the part of cisgender (non-trans) participants for both measures. However, a majority of interview participants indicated problems with each question item set. Agreement between the two measures in assessment of gender identity was very high (K = 0.9081), but gender identity was a poor proxy for other dimensions of sex or gender among trans participants. Issues to inform measure development or adaptation that emerged from analysis included dimensions of sex/gender measured, whether non-binary identities were trans, Indigenous and cultural identities, proxy reporting, temporality concerns, and the inability of a single item to provide a valid measure of sex/gender. Based on this evaluation, we recommend that population surveys meant for multi-purpose analysis consider a new Multidimensional Sex/Gender Measure for testing that includes three simple items (one asked only of a small sub-group) to assess gender identity and lived gender, with optional additions. We provide considerations for adaptation of this measure to different contexts. PMID:28542498
An evaluation of scanpath-comparison and machine-learning classification algorithms used to study the dynamics of analogy making.

PubMed

French, Robert M; Glady, Yannick; Thibaut, Jean-Pierre

2017-08-01

In recent years, eyetracking has begun to be used to study the dynamics of analogy making. Numerous scanpath-comparison algorithms and machine-learning techniques are available that can be applied to the raw eyetracking data. We show how scanpath-comparison algorithms, combined with multidimensional scaling and a classification algorithm, can be used to resolve an outstanding question in analogy making-namely, whether or not children's and adults' strategies in solving analogy problems are different. (They are.) We show which of these scanpath-comparison algorithms is best suited to the kinds of analogy problems that have formed the basis of much analogy-making research over the years. Furthermore, we use machine-learning classification algorithms to examine the item-to-item saccade vectors making up these scanpaths. We show which of these algorithms best predicts, from very early on in a trial, on the basis of the frequency of various item-to-item saccades, whether a child or an adult is doing the problem. This type of analysis can also be used to predict, on the basis of the item-to-item saccade dynamics in the first third of a trial, whether or not a problem will be solved correctly.
Food parenting practices for 5 to 12 year old children: a concept map analysis of parenting and nutrition experts input.

PubMed

O'Connor, Teresia M; Mâsse, Louise C; Tu, Andrew W; Watts, Allison W; Hughes, Sheryl O; Beauchamp, Mark R; Baranowski, Tom; Pham, Truc; Berge, Jerica M; Fiese, Barbara; Golley, Rebecca; Hingle, Melanie; Kremers, Stef P J; Rhee, Kyung E; Skouteris, Helen; Vaughn, Amber

2017-09-11

Parents are an important influence on children's dietary intake and eating behaviors. However, the lack of a conceptual framework and inconsistent assessment of food parenting practices limits our understanding of which food parenting practices are most influential on children. The aim of this study was to develop a food parenting practice conceptual framework using systematic approaches of literature reviews and expert input. A previously completed systematic review of food parenting practice instruments and a qualitative study of parents informed the development of a food parenting practice item bank consisting of 3632 food parenting practice items. The original item bank was further reduced to 110 key food parenting concepts using binning and winnowing techniques. A panel of 32 experts in parenting and nutrition were invited to sort the food parenting practice concepts into categories that reflected their perceptions of a food parenting practice conceptual framework. Multi-dimensional scaling produced a point map of the sorted concepts and hierarchical cluster analysis identified potential solutions. Subjective modifications were used to identify two potential solutions, with additional feedback from the expert panel requested. The experts came from 8 countries and 25 participated in the sorting and 23 provided additional feedback. A parsimonious and a comprehensive concept map were developed based on the clustering of the food parenting practice constructs. The parsimonious concept map contained 7 constructs, while the comprehensive concept map contained 17 constructs and was informed by a previously published content map for food parenting practices. Most of the experts (52%) preferred the comprehensive concept map, while 35% preferred to present both solutions. The comprehensive food parenting practice conceptual map will provide the basis for developing a calibrated Item Response Modeling (IRM) item bank that can be used with computerized adaptive testing. Such an item bank will allow for more consistency in measuring food parenting practices across studies to better assess the impact of food parenting practices on child outcomes and the effect of interventions that target parents as agents of change.
Confirming the Multidimensionality of Psychologically Controlling Parenting among Chinese-American Mothers: Love Withdrawal, Guilt Induction, and Shaming.

PubMed

Cheah, Charissa; Yu, Jing; Hart, Craig; Sun, Shuyan; Olsen, Joseph

2015-05-01

Despite the theoretical conceptualization of parental psychological control as a multidimensional construct, the majority of previous studies have examined psychological control as a unidimensional scale. Moreover, the conceptualization of shaming and its associations with love withdrawal and guilt induction are unclear. The current study aimed to fill these gaps by evaluating the latent factor structure underlying 18 items from Olsen et al. (2002) that were conceptually relevant to love withdrawal, guilt induction, and shaming practices in a sample of 169 mothers of Chinese-American preschoolers. A multidimensional three-factor model and bi-factor model were specified based on our formulated operational definitions for the three dimensions of psychological control. Both models were found to be superior to the unidimensional model. In addition, results from the bi-factor model and an additional second-order factor model indicated that psychological control is essentially empirically isomorphic with guilt induction. Although love withdrawal and shaming factors were also fairly strong indicators of psychological control, each exhibited important additional unique variability and mutual distinctiveness. Implications for the conceptualization of love withdrawal, guilt induction, and shaming as well as directions for future studies are discussed.
Confirming the Multidimensionality of Psychologically Controlling Parenting among Chinese-American Mothers: Love Withdrawal, Guilt Induction, and Shaming

PubMed Central

Cheah, Charissa; Yu, Jing; Hart, Craig; Sun, Shuyan; Olsen, Joseph

2014-01-01

Despite the theoretical conceptualization of parental psychological control as a multidimensional construct, the majority of previous studies have examined psychological control as a unidimensional scale. Moreover, the conceptualization of shaming and its associations with love withdrawal and guilt induction are unclear. The current study aimed to fill these gaps by evaluating the latent factor structure underlying 18 items from Olsen et al. (2002) that were conceptually relevant to love withdrawal, guilt induction, and shaming practices in a sample of 169 mothers of Chinese-American preschoolers. A multidimensional three-factor model and bi-factor model were specified based on our formulated operational definitions for the three dimensions of psychological control. Both models were found to be superior to the unidimensional model. In addition, results from the bi-factor model and an additional second-order factor model indicated that psychological control is essentially empirically isomorphic with guilt induction. Although love withdrawal and shaming factors were also fairly strong indicators of psychological control, each exhibited important additional unique variability and mutual distinctiveness. Implications for the conceptualization of love withdrawal, guilt induction, and shaming as well as directions for future studies are discussed. PMID:26052168
Phonological and acoustic bases for earliest grammatical category assignment: a cross-linguistic perspective.

PubMed

Shi, R; Morgan, J L; Allopenna, P

1998-02-01

Maternal infant-directed speech in Mandarin Chinese and Turkish (two mother-child dyads each; ages of children between 0;11 and 1;8) was examined to see if cues exist in input that might assist infants' assignment of words to lexical and functional item categories. Distributional, phonological, and acoustic measures were analysed. In each language, lexical and functional items (i.e. syllabic morphemes) differed significantly on numerous measures. Despite differences in mean values between categories, distributions of values typically displayed substantial overlap. However, simulations with self-organizing neural networks supported the conclusion that although individual dimensions had low cue validity, in each language multidimensional constellations of presyntactic cues are sufficient to guide assignment of words to rudimentary grammatical categories.
Measuring Men's Gender Norms and Gender Role Conflict/Stress in a High HIV-Prevalence South African Setting.

PubMed

Gottert, Ann; Barrington, Clare; Pettifor, Audrey; McNaughton-Reyes, Heath Luz; Maman, Suzanne; MacPhail, Catherine; Kahn, Kathleen; Selin, Amanda; Twine, Rhian; Lippman, Sheri A

2016-08-01

Gender norms and gender role conflict/stress may influence HIV risk behaviors among men; however scales measuring these constructs need further development and evaluation in African settings. We conducted exploratory and confirmatory factor analyses to evaluate the Gender Equitable Men's Scale (GEMS) and the Gender Role Conflict/Stress (GRC/S) scale among 581 men in rural northeast South Africa. The final 17-item GEMS was unidimensional, with adequate model fit and reliability (alpha = 0.79). Factor loadings were low (0.2-0.3) for items related to violence and sexual relationships. The final 24-item GRC/S scale was multidimensional with four factors: Success, power, competition; Subordination to women; Restrictive emotionality; and Sexual prowess. The scale had adequate model fit and good reliability (alpha = 0.83). While GEMS is a good measure of inequitable gender norms, new or revised scale items may need to be explored in the South African context. Adding the GRC/S scale to capture men's strain related to gender roles could provide important insights into men's risk behaviors.
Psychometric properties of the Spanish version of the UPPS-P Impulsive Behavior Scale: A Rasch rating scale analysis and confirmatory factor analysis.

PubMed

Pilatti, Angelina; Lozano, Oscar M; Cyders, Melissa A

2015-12-01

The present study was aimed at determining the psychometric properties of the Spanish version of the UPPS-P Impulsive Behavior Scale in a sample of college students. Participants were 318 college students (36.2% men; mean age = 20.9 years, SD = 6.4 years). The psychometric properties of this Spanish version were analyzed using the Rasch model, and the factor structure was examined using confirmatory factor analysis. The verification of the global fit of the data showed adequate indexes for persons and items. The reliability estimates were high for both items and persons. Differential item functioning across gender was found for 23 items, which likely reflects known differences in impulsivity levels between men and women. The factor structure of the Spanish version of the UPPS-P replicates previous work with the original UPPS-P Scale. Overall, results suggest that test scores from the Spanish version of the UPPS-P show adequate psychometric properties to accurately assess the multidimensional model of impulsivity, which represents the most exhaustive measure of this construct. (c) 2015 APA, all rights reserved).
Condoms and US college-aged men and women: briefly assessing attitudes toward condoms and general condom use behaviours.

PubMed

Hill, Brandon J; Amick, Erick E; Sanders, Stephanie A

2011-09-01

The purpose of this study was to develop an abbreviated reliable tool for assessing the attitudes US college-aged men and women have about condoms and condom use. An online questionnaire was constructed and completed by 674 participants incorporating modified items from the Attitudes Towards Condom Scale (1984) and the Multidimensional Condom Attitude Scale (1994), with the addition of gender-neutral worded and condom positive or erotic items. The original 40 items were reduced to 18 Likert-type items comprising the Brief Condom Attitude Scale (BCAS). Gender comparisons on a subset of 584 self-identified heterosexual participants indicated that women were significantly more likely to consider condoms as less protective, while men were significantly more likely to consider condoms as more interruptive. Additional analyses examining partnership indicated that monogamous participants were significantly more likely to view condoms as less interruptive, more erotic and less negative than non-monogamous participants. The BCAS appears to be a reliable measure for assessing US college-aged individuals' attitudes about condoms.
Multidimensional Risk Analysis: MRISK

NASA Technical Reports Server (NTRS)

McCollum, Raymond; Brown, Douglas; O'Shea, Sarah Beth; Reith, William; Rabulan, Jennifer; Melrose, Graeme

2015-01-01

Multidimensional Risk (MRISK) calculates the combined multidimensional score using Mahalanobis distance. MRISK accounts for covariance between consequence dimensions, which de-conflicts the interdependencies of consequence dimensions, providing a clearer depiction of risks. Additionally, in the event the dimensions are not correlated, Mahalanobis distance reduces to Euclidean distance normalized by the variance and, therefore, represents the most flexible and optimal method to combine dimensions. MRISK is currently being used in NASA's Environmentally Responsible Aviation (ERA) project o assess risk and prioritize scarce resources.
Trust in the Medical Profession: Conceptual and Measurement Issues

PubMed Central

Hall, Mark A; Camacho, Fabian; Dugan, Elizabeth; Balkrishnan, Rajesh

2002-01-01

Objective To develop and test a multi-item measure for general trust in physicians, in contrast with trust in a specific physician. Data Sources Random national telephone survey of 502 adult subjects with a regular physician and source of payment. Study Design Based on a multidimensional conceptual model, a large pool of candidate items was generated, tested, and revised using focus groups, expert reviewers, and pilot testing. The scale was analyzed for its factor structure, internal consistency, construct validity, and other psychometric properties. Principal Findings The resulting 11-item scale measuring trust in physicians generally is consistent with most aspects of the conceptual model except that it does not include the dimension of confidentiality. This scale has a single-factor structure, good internal consistency (alpha=.89), and good response variability (range=11–54; mean=33.5; SD=6.9). This scale is related to satisfaction with care, trust in one's physician, following doctors' recommendations, having no prior disputes with physicians, not having sought second opinions, and not having changed doctors. No association was found with race/ethnicity. While general trust and interpersonal trust are qualitatively similar, they are only moderately correlated with each other and general trust is substantially lower. Conclusions Emerging research on patients' trust has focused on interpersonal trust in a specific, known physician. Trust in physicians in general is also important and differs significantly from interpersonal physician trust. General physician trust potentially has a strong influence on important behaviors and attitudes, and on the formation of interpersonal physician trust. PMID:12479504
Performance of the Swedish version of the Revised Piper Fatigue Scale.

PubMed

Jakobsson, Sofie; Taft, Charles; Östlund, Ulrika; Ahlberg, Karin

2013-12-01

The Revised Piper Fatigue scale is one of the most widely used instruments internationally to assess cancer-related fatigue. The aim of the present study was to evaluate selected psychometric properties of a Swedish version of the RPFS (SPFS). An earlier translation of the SPFS was further evaluated and developed. The new version was mailed to 300 patients undergoing curative radiotherapy. The internal validity was assessed using Principal Axis Factor Analysis with oblimin rotation and multitrait analysis. External validity was examined in relation to the Multidimensional Fatigue Inventory-20 (MFI-20) and in known-groups analyses. Totally 196 patients (response rate = 65%) returned evaluable questionnaires. Principal axis factoring analysis yielded three factors (74% of the variance) rather than four as in the original RPFS. Multitrait analyses confirmed the adequacy of scaling assumptions. Known-groups analyses failed to support the discriminative validity. Concurrent validity was satisfactory. The new Swedish version of the RPFS showed good acceptability, reliability and convergent and- discriminant item-scale validity. Our results converge with other international versions of the RPFS in failing to support the four-dimension conceptual model of the instrument. Hence, RPFS suitability for use in international comparisons may be limited which also may have implications for cross-cultural validity of the newly released 12-item version of the RPFS. Further research on the Swedish version should address reasons for high missing rates for certain items in the subscale of affective meaning, further evaluation of the discriminative validity and assessment of its sensitivity in detecting changes over time. Copyright © 2013 Elsevier Ltd. All rights reserved.

Feasibility, Validity, and Reliability of the Italian Pediatric Quality of Life Inventory Multidimensional Fatigue Scale for Adults in Inpatients with Severe Obesity.

PubMed

Manzoni, Gian Mauro; Rossi, Alessandro; Marazzi, Nicoletta; Agosti, Fiorenza; De Col, Alessandra; Pietrabissa, Giada; Castelnuovo, Gianluca; Molinari, Enrico; Sartorio, Allessandro

2018-01-01

This study was aimed to examine the feasibility, validity, and reliability of the Italian Pediatric Quality of Life Inventory Multidimensional Fatigue Scale (PedsQL™ MFS) for adult inpatients with severe obesity. 200 inpatients (81% females) with severe obesity (BMI ≥ 35 kg/m2) completed the PedsQL MFS (General Fatigue, Sleep/Rest Fatigue and Cognitive Fatigue domains), the Fatigue Severity Scale, and the Center for Epidemiologic Studies Depression Scale immediately after admission to a 3-week residential body weight reduction program. A randomized subsample of 48 patients re-completed the PedsQL MFS after 3 days. Confirmatory factor analysis showed that a modified hierarchical model with two items moved from the Sleep/Rest Fatigue domain to the General Fatigue domain and a second-order latent factor best fitted the data. Internal consistency and test-retest reliabilities were acceptable to high in all scales, and small to high statistically significant correlations were found with all convergent measures, with the exception of BMI. Significant floor effects were found in two scales (Cognitive Fatigue and Sleep/Rest Fatigue). The Italian modified PedsQL MFS for adults showed to be a valid and reliable tool for the assessment of fatigue in inpatients with severe obesity. Future studies should assess its discriminant validity as well as its responsiveness to weight reduction. © 2018 The Author(s) Published by S. Karger GmbH, Freiburg.
A combined analysis of the Frost Multidimensional Perfectionism Scale (FMPS), Child and Adolescent Perfectionism Scale (CAPS), and Almost Perfect Scale-Revised (APS-R): Different perfectionist profiles in adolescent high school students.

PubMed

Sironic, Amanda; Reeve, Robert A

2015-12-01

To investigate differences and similarities in the dimensional constructs of the Frost Multidimensional Perfectionism Scale (FMPS; Frost, Marten, Lahart, & Rosenblate, 1990), Child and Adolescent Perfectionism Scale (CAPS; Flett, Hewitt, Boucher, Davidson, & Munro, 2000), and Almost Perfect Scale-Revised (APS-R; Slaney, Rice, Mobley, Trippi, & Ashby, 2001), 938 high school students completed the 3 perfectionism questionnaires, as well as the Depression Anxiety Stress Scales (DASS; Lovibond & Lovibond, 1995). Preliminary analyses revealed commonly observed factor structures for each perfectionism questionnaire. Exploratory factor analysis of item responses from the questionnaires (combined) yielded a 4-factor solution (factors were labeled High Personal Standards, Concerns, Doubts and Discrepancy, Externally Motivated Perfectionism, and Organization and Order). A latent class analysis of individuals' mean ratings on each of the 4 factors yielded a 6-class solution. Three of the 6 classes represented perfectionist subgroups (labeled adaptive perfectionist, externally motivated maladaptive perfectionist, and mixed maladaptive perfectionist), and 3 represented nonperfectionist subgroups (labeled nonperfectionist A, nonperfectionist B, and order and organization nonperfectionist). Each of the 6 subgroups was meaningfully associated with the DASS. Findings showed that 3 out of 10 students were classified as maladaptive perfectionists, and maladaptive perfectionists were more prevalent than adaptive perfectionists. In sum, it is evident that combined ratings from the FMPS, CAPS, and APS-R offer a meaningful characterization of perfectionism. (c) 2015 APA, all rights reserved).
The validation and translation of Multidimensional Measure of Informed Choice in Greek.

PubMed

Gourounti, Kleanthi; Sandall, Jane

2011-04-01

to translate the original English version of the Multidimensional Measure of Informed Choice (MMIC) into Greek, to adapt it culturally to Greece, and to determine its psychometric properties for the assessment of informed choice in antenatal screening for Down syndrome. survey using self-administrated questionnaires. public hospital in Athens, Greece. 135 pregnant women with gestational age between 11th and 20th week just prior to having antenatal screening for Down syndrome. 96% of women had a positive attitude towards screening and 45% had a good level of knowledge concerning the screening process for Down syndrome. Using a standard measure of informed choice, validated for use in Greek, it was found that 44% of women made an informed choice, and thus 56% of women made an uninformed choice. The internal consistency of the scales was good; Cronbach's alpha was found to be 0.76 for the attitude scale and 0.64 for the knowledge scale, suggesting that all items were appropriate to measure. The performed factor analysis of the attitude scale indicated three factors with an eigenvalue over 1.0. Those factors were responsible for 87% of the variance. this study indicates that the Greek version of the MMIC appears to be a reliable and valid tool for measuring informed choice in antenatal screening for Down syndrome. Due to its short length and consumption of time, it seems to be a practical instrument for use in Greek antenatal clinics. Copyright © 2009 Elsevier Ltd. All rights reserved.
Interactions across Multiple Stimulus Dimensions in Primary Auditory Cortex.

PubMed

Sloas, David C; Zhuo, Ran; Xue, Hongbo; Chambers, Anna R; Kolaczyk, Eric; Polley, Daniel B; Sen, Kamal

2016-01-01

Although sensory cortex is thought to be important for the perception of complex objects, its specific role in representing complex stimuli remains unknown. Complex objects are rich in information along multiple stimulus dimensions. The position of cortex in the sensory hierarchy suggests that cortical neurons may integrate across these dimensions to form a more gestalt representation of auditory objects. Yet, studies of cortical neurons typically explore single or few dimensions due to the difficulty of determining optimal stimuli in a high dimensional stimulus space. Evolutionary algorithms (EAs) provide a potentially powerful approach for exploring multidimensional stimulus spaces based on real-time spike feedback, but two important issues arise in their application. First, it is unclear whether it is necessary to characterize cortical responses to multidimensional stimuli or whether it suffices to characterize cortical responses to a single dimension at a time. Second, quantitative methods for analyzing complex multidimensional data from an EA are lacking. Here, we apply a statistical method for nonlinear regression, the generalized additive model (GAM), to address these issues. The GAM quantitatively describes the dependence between neural response and all stimulus dimensions. We find that auditory cortical neurons in mice are sensitive to interactions across dimensions. These interactions are diverse across the population, indicating significant integration across stimulus dimensions in auditory cortex. This result strongly motivates using multidimensional stimuli in auditory cortex. Together, the EA and the GAM provide a novel quantitative paradigm for investigating neural coding of complex multidimensional stimuli in auditory and other sensory cortices.
Detecting Multidimensionality: Which Residual Data-Type Works Best?

ERIC Educational Resources Information Center

Linacre, John Michael

1998-01-01

Simulation studies indicate that, for responses to complete tests, construction of Rasch measures from observational data, followed by principal components factor analysis of Rasch residuals, provides an effective means of identifying multidimensionality. The most diagnostically useful residual form was found to be the standardized residual. (SLD)
Establishing a coherent and replicable measurement model of the Edinburgh Postnatal Depression Scale.

PubMed

Martin, Colin R; Redshaw, Maggie

2018-06-01

The 10-item Edinburgh Postnatal Depression Scale (EPDS) is an established screening tool for postnatal depression. Inconsistent findings in factor structure and replication difficulties have limited the scope of development of the measure as a multi-dimensional tool. The current investigation sought to robustly determine the underlying factor structure of the EPDS and the replicability and stability of the most plausible model identified. A between-subjects design was used. EPDS data were collected postpartum from two independent cohorts using identical data capture methods. Datasets were examined with confirmatory factor analysis, model invariance testing and systematic evaluation of relational and internal aspects of the measure. Participants were two samples of postpartum women in England assessed at three months (n = 245) and six months (n = 217). The findings showed a three-factor seven-item model of the EPDS offered an excellent fit to the data, and was observed to be replicable in both datasets and invariant as a function of time point of assessment. Some EPDS sub-scale scores were significantly higher at six months. The EPDS is multi-dimensional and a robust measurement model comprises three factors that are replicable. The potential utility of the sub-scale components identified requires further research to identify a role in contemporary screening practice. Copyright © 2018 The Authors. Published by Elsevier B.V. All rights reserved.
Parentification of Adult Children of Divorce: A Multidimensional Analysis.

ERIC Educational Resources Information Center

Jurkovic, Gregory J.; Thirkield, Alison; Morrell, Richard

2001-01-01

Compared the responses of 381 late adolescent and young adult children of divorce and nondivorce on a new multidimensional measure of parentification assessing the extent and fairness of past and present family caregiving. Evidence that problematic forms of parentification in children of divorce continue into late adolescence and young adulthood…
Assessing Multidimensional Energy Literacy of Secondary Students Using Contextualized Assessment

ERIC Educational Resources Information Center

Chen, Kuan-Li; Liu, Shiang-Yao; Chen, Po-Hsi

2015-01-01

Energy literacy is multidimensional, comprising broad content knowledge as well as affect and behavior. Our previous study has defined four core dimensions for the assessment framework, including energy concepts, reasoning on energy issues, low-carbon lifestyle, and civic responsibility for a sustainable society. The present study compiled a…
The Cyber Aggression in Relationships Scale: A New Multidimensional Measure of Technology-Based Intimate Partner Aggression.

PubMed

Watkins, Laura E; Maldonado, Rosalita C; DiLillo, David

2018-07-01

The purpose of this study was to develop and provide initial validation for a measure of adult cyber intimate partner aggression (IPA): the Cyber Aggression in Relationships Scale (CARS). Drawing on recent conceptual models of cyber IPA, items from previous research exploring general cyber aggression and cyber IPA were modified and new items were generated for inclusion in the CARS. Two samples of adults 18 years or older were recruited online. We used item factor analysis to test the factor structure, model fit, and invariance of the measure structure across women and men. Results confirmed that three-factor models for both perpetration and victimization demonstrated good model fit, and that, in general, the CARS measures partner cyber aggression similarly for women and men. The CARS also demonstrated validity through significant associations with in-person IPA, trait anger, and jealousy. Findings suggest the CARS is a useful tool for assessing cyber IPA in both research and clinical settings.
Sharing medicine: the candidacy of medicines and other household items for sharing, Dominican Republic.

PubMed

Dohn, Michael N; Pilkington, Hugo

2014-01-01

People share medicines and problems can result from this behavior. Successful interventions to change sharing behavior will require understanding people's motives and purposes for sharing medicines. Better information about how medicines fit into the gifting and reciprocity system could be useful in designing interventions to modify medicine sharing behavior. However, it is uncertain how people situate medicines among other items that might be shared. This investigation is a descriptive study of how people sort medicines and other shareable items. This study in the Dominican Republic examined how a convenience sample (31 people) sorted medicines and rated their shareability in relation to other common household items. We used non-metric multidimensional scaling to produce association maps in which the distances between items offer a visual representation of the collective opinion of the participants regarding the relationships among the items. In addition, from a pile sort constrained by four categories of whether sharing or loaning the item was acceptable (on a scale from not shareable to very shareable), we assessed the degree to which the participants rated the medicines as shareable compared to other items. Participants consistently grouped medicines together in all pile sort activities; yet, medicines were mixed with other items when rated by their candidacy to be shared. Compared to the other items, participants had more variability of opinion as to whether medicines should be shared. People think of medicines as a distinct group, suggesting that interventions might be designed to apply to medicines as a group. People's differing opinions as to whether it was appropriate to share medicines imply a degree of uncertainty or ambiguity that health promotion interventions might exploit to alter attitudes and behaviors. These findings have implications for the design of health promotion interventions to impact medicine sharing behavior.
The Danieli Inventory of Multigenerational Legacies of Trauma, Part I: Survivors' posttrauma adaptational styles in their children's eyes.

PubMed

Danieli, Yael; Norris, Fran H; Lindert, Jutta; Paisner, Vera; Engdahl, Brian; Richter, Julia

2015-09-01

A comprehensive valid behavioral measure for assessing multidimensional multigenerational impacts of massive trauma has been missing thus far. We describe the development of the Posttrauma Adaptational Styles questionnaire (Part I of the three-part Danieli Inventory of Multigenerational Legacies of Trauma), a self-report questionnaire of Holocaust survivors' children's perceptions of each parent and their own upbringing (60 items per parent). The items were based on literature and cognitive interviewing of 18 survivors' offspring. A web-based convenience sample survey was designed in English and Hebrew and completed by 482 adult children (M age = 59; 67% women) of Holocaust survivors. Exploratory factor analyses were conducted by using maximum likelihood extraction with Geomin rotation to examine the factor structure of the original 70 items for each parent. Conducted hierarchically, the analysis yielded three higher-order factors reflecting intensities of victim, numb, and fighter styles. The 30-item Victim Style Scale (α = .92-.93) and 18-item Numb Style Scale (α = .89) had excellent internal consistency; the consistency of the 12-item Fighter Style Scale (α = .69-.70) was more modest. English-Hebrew analyses suggested good-to-excellent congruence in factor structure (φ = .87-.99). Further research is needed to evaluate the validity of the measure in other samples and populations. Copyright © 2015 Elsevier Ltd. All rights reserved.
DEVELOPMENT OF MOTIVATION SCALE - CLINICAL VALIDATION WITH ALCOHOL DEPENDENTS

PubMed Central

Neeliyara, Teresa; Nagalakshmi, S.V.

1994-01-01

This study focusses on the development of a comprehensive multi-dimensional scale for assessing motivation for change in the alcohol dependent population. After establishing face validity, the items evolved were administered to a normal sample of 600 male subjects in whom psychiatric illness was ruled out. The data thus obtained was subjected to factor analysis. Six factors were obtained which accounted for 55.2% of variance. These together formed a 80 item five point scale and norms were established on a sample of 600 normal subjects. Further clinical validation was established on 30 alcohol dependent subjects and 30 normals. The status of motivation was found to be inadequate in alcohol dependent individuals as compared to the normals. Split-half reliability was carried out and the tool was found to be highly reliable. PMID:21743674
[Statistical validity of the Mexican Food Security Scale and the Latin American and Caribbean Food Security Scale].

PubMed

Villagómez-Ornelas, Paloma; Hernández-López, Pedro; Carrasco-Enríquez, Brenda; Barrios-Sánchez, Karina; Pérez-Escamilla, Rafael; Melgar-Quiñónez, Hugo

2014-01-01

This article validates the statistical consistency of two food security scales: the Mexican Food Security Scale (EMSA) and the Latin American and Caribbean Food Security Scale (ELCSA). Validity tests were conducted in order to verify that both scales were consistent instruments, conformed by independent, properly calibrated and adequately sorted items, arranged in a continuum of severity. The following tests were developed: sorting of items; Cronbach's alpha analysis; parallelism of prevalence curves; Rasch models; sensitivity analysis through mean differences' hypothesis test. The tests showed that both scales meet the required attributes and are robust statistical instruments for food security measurement. This is relevant given that the lack of access to food indicator, included in multidimensional poverty measurement in Mexico, is calculated with EMSA.
A psychometric evaluation of the four-item version of the Control Attitudes Scale for patients with cardiac disease and their partners.

PubMed

Årestedt, Kristofer; Ågren, Susanna; Flemme, Inger; Moser, Debra K; Strömberg, Anna

2015-08-01

The four-item Control Attitudes Scale (CAS) was developed to measure control perceived by patients with cardiac disease and their family members, but extensive psychometric evaluation has not been performed. The aim was to translate, culturally adapt and psychometrically evaluate the CAS in a Swedish sample of implantable cardioverter defibrillator (ICD) recipients, heart failure (HF) patients and their partners. A sample (n=391) of ICD recipients, HF patients and partners were used. Descriptive statistics, item-total and inter-item correlations, exploratory factor analysis, ordinal regression modelling and Cronbach's alpha were used to validate the CAS. The findings from the factor analyses revealed that the CAS is a multidimensional scale including two factors, Control and Helplessness. The internal consistency was satisfactory for all scales (α=0.74-0.85), except the family version total scale (α=0.62). No differential item functioning was detected which implies that the CAS can be used to make invariant comparisons between groups of different age and sex. The psychometric properties, together with the simple and short format of the CAS, make it to a useful tool for measuring perceived control among patients with cardiac diseases and their family members. When using the CAS, subscale scores should be preferred. © The European Society of Cardiology 2014.
The Novel Object and Unusual Name (NOUN) Database: A collection of novel images for use in experimental research.

PubMed

Horst, Jessica S; Hout, Michael C

2016-12-01

Many experimental research designs require images of novel objects. Here we introduce the Novel Object and Unusual Name (NOUN) Database. This database contains 64 primary novel object images and additional novel exemplars for ten basic- and nine global-level object categories. The objects' novelty was confirmed by both self-report and a lack of consensus on questions that required participants to name and identify the objects. We also found that object novelty correlated with qualifying naming responses pertaining to the objects' colors. The results from a similarity sorting task (and a subsequent multidimensional scaling analysis on the similarity ratings) demonstrated that the objects are complex and distinct entities that vary along several featural dimensions beyond simply shape and color. A final experiment confirmed that additional item exemplars comprised both sub- and superordinate categories. These images may be useful in a variety of settings, particularly for developmental psychology and other research in the language, categorization, perception, visual memory, and related domains.
1999 Survey of Active Duty Personnel: Administration, Datasets, and Codebook. Appendix G: Frequency and Percentage Distributions for Variables in the Survey Analysis Files.

DTIC Science & Technology

2000-12-01

A SKIP FLAG INDICATING THE RESULT OF CHECKING THE RESPONSE ON THE PARENT (SCREENING) ITEM AGAINST THE RESPONSE(S) ON THE ITEMS WITHIN THE SKIP...RESPONSE ON THE PARENT (SCREENING) ITEM AGAINST THE RESPONSE(S) ON THE ITEMS WITHIN THE SKIP PATTERN. SEE TABLE D-5, NOTE 2, IN APPENDIX D. G-52...RESULT OF CHECKING THE RESPONSE ON THE PARENT (SCREENING) ITEM AGAINST THE RESPONSE(S) ON THE ITEMS WITHIN THE SKIP PATTERN. SEE TABLE D-5
Cross-cultural evaluation of the French version of the LEIPAD, a health-related quality of life instrument for use in the elderly living at home.

PubMed

Jalenques, I; Auclair, C; Roblin, J; Morand, D; Tourtauchaux, R; May, R; Vaille-Perret, E; Watts, J; Gerbaud, L; De Leo, D

2013-04-01

To cross-culturally adapt a French version of the LEIPAD, a self-administered questionnaire assessing the health-related quality of life (HRQoL) in adults aged 65 years and over living at home, and to evaluate its psychometric properties. After having translated LEIPAD in accordance with guidelines, we studied psychometric properties: reliability and construct validity-factor analysis, relationships between items and scales, internal consistency, concurrent validity with the Medical Outcome Study Short-Form 36 and known-groups validity. The results obtained in a sample of 195 elderly from the general population showed very good acceptability, with response rates superior to 93 %. Exploratory factor analysis extracted eight factors providing a multidimensionality structure with five misclassifications of items in the seven theoretical scales. Good internal consistency (Cronbach's alpha ranging from 0.73 and 0.86) and strong test-retest reliability (ICCs higher than 0.80 for six scales and 0.70 for one) were demonstrated. Concurrent validity with the SF-36 showed small to strong expected correlations. This first evaluation of the French version of LEIPAD's psychometric properties provides evidence in construct validity and reliability. It would allow HRQoL assessment in clinical and common practice, and investigators would be able to take part in national and international research projects.
Self-Management and Transition Readiness Assessment: Development, Reliability, and Factor Structure of the STARx Questionnaire.

PubMed

Ferris, M; Cohen, S; Haberman, C; Javalkar, K; Massengill, S; Mahan, J D; Kim, S; Bickford, K; Cantu, G; Medeiros, M; Phillips, A; Ferris, M T; Hooper, S R

2015-01-01

The Self-Management and Transition to Adulthood with Rx=Treatment (STARx) Questionnaire was developed to collect information on self-management and health care transition (HCT) skills, via self-report, in a broad population of adolescents and young adults (AYAs) with chronic conditions. Over several iterations, the STARx questionnaire was created with AYA, family, and health provider input. The development and pilot testing of the STARx Questionnaire took place with the assistance of 1219 AYAs with different chronic health conditions, in multiple institutions and settings over three phases: item development, pilot testing, reliability and factor structuring. The three development phases resulted in a final version of the STARx Questionnaire. The exploratory factor analysis of the third version of the 18-item STARx identified six factors that accounted for about 65% of the variance: Medication management, Provider communication, Engagement during appointments, Disease knowledge, Adult health responsibilities, and Resource utilization. Reliability estimates revealed good internal consistency and temporal stability, with the alpha coefficient for the overall scale being .80. The STARx was developmentally sensitive, with older patients scoring significantly higher on nearly every factor than younger patients. The STARx Questionnaire is a reliable, self-report tool with adequate internal consistency, temporal stability, and a strong, multidimensional factor structure. It provides another assessment strategy to measure self-management and transition skills in AYAs with chronic conditions. Copyright © 2015 Elsevier Inc. All rights reserved.
The Employment Precariousness Scale (EPRES): psychometric properties of a new tool for epidemiological studies among waged and salaried workers.

PubMed

Vives, Alejandra; Amable, Marcelo; Ferrer, Montserrat; Moncada, Salvador; Llorens, Clara; Muntaner, Carles; Benavides, Fernando G; Benach, Joan

2010-08-01

Despite the fact that labour market flexibility has resulted in an expansion of precarious employment in industrialised countries, to date there is limited empirical evidence concerning its health consequences. The Employment Precariousness Scale (EPRES) is a newly developed, theory-based, multidimensional questionnaire specifically devised for epidemiological studies among waged and salaried workers. To assess the acceptability, reliability and construct validity of EPRES in a sample of waged and salaried workers in Spain. A sample of 6968 temporary and permanent workers from a population-based survey carried out in 2004-2005 was analysed. The survey questionnaire was interviewer administered and included the six EPRES subscales, and measures of the psychosocial work environment (COPSOQ ISTAS21) and perceived general and mental health (SF-36). A high response rate to all EPRES items indicated good acceptability; Cronbach's alpha coefficients, over 0.70 for all subscales and the global score, demonstrated good internal consistency reliability; exploratory factor analysis using principal axis analysis and varimax rotation confirmed the six-subscale structure and the theoretical allocation of all items. Patterns across known groups and correlation coefficients with psychosocial work environment measures and perceived health demonstrated the expected relations, providing evidence of construct validity. Our results provide evidence in support of the psychometric properties of EPRES, which appears to be a promising tool for the measurement of employment precariousness in public health research.
The Cyclic Nature of Problem Solving: An Emergent Multidimensional Problem-Solving Framework

ERIC Educational Resources Information Center

Carlson, Marilyn P.; Bloom, Irene

2005-01-01

This paper describes the problem-solving behaviors of 12 mathematicians as they completed four mathematical tasks. The emergent problem-solving framework draws on the large body of research, as grounded by and modified in response to our close observations of these mathematicians. The resulting "Multidimensional Problem-Solving Framework" has four…

Testing Multidimensional Models of Youth Civic Engagement: Model Comparisons, Measurement Invariance, and Age Differences

ERIC Educational Resources Information Center

Wray-Lake, Laura; Metzger, Aaron; Syvertsen, Amy K.

2017-01-01

Despite recognition that youth civic engagement is multidimensional, different modeling approaches are rarely compared or tested for measurement invariance. Using a diverse sample of 2,467 elementary, middle, and high school-aged youth, we measured eight dimensions of civic engagement: social responsibility values, informal helping, political…
Response Mixture Modeling: Accounting for Heterogeneity in Item Characteristics across Response Times.

PubMed

Molenaar, Dylan; de Boeck, Paul

2018-06-01

In item response theory modeling of responses and response times, it is commonly assumed that the item responses have the same characteristics across the response times. However, heterogeneity might arise in the data if subjects resort to different response processes when solving the test items. These differences may be within-subject effects, that is, a subject might use a certain process on some of the items and a different process with different item characteristics on the other items. If the probability of using one process over the other process depends on the subject's response time, within-subject heterogeneity of the item characteristics across the response times arises. In this paper, the method of response mixture modeling is presented to account for such heterogeneity. Contrary to traditional mixture modeling where the full response vectors are classified, response mixture modeling involves classification of the individual elements in the response vector. In a simulation study, the response mixture model is shown to be viable in terms of parameter recovery. In addition, the response mixture model is applied to a real dataset to illustrate its use in investigating within-subject heterogeneity in the item characteristics across response times.
A general theoretical framework for interpreting patient-reported outcomes estimated from ordinally scaled item responses.

PubMed

Massof, Robert W

2014-10-01

A simple theoretical framework explains patient responses to items in rating scale questionnaires. Fixed latent variables position each patient and each item on the same linear scale. Item responses are governed by a set of fixed category thresholds, one for each ordinal response category. A patient's item responses are magnitude estimates of the difference between the patient variable and the patient's estimate of the item variable, relative to his/her personally defined response category thresholds. Differences between patients in their personal estimates of the item variable and in their personal choices of category thresholds are represented by random variables added to the corresponding fixed variables. Effects of intervention correspond to changes in the patient variable, the patient's response bias, and/or latent item variables for a subset of items. Intervention effects on patients' item responses were simulated by assuming the random variables are normally distributed with a constant scalar covariance matrix. Rasch analysis was used to estimate latent variables from the simulated responses. The simulations demonstrate that changes in the patient variable and changes in response bias produce indistinguishable effects on item responses and manifest as changes only in the estimated patient variable. Changes in a subset of item variables manifest as intervention-specific differential item functioning and as changes in the estimated person variable that equals the average of changes in the item variables. Simulations demonstrate that intervention-specific differential item functioning produces inefficiencies and inaccuracies in computer adaptive testing. © The Author(s) 2013 Reprints and permissions: sagepub.co.uk/journalsPermissions.nav.
Translation, Transcultural Adaptation, and Validation of the Empathy, Spirituality, and Wellness in Medicine Scale to the Brazilian Portuguese Language.

PubMed

Cangussu Silva, Alexander; Ezequiel, Oscarina da Silva; Damiano, Rodolfo Furlan; Granero Lucchetti, Alessandra Lamas; DiLalla, Lisabeth Fisher; Dorsey, J Kevin; Lucchetti, Giancarlo

2018-04-09

Construct: The Empathy, Spirituality, and Wellness in Medicine Scale (ESWIM) is a 43-item multidimensional scale developed to investigate different dimensions of physicians and medical students. Medical education research requires the use of several different instruments with dozens of items that evaluate each construct separately, making their application slow and increasing the likelihood of students providing a large number of incomplete or missing responses. To provide an alternative measure, this study aims to translate, adapt, and validate the multidimensional ESWIM instrument for Brazilian medical students. This is a very promising instrument because it is multidimensional, relatively short, and cost free; it evaluates important constructs; and it has been explicitly designed for use in the medical context. The English-language instrument was translated and adapted into the Brazilian Portuguese language using standard procedures: translation, transcultural adaptation, and back-translation. ESWIM was administered to students in all years of the medical curriculum. A retest was given 45 days later to evaluate reliability. To assess validity, the questionnaire also included sociodemographic data, the Duke Religion Index, the Empathy Inventory, the brief version of the World Health Organization Quality of Life (WHOQOL-Bref), and the Oldenburg Burnout Inventory. A total of 776 medical students (M age = 22.34 years, SD = 3.11) were assessed. The Brazilian Portuguese version of ESWIM showed good internal consistency for the factor of Empathy (α = 0.79-0.81) and borderline internal consistency for the other factors: Openness to Spirituality (α = 0.61-0.66), Wellness (α = 0.57-0.68), and Tolerance (α = 0.56-0.65). The principal component analysis revealed a four-factor structure; however, the confirmatory factor analysis showed a better fit for a three-factor structure. We found a significant positive correlation between ESWIM empathy and empathy measured by the Empathy Inventory (r = .444, p < .01), as well as negative correlations between ESWIM empathy and burnout (r = -.145 to -.224, p < .01). ESWIM openness to spirituality was also significantly correlated with different subscales of religiosity (r = .301-.417, p < .01), and ESWIM wellness was significantly correlated with the WHOQOL-Bref factors (r = .390-.673, p < .01). The test-retest reliability (applied to 83 students) was high for all factors except Tolerance. This study provides supportive evidence regarding the reliability and validity of ESWIM empathy scores. The ESWIM scale opens a new field of research in relation to openness to spirituality by introducing a scale that measures this openness attitude. Despite borderline internal consistency, ESWIM wellness was strongly associated with quality of life and had good test-retest reliability. Thus, ESWIM appears to be a valid option for evaluating these constructs in medical students.
Portuguese Medical Students' Knowledge and Attitudes Towards Homosexuality.

PubMed

Lopes, Lucas; Gato, Jorge; Esteves, Manuel

2016-11-01

Lesbian, gay, bisexual and transgender people still face discrimination in healthcare environments and physicians often report lack of knowledge on this population's specific healthcare needs. In fact, recommendations have been put forward to include lesbian, gay, bisexual and transgender health in medical curricula. This study aimed to explore factors associated with medical students' knowledge and attitudes towards homosexuality in different years of the medical course. An anonymous online-based questionnaire was sent to all medical students enrolled at the Faculty of Medicine - University of Porto, Portugal, in December 2015. The questionnaire included socio-demographic questions, the Multidimensional Scale of Attitudes Toward Lesbians and Gay Men (27 items) and a Homosexuality Knowledge Questionnaire (17 items). Descriptive statistics, ANOVAs, Chi-square tests and Pearson's correlations were used in the analysis. A total of 489 completed responses was analyzed. Male gender, religiosity and absence of lesbian, gay or bisexual friends were associated with more negative attitudes towards homosexuality. Attitudinal scores did not correlate with advanced years in medical course or contact with lesbian, gay or bisexual patients. Students aiming to pursue technique-oriented specialties presented higher scores in the 'Modern Heterosexism' subscale than students seeking patient-oriented specialties. Although advanced years in medical course correlated significantly with higher knowledge scores, items related with lesbian, gay or bisexual health showed the lowest percentage of correct answers. There seems to be a lack of exploration of medical students' personal attitudes towards lesbians and gay men, and also a lack of knowledge on lesbian, gay or bisexual specific healthcare needs. This study highlights the importance of inclusive undergraduate curriculum development in order to foster quality healthcare.
Psychometric properties of the Polish version of the Multidimensional Fatigue Inventory-20 in cancer patients.

PubMed

Buss, Tomasz; Kruk, Agnieszka; Wiśniewski, Piotr; Modlinska, Aleksandra; Janiszewska, Justyna; Lichodziejewska-Niemierko, Monika

2014-10-01

Multidimensional questionnaires estimating cancer-related fatigue (CRF) as a symptom cluster or a clinical syndrome primarily have been used and validated in English-speaking populations. However, cultural issues and language peculiarities can affect CRF assessment The main aims of this study were to evaluate the psychometric properties of the Polish version of the Multidimensional Fatigue Inventory-20 (MFI-20) and to deliver to clinicians a multidimensional tool for CRF assessment in Polish-speaking patients with cancer. After forward-backward translation procedures, the Polish version of MFI-20 was administered to 340 cancer patients. The Polish MFI-20 was appraised in terms of acceptability, reliability, and validity. Internal consistency was assessed by calculating Cronbach's alpha coefficients. Structural validity was evaluated with confirmatory factor analysis. The translated MFI-20 was well accepted; 90% of subjects fully completed the questionnaire. The overall Cronbach's alpha coefficient was 0.9, ranging from 0.57 to 0.81. All correlation coefficients among Numeric Rating Scale-fatigue, fatigue-related items from the European Organization for Research and Treatment of Cancer Quality of Life Core-30 questionnaire, and the MFI--20 were statistically significant (P < 0.001). Confirmatory factor analysis demonstrated good structural validity and revealed only three dimensions in the Polish version of the MFI-20-physical and mental fatigue as well as reduced motivation. The Polish version of the MFI-20 is well accepted by patients, reliable, and a valid instrument to assess CRF in Polish cancer patients. Copyright © 2014 American Academy of Hospice and Palliative Medicine. Published by Elsevier Inc. All rights reserved.
Another Look at the PART-O Using the Traumatic Brain Injury Model Systems National Database: Scoring to Optimize Psychometrics.

PubMed

Malec, James F; Whiteneck, Gale G; Bogner, Jennifer A

2016-02-01

To integrate previous approaches to scoring the Participation Assessment with Recombined Tools-Objective (PART-O) in a unidimensional scale. Retrospective analysis of PART-O data from the Traumatic Brain Injury Model Systems. Community. Data from individuals (N=469) selected randomly from participants who completed 1-year follow-up in the Traumatic Brain Injury Model Systems were used in Rasch model development. The model was subsequently tested on data from additional random samples of similar size at 1-, 2-, 5-, 10-, and >15-year follow-ups. Not applicable. PART-O. After combining items for productivity and social interaction, the initial analysis at 1-year follow-up indicated relatively good fit to the Rasch model (person reliability=.80) but also suggested item misfit and that the 0-to-5 scale used for most items did not consistently show clear separation between rating levels. Reducing item rating scales to 3 levels (except combined and dichotomous items) resolved these issues and demonstrated good item level discrimination, fit, and person reliability (.81), with no evidence of multidimensionality. These results replicated in analyses at each additional follow-up period. Modifications to item scoring for the PART-O resulted in a unidimensional parametric equivalent measure that addresses previous concerns about competing item relations, and it fit the Rasch model consistently across follow-up periods. The person-item map shows a progression toward greater community participation from solitary and dyadic activities, such as leaving the house and having a friend through social and productivity activities, to group activities with others who share interests or beliefs. Copyright © 2016 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.
Effectiveness of Multidimensional Family Therapy with Higher Severity Substance-Abusing Adolescents: Report from Two Randomized Controlled Trials

ERIC Educational Resources Information Center

Henderson, Craig E.; Dakof, Gayle A.; Greenbaum, Paul E.; Liddle, Howard A.

2010-01-01

Objective: We used growth mixture modeling to examine heterogeneity in treatment response in a secondary analysis of 2 randomized controlled trials testing multidimensional family therapy (MDFT), an established evidence-based therapy for adolescent drug abuse and delinquency. Method: The first study compared 2 evidence-based adolescent substance…
Development and validation of a premature ejaculation diagnostic tool.

PubMed

Symonds, Tara; Perelman, Michael A; Althof, Stanley; Giuliano, François; Martin, Mona; May, Kathryn; Abraham, Lucy; Crossland, Anna; Morris, Mark

2007-08-01

Diagnosis of premature ejaculation (PE) for clinical trial purposes has typically relied on intravaginal ejaculation latency time (IELT) for entry, but this parameter does not capture the multidimensional nature of PE. Therefore, the aim was to develop a brief, multidimensional, psychometrically validated instrument for diagnosing PE status. The questionnaire development involved three stages: (1) Five focus groups and six individual interviews were conducted to develop the content; (2) psychometric validation using three different groups of men; and (3) generation of a scoring system. For psychometric validation/scoring system development, data was collected from (1) men with PE based on clinician diagnosis, using DSM-IV-TR, who also had IELTs < or =2 min (n=292); (2) men self-reporting PE (n=309); and (3) men self-reporting no-PE (n=701). Standard psychometric analyses were conducted to produce the final questionnaire. Sensitivity/specificity analysis was used to determine an appropriate scoring system. The qualitative research identified 9 items to capture the essence of DSM-IV-TR PE classification. The psychometric validation resulted in a 5-item, unidimensional, measure, which captures the essence of DSM-IV-TR: control, frequency, minimal stimulation, distress, and interpersonal difficulty. Sensitivity/specificity analyses suggested a score of < or =8 indicated no-PE, 9 and 10 probable PE, and > or =11 PE. The development and validation of this new PE diagnostic tool has resulted in a new, user-friendly, and brief self-report questionnaire for use in clinical trials to diagnose PE.
Latino Immigrant Family Socialization Scale: Development and Validation of a Multidimensional Ethnic-Racial Socialization Measurement.

PubMed

Ayón, Cecilia

2018-04-26

The study describes multiple steps taken to develop and test the Latino Immigrant Family Socialization (LIFS) scale. Scale items were developed based on qualitative interviews, and feedback on the items was solicited from content experts including an academic, practitioner, and a group of promotoras (or lay health workers). The scale was completed by 300 Latino immigrant parents in the state of Arizona. Exploratory and confirmatory factor analysis confirmed a six-factor model. The six factors ware cultural socialization, adapt, advocate, value diversity, promote mistrust, and educate about nativity and documentation. Follow-up studies are needed to continue the measurement validation process and assess how strategies are used in conjunction with each other, the application of the six strategies across different policy contexts, and how the ethnic-racial socialization process supports children's health and well-being.
The Effects of Q-Matrix Design on Classification Accuracy in the Log-Linear Cognitive Diagnosis Model.

PubMed

Madison, Matthew J; Bradshaw, Laine P

2015-06-01

Diagnostic classification models are psychometric models that aim to classify examinees according to their mastery or non-mastery of specified latent characteristics. These models are well-suited for providing diagnostic feedback on educational assessments because of their practical efficiency and increased reliability when compared with other multidimensional measurement models. A priori specifications of which latent characteristics or attributes are measured by each item are a core element of the diagnostic assessment design. This item-attribute alignment, expressed in a Q-matrix, precedes and supports any inference resulting from the application of the diagnostic classification model. This study investigates the effects of Q-matrix design on classification accuracy for the log-linear cognitive diagnosis model. Results indicate that classification accuracy, reliability, and convergence rates improve when the Q-matrix contains isolated information from each measured attribute.
Assessing item fit for unidimensional item response theory models using residuals from estimated item response functions.

PubMed

Haberman, Shelby J; Sinharay, Sandip; Chon, Kyong Hee

2013-07-01

Residual analysis (e.g. Hambleton & Swaminathan, Item response theory: principles and applications, Kluwer Academic, Boston, 1985; Hambleton, Swaminathan, & Rogers, Fundamentals of item response theory, Sage, Newbury Park, 1991) is a popular method to assess fit of item response theory (IRT) models. We suggest a form of residual analysis that may be applied to assess item fit for unidimensional IRT models. The residual analysis consists of a comparison of the maximum-likelihood estimate of the item characteristic curve with an alternative ratio estimate of the item characteristic curve. The large sample distribution of the residual is proved to be standardized normal when the IRT model fits the data. We compare the performance of our suggested residual to the standardized residual of Hambleton et al. (Fundamentals of item response theory, Sage, Newbury Park, 1991) in a detailed simulation study. We then calculate our suggested residuals using data from an operational test. The residuals appear to be useful in assessing the item fit for unidimensional IRT models.
Assessing birth experience in fathers as an important aspect of clinical obstetrics: how applicable is Salmon's Item List for men?

PubMed

Gawlik, Stephanie; Müller, Mitho; Hoffmann, Lutz; Dienes, Aimée; Reck, Corinna

2015-01-01

validated questionnaire assessment of fathers' experiences during childbirth is lacking in routine clinical practice. Salmon's Item List is a short, validated method used for the assessment of birth experience in mothers in both English- and German-speaking communities. With little to no validated data available for fathers, this pilot study aimed to assess the applicability of the German version of Salmon's Item List, including a multidimensional birth experience concept, in fathers. longitudinal study. Data were collected by questionnaires. University hospital in Germany. the birth experiences of 102 fathers were assessed four to six weeks post partum using the German version of Salmon's Item List. construct validity testing with exploratory factor analysis using principal component analysis with varimax rotation was performed to identify the dimensions of childbirth experiences. Internal consistency was also analysed. factor analysis yielded a four-factor solution comprising 17 items that accounted for 54.5% of the variance. The main domain was 'fulfilment', and the secondary domains were 'emotional distress', 'physical discomfort' and 'emotional adaption'. For fulfilment, Cronbach's α met conventional reliability standards (0.87). Salmon's Item List is an appropriate instrument to assess birth experience in fathers in terms of fulfilment. Larger samples need to be examined in order to prove the stability of the factor structure before this can be extended to routine clinical assessment. a reduced version of Salmon's Item List may be useful as a screening tool for general assessment. Copyright © 2014 Elsevier Ltd. All rights reserved.
An NCME Instructional Module on Polytomous Item Response Theory Models

ERIC Educational Resources Information Center

Penfield, Randall David

2014-01-01

A polytomous item is one for which the responses are scored according to three or more categories. Given the increasing use of polytomous items in assessment practices, item response theory (IRT) models specialized for polytomous items are becoming increasingly common. The purpose of this ITEMS module is to provide an accessible overview of…
Factorial validity and reliability of the Malaysian simplified Chinese version of Multidimensional Scale of Perceived Social Support (MSPSS-SCV) among a group of university students.

PubMed

Guan, Ng Chong; Seng, Loh Huai; Hway Ann, Anne Yee; Hui, Koh Ong

2015-03-01

This study was aimed at validating the simplified Chinese version of the Multidimensional Scale of Perceived Support (MSPSS-SCV) among a group of medical and dental students in University Malaya. Two hundred and two students who took part in this study were given the MSPSS-SCV, the Medical Outcome Study social support survey, the Malay version of the Beck Depression Inventory, the Malay version of the General Health Questionnaire, and the English version of the MSPSS. After 1 week, these students were again required to complete the MSPSS-SCV but with the item sequences shuffled. This scale displayed excellent internal consistency (Cronbach's α = .924), high test-retest reliability (.71), parallel form reliability (.92; Spearman's ρ, P < .01), and validity. In conclusion, the MSPSS-SCV demonstrated sound psychometric properties in measuring social support among a group of medical and dental students. It could therefore be used as a simple screening tool among young educated Malaysian adolescents. © 2013 APJPH.
The Career and Work Adaptability Questionnaire (CWAQ): a first contribution to its validation.

PubMed

Nota, Laura; Ginevra, Maria Cristina; Soresi, Salvatore

2012-12-01

Over the last decade, occupational changes have the rapidly changing job market has begun to demand that people more actively construct their professional lives and acquire career adaptability. The aim of the present study was to develop a specific, new instrument, "Career and Work Adaptability", to assess degree of adaptability in adolescents planning their futures. We conducted three studies, the first of which aimed to formulate the instrument's items and to verify its factor structure; the second study confirmed the instrument's multidimensional structure and evaluated its discriminant validity; the third study was conducted to verify the factorial structure's across-gender invariance and to evaluate its stability over time. Our results showed that the instrument is an effective and multidimensional instrument for accurately measuring career adaptability. Specifically, it can serve as a useful vocational guidance tool in analyzing adolescents' career adaptability. Copyright © 2012 The Foundation for Professionals in Services for Adolescents. Published by Elsevier Ltd. All rights reserved.
Different Characteristics of the Female Sexual Function Index in a Sample of Sexually Active and Inactive Women.

PubMed

Hevesi, Krisztina; Mészáros, Veronika; Kövi, Zsuzsanna; Márki, Gabriella; Szabó, Marianna

2017-09-01

The Female Sexual Function Index (FSFI) is a widely used measurement tool to assess female sexual function along the six dimensions of desire, arousal, lubrication, orgasm, satisfaction, and pain. However, the structure of the questionnaire is not clear, and several studies have found high correlations among the dimensions, indicating that a common underlying "sexual function" factor might be present. To investigate whether female sexual function is best understood as a multidimensional construct or, alternatively, whether a common underlying factor explains most of the variance in FSFI scores, and to investigate the possible effect of the common practice of including sexually inactive women in studies using the FSFI. The sample consisted of 508 women: 202 university students, 177 patients with endometriosis, and 129 patients with polycystic ovary syndrome. Participants completed the FSFI, and confirmatory factor analyses were used to test the underlying structure of this instrument in the total sample and in samples including sexually active women only. The FSFI is a multidimensional self-report questionnaire composed of 19 items. Strong positive correlations were found among five of the six original factors on the FSFI. Confirmatory factor analyses showed that in the total sample items loaded mainly on the general sexual function factor and very little variance was explained by the specific factors. However, when only sexually active women were included in the analyses, a clear factor structure emerged, with items loading on their six specific factors, and most of the variance in FSFI scores was explained by the specific factors, rather than the general factor. University students reported higher scores, indicating better functioning compared with the patient samples. The reliable and valid assessment of female sexual function can contribute to better understanding, prevention, and treatment of different sexual difficulties and dysfunctions. This study provides a rigorous statistical test of the structure of the FSFI and an explicit decision rule for categorizing sexually inactive women. Limitations include a lack of control over the circumstances of data collection. This study supports the use of the FSFI as a multidimensional measurement of female sexual function but highlights the need to establish clear decision rules for the inclusion or exclusion of sexually active and inactive respondents. Hevesi K, Mészáros V, Kövi Z, et al. Different Characteristics of the Female Sexual Function Index in a Sample of Sexually Active and Inactive Women. J Sex Med 2017;14:1133-1141. Copyright © 2017 International Society for Sexual Medicine. Published by Elsevier Inc. All rights reserved.
An empirical study of multidimensional fidelity of COMPASS consultation.

PubMed

Wong, Venus; Ruble, Lisa A; McGrew, John H; Yu, Yue

2018-06-01

Consultation is essential to the daily practice of school psychologists (National Association of School Psychologist, 2010). Successful consultation requires fidelity at both the consultant (implementation) and consultee (intervention) levels. We applied a multidimensional, multilevel conception of fidelity (Dunst, Trivette, & Raab, 2013) to a consultative intervention called the Collaborative Model for Promoting Competence and Success (COMPASS) for students with autism. The study provided 3 main findings. First, multidimensional, multilevel fidelity is a stable construct and increases over time with consultation support. Second, mediation analyses revealed that implementation-level fidelity components had distant, indirect effects on student Individualized Education Program (IEP) outcomes. Third, 3 fidelity components correlated with IEP outcomes: teacher coaching responsiveness at the implementation level, and teacher quality of delivery and student responsiveness at the intervention levels. Implications and future directions are discussed. (PsycINFO Database Record (c) 2018 APA, all rights reserved).
Inverse MDS: Inferring Dissimilarity Structure from Multiple Item Arrangements

PubMed Central

Kriegeskorte, Nikolaus; Mur, Marieke

2012-01-01

The pairwise dissimilarities of a set of items can be intuitively visualized by a 2D arrangement of the items, in which the distances reflect the dissimilarities. Such an arrangement can be obtained by multidimensional scaling (MDS). We propose a method for the inverse process: inferring the pairwise dissimilarities from multiple 2D arrangements of items. Perceptual dissimilarities are classically measured using pairwise dissimilarity judgments. However, alternative methods including free sorting and 2D arrangements have previously been proposed. The present proposal is novel (a) in that the dissimilarity matrix is estimated by “inverse MDS” based on multiple arrangements of item subsets, and (b) in that the subsets are designed by an adaptive algorithm that aims to provide optimal evidence for the dissimilarity estimates. The subject arranges the items (represented as icons on a computer screen) by means of mouse drag-and-drop operations. The multi-arrangement method can be construed as a generalization of simpler methods: It reduces to pairwise dissimilarity judgments if each arrangement contains only two items, and to free sorting if the items are categorically arranged into discrete piles. Multi-arrangement combines the advantages of these methods. It is efficient (because the subject communicates many dissimilarity judgments with each mouse drag), psychologically attractive (because dissimilarities are judged in context), and can characterize continuous high-dimensional dissimilarity structures. We present two procedures for estimating the dissimilarity matrix: a simple weighted-aligned-average of the partial dissimilarity matrices and a computationally intensive algorithm, which estimates the dissimilarity matrix by iteratively minimizing the error of MDS-predictions of the subject’s arrangements. The Matlab code for interactive arrangement and dissimilarity estimation is available from the authors upon request. PMID:22848204
Ramsay-Curve Item Response Theory for the Three-Parameter Logistic Item Response Model

ERIC Educational Resources Information Center

Woods, Carol M.

2008-01-01

In Ramsay-curve item response theory (RC-IRT), the latent variable distribution is estimated simultaneously with the item parameters of a unidimensional item response model using marginal maximum likelihood estimation. This study evaluates RC-IRT for the three-parameter logistic (3PL) model with comparisons to the normal model and to the empirical…

Using the Nominal Response Model to Evaluate Response Category Discrimination in the PROMIS Emotional Distress Item Pools

ERIC Educational Resources Information Center

Preston, Kathleen; Reise, Steven; Cai, Li; Hays, Ron D.

2011-01-01

The authors used a nominal response item response theory model to estimate category boundary discrimination (CBD) parameters for items drawn from the Emotional Distress item pools (Depression, Anxiety, and Anger) developed in the Patient-Reported Outcomes Measurement Information Systems (PROMIS) project. For polytomous items with ordered response…
Multidimensional Latent Markov Models in a Developmental Study of Inhibitory Control and Attentional Flexibility in Early Childhood

ERIC Educational Resources Information Center

Bartolucci, Francesco; Solis-Trapala, Ivonne L.

2010-01-01

We demonstrate the use of a multidimensional extension of the latent Markov model to analyse data from studies with repeated binary responses in developmental psychology. In particular, we consider an experiment based on a battery of tests which was administered to pre-school children, at three time periods, in order to measure their inhibitory…
Item Response Models for Examinee-Selected Items

ERIC Educational Resources Information Center

Wang, Wen-Chung; Jin, Kuan-Yu; Qiu, Xue-Lan; Wang, Lei

2012-01-01

In some tests, examinees are required to choose a fixed number of items from a set of given items to answer. This practice creates a challenge to standard item response models, because more capable examinees may have an advantage by making wiser choices. In this study, we developed a new class of item response models to account for the choice…
Detecting Differential Item Discrimination (DID) and the Consequences of Ignoring DID in Multilevel Item Response Models

ERIC Educational Resources Information Center

Lee, Woo-yeol; Cho, Sun-Joo

2017-01-01

Cross-level invariance in a multilevel item response model can be investigated by testing whether the within-level item discriminations are equal to the between-level item discriminations. Testing the cross-level invariance assumption is important to understand constructs in multilevel data. However, in most multilevel item response model…
An NCME Instructional Module on Latent DIF Analysis Using Mixture Item Response Models

ERIC Educational Resources Information Center

Cho, Sun-Joo; Suh, Youngsuk; Lee, Woo-yeol

2016-01-01

The purpose of this ITEMS module is to provide an introduction to differential item functioning (DIF) analysis using mixture item response models. The mixture item response models for DIF analysis involve comparing item profiles across latent groups, instead of manifest groups. First, an overview of DIF analysis based on latent groups, called…
Forced-Choice Assessment of Work-Related Maladaptive Personality Traits: Preliminary Evidence From an Application of Thurstonian Item Response Modeling.

PubMed

Guenole, Nigel; Brown, Anna A; Cooper, Andrew J

2018-06-01

This article describes an investigation of whether Thurstonian item response modeling is a viable method for assessment of maladaptive traits. Forced-choice responses from 420 working adults to a broad-range personality inventory assessing six maladaptive traits were considered. The Thurstonian item response model's fit to the forced-choice data was adequate, while the fit of a counterpart item response model to responses to the same items but arranged in a single-stimulus design was poor. Monotrait heteromethod correlations indicated corresponding traits in the two formats overlapped substantially, although they did not measure equivalent constructs. A better goodness of fit and higher factor loadings for the Thurstonian item response model, coupled with a clearer conceptual alignment to the theoretical trait definitions, suggested that the single-stimulus item responses were influenced by biases that the independent clusters measurement model did not account for. Researchers may wish to consider forced-choice designs and appropriate item response modeling techniques such as Thurstonian item response modeling for personality questionnaire applications in industrial psychology, especially when assessing maladaptive traits. We recommend further investigation of this approach in actual selection situations and with different assessment instruments.
Development of the Bullying and Health Experiences Scale

PubMed Central

2012-01-01

Background Until recently, researchers have studied forms of bullying separately. For 40 years, research has looked at the traditional forms of bullying, including physical (eg, hitting), verbal (eg, threats), and social (eg, exclusion). Attention focused on cyberbullying in the early 2000s. Although accumulating research suggests that bullying has multiple negative effects for children who are targeted, these effects excluded cyberbullying from the definition of bullying. Objective This paper responds to the need for a multidimensional measure of the impact of various forms of bullying. We used a comprehensive definition of bullying, which includes all of its forms, to identify children who had been targeted or who had participated in bullying. We then examined various ways in which they were impacted. Methods We used an online method to administer 37 impact items to 377 (277 female, 100 male) children and youth, to develop and test the Bullying and Health Experience Scale. Results A principal components analysis of the bullying impact items with varimax rotation resulted in 8 factors with eigenvalues greater than one, explaining 68.0% of the variance. These scales include risk, relationships, anger, physical injury, drug use, anxiety, self-esteem, and eating problems, which represent many of the cognitive, psychological, and behavioral consequences of bullying. The Cronbach alpha coefficients for the 8 scales range from .73 to .90, indicating good inter-item consistency. Comparisons between the groups showed that children involved in bullying had significantly higher negative outcomes on all scales than children not involved in bullying. Conclusions The high Cronbach alpha values indicate that the 8 impact scales provide reliable scores. In addition, comparisons between the groups indicate that the 8 scales provide accurate scores, with more negative outcomes reported by children involved in bullying compared to those who are not involved in bullying. This evidence of reliability and validity indicates that these scales are useful for research and clinical purposes to measure the multidimensional experiences of children who bully and are bullied. PMID:23612028
Development of the bullying and health experiences scale.

PubMed

Beran, Tanya; Stanton, Lauren; Hetherington, Ross; Mishna, Faye; Shariff, Shaheen

2012-11-09

Until recently, researchers have studied forms of bullying separately. For 40 years, research has looked at the traditional forms of bullying, including physical (eg, hitting), verbal (eg, threats), and social (eg, exclusion). Attention focused on cyberbullying in the early 2000s. Although accumulating research suggests that bullying has multiple negative effects for children who are targeted, these effects excluded cyberbullying from the definition of bullying. This paper responds to the need for a multidimensional measure of the impact of various forms of bullying. We used a comprehensive definition of bullying, which includes all of its forms, to identify children who had been targeted or who had participated in bullying. We then examined various ways in which they were impacted. We used an online method to administer 37 impact items to 377 (277 female, 100 male) children and youth, to develop and test the Bullying and Health Experience Scale. A principal components analysis of the bullying impact items with varimax rotation resulted in 8 factors with eigenvalues greater than one, explaining 68.0% of the variance. These scales include risk, relationships, anger, physical injury, drug use, anxiety, self-esteem, and eating problems, which represent many of the cognitive, psychological, and behavioral consequences of bullying. The Cronbach alpha coefficients for the 8 scales range from .73 to .90, indicating good inter-item consistency. Comparisons between the groups showed that children involved in bullying had significantly higher negative outcomes on all scales than children not involved in bullying. The high Cronbach alpha values indicate that the 8 impact scales provide reliable scores. In addition, comparisons between the groups indicate that the 8 scales provide accurate scores, with more negative outcomes reported by children involved in bullying compared to those who are not involved in bullying. This evidence of reliability and validity indicates that these scales are useful for research and clinical purposes to measure the multidimensional experiences of children who bully and are bullied.
Psychometric evaluation of the Arabic version of the multidimensional assessment of fatigue scale (MAF) for use in patients with ankylosing spondylitis.

PubMed

Bahouq, Hanane; Rostom, Samira; Bahiri, Rachid; Hakkou, Jinane; Aissaoui, Nawal; Hajjaj-Hassouni, Najia

2012-12-01

Fatigue is a frequent symptom during ankylosing spondylitis (AS) often under estimated which needs to be measured properly with respect to its intensity by appropriate measures, such as the multidimensional assessment of fatigue (MAF). The aims of this study were to translate into the classic Arabic version of the MAF questionnaire and to validate its use for assessing fatigue in Moroccan patients with AS. The MAF contains 16 items with a global fatigue index (IGF). The MAF was translated and back-translated to arabic, pretested and reviewed by a committee following the Guillemin criteria (J Clin Epidemiol 46:1417-1432, 1993). It was then validate on 110 Moroccan patients with AS. Reliability for the 3-day test-retest was assessed using internal consistency by Cronbach's alpha coefficient and the intra-class correlation coefficient (ICC). External construct validity was assessed by correlation with pain, activity of disease and other keys variable. The reproducibility of the 15 items was satisfactory with a kappa statistics of agreement superior to 0.6. The ICC for IGF score reproducibility was good and reached 0.98 (IC 95%, 0.96-0.99). The internal consistency was at 0.991 with Cronbach's alpha coefficient. The construct validity showed a positive correlation between MAF and the axial (r = 0.34) and peripheral (r = 0.32) visual analogical scale, the Bath ankylosing spondylitis disease activity index (BASDAI) (r = 0.77), the first item of BASDAI (r = 0.85), the functional disability by the Bath ankylosing spondylitis functional index (r = 0.64), the erythrocyte sedimentation rate (r = 0.43) and the C reactive protein (r = 0.30) (for all P < 0.001). There was no statistical correlation between MAF and the other variables. The Arabic version of the MAF has good comprehensibility, internal consistency, reliability and validity for the evaluation of Arabic speaking patients with AS.
Validation of the Modified Fatigue Impact Scale in mild to moderate traumatic brain injury.

PubMed

Schiehser, Dawn M; Delano-Wood, Lisa; Jak, Amy J; Matthews, Scott C; Simmons, Alan N; Jacobson, Mark W; Filoteo, J Vincent; Bondi, Mark W; Orff, Henry J; Liu, Lin

2015-01-01

To evaluate the validity of the Modified Fatigue Impact Scale (MFIS) in veterans with a history of mild to moderate traumatic brain injury (TBI). Veterans (N = 106) with mild (92%) or moderate (8%) TBI. Veterans Administration Health System. Factor structure, internal consistency, convergent validity, sensitivity, and specificity of the MFIS were examined. Principal component analysis identified 2 viable MFIS factors: a Cognitive subscale and a Physical/Activities subscale. Item analysis revealed high internal consistency of the MFIS Total scale and subscale items. Strong convergent validity of the MFIS scales was established with 2 Beck Depression Inventory II fatigue items. Receiver operating characteristic curve analysis revealed good to excellent accuracy of the MFIS in classifying fatigued versus nonfatigued individuals. The MFIS is a valid multidimensional measure that can be used to evaluate the impact of fatigue on cognitive and physical functioning in individuals with mild to moderate TBI. The psychometric properties of the MFIS make it useful for evaluating fatigue and provide the potential for improving research on fatigue in this population.
Design and pilot results of a single blind randomized controlled trial of systematic demand-led home visits by nurses to frail elderly persons in primary care [ISRCTN05358495].

PubMed

van Hout, Hein P J; Nijpels, Giel; van Marwijk, Harm W J; Jansen, Aaltje P D; Van't Veer, Petronella J; Tybout, Willemijn; Stalman, Wim A B

2005-09-08

The objective of this article is to describe the design of an evaluation of the cost-effectiveness of systematic home visits by nurses to frail elderly primary care patients. Pilot objectives were: 1. To determine the feasibility of postal multidimensional frailty screening instruments; 2. to identify the need for home visits to elderly. Main study: The main study concerns a randomized controlled in primary care practices (PCP) with 18 months follow-up and blinded PCPs. Frail persons aged 75 years or older and living at home but neither terminally ill nor demented from 33 PCPs were eligible. Trained community nurses (1) visit patients at home and assess the care needs with the Resident Assessment Instrument-Home Care, a multidimensional computerized geriatric assessment instrument, enabling direct identification of problem areas; (2) determine the care priorities together with the patient; (3) design and execute interventions according to protocols; (4) and visit patients at least five times during a year in order to execute and monitor the care-plan. Controls receive usual care. Outcome measures are Quality of life, and Quality Adjusted Life Years; time to nursing home admission; mortality; hospital admissions; health care utilization. Pilot 1: Three brief postal multidimensional screening measures to identify frail health among elderly persons were tested on percentage complete item response (selected after a literature search): 1) Vulnerable Elders Screen, 2) Strawbridge's frailty screen, and 3) COOP-WONCA charts. Pilot 2: Three nurses visited elderly frail patients as identified by PCPs in a health center of 5400 patients and used an assessment protocol to identify psychosocial and medical problems. The needs and experiences of all participants were gathered by semi-structured interviews. The design holds several unique elements such as early identification of frail persons combined with case-management by nurses. From two pilots we learned that of three potential postal frailty measures, the COOP-WONCA charts were completed best by elderly and that preventive home visits by nurses were positively evaluated to have potential for quality of care improvement.
A Quasi-Parametric Method for Fitting Flexible Item Response Functions

ERIC Educational Resources Information Center

Liang, Longjuan; Browne, Michael W.

2015-01-01

If standard two-parameter item response functions are employed in the analysis of a test with some newly constructed items, it can be expected that, for some items, the item response function (IRF) will not fit the data well. This lack of fit can also occur when standard IRFs are fitted to personality or psychopathology items. When investigating…
Qualitative Development of the PROMIS® Pediatric Stress Response Item Banks

PubMed Central

Gardner, William; Pajer, Kathleen; Riley, Anne W.; Forrest, Christopher B.

2013-01-01

Objective To describe the qualitative development of the Patient-Reported Outcome Measurement Information System (PROMIS®) Pediatric Stress Response item banks. Methods Stress response concepts were specified through a literature review and interviews with content experts, children, and parents. A library comprising 2,677 items derived from 71 instruments was developed. Items were classified into conceptual categories; new items were written and redundant items were removed. Items were then revised based on cognitive interviews (n = 39 children), readability analyses, and translatability reviews. Results 2 pediatric Stress Response sub-domains were identified: somatic experiences (43 items) and psychological experiences (64 items). Final item pools cover the full range of children’s stress experiences. Items are comprehensible among children aged ≥8 years and ready for translation. Conclusions Child- and parent-report versions of the item banks assess children’s somatic and psychological states when demands tax their adaptive capabilities. PMID:23124904
The Comprehensive Geriatric Assessment and the multidimensional approach. A new look at the older patient with gastroenterological disorders.

PubMed

Pilotto, Alberto; Addante, Filomena; D'Onofrio, Grazia; Sancarlo, Daniele; Ferrucci, Luigi

2009-01-01

The Comprehensive Geriatric Assessment (CGA) is a multidimensional, usually interdisciplinary, diagnostic process intended to determine an elderly person's medical, psychosocial, and functional capacity and problems with the objective of developing an overall plan for treatment and short- and long-term follow-up. The potential usefulness of the CGA in evaluating treatment and follow-up of older patients with gastroenterological disorders is unknown. In the paper we reported the efficacy of a Multidimensional-Prognostic Index (MPI), calculated from information collected by a standardized CGA, in predicting mortality risk in older patients hospitalized with upper gastrointestinal bleeding and liver cirrhosis. Patients underwent a CGA that included six standardized scales, i.e. Activities of Daily Living (ADL), Instrumental Activities of Daily Living (IADL), Short-Portable Mental Status Questionnaire (SPMSQ), Mini-Nutritional Assessment (MNA), Exton-Smith Score (ESS) and Comorbity Index Rating Scale (CIRS), as well as information on medication history and cohabitation, for a total of 63 items. The MPI was calculated from the integrated total scores and expressed as MPI 1=low risk, MPI 2=moderate risk and MPI 3=severe risk of mortality. Higher MPI values were significantly associated with higher short- and long-term mortality in older patients with both upper gastrointestinal bleeding and liver cirrhosis. A close agreement was found between the estimated mortality by MPI and the observed mortality. Moreover, MPI seems to have a greater discriminatory power than organ-specific prognostic indices such as Rockall and Blatchford scores (in upper gastrointestinal bleeding patients) and Child-Plugh score (in liver cirrhosis patients). All these findings support the concept that a multidimensional approach may be appropriate for the evaluation of older patients with gastroenterological disorders, like it has been reported for patients with other pathological conditions.
Practical methods for dealing with 'not applicable' item responses in the AMC Linear Disability Score project

PubMed Central

Holman, Rebecca; Glas, Cees AW; Lindeboom, Robert; Zwinderman, Aeilko H; de Haan, Rob J

2004-01-01

Background Whenever questionnaires are used to collect data on constructs, such as functional status or health related quality of life, it is unlikely that all respondents will respond to all items. This paper examines ways of dealing with responses in a 'not applicable' category to items included in the AMC Linear Disability Score (ALDS) project item bank. Methods The data examined in this paper come from the responses of 392 respondents to 32 items and form part of the calibration sample for the ALDS item bank. The data are analysed using the one-parameter logistic item response theory model. The four practical strategies for dealing with this type of response are: cold deck imputation; hot deck imputation; treating the missing responses as if these items had never been offered to those individual patients; and using a model which takes account of the 'tendency to respond to items'. Results The item and respondent population parameter estimates were very similar for the strategies involving hot deck imputation; treating the missing responses as if these items had never been offered to those individual patients; and using a model which takes account of the 'tendency to respond to items'. The estimates obtained using the cold deck imputation method were substantially different. Conclusions The cold deck imputation method was not considered suitable for use in the ALDS item bank. The other three methods described can be usefully implemented in the ALDS item bank, depending on the purpose of the data analysis to be carried out. These three methods may be useful for other data sets examining similar constructs, when item response theory based methods are used. PMID:15200681
Relationship between Item Responses of Negative Affect Items and the Distribution of the Sum of the Item Scores in the General Population

PubMed Central

Kawasaki, Yohei; Ide, Kazuki; Akutagawa, Maiko; Yamada, Hiroshi; Furukawa, Toshiaki A.; Ono, Yutaka

2016-01-01

Background Several studies have shown that total depressive symptom scores in the general population approximate an exponential pattern, except for the lower end of the distribution. The Center for Epidemiologic Studies Depression Scale (CES-D) consists of 20 items, each of which may take on four scores: “rarely,” “some,” “occasionally,” and “most of the time.” Recently, we reported that the item responses for 16 negative affect items commonly exhibit exponential patterns, except for the level of “rarely,” leading us to hypothesize that the item responses at the level of “rarely” may be related to the non-exponential pattern typical of the lower end of the distribution. To verify this hypothesis, we investigated how the item responses contribute to the distribution of the sum of the item scores. Methods Data collected from 21,040 subjects who had completed the CES-D questionnaire as part of a Japanese national survey were analyzed. To assess the item responses of negative affect items, we used a parameter r, which denotes the ratio of “rarely” to “some” in each item response. The distributions of the sum of negative affect items in various combinations were analyzed using log-normal scales and curve fitting. Results The sum of the item scores approximated an exponential pattern regardless of the combination of items, whereas, at the lower end of the distributions, there was a clear divergence between the actual data and the predicted exponential pattern. At the lower end of the distributions, the sum of the item scores with high values of r exhibited higher scores compared to those predicted from the exponential pattern, whereas the sum of the item scores with low values of r exhibited lower scores compared to those predicted. Conclusions The distributional pattern of the sum of the item scores could be predicted from the item responses of such items. PMID:27806132
Gender Differences in Scientific Literacy of HKPISA 2006: A Multidimensional Differential Item Functioning and Multilevel Mediation Study

NASA Astrophysics Data System (ADS)

Wong, Kwan Yin

The aim of this study is to investigate the effect of gender differences of 15-year-old students on scientific literacy and their impacts on students’ motivation to pursue science education and careers (Future-oriented Science Motivation) in Hong Kong. The data for this study was collected from the Program for International Student Assessment in Hong Kong (HKPISA). It was carried out in 2006. A total of 4,645 students were randomly selected from 146 secondary schools including government, aided and private schools by two-stage stratified sampling method for the assessment. HKPISA 2006, like most of other large-scale international assessments, presents its assessment frameworks in multidimensional subscales. To fulfill the requirements of this multidimensional assessment framework, this study deployed new approaches to model and investigate gender differences in cognitive and affective latent traits of scientific literacy by using multidimensional differential item functioning (MDIF) and multilevel mediation (MLM). Compared with mean score difference t-test, MDIF improves the precision of each subscales measure at item level and the gender differences in science performance can be accurately estimated. In the light of Eccles et al (1983) Expectancy-value Model of Achievement-related Choices (Eccles’ Model), MLM examines the pattern of gender effects on Future-oriented Science Motivation mediated through cognitive and affective factors. As for MLM investigation, Single-Group Confirmatory Factor Analysis (Single-Group CFA) was used to confirm the applicability and validity of six affective factors which was, originally prepared by OECD. These six factors are Science Self-concept, Personal Value of Science, Interest in Science Learning, Enjoyment of Science Learning, Instrumental Motivation to Learn Science and Future-oriented Science Motivation. Then, Multiple Group CFA was used to verify measurement invariance of these factors across gender groups. The results of Single-Group CFA confirmed that five out of the six affective factors except Interest in Science Learning had strong psychometric properties in the context of Hong Kong. Multiple-group CFA results also confirmed measurement invariance of these factors across gender groups. The findings of this study suggest that 15-year-old school boys consistently outperformed girls in most of the cognitive dimensions except identifying scientific issues. Similarly, boys have higher affective learning outcomes than girls. The effect sizes of gender differences in affective learning outcomes are relatively larger than that of cognitive one. The MLM study reveals that gender effects on Future-oriented Science Motivation mediate through affective factors including Science Self-concept, Enjoyment of Science Learning, Interest in Science Learning, Instrumental Motivation to Learn Science and Personal Value of Science. Girls are significantly affected by the negative impacts of these mediating factors and thus Future-oriented Science Motivation. The MLM results were consistent with the predications by Eccles’ Model. Overall, the CFA and MLM results provide strong support for cross-cultural validity of Eccles’ Model. In light of our findings, recommendations to reduce the gender differences in science achievement and Future-oriented Science Motivation are made for science education participants, teachers, parents, curriculum leaders, examination bodies and policy makers.
Stochastic Approximation Methods for Latent Regression Item Response Models

ERIC Educational Resources Information Center

von Davier, Matthias; Sinharay, Sandip

2010-01-01

This article presents an application of a stochastic approximation expectation maximization (EM) algorithm using a Metropolis-Hastings (MH) sampler to estimate the parameters of an item response latent regression model. Latent regression item response models are extensions of item response theory (IRT) to a latent variable model with covariates…
Factor Structure and Item Level Psychometrics of the Social Problem Solving Inventory Revised-Short Form in Traumatic Brain Injury

PubMed Central

Li, Chih-Ying; Waid-Ebbs, Julia; Velozo, Craig A.; Heaton, Shelley C.

2016-01-01

Primary Objective Social problem solving deficits characterize individuals with traumatic brain injury (TBI). Poor social problem solving interferes with daily functioning and productive lifestyles. Therefore, it is of vital importance to use the appropriate instrument to identify deficits in social problem solving for individuals with TBI. This study investigates factor structure and item-level psychometrics of the Social Problem Solving Inventory-Revised Short Form (SPSI-R:S), for adults with moderate and severe TBI. Research Design Secondary analysis of 90 adults with moderate and severe TBI who completed the SPSI-R:S. Methods and Procedures An exploratory factor analysis (EFA), principal components analysis (PCA) and Rasch analysis examined the factor structure and item-level psychometrics of the SPSI-R:S. Main Outcomes and Results The EFA showed three dominant factors, with positively worded items represented as the most definite factor. The other two factors are negative problem solving orientation and skills; and negative problem solving emotion. Rasch analyses confirmed the three factors are each unidimensional constructs. Conclusions The total score interpretability of the SPSI-R:S may be challenging due to the multidimensional structure of the total measure. Instead, we propose using three separate SPSI-R:S subscores to measure social problem solving for the TBI population. PMID:26052731
Lay beliefs about the causes and cures of schizophrenia.

PubMed

Park, Subin; Lee, Minji; Furnham, Adrian; Jeon, Mina; Ko, Young-Mi

2017-09-01

Lay beliefs about schizophrenia are an important factor associated with treatment-seeking behavior. This study was conducted to investigate the lay beliefs about the causes and treatments of schizophrenia in South Korea. A total of 654 adults (mean age, 35.96 ± 11.33 years) completed two questionnaires assessing their views on the causes and cures of schizophrenia. The factor structures of lay beliefs about the causes and treatments of schizophrenia were then analyzed and the correlations between the resultant factors investigated. From the cause items, four factors were extracted: Health/Lifestyle, God/Fate, Social/Environmental and Biological. Four factors were also extracted from the treatment items: Self-Help/Stress Management, Physical Treatment/Health Management, Religious Help and Mental Health Service Utilization. Notably, most participants believed that items in the Social/Environmental and Biological factors were the causes of schizophrenia, while they believed that items in the Mental Health Service Utilization and Self-Help/Stress Management factors were the treatments. Participants' beliefs about the causes and treatments of schizophrenia were systematically correlated. Overall, laypeople have reasonably accurate beliefs and a multidimensional view of the causes and treatments of schizophrenia. Nevertheless, our results suggest that public education about the etiology and treatment of schizophrenia are necessary to increase actual usage of mental health services and treatments for schizophrenia.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.