Sample records for common item equating

  1. Robust Scale Transformation Methods in IRT True Score Equating under Common-Item Nonequivalent Groups Design

    ERIC Educational Resources Information Center

    He, Yong

    2013-01-01

    Common test items play an important role in equating multiple test forms under the common-item nonequivalent groups design. Inconsistent item parameter estimates among common items can lead to large bias in equated scores for IRT true score equating. Current methods extensively focus on detection and elimination of outlying common items, which…

  2. Using a Linear Regression Method to Detect Outliers in IRT Common Item Equating

    ERIC Educational Resources Information Center

    He, Yong; Cui, Zhongmin; Fang, Yu; Chen, Hanwei

    2013-01-01

    Common test items play an important role in equating alternate test forms under the common item nonequivalent groups design. When the item response theory (IRT) method is applied in equating, inconsistent item parameter estimates among common items can lead to large bias in equated scores. It is prudent to evaluate inconsistency in parameter…

  3. Selection of Common Items as an Unrecognized Source of Variability in Test Equating: A Bootstrap Approximation Assuming Random Sampling of Common Items

    ERIC Educational Resources Information Center

    Michaelides, Michalis P.; Haertel, Edward H.

    2014-01-01

    The standard error of equating quantifies the variability in the estimation of an equating function. Because common items for deriving equated scores are treated as fixed, the only source of variability typically considered arises from the estimation of common-item parameters from responses of samples of examinees. Use of alternative, equally…

  4. The Impact of Test Dimensionality, Common-Item Set Format, and Scale Linking Methods on Mixed-Format Test Equating

    ERIC Educational Resources Information Center

    Öztürk-Gübes, Nese; Kelecioglu, Hülya

    2016-01-01

    The purpose of this study was to examine the impact of dimensionality, common-item set format, and different scale linking methods on preserving equity property with mixed-format test equating. Item response theory (IRT) true-score equating (TSE) and IRT observed-score equating (OSE) methods were used under common-item nonequivalent groups design.…

  5. Investigation of IRT-Based Equating Methods in the Presence of Outlier Common Items

    ERIC Educational Resources Information Center

    Hu, Huiqin; Rogers, W. Todd; Vukmirovic, Zarko

    2008-01-01

    Common items with inconsistent b-parameter estimates may have a serious impact on item response theory (IRT)--based equating results. To find a better way to deal with the outlier common items with inconsistent b-parameters, the current study investigated the comparability of 10 variations of four IRT-based equating methods (i.e., concurrent…

  6. Combined Common Person and Common Item Equating of Medical Science Examinations.

    ERIC Educational Resources Information Center

    Kelley, Paul R.

    This equating study of the National Board of Medical Examiners Examinations was a combined common persons and common items equating, using the Rasch model. The 1,000-item test was administered to about 3,000 second-year medical students in seven equal-length subtests: anatomy, physiology, biochemistry, pathology, microbiology, pharmacology, and…

  7. A Review of the Effects on IRT Item Parameter Estimates with a Focus on Misbehaving Common Items in Test Equating

    PubMed Central

    Michaelides, Michalis P.

    2010-01-01

    Many studies have investigated the topic of change or drift in item parameter estimates in the context of item response theory (IRT). Content effects, such as instructional variation and curricular emphasis, as well as context effects, such as the wording, position, or exposure of an item have been found to impact item parameter estimates. The issue becomes more critical when items with estimates exhibiting differential behavior across test administrations are used as common for deriving equating transformations. This paper reviews the types of effects on IRT item parameter estimates and focuses on the impact of misbehaving or aberrant common items on equating transformations. Implications relating to test validity and the judgmental nature of the decision to keep or discard aberrant common items are discussed, with recommendations for future research into more informed and formal ways of dealing with misbehaving common items. PMID:21833230

  8. A Review of the Effects on IRT Item Parameter Estimates with a Focus on Misbehaving Common Items in Test Equating.

    PubMed

    Michaelides, Michalis P

    2010-01-01

    Many studies have investigated the topic of change or drift in item parameter estimates in the context of item response theory (IRT). Content effects, such as instructional variation and curricular emphasis, as well as context effects, such as the wording, position, or exposure of an item have been found to impact item parameter estimates. The issue becomes more critical when items with estimates exhibiting differential behavior across test administrations are used as common for deriving equating transformations. This paper reviews the types of effects on IRT item parameter estimates and focuses on the impact of misbehaving or aberrant common items on equating transformations. Implications relating to test validity and the judgmental nature of the decision to keep or discard aberrant common items are discussed, with recommendations for future research into more informed and formal ways of dealing with misbehaving common items.

  9. Examining the Impact of Drifted Polytomous Anchor Items on Test Characteristic Curve (TCC) Linking and IRT True Score Equating. Research Report. ETS RR-12-09

    ERIC Educational Resources Information Center

    Li, Yanmei

    2012-01-01

    In a common-item (anchor) equating design, the common items should be evaluated for item parameter drift. Drifted items are often removed. For a test that contains mostly dichotomous items and only a small number of polytomous items, removing some drifted polytomous anchor items may result in anchor sets that no longer resemble mini-versions of…

  10. Sensitivity of Equated Aggregate Scores to the Treatment of Misbehaving Common Items

    ERIC Educational Resources Information Center

    Michaelides, Michalis P.

    2010-01-01

    The delta-plot method (Angoff, 1972) is a graphical technique used in the context of test equating for identifying common items with aberrant changes in their item difficulties across administrations or alternate forms. This brief research report explores the effects on equated aggregate scores when delta-plot outliers are either retained in or…

  11. An Evaluation of the Single-Group Growth Model as an Alternative to Common-Item Equating. Research Report. ETS RR-16-01

    ERIC Educational Resources Information Center

    Wei, Youhua; Morgan, Rick

    2016-01-01

    As an alternative to common-item equating when common items do not function as expected, the single-group growth model (SGGM) scaling uses common examinees or repeaters to link test scores on different forms. The SGGM scaling assumes that, for repeaters taking adjacent administrations, the conditional distribution of scale scores in later…

  12. The Importance of Content Representation for Common-Item Equating with Nonrandom Groups.

    ERIC Educational Resources Information Center

    Klein, Lawrence W.; Jarjoura, David

    1985-01-01

    The test equating accuracy of content-representative anchors (subsets of items in common) versus nonrepresentative, but substantially longer, anchors was compared for a professional certification examination. Through a chain of equatings, it was found that content representation in anchors was critical. (Author/GDC)

  13. A Comparison of Methods of Vertical Equating.

    ERIC Educational Resources Information Center

    Loyd, Brenda H.; Hoover, H. D.

    Rasch model vertical equating procedures were applied to three mathematics computation tests for grades six, seven, and eight. Each level of the test was composed of 45 items in three sets of 15 items, arranged in such a way that tests for adjacent grades had two sets (30 items) in common, and the sixth and eighth grades had 15 items in common. In…

  14. Evaluating Common Item Block Options When Faced with Practical Constraints

    ERIC Educational Resources Information Center

    Wolkowitz, Amanda; Davis-Becker, Susan

    2015-01-01

    This study evaluates the impact of common item characteristics on the outcome of equating in credentialing examinations when traditionally recommended representation is not possible. This research used real data sets from several credentialing exams to test the impact of content representation, item statistics, and number of common items on…

  15. Maintaining Equivalent Cut Scores for Small Sample Test Forms

    ERIC Educational Resources Information Center

    Dwyer, Andrew C.

    2016-01-01

    This study examines the effectiveness of three approaches for maintaining equivalent performance standards across test forms with small samples: (1) common-item equating, (2) resetting the standard, and (3) rescaling the standard. Rescaling the standard (i.e., applying common-item equating methodology to standard setting ratings to account for…

  16. Investigating Separate and Concurrent Approaches for Item Parameter Drift in 3PL Item Response Theory Equating

    ERIC Educational Resources Information Center

    Arce-Ferrer, Alvaro J.; Bulut, Okan

    2017-01-01

    This study examines separate and concurrent approaches to combine the detection of item parameter drift (IPD) and the estimation of scale transformation coefficients in the context of the common item nonequivalent groups design with the three-parameter item response theory equating. The study uses real and synthetic data sets to compare the two…

  17. The Examination of the Classification of Students into Performance Categories by Two Different Equating Methods

    ERIC Educational Resources Information Center

    Keller, Lisa A.; Keller, Robert R.; Parker, Pauline A.

    2011-01-01

    This study investigates the comparability of two item response theory based equating methods: true score equating (TSE), and estimated true equating (ETE). Additionally, six scaling methods were implemented within each equating method: mean-sigma, mean-mean, two versions of fixed common item parameter, Stocking and Lord, and Haebara. Empirical…

  18. Mixed-Format Test Score Equating: Effect of Item-Type Multidimensionality, Length and Composition of Common-Item Set, and Group Ability Difference

    ERIC Educational Resources Information Center

    Wang, Wei

    2013-01-01

    Mixed-format tests containing both multiple-choice (MC) items and constructed-response (CR) items are now widely used in many testing programs. Mixed-format tests often are considered to be superior to tests containing only MC items although the use of multiple item formats leads to measurement challenges in the context of equating conducted under…

  19. Evaluating the Effects of Differences in Group Abilities on the Tucker and the Levine Observed-Score Methods for Common-Item Nonequivalent Groups Equating. ACT Research Report Series 2010-1

    ERIC Educational Resources Information Center

    Chen, Hanwei; Cui, Zhongmin; Zhu, Rongchun; Gao, Xiaohong

    2010-01-01

    The most critical feature of a common-item nonequivalent groups equating design is that the average score difference between the new and old groups can be accurately decomposed into a group ability difference and a form difficulty difference. Two widely used observed-score linear equating methods, the Tucker and the Levine observed-score methods,…

  20. Effects of Misbehaving Common Items on Aggregate Scores and an Application of the Mantel-Haenszel Statistic in Test Equating. CSE Report 688

    ERIC Educational Resources Information Center

    Michaelides, Michalis P.

    2006-01-01

    Consistent behavior is a desirable characteristic that common items are expected to have when administered to different groups. Findings from the literature have established that items do not always behave in consistent ways; item indices and IRT item parameter estimates of the same items differ when obtained from different administrations.…

  1. A Comparison of Linking and Concurrent Calibration under the Graded Response Model.

    ERIC Educational Resources Information Center

    Kim, Seock-Ho; Cohen, Allan S.

    Applications of item response theory to practical testing problems including equating, differential item functioning, and computerized adaptive testing, require that item parameter estimates be placed onto a common metric. In this study, two methods for developing a common metric for the graded response model under item response theory were…

  2. Reliability of Summed Item Scores Using Structural Equation Modeling: An Alternative to Coefficient Alpha

    ERIC Educational Resources Information Center

    Green, Samuel B.; Yang, Yanyun

    2009-01-01

    A method is presented for estimating reliability using structural equation modeling (SEM) that allows for nonlinearity between factors and item scores. Assuming the focus is on consistency of summed item scores, this method for estimating reliability is preferred to those based on linear SEM models and to the most commonly reported estimate of…

  3. Sample Invariance of the Structural Equation Model and the Item Response Model: A Case Study.

    ERIC Educational Resources Information Center

    Breithaupt, Krista; Zumbo, Bruno D.

    2002-01-01

    Evaluated the sample invariance of item discrimination statistics in a case study using real data, responses of 10 random samples of 500 people to a depression scale. Results lend some support to the hypothesized superiority of a two-parameter item response model over the common form of structural equation modeling, at least when responses are…

  4. The Effect of Error in Item Parameter Estimates on the Test Response Function Method of Linking.

    ERIC Educational Resources Information Center

    Kaskowitz, Gary S.; De Ayala, R. J.

    2001-01-01

    Studied the effect of item parameter estimation for computation of linking coefficients for the test response function (TRF) linking/equating method. Simulation results showed that linking was more accurate when there was less error in the parameter estimates, and that 15 or 25 common items provided better results than 5 common items under both…

  5. A Comparison of Four Linear Equating Methods for the Common-Item Nonequivalent Groups Design Using Simulation Methods. ACT Research Report Series, 2013 (2)

    ERIC Educational Resources Information Center

    Topczewski, Anna; Cui, Zhongmin; Woodruff, David; Chen, Hanwei; Fang, Yu

    2013-01-01

    This paper investigates four methods of linear equating under the common item nonequivalent groups design. Three of the methods are well known: Tucker, Angoff-Levine, and Congeneric-Levine. A fourth method is presented as a variant of the Congeneric-Levine method. Using simulation data generated from the three-parameter logistic IRT model we…

  6. Structural Zeros and Their Implications with Log-Linear Bivariate Presmoothing under the Internal-Anchor Design

    ERIC Educational Resources Information Center

    Kim, Hyung Jin; Brennan, Robert L.; Lee, Won-Chan

    2017-01-01

    In equating, when common items are internal and scoring is conducted in terms of the number of correct items, some pairs of total scores ("X") and common-item scores ("V") can never be observed in a bivariate distribution of "X" and "V"; these pairs are called "structural zeros." This simulation…

  7. The Impact of Item Position Change on Item Parameters and Common Equating Results under the 3PL Model

    ERIC Educational Resources Information Center

    Meyers, Jason L.; Murphy, Stephen; Goodman, Joshua; Turhan, Ahmet

    2012-01-01

    Operational testing programs employing item response theory (IRT) applications benefit from of the property of item parameter invariance whereby item parameter estimates obtained from one sample can be applied to other samples (when the underlying assumptions are satisfied). In theory, this feature allows for applications such as computer-adaptive…

  8. Kernel Equating Under the Non-Equivalent Groups With Covariates Design

    PubMed Central

    Bränberg, Kenny

    2015-01-01

    When equating two tests, the traditional approach is to use common test takers and/or common items. Here, the idea is to use variables correlated with the test scores (e.g., school grades and other test scores) as a substitute for common items in a non-equivalent groups with covariates (NEC) design. This is performed in the framework of kernel equating and with an extension of the method developed for post-stratification equating in the non-equivalent groups with anchor test design. Real data from a college admissions test were used to illustrate the use of the design. The equated scores from the NEC design were compared with equated scores from the equivalent group (EG) design, that is, equating with no covariates as well as with equated scores when a constructed anchor test was used. The results indicate that the NEC design can produce lower standard errors compared with an EG design. When covariates were used together with an anchor test, the smallest standard errors were obtained over a large range of test scores. The results obtained, that an EG design equating can be improved by adjusting for differences in test score distributions caused by differences in the distribution of covariates, are useful in practice because not all standardized tests have anchor tests. PMID:29881012

  9. Kernel Equating Under the Non-Equivalent Groups With Covariates Design.

    PubMed

    Wiberg, Marie; Bränberg, Kenny

    2015-07-01

    When equating two tests, the traditional approach is to use common test takers and/or common items. Here, the idea is to use variables correlated with the test scores (e.g., school grades and other test scores) as a substitute for common items in a non-equivalent groups with covariates (NEC) design. This is performed in the framework of kernel equating and with an extension of the method developed for post-stratification equating in the non-equivalent groups with anchor test design. Real data from a college admissions test were used to illustrate the use of the design. The equated scores from the NEC design were compared with equated scores from the equivalent group (EG) design, that is, equating with no covariates as well as with equated scores when a constructed anchor test was used. The results indicate that the NEC design can produce lower standard errors compared with an EG design. When covariates were used together with an anchor test, the smallest standard errors were obtained over a large range of test scores. The results obtained, that an EG design equating can be improved by adjusting for differences in test score distributions caused by differences in the distribution of covariates, are useful in practice because not all standardized tests have anchor tests.

  10. Equating TIMSS Mathematics Subtests with Nonlinear Equating Methods Using NEAT Design: Circle-Arc Equating Approaches

    ERIC Educational Resources Information Center

    Ozdemir, Burhanettin

    2017-01-01

    The purpose of this study is to equate Trends in International Mathematics and Science Study (TIMSS) mathematics subtest scores obtained from TIMSS 2011 to scores obtained from TIMSS 2007 form with different nonlinear observed score equating methods under Non-Equivalent Anchor Test (NEAT) design where common items are used to link two or more test…

  11. High Agreement was Obtained Across Scores from Multiple Equated Scales for Social Anxiety Disorder using Item Response Theory.

    PubMed

    Sunderland, Matthew; Batterham, Philip; Calear, Alison; Carragher, Natacha; Baillie, Andrew; Slade, Tim

    2018-04-10

    There is no standardized approach to the measurement of social anxiety. Researchers and clinicians are faced with numerous self-report scales with varying strengths, weaknesses, and psychometric properties. The lack of standardization makes it difficult to compare scores across populations that utilise different scales. Item response theory offers one solution to this problem via equating different scales using an anchor scale to set a standardized metric. This study is the first to equate several scales for social anxiety disorder. Data from two samples (n=3,175 and n=1,052), recruited from the Australian community using online advertisements, were utilised to equate a network of 11 self-report social anxiety scales via a fixed parameter item calibration method. Comparisons between actual and equated scores for most of the scales indicted a high level of agreement with mean differences <0.10 (equivalent to a mean difference of less than one point on the standardized metric). This study demonstrates that scores from multiple scales that measure social anxiety can be converted to a common scale. Re-scoring observed scores to a common scale provides opportunities to combine research from multiple studies and ultimately better assess social anxiety in treatment and research settings. Copyright © 2018. Published by Elsevier Inc.

  12. Impact of Eliminating Anchor Items Flagged from Statistical Criteria on Test Score Classifications in Common Item Equating

    ERIC Educational Resources Information Center

    Karkee, Thakur; Choi, Seung

    2005-01-01

    Proper maintenance of a scale established in the baseline year would assure the accurate estimation of growth in subsequent years. Scale maintenance is especially important when the state performance standards must be preserved for future administrations. To ensure proper maintenance of a scale, the selection of anchor items and evaluation of…

  13. Impact of Accumulated Error on Item Response Theory Pre-Equating with Mixed Format Tests

    ERIC Educational Resources Information Center

    Keller, Lisa A.; Keller, Robert; Cook, Robert J.; Colvin, Kimberly F.

    2016-01-01

    The equating of tests is an essential process in high-stakes, large-scale testing conducted over multiple forms or administrations. By adjusting for differences in difficulty and placing scores from different administrations of a test on a common scale, equating allows scores from these different forms and administrations to be directly compared…

  14. The Effect of Anchor Test Construction on Scale Drift

    ERIC Educational Resources Information Center

    Antal, Judit; Proctor, Thomas P.; Melican, Gerald J.

    2014-01-01

    In common-item equating the anchor block is generally built to represent a miniature form of the total test in terms of content and statistical specifications. The statistical properties frequently reflect equal mean and spread of item difficulty. Sinharay and Holland (2007) suggested that the requirement for equal spread of difficulty may be too…

  15. Least Squares Method for Equating Logistic Ability Scales: A General Approach and Evaluation. Iowa Testing Programs Occasional Papers, Number 30.

    ERIC Educational Resources Information Center

    Haebara, Tomokazu

    When several ability scales in item response models are separately derived from different test forms administered to different samples of examinees, these scales must be equated to a common scale because their units and origins are arbitrarily determined and generally different from scale to scale. A general method for equating logistic ability…

  16. Local Equating Using the Rasch Model, the OPLM, and the 2PL IRT Model--or--What Is It Anyway if the Model Captures Everything There Is to Know about the Test Takers?

    ERIC Educational Resources Information Center

    von Davier, Matthias; González B., Jorge; von Davier, Alina A.

    2013-01-01

    Local equating (LE) is based on Lord's criterion of equity. It defines a family of true transformations that aim at the ideal of equitable equating. van der Linden (this issue) offers a detailed discussion of common issues in observed-score equating relative to this local approach. By assuming an underlying item response theory model, one of…

  17. Cognitive Abilities Explain Wording Effects in the Rosenberg Self-Esteem Scale.

    PubMed

    Gnambs, Timo; Schroeders, Ulrich

    2017-12-01

    There is consensus that the 10 items of the Rosenberg Self-Esteem Scale (RSES) reflect wording effects resulting from positively and negatively keyed items. The present study examined the effects of cognitive abilities on the factor structure of the RSES with a novel, nonparametric latent variable technique called local structural equation models. In a nationally representative German large-scale assessment including 12,437 students competing measurement models for the RSES were compared: a bifactor model with a common factor and a specific factor for all negatively worded items had an optimal fit. Local structural equation models showed that the unidimensionality of the scale increased with higher levels of reading competence and reasoning, while the proportion of variance attributed to the negatively keyed items declined. Wording effects on the factor structure of the RSES seem to represent a response style artifact associated with cognitive abilities.

  18. Point model equations for neutron correlation counting: Extension of Böhnel's equations to any order

    DOE PAGES

    Favalli, Andrea; Croft, Stephen; Santi, Peter

    2015-06-15

    Various methods of autocorrelation neutron analysis may be used to extract information about a measurement item containing spontaneously fissioning material. The two predominant approaches being the time correlation analysis (that make use of a coincidence gate) methods of multiplicity shift register logic and Feynman sampling. The common feature is that the correlated nature of the pulse train can be described by a vector of reduced factorial multiplet rates. We call these singlets, doublets, triplets etc. Within the point reactor model the multiplet rates may be related to the properties of the item, the parameters of the detector, and basic nuclearmore » data constants by a series of coupled algebraic equations – the so called point model equations. Solving, or inverting, the point model equations using experimental calibration model parameters is how assays of unknown items is performed. Currently only the first three multiplets are routinely used. In this work we develop the point model equations to higher order multiplets using the probability generating functions approach combined with the general derivative chain rule, the so called Faà di Bruno Formula. Explicit expression up to 5th order are provided, as well the general iterative formula to calculate any order. This study represents the first necessary step towards determining if higher order multiplets can add value to nondestructive measurement practice for nuclear materials control and accountancy.« less

  19. Assessing First- and Second-Order Equity for the Common-Item Nonequivalent Groups Design Using Multidimensional IRT

    ERIC Educational Resources Information Center

    Andrews, Benjamin James

    2011-01-01

    The equity properties can be used to assess the quality of an equating. The degree to which expected scores conditional on ability are similar between test forms is referred to as first-order equity. Second-order equity is the degree to which conditional standard errors of measurement are similar between test forms after equating. The purpose of…

  20. Item Response Theory Equating Using Bayesian Informative Priors.

    ERIC Educational Resources Information Center

    de la Torre, Jimmy; Patz, Richard J.

    This paper seeks to extend the application of Markov chain Monte Carlo (MCMC) methods in item response theory (IRT) to include the estimation of equating relationships along with the estimation of test item parameters. A method is proposed that incorporates estimation of the equating relationship in the item calibration phase. Item parameters from…

  1. Enhancing the Equating of Item Difficulty Metrics: Estimation of Reference Distribution. Research Report. ETS RR-14-07

    ERIC Educational Resources Information Center

    Ali, Usama S.; Walker, Michael E.

    2014-01-01

    Two methods are currently in use at Educational Testing Service (ETS) for equating observed item difficulty statistics. The first method involves the linear equating of item statistics in an observed sample to reference statistics on the same items. The second method, or the item response curve (IRC) method, involves the summation of conditional…

  2. Effect of Differential Item Functioning on Test Equating

    ERIC Educational Resources Information Center

    Kabasakal, Kübra Atalay; Kelecioglu, Hülya

    2015-01-01

    This study examines the effect of differential item functioning (DIF) items on test equating through multilevel item response models (MIRMs) and traditional IRMs. The performances of three different equating models were investigated under 24 different simulation conditions, and the variables whose effects were examined included sample size, test…

  3. Use of Jackknifing to Evaluate Effects of Anchor Item Selection on Equating with the Nonequivalent Groups with Anchor Test (NEAT) Design. Research Report. ETS RR-15-10

    ERIC Educational Resources Information Center

    Lu, Ru; Haberman, Shelby; Guo, Hongwen; Liu, Jinghua

    2015-01-01

    In this study, we apply jackknifing to anchor items to evaluate the impact of anchor selection on equating stability. In an ideal world, the choice of anchor items should have little impact on equating results. When this ideal does not correspond to reality, selection of anchor items can strongly influence equating results. This influence does not…

  4. Test Score Equating Using Discrete Anchor Items versus Passage-Based Anchor Items: A Case Study Using "SAT"® Data. Research Report. ETS RR-14-14

    ERIC Educational Resources Information Center

    Liu, Jinghua; Zu, Jiyun; Curley, Edward; Carey, Jill

    2014-01-01

    The purpose of this study is to investigate the impact of discrete anchor items versus passage-based anchor items on observed score equating using empirical data.This study compares an "SAT"® critical reading anchor that contains more discrete items proportionally, compared to the total tests to be equated, to another anchor that…

  5. An analysis of the DuPage County Regional Office of Education physics exam

    NASA Astrophysics Data System (ADS)

    Muehsler, Hans

    In 2009, the DuPage County Regional Office of Education (ROE) tasked volunteer physics teachers with creating a basic skills physics exam reflecting what the participants valued and shared in common across curricula. Mechanics, electricity & magnetism (E&M), and wave phenomena emerged as the primary constructs. The resulting exam was intended for first-exposure physics students. The most recently completed version was psychometrically assessed for unidimensionality within the constructs using a robust WLS structural equation model and for reliability. An item analysis using a 3-PL IRT model was performed on the mechanics items and a 2-PL IRT model was performed on the E&M and waves items; a distractor analysis was also performed on all items. Lastly, differential item functioning (DIF) and differential test functioning (DTF) analyses, using the Mantel-Haenszel procedure, were performed using gender, ethnicity, year in school, ELL, physics level, and math level as groupings.

  6. Asymptotic Standard Errors for Item Response Theory True Score Equating of Polytomous Items

    ERIC Educational Resources Information Center

    Cher Wong, Cheow

    2015-01-01

    Building on previous works by Lord and Ogasawara for dichotomous items, this article proposes an approach to derive the asymptotic standard errors of item response theory true score equating involving polytomous items, for equivalent and nonequivalent groups of examinees. This analytical approach could be used in place of empirical methods like…

  7. The computation of equating errors in international surveys in education.

    PubMed

    Monseur, Christian; Berezner, Alla

    2007-01-01

    Since the IEA's Third International Mathematics and Science Study, one of the major objectives of international surveys in education has been to report trends in achievement. The names of the two current IEA surveys reflect this growing interest: Trends in International Mathematics and Science Study (TIMSS) and Progress in International Reading Literacy Study (PIRLS). Similarly a central concern of the OECD's PISA is with trends in outcomes over time. To facilitate trend analyses these studies link their tests using common item equating in conjunction with item response modelling methods. IEA and PISA policies differ in terms of reporting the error associated with trends. In IEA surveys, the standard errors of the trend estimates do not include the uncertainty associated with the linking step while PISA does include a linking error component in the standard errors of trend estimates. In other words, PISA implicitly acknowledges that trend estimates partly depend on the selected common items, while the IEA's surveys do not recognise this source of error. Failing to recognise the linking error leads to an underestimation of the standard errors and thus increases the Type I error rate, thereby resulting in reporting of significant changes in achievement when in fact these are not significant. The growing interest of policy makers in trend indicators and the impact of the evaluation of educational reforms appear to be incompatible with such underestimation. However, the procedure implemented by PISA raises a few issues about the underlying assumptions for the computation of the equating error. After a brief introduction, this paper will describe the procedure PISA implemented to compute the linking error. The underlying assumptions of this procedure will then be discussed. Finally an alternative method based on replication techniques will be presented, based on a simulation study and then applied to the PISA 2000 data.

  8. Using Multigroup Confirmatory Factor Analysis to Test Measurement Invariance in Raters: A Clinical Skills Examination Application

    ERIC Educational Resources Information Center

    Kahraman, Nilufer; Brown, Crystal B.

    2015-01-01

    Psychometric models based on structural equation modeling framework are commonly used in many multiple-choice test settings to assess measurement invariance of test items across examinee subpopulations. The premise of the current article is that they may also be useful in the context of performance assessment tests to test measurement invariance…

  9. Combining item response theory with multiple imputation to equate health assessment questionnaires.

    PubMed

    Gu, Chenyang; Gutman, Roee

    2017-09-01

    The assessment of patients' functional status across the continuum of care requires a common patient assessment tool. However, assessment tools that are used in various health care settings differ and cannot be easily contrasted. For example, the Functional Independence Measure (FIM) is used to evaluate the functional status of patients who stay in inpatient rehabilitation facilities, the Minimum Data Set (MDS) is collected for all patients who stay in skilled nursing facilities, and the Outcome and Assessment Information Set (OASIS) is collected if they choose home health care provided by home health agencies. All three instruments or questionnaires include functional status items, but the specific items, rating scales, and instructions for scoring different activities vary between the different settings. We consider equating different health assessment questionnaires as a missing data problem, and propose a variant of predictive mean matching method that relies on Item Response Theory (IRT) models to impute unmeasured item responses. Using real data sets, we simulated missing measurements and compared our proposed approach to existing methods for missing data imputation. We show that, for all of the estimands considered, and in most of the experimental conditions that were examined, the proposed approach provides valid inferences, and generally has better coverages, relatively smaller biases, and shorter interval estimates. The proposed method is further illustrated using a real data set. © 2016, The International Biometric Society.

  10. Food Insecurity and Common Mental Disorders among Ethiopian Youth: Structural Equation Modeling.

    PubMed

    Jebena, Mulusew G; Lindstrom, David; Belachew, Tefera; Hadley, Craig; Lachat, Carl; Verstraeten, Roos; De Cock, Nathalie; Kolsteren, Patrick

    2016-01-01

    Although the consequences of food insecurity on physical health and nutritional status of youth living have been reported, its effect on their mental health remains less investigated in developing countries. The aim of this study was to examine the pathways through which food insecurity is associated with poor mental health status among youth living in Ethiopia. We used data from Jimma Longitudinal Family Survey of Youth (JLFSY) collected in 2009/10. A total of 1,521 youth were included in the analysis. We measured food insecurity using a 5-items scale and common mental disorders using the 20-item Self-Reporting Questionnaire (SRQ-20). Structural and generalized equation modeling using maximum likelihood estimation method was used to analyze the data. The prevalence of common mental disorders was 30.8% (95% CI: 28.6, 33.2). Food insecurity was independently associated with common mental disorders (β = 0.323, P<0.05). Most (91.8%) of the effect of food insecurity on common mental disorders was direct and only 8.2% of their relationship was partially mediated by physical health. In addition, poor self-rated health (β = 0.285, P<0.05), high socioeconomic status (β = -0.076, P<0.05), parental education (β = 0.183, P<0.05), living in urban area (β = 0.139, P<0.05), and female-headed household (β = 0.192, P<0.05) were associated with common mental disorders. Food insecurity is directly associated with common mental disorders among youth in Ethiopia. Interventions that aim to improve mental health status of youth should consider strategies to improve access to sufficient, safe and nutritious food.

  11. Food Insecurity and Common Mental Disorders among Ethiopian Youth: Structural Equation Modeling

    PubMed Central

    Lindstrom, David; Belachew, Tefera; Hadley, Craig; Lachat, Carl; Verstraeten, Roos; De Cock, Nathalie; Kolsteren, Patrick

    2016-01-01

    Background Although the consequences of food insecurity on physical health and nutritional status of youth living have been reported, its effect on their mental health remains less investigated in developing countries. The aim of this study was to examine the pathways through which food insecurity is associated with poor mental health status among youth living in Ethiopia. Methods We used data from Jimma Longitudinal Family Survey of Youth (JLFSY) collected in 2009/10. A total of 1,521 youth were included in the analysis. We measured food insecurity using a 5-items scale and common mental disorders using the 20-item Self-Reporting Questionnaire (SRQ-20). Structural and generalized equation modeling using maximum likelihood estimation method was used to analyze the data. Results The prevalence of common mental disorders was 30.8% (95% CI: 28.6, 33.2). Food insecurity was independently associated with common mental disorders (β = 0.323, P<0.05). Most (91.8%) of the effect of food insecurity on common mental disorders was direct and only 8.2% of their relationship was partially mediated by physical health. In addition, poor self-rated health (β = 0.285, P<0.05), high socioeconomic status (β = -0.076, P<0.05), parental education (β = 0.183, P<0.05), living in urban area (β = 0.139, P<0.05), and female-headed household (β = 0.192, P<0.05) were associated with common mental disorders. Conclusions Food insecurity is directly associated with common mental disorders among youth in Ethiopia. Interventions that aim to improve mental health status of youth should consider strategies to improve access to sufficient, safe and nutritious food. PMID:27846283

  12. A Unified Approach to IRT Scale Linking and Scale Transformations. Research Report. RR-04-09

    ERIC Educational Resources Information Center

    von Davier, Matthias; von Davier, Alina A.

    2004-01-01

    This paper examines item response theory (IRT) scale transformations and IRT scale linking methods used in the Non-Equivalent Groups with Anchor Test (NEAT) design to equate two tests, X and Y. It proposes a unifying approach to the commonly used IRT linking methods: mean-mean, mean-var linking, concurrent calibration, Stocking and Lord and…

  13. Use of Item Parceling in Structural Equation Modeling with Missing Data

    ERIC Educational Resources Information Center

    Orcan, Fatih

    2013-01-01

    Parceling is referred to as a procedure for computing sums or average scores across multiple items. Parcels instead of individual items are then used as indicators of latent factors in the structural equation modeling analysis (Bandalos 2002, 2008; Little et al., 2002; Yang, Nay, & Hoyle, 2010). Item parceling may be applied to alleviate some…

  14. Psychological distress in cancer survivors: the further development of an item bank.

    PubMed

    Smith, Adam B; Armes, Jo; Richardson, Alison; Stark, Dan P

    2013-02-01

    Assessment of psychological distress by patient report is necessary to meet patients' needs throughout the cancer journey. We have previously developed an item bank to assess psychological distress but not evaluated it for cancer survivors. Our first aim in this study was to test whether we could extend our item bank to include cancer survivors. The second aim was to examine whether the item bank could assess positive affect as a single construct alongside negative psychological symptoms. Responses from 1315 cancer survivors to the Hospital Anxiety and Depression Scale (HADS) and the Positive and Negative Affect Scale (PANAS) were considered for inclusion in a pre-existing item bank created from a heterogeneous sample of 4914 cancer patients. Differential item functioning (DIF) was used to assess whether HADS responses drawn from the two samples were equivalent. Common-item equating was used to anchor the shared (HADS) items, whilst the PANAS items were added. Item fit was evaluated at each stage, and misfitting items were removed. Unidimensionality was assessed with a principal components factor analysis. The DIF analysis did not reveal any differences between the HADS item locations from the two samples. Three misfitting PANAS items were removed, resulting in a final unidimensional bank of 80 items with good internal reliability (α = 0.85). The new item bank is valid for use across the cancer journey, including cancer survivors, and modestly improves the assessment of all levels of psychological distress and positive psychological function. Copyright © 2011 John Wiley & Sons, Ltd.

  15. An Evaluation of Three Approximate Item Response Theory Models for Equating Test Scores.

    ERIC Educational Resources Information Center

    Marco, Gary L.; And Others

    Three item response models were evaluated for estimating item parameters and equating test scores. The models, which approximated the traditional three-parameter model, included: (1) the Rasch one-parameter model, operationalized in the BICAL computer program; (2) an approximate three-parameter logistic model based on coarse group data divided…

  16. An Extension of IRT-Based Equating to the Dichotomous Testlet Response Theory Model

    ERIC Educational Resources Information Center

    Tao, Wei; Cao, Yi

    2016-01-01

    Current procedures for equating number-correct scores using traditional item response theory (IRT) methods assume local independence. However, when tests are constructed using testlets, one concern is the violation of the local item independence assumption. The testlet response theory (TRT) model is one way to accommodate local item dependence.…

  17. Test Bias: An Objective Definition for Test Items.

    ERIC Educational Resources Information Center

    Durovic, Jerry J.

    A test bias definition, applicable at the item-level of a test is presented. The definition conceptually equates test bias with measuring different things in different groups, and operationally equates test bias with a difference in item fit to the Rasch Model, greater than one, between groups. It is suggested that the proposed definition avoids…

  18. Using Kernel Equating to Assess Item Order Effects on Test Scores

    ERIC Educational Resources Information Center

    Moses, Tim; Yang, Wen-Ling; Wilson, Christine

    2007-01-01

    This study explored the use of kernel equating for integrating and extending two procedures proposed for assessing item order effects in test forms that have been administered to randomly equivalent groups. When these procedures are used together, they can provide complementary information about the extent to which item order effects impact test…

  19. Observed Score and True Score Equating Procedures for Multidimensional Item Response Theory

    ERIC Educational Resources Information Center

    Brossman, Bradley Grant

    2010-01-01

    The purpose of this research was to develop observed score and true score equating procedures to be used in conjunction with the Multidimensional Item Response Theory (MIRT) framework. Currently, MIRT scale linking procedures exist to place item parameter estimates and ability estimates on the same scale after separate calibrations are conducted.…

  20. Asymptotic Standard Errors of Observed-Score Equating with Polytomous IRT Models

    ERIC Educational Resources Information Center

    Andersson, Björn

    2016-01-01

    In observed-score equipercentile equating, the goal is to make scores on two scales or tests measuring the same construct comparable by matching the percentiles of the respective score distributions. If the tests consist of different items with multiple categories for each item, a suitable model for the responses is a polytomous item response…

  1. The medial temporal lobes distinguish between within-item and item-context relations during autobiographical memory retrieval.

    PubMed

    Sheldon, Signy; Levine, Brian

    2015-12-01

    During autobiographical memory retrieval, the medial temporal lobes (MTL) relate together multiple event elements, including object (within-item relations) and context (item-context relations) information, to create a cohesive memory. There is consistent support for a functional specialization within the MTL according to these relational processes, much of which comes from recognition memory experiments. In this study, we compared brain activation patterns associated with retrieving within-item relations (i.e., associating conceptual and sensory-perceptual object features) and item-context relations (i.e., spatial relations among objects) with respect to naturalistic autobiographical retrieval. We developed a novel paradigm that cued participants to retrieve information about past autobiographical events, non-episodic within-item relations, and non-episodic item-context relations with the perceptuomotor aspects of retrieval equated across these conditions. We used multivariate analysis techniques to extract common and distinct patterns of activity among these conditions within the MTL and across the whole brain, both in terms of spatial and temporal patterns of activity. The anterior MTL (perirhinal cortex and anterior hippocampus) was preferentially recruited for generating within-item relations later in retrieval whereas the posterior MTL (posterior parahippocampal cortex and posterior hippocampus) was preferentially recruited for generating item-context relations across the retrieval phase. These findings provide novel evidence for functional specialization within the MTL with respect to naturalistic memory retrieval. © 2015 Wiley Periodicals, Inc.

  2. Linking Existing Instruments to Develop an Activity of Daily Living Item Bank.

    PubMed

    Li, Chih-Ying; Romero, Sergio; Bonilha, Heather S; Simpson, Kit N; Simpson, Annie N; Hong, Ickpyo; Velozo, Craig A

    2018-03-01

    This study examined dimensionality and item-level psychometric properties of an item bank measuring activities of daily living (ADL) across inpatient rehabilitation facilities and community living centers. Common person equating method was used in the retrospective veterans data set. This study examined dimensionality, model fit, local independence, and monotonicity using factor analyses and fit statistics, principal component analysis (PCA), and differential item functioning (DIF) using Rasch analysis. Following the elimination of invalid data, 371 veterans who completed both the Functional Independence Measure (FIM) and minimum data set (MDS) within 6 days were retained. The FIM-MDS item bank demonstrated good internal consistency (Cronbach's α = .98) and met three rating scale diagnostic criteria and three of the four model fit statistics (comparative fit index/Tucker-Lewis index = 0.98, root mean square error of approximation = 0.14, and standardized root mean residual = 0.07). PCA of Rasch residuals showed the item bank explained 94.2% variance. The item bank covered the range of θ from -1.50 to 1.26 (item), -3.57 to 4.21 (person) with person strata of 6.3. The findings indicated the ADL physical function item bank constructed from FIM and MDS measured a single latent trait with overall acceptable item-level psychometric properties, suggesting that it is an appropriate source for developing efficient test forms such as short forms and computerized adaptive tests.

  3. Weighting Test Samples in IRT Linking and Equating: Toward an Improved Sampling Design for Complex Equating. Research Report. ETS RR-13-39

    ERIC Educational Resources Information Center

    Qian, Jiahe; Jiang, Yanming; von Davier, Alina A.

    2013-01-01

    Several factors could cause variability in item response theory (IRT) linking and equating procedures, such as the variability across examinee samples and/or test items, seasonality, regional differences, native language diversity, gender, and other demographic variables. Hence, the following question arises: Is it possible to select optimal…

  4. A Practitioner's Introduction to Equating with Primers on Classical Test Theory and Item Response Theory

    ERIC Educational Resources Information Center

    Ryan, Joseph; Brockmann, Frank

    2009-01-01

    Equating is an essential tool in educational assessment due the critical role it plays in several key areas: establishing validity across forms and years; fairness; test security; and, increasingly, continuity in programs that release items or require ongoing development. Although the practice of equating is rooted in long standing practices that…

  5. Single- versus Double-Scoring of Trend Responses in Trend Score Equating with Constructed-Response Tests. Research Report. ETS RR-10-12

    ERIC Educational Resources Information Center

    Tan, Xuan; Ricker, Kathryn L.; Puhan, Gautam

    2010-01-01

    This study examines the differences in equating outcomes between two trend score equating designs resulting from two different scoring strategies for trend scoring when operational constructed-response (CR) items are double-scored--the single group (SG) design, where each trend CR item is double-scored, and the nonequivalent groups with anchor…

  6. Item response theory and structural equation modelling for ordinal data: Describing the relationship between KIDSCREEN and Life-H.

    PubMed

    Titman, Andrew C; Lancaster, Gillian A; Colver, Allan F

    2016-10-01

    Both item response theory and structural equation models are useful in the analysis of ordered categorical responses from health assessment questionnaires. We highlight the advantages and disadvantages of the item response theory and structural equation modelling approaches to modelling ordinal data, from within a community health setting. Using data from the SPARCLE project focussing on children with cerebral palsy, this paper investigates the relationship between two ordinal rating scales, the KIDSCREEN, which measures quality-of-life, and Life-H, which measures participation. Practical issues relating to fitting models, such as non-positive definite observed or fitted correlation matrices, and approaches to assessing model fit are discussed. item response theory models allow properties such as the conditional independence of particular domains of a measurement instrument to be assessed. When, as with the SPARCLE data, the latent traits are multidimensional, structural equation models generally provide a much more convenient modelling framework. © The Author(s) 2013.

  7. Item Parameter Changes and Equating: An Examination of the Effects of Lack of Item Parameter Invariance on Equating and Score Accuracy for Different Proficiency Levels

    ERIC Educational Resources Information Center

    Store, Davie

    2013-01-01

    The impact of particular types of context effects on actual scores is less understood although there has been some research carried out regarding certain types of context effects under the nonequivalent anchor test (NEAT) design. In addition, the issue of the impact of item context effects on scores has not been investigated extensively when item…

  8. Constructing general partial differential equations using polynomial and neural networks.

    PubMed

    Zjavka, Ladislav; Pedrycz, Witold

    2016-01-01

    Sum fraction terms can approximate multi-variable functions on the basis of discrete observations, replacing a partial differential equation definition with polynomial elementary data relation descriptions. Artificial neural networks commonly transform the weighted sum of inputs to describe overall similarity relationships of trained and new testing input patterns. Differential polynomial neural networks form a new class of neural networks, which construct and solve an unknown general partial differential equation of a function of interest with selected substitution relative terms using non-linear multi-variable composite polynomials. The layers of the network generate simple and composite relative substitution terms whose convergent series combinations can describe partial dependent derivative changes of the input variables. This regression is based on trained generalized partial derivative data relations, decomposed into a multi-layer polynomial network structure. The sigmoidal function, commonly used as a nonlinear activation of artificial neurons, may transform some polynomial items together with the parameters with the aim to improve the polynomial derivative term series ability to approximate complicated periodic functions, as simple low order polynomials are not able to fully make up for the complete cycles. The similarity analysis facilitates substitutions for differential equations or can form dimensional units from data samples to describe real-world problems. Copyright © 2015 Elsevier Ltd. All rights reserved.

  9. A standard description and costing methodology for the balance-of-plant items of a solar thermal electric power plant. Report of a multi-institutional working group

    NASA Technical Reports Server (NTRS)

    1983-01-01

    Standard descriptions for solar thermal power plants are established and uniform costing methodologies for nondevelopmental balance of plant (BOP) items are developed. The descriptions and methodologies developed are applicable to the major systems. These systems include the central receiver, parabolic dish, parabolic trough, hemispherical bowl, and solar pond. The standard plant is defined in terms of four categories comprising (1) solar energy collection, (2) power conversion, (3) energy storage, and (4) balance of plant. Each of these categories is described in terms of the type and function of components and/or subsystems within the category. A detailed description is given for the BOP category. BOP contains a number of nondevelopmental items that are common to all solar thermal systems. A standard methodology for determining the costs of these nondevelopmental BOP items is given. The methodology is presented in the form of cost equations involving cost factors such as unit costs. A set of baseline values for the normalized cost factors is also given.

  10. Redefining diagnostic symptoms of depression using Rasch analysis: testing an item bank suitable for DSM-V and computer adaptive testing.

    PubMed

    Mitchell, Alex J; Smith, Adam B; Al-salihy, Zerak; Rahim, Twana A; Mahmud, Mahmud Q; Muhyaldin, Asma S

    2011-10-01

    We aimed to redefine the optimal self-report symptoms of depression suitable for creation of an item bank that could be used in computer adaptive testing or to develop a simplified screening tool for DSM-V. Four hundred subjects (200 patients with primary depression and 200 non-depressed subjects), living in Iraqi Kurdistan were interviewed. The Mini International Neuropsychiatric Interview (MINI) was used to define the presence of major depression (DSM-IV criteria). We examined symptoms of depression using four well-known scales delivered in Kurdish. The Partial Credit Model was applied to each instrument. Common-item equating was subsequently used to create an item bank and differential item functioning (DIF) explored for known subgroups. A symptom level Rasch analysis reduced the original 45 items to 24 items of the original after the exclusion of 21 misfitting items. A further six items (CESD13 and CESD17, HADS-D4, HADS-D5 and HADS-D7, and CDSS3 and CDSS4) were removed due to misfit as the items were added together to form the item bank, and two items were subsequently removed following the DIF analysis by diagnosis (CESD20 and CDSS9, both of which were harder to endorse for women). Therefore the remaining optimal item bank consisted of 17 items and produced an area under the curve (AUC) of 0.987. Using a bank restricted to the optimal nine items revealed only minor loss of accuracy (AUC = 0.989, sensitivity 96%, specificity 95%). Finally, when restricted to only four items accuracy was still high (AUC was still 0.976; sensitivity 93%, specificity 96%). An item bank of 17 items may be useful in computer adaptive testing and nine or even four items may be used to develop a simplified screening tool for DSM-V major depressive disorder (MDD). Further examination of this item bank should be conducted in different cultural settings.

  11. Composition of key offensive odorants released from fresh food materials

    NASA Astrophysics Data System (ADS)

    Kim, Ki-Hyun; Kim, Yong-Hyun

    2014-06-01

    A refrigerator loaded with a variety of foods without sealed packaging can create quite an olfactory nuisance, and it may come as a surprise that fresh foods emit unpleasant odorants just as those that are decaying. To learn more about nuisance sources in our daily lives, we measured a list of 22 compounds designated as the key offensive odorants (e.g., reduced sulfur, nitrogenous, volatile fatty acid (VFA), and carbonyls) from nine types of common food items consumed in S. Korea: raw beef, raw fish, spam, yolks and albumin of boiled eggs (analyzed separately), milk, cheese, onions, and strawberries. The odor intensity (OI) of each food item was computed initially with the aid of previously used empirical equations. This indicates that the malodor properties of target foods tend to be governed by a few key odorants such as VFA, S, and N compounds. The extent of odorant mixing of a given food was then evaluated by exploring the correlation between the human olfaction (e.g., dilution-to-threshold (D/T) ratio) and the odor potential determined indirectly (instrumentally) such as odor activity value (OAV) or sum of odor intensity (SOI). The overall results of our study confirm the existence of malodorant compounds released from common food items and their contribution to their odor characteristics to a certain degree.

  12. Repeated retrieval practice and item difficulty: does criterion learning eliminate item difficulty effects?

    PubMed

    Vaughn, Kalif E; Rawson, Katherine A; Pyc, Mary A

    2013-12-01

    A wealth of previous research has established that retrieval practice promotes memory, particularly when retrieval is successful. Although successful retrieval promotes memory, it remains unclear whether successful retrieval promotes memory equally well for items of varying difficulty. Will easy items still outperform difficult items on a final test if all items have been correctly recalled equal numbers of times during practice? In two experiments, normatively difficult and easy Lithuanian-English word pairs were learned via test-restudy practice until each item had been correctly recalled a preassigned number of times (from 1 to 11 correct recalls). Despite equating the numbers of successful recalls during practice, performance on a delayed final cued-recall test was lower for difficult than for easy items. Experiment 2 was designed to diagnose whether the disadvantage for difficult items was due to deficits in cue memory, target memory, and/or associative memory. The results revealed a disadvantage for the difficult versus the easy items only on the associative recognition test, with no differences on cue recognition, and even an advantage on target recognition. Although successful retrieval enhanced memory for both difficult and easy items, equating retrieval success during practice did not eliminate normative item difficulty differences.

  13. Score Equating and Item Response Theory: Some Practical Considerations.

    ERIC Educational Resources Information Center

    Cook, Linda L.; Eignor, Daniel R.

    The purposes of this paper are five-fold to discuss: (1) when item response theory (IRT) equating methods should provide better results than traditional methods; (2) which IRT model, the three-parameter logistic or the one-parameter logistic (Rasch), is the most reasonable to use; (3) what unique contributions IRT methods can offer the equating…

  14. Overcoming redundancies in bedside nursing assessments by validating a parsimonious meta-tool: findings from a methodological exercise study.

    PubMed

    Palese, Alvisa; Marini, Eva; Guarnier, Annamaria; Barelli, Paolo; Zambiasi, Paola; Allegrini, Elisabetta; Bazoli, Letizia; Casson, Paola; Marin, Meri; Padovan, Marisa; Picogna, Michele; Taddia, Patrizia; Chiari, Paolo; Salmaso, Daniele; Marognolli, Oliva; Canzan, Federica; Ambrosi, Elisa; Saiani, Luisa; Grassetti, Luca

    2016-10-01

    There is growing interest in validating tools aimed at supporting the clinical decision-making process and research. However, an increased bureaucratization of clinical practice and redundancies in the measures collected have been reported by clinicians. Redundancies in clinical assessments affect negatively both patients and nurses. To validate a meta-tool measuring the risks/problems currently estimated by multiple tools used in daily practice. A secondary analysis of a database was performed, using a cross-validation and a longitudinal study designs. In total, 1464 patients admitted to 12 medical units in 2012 were assessed at admission with the Brass, Barthel, Conley and Braden tools. Pertinent outcomes such as the occurrence of post-discharge need for resources and functional decline at discharge, as well as falls and pressure sores, were measured. Explorative factor analysis of each tool, inter-tool correlations and a conceptual evaluation of the redundant/similar items across tools were performed. Therefore, the validation of the meta-tool was performed through explorative factor analysis, confirmatory factor analysis and the structural equation model to establish the ability of the meta-tool to predict the outcomes estimated by the original tools. High correlations between the tools have emerged (from r 0.428 to 0.867) with a common variance from 18.3% to 75.1%. Through a conceptual evaluation and explorative factor analysis, the items were reduced from 42 to 20, and the three factors that emerged were confirmed by confirmatory factor analysis. According to the structural equation model results, two out of three emerged factors predicted the outcomes. From the initial 42 items, the meta-tool is composed of 20 items capable of predicting the outcomes as with the original tools. © 2016 John Wiley & Sons, Ltd.

  15. The Effect of Mini and Midi Anchor Tests on Test Equating

    ERIC Educational Resources Information Center

    Arikan, Çigdem Akin

    2018-01-01

    The main purpose of this study is to compare the test forms to the midi anchor test and the mini anchor test performance based on item response theory. The research was conducted with using simulated data which were generated based on Rasch model. In order to equate two test forms the anchor item nonequivalent groups (internal anchor test) was…

  16. A Comparison between Linear IRT Observed-Score Equating and Levine Observed-Score Equating under the Generalized Kernel Equating Framework

    ERIC Educational Resources Information Center

    Chen, Haiwen

    2012-01-01

    In this article, linear item response theory (IRT) observed-score equating is compared under a generalized kernel equating framework with Levine observed-score equating for nonequivalent groups with anchor test design. Interestingly, these two equating methods are closely related despite being based on different methodologies. Specifically, when…

  17. Equating Multidimensional Tests under a Random Groups Design: A Comparison of Various Equating Procedures

    ERIC Educational Resources Information Center

    Lee, Eunjung

    2013-01-01

    The purpose of this research was to compare the equating performance of various equating procedures for the multidimensional tests. To examine the various equating procedures, simulated data sets were used that were generated based on a multidimensional item response theory (MIRT) framework. Various equating procedures were examined, including…

  18. Comparison of Kernel Equating and Item Response Theory Equating Methods

    ERIC Educational Resources Information Center

    Meng, Yu

    2012-01-01

    The kernel method of test equating is a unified approach to test equating with some advantages over traditional equating methods. Therefore, it is important to evaluate in a comprehensive way the usefulness and appropriateness of the Kernel equating (KE) method, as well as its advantages and disadvantages compared with several popular item…

  19. Using structural equation modeling to detect response shifts and true change in discrete variables: an application to the items of the SF-36.

    PubMed

    Verdam, Mathilde G E; Oort, Frans J; Sprangers, Mirjam A G

    2016-06-01

    The structural equation modeling (SEM) approach for detection of response shift (Oort in Qual Life Res 14:587-598, 2005. doi: 10.1007/s11136-004-0830-y ) is especially suited for continuous data, e.g., questionnaire scales. The present objective is to explain how the SEM approach can be applied to discrete data and to illustrate response shift detection in items measuring health-related quality of life (HRQL) of cancer patients. The SEM approach for discrete data includes two stages: (1) establishing a model of underlying continuous variables that represent the observed discrete variables, (2) using these underlying continuous variables to establish a common factor model for the detection of response shift and to assess true change. The proposed SEM approach was illustrated with data of 485 cancer patients whose HRQL was measured with the SF-36, before and after start of antineoplastic treatment. Response shift effects were detected in items of the subscales mental health, physical functioning, role limitations due to physical health, and bodily pain. Recalibration response shifts indicated that patients experienced relatively fewer limitations with "bathing or dressing yourself" (effect size d = 0.51) and less "nervousness" (d = 0.30), but more "pain" (d = -0.23) and less "happiness" (d = -0.16) after antineoplastic treatment as compared to the other symptoms of the same subscale. Overall, patients' mental health improved, while their physical health, vitality, and social functioning deteriorated. No change was found for the other subscales of the SF-36. The proposed SEM approach to discrete data enables response shift detection at the item level. This will lead to a better understanding of the response shift phenomena at the item level and therefore enhances interpretation of change in the area of HRQL.

  20. Comparing five depression measures in depressed Chinese patients using item response theory: an examination of item properties, measurement precision and score comparability.

    PubMed

    Zhao, Yue; Chan, Wai; Lo, Barbara Chuen Yee

    2017-04-04

    Item response theory (IRT) has been increasingly applied to patient-reported outcome (PRO) measures. The purpose of this study is to apply IRT to examine item properties (discrimination and severity of depressive symptoms), measurement precision and score comparability across five depression measures, which is the first study of its kind in the Chinese context. A clinical sample of 207 Hong Kong Chinese outpatients was recruited. Data analyses were performed including classical item analysis, IRT concurrent calibration and IRT true score equating. The IRT assumptions of unidimensionality and local independence were tested respectively using confirmatory factor analysis and chi-square statistics. The IRT linking assumptions of construct similarity, equity and subgroup invariance were also tested. The graded response model was applied to concurrently calibrate all five depression measures in a single IRT run, resulting in the item parameter estimates of these measures being placed onto a single common metric. IRT true score equating was implemented to perform the outcome score linking and construct score concordances so as to link scores from one measure to corresponding scores on another measure for direct comparability. Findings suggested that (a) symptoms on depressed mood, suicidality and feeling of worthlessness served as the strongest discriminating indicators, and symptoms concerning suicidality, changes in appetite, depressed mood, feeling of worthlessness and psychomotor agitation or retardation reflected high levels of severity in the clinical sample. (b) The five depression measures contributed to various degrees of measurement precision at varied levels of depression. (c) After outcome score linking was performed across the five measures, the cut-off scores led to either consistent or discrepant diagnoses for depression. The study provides additional evidence regarding the psychometric properties and clinical utility of the five depression measures, offers methodological contributions to the appropriate use of IRT in PRO measures, and helps elucidate cultural variation in depressive symptomatology. The approach of concurrently calibrating and linking multiple PRO measures can be applied to the assessment of PROs other than the depression context.

  1. An Investigation of the Sampling Distributions of Equating Coefficients.

    ERIC Educational Resources Information Center

    Baker, Frank B.

    1996-01-01

    Using the characteristic curve method for dichotomously scored test items, the sampling distributions of equating coefficients were examined. Simulations indicate that for the equating conditions studied, the sampling distributions of the equating coefficients appear to have acceptable characteristics, suggesting confidence in the values obtained…

  2. Using Linear Equating to Map PROMIS(®) Global Health Items and the PROMIS-29 V2.0 Profile Measure to the Health Utilities Index Mark 3.

    PubMed

    Hays, Ron D; Revicki, Dennis A; Feeny, David; Fayers, Peter; Spritzer, Karen L; Cella, David

    2016-10-01

    Preference-based health-related quality of life (HR-QOL) scores are useful as outcome measures in clinical studies, for monitoring the health of populations, and for estimating quality-adjusted life-years. This was a secondary analysis of data collected in an internet survey as part of the Patient-Reported Outcomes Measurement Information System (PROMIS(®)) project. To estimate Health Utilities Index Mark 3 (HUI-3) preference scores, we used the ten PROMIS(®) global health items, the PROMIS-29 V2.0 single pain intensity item and seven multi-item scales (physical functioning, fatigue, pain interference, depressive symptoms, anxiety, ability to participate in social roles and activities, sleep disturbance), and the PROMIS-29 V2.0 items. Linear regression analyses were used to identify significant predictors, followed by simple linear equating to avoid regression to the mean. The regression models explained 48 % (global health items), 61 % (PROMIS-29 V2.0 scales), and 64 % (PROMIS-29 V2.0 items) of the variance in the HUI-3 preference score. Linear equated scores were similar to observed scores, although differences tended to be larger for older study participants. HUI-3 preference scores can be estimated from the PROMIS(®) global health items or PROMIS-29 V2.0. The estimated HUI-3 scores from the PROMIS(®) health measures can be used for economic applications and as a measure of overall HR-QOL in research.

  3. The Missing Data Assumptions of the NEAT Design and Their Implications for Test Equating

    ERIC Educational Resources Information Center

    Sinharay, Sandip; Holland, Paul W.

    2010-01-01

    The Non-Equivalent groups with Anchor Test (NEAT) design involves "missing data" that are "missing by design." Three nonlinear observed score equating methods used with a NEAT design are the "frequency estimation equipercentile equating" (FEEE), the "chain equipercentile equating" (CEE), and the "item-response-theory observed-score-equating" (IRT…

  4. A Comparison of Limited-Information and Full-Information Methods in M"plus" for Estimating Item Response Theory Parameters for Nonnormal Populations

    ERIC Educational Resources Information Center

    DeMars, Christine E.

    2012-01-01

    In structural equation modeling software, either limited-information (bivariate proportions) or full-information item parameter estimation routines could be used for the 2-parameter item response theory (IRT) model. Limited-information methods assume the continuous variable underlying an item response is normally distributed. For skewed and…

  5. A Markov Chain Monte Carlo Approach to Confirmatory Item Factor Analysis

    ERIC Educational Resources Information Center

    Edwards, Michael C.

    2010-01-01

    Item factor analysis has a rich tradition in both the structural equation modeling and item response theory frameworks. The goal of this paper is to demonstrate a novel combination of various Markov chain Monte Carlo (MCMC) estimation routines to estimate parameters of a wide variety of confirmatory item factor analysis models. Further, I show…

  6. Practical Consequences of Item Response Theory Model Misfit in the Context of Test Equating with Mixed-Format Test Data

    PubMed Central

    Zhao, Yue; Hambleton, Ronald K.

    2017-01-01

    In item response theory (IRT) models, assessing model-data fit is an essential step in IRT calibration. While no general agreement has ever been reached on the best methods or approaches to use for detecting misfit, perhaps the more important comment based upon the research findings is that rarely does the research evaluate IRT misfit by focusing on the practical consequences of misfit. The study investigated the practical consequences of IRT model misfit in examining the equating performance and the classification of examinees into performance categories in a simulation study that mimics a typical large-scale statewide assessment program with mixed-format test data. The simulation study was implemented by varying three factors, including choice of IRT model, amount of growth/change of examinees’ abilities between two adjacent administration years, and choice of IRT scaling methods. Findings indicated that the extent of significant consequences of model misfit varied over the choice of model and IRT scaling methods. In comparison with mean/sigma (MS) and Stocking and Lord characteristic curve (SL) methods, separate calibration with linking and fixed common item parameter (FCIP) procedure was more sensitive to model misfit and more robust against various amounts of ability shifts between two adjacent administrations regardless of model fit. SL was generally the least sensitive to model misfit in recovering equating conversion and MS was the least robust against ability shifts in recovering the equating conversion when a substantial degree of misfit was present. The key messages from the study are that practical ways are available to study model fit, and, model fit or misfit can have consequences that should be considered when choosing an IRT model. Not only does the study address the consequences of IRT model misfit, but also it is our hope to help researchers and practitioners find practical ways to study model fit and to investigate the validity of particular IRT models for achieving a specified purpose, to assure that the successful use of the IRT models are realized, and to improve the applications of IRT models with educational and psychological test data. PMID:28421011

  7. The factor structure of the Values in Action Inventory of Strengths (VIA-IS): An item-level exploratory structural equation modeling (ESEM) bifactor analysis.

    PubMed

    Ng, Vincent; Cao, Mengyang; Marsh, Herbert W; Tay, Louis; Seligman, Martin E P

    2017-08-01

    The factor structure of the Values in Action Inventory of Strengths (VIA-IS; Peterson & Seligman, 2004) has not been well established as a result of methodological challenges primarily attributable to a global positivity factor, item cross-loading across character strengths, and questions concerning the unidimensionality of the scales assessing character strengths. We sought to overcome these methodological challenges by applying exploratory structural equation modeling (ESEM) at the item level using a bifactor analytic approach to a large sample of 447,573 participants who completed the VIA-IS with all 240 character strengths items and a reduced set of 107 unidimensional character strength items. It was found that a 6-factor bifactor structure generally held for the reduced set of unidimensional character strength items; these dimensions were justice, temperance, courage, wisdom, transcendence, humanity, and an overarching general factor that is best described as dispositional positivity. (PsycINFO Database Record (c) 2017 APA, all rights reserved).

  8. Harmonizing Measures of Cognitive Performance Across International Surveys of Aging Using Item Response Theory.

    PubMed

    Chan, Kitty S; Gross, Alden L; Pezzin, Liliana E; Brandt, Jason; Kasper, Judith D

    2015-12-01

    To harmonize measures of cognitive performance using item response theory (IRT) across two international aging studies. Data for persons ≥65 years from the Health and Retirement Study (HRS, N = 9,471) and the English Longitudinal Study of Aging (ELSA, N = 5,444). Cognitive performance measures varied (HRS fielded 25, ELSA 13); 9 were in common. Measurement precision was examined for IRT scores based on (a) common items, (b) common items adjusted for differential item functioning (DIF), and (c) DIF-adjusted all items. Three common items (day of date, immediate word recall, and delayed word recall) demonstrated DIF by survey. Adding survey-specific items improved precision but mainly for HRS respondents at lower cognitive levels. IRT offers a feasible strategy for harmonizing cognitive performance measures across other surveys and for other multi-item constructs of interest in studies of aging. Practical implications depend on sample distribution and the difficulty mix of in-common and survey-specific items. © The Author(s) 2015.

  9. Long-Term Impact of Valid Case Criterion on Capturing Population-Level Growth under Item Response Theory Equating. Research Report. ETS RR-17-17

    ERIC Educational Resources Information Center

    Deng, Weiling; Monfils, Lora

    2017-01-01

    Using simulated data, this study examined the impact of different levels of stringency of the valid case inclusion criterion on item response theory (IRT)-based true score equating over 5 years in the context of K-12 assessment when growth in student achievement is expected. Findings indicate that the use of the most stringent inclusion criterion…

  10. Developing an Initial Physical Function Item Bank from Existing Sources.

    ERIC Educational Resources Information Center

    Bode, Rita K.; Cella, David; Lai, Jin-shei; Heinemann, Allen W.

    2003-01-01

    Illustrates incremental item banking using health-related quality of life data collected from two samples of patients receiving cancer treatment (n=1,755 and n=1,544). Results support findings from previous studies that have equated separate instruments by co-calibrating their items. (SLD)

  11. Evaluating Equating Accuracy and Assumptions for Groups that Differ in Performance

    ERIC Educational Resources Information Center

    Powers, Sonya; Kolen, Michael J.

    2014-01-01

    Accurate equating results are essential when comparing examinee scores across exam forms. Previous research indicates that equating results may not be accurate when group differences are large. This study compared the equating results of frequency estimation, chained equipercentile, item response theory (IRT) true-score, and IRT observed-score…

  12. Equating with Miditests Using IRT

    ERIC Educational Resources Information Center

    Fitzpatrick, Joseph; Skorupski, William P.

    2016-01-01

    The equating performance of two internal anchor test structures--miditests and minitests--is studied for four IRT equating methods using simulated data. Originally proposed by Sinharay and Holland, miditests are anchors that have the same mean difficulty as the overall test but less variance in item difficulties. Four popular IRT equating methods…

  13. The Effect of Repeaters on Equating

    ERIC Educational Resources Information Center

    Kim, HeeKyoung; Kolen, Michael J.

    2010-01-01

    Test equating might be affected by including in the equating analyses examinees who have taken the test previously. This study evaluated the effect of including such repeaters on Medical College Admission Test (MCAT) equating using a population invariance approach. Three-parameter logistic (3-PL) item response theory (IRT) true score and…

  14. Assessing Equating Results on Different Equating Criteria

    ERIC Educational Resources Information Center

    Tong, Ye; Kolen, Michael

    2005-01-01

    The performance of three equating methods--the presmoothed equipercentile method, the item response theory (IRT) true score method, and the IRT observed score method--were examined based on three equating criteria: the same distributions property, the first-order equity property, and the second-order equity property. The magnitude of the…

  15. Local Observed-Score Kernel Equating

    ERIC Educational Resources Information Center

    Wiberg, Marie; van der Linden, Wim J.; von Davier, Alina A.

    2014-01-01

    Three local observed-score kernel equating methods that integrate methods from the local equating and kernel equating frameworks are proposed. The new methods were compared with their earlier counterparts with respect to such measures as bias--as defined by Lord's criterion of equity--and percent relative error. The local kernel item response…

  16. Developmental changes in memorial comparisons: the effects of stimulus presentation mode.

    PubMed

    Wright, K P; Berch, D B

    1992-06-01

    First graders, fifth graders, and college students made comparative size judgments of either pictures (line drawings) or names (spoken words) of common objects by designating the "bigger" item in real life. Care was taken to equate the picture and word conditions on a number of critical parameters including method of item-pair presentation and activation of response-time intervals. All groups exhibited a symbolic distance effect. While judgments were faster with pictures than words, the magnitude of the difference did not change with age. Previous research suggesting a marked developmental decline in the magnitude of the "pictorial superiority effect" may have confounded reduced memory demands with stimulus presentation mode for young children. Finally, slopes of the symbolic distance functions were found to decrease with increasing grade level, at least from first to fifth grade. This is the first demonstration of an age-related decline in slopes for magnitude comparisons of concrete objects.

  17. Multiple determinants of lifespan memory differences.

    PubMed

    Henson, Richard N; Campbell, Karen L; Davis, Simon W; Taylor, Jason R; Emery, Tina; Erzinclioglu, Sharon; Kievit, Rogier A

    2016-09-07

    Memory problems are among the most common complaints as people grow older. Using structural equation modeling of commensurate scores of anterograde memory from a large (N = 315), population-derived sample (www.cam-can.org), we provide evidence for three memory factors that are supported by distinct brain regions and show differential sensitivity to age. Associative memory and item memory are dramatically affected by age, even after adjusting for education level and fluid intelligence, whereas visual priming is not. Associative memory and item memory are differentially affected by emotional valence, and the age-related decline in associative memory is faster for negative than for positive or neutral stimuli. Gray-matter volume in the hippocampus, parahippocampus and fusiform cortex, and a white-matter index for the fornix, uncinate fasciculus and inferior longitudinal fasciculus, show differential contributions to the three memory factors. Together, these data demonstrate the extent to which differential ageing of the brain leads to differential patterns of memory loss.

  18. Comparing the IRT Pre-equating and Section Pre-equating: A Simulation Study.

    ERIC Educational Resources Information Center

    Hwang, Chi-en; Cleary, T. Anne

    The results obtained from two basic types of pre-equatings of tests were compared: the item response theory (IRT) pre-equating and section pre-equating (SPE). The simulated data were generated from a modified three-parameter logistic model with a constant guessing parameter. Responses of two replication samples of 3000 examinees on two 72-item…

  19. Item Selection and Pre-equating with Empirical Item Characteristic Curves.

    ERIC Educational Resources Information Center

    Livingston, Samuel A.

    An empirical item characteristic curve shows the probability of a correct response as a function of the student's total test score. These curves can be estimated from large-scale pretest data. They enable test developers to select items that discriminate well in the score region where decisions are made. A similar set of curves can be used to…

  20. Model Analysis and Model Creation: Capturing the Task-Model Structure of Quantitative Item Domains. Research Report. ETS RR-06-11

    ERIC Educational Resources Information Center

    Deane, Paul; Graf, Edith Aurora; Higgins, Derrick; Futagi, Yoko; Lawless, René

    2006-01-01

    This study focuses on the relationship between item modeling and evidence-centered design (ECD); it considers how an appropriately generalized item modeling software tool can support systematic identification and exploitation of task-model variables, and then examines the feasibility of this goal, using linear-equation items as a test case. The…

  1. [Development of competency to stand trial rating scale in offenders with mental disorders].

    PubMed

    Chen, Xiao-Bing; Cai, Wei-Xiong

    2013-04-01

    According with Chinese legal system, to develop a competency to stand trial rating scale in offenders with mental disorders. Proceeding from the juristical elements, 15 items were extracted and formulated a preliminary instrument named the competency to stand trial rating scale in offenders with mental disorders. The item analysis included six aspects, which were critical ratio, item-total correlation, corrected item-total correlation, alpha value if item deleted, communalities of items, and factor loading. The Logistic regression equation and cut-off score of ROC curve were used to explore the diagnostic efficiency. The data of critical ratio of extreme group were 18.390-46.763; item-total correlation, 0.639-0.952; corrected item-total correlation, 0.582-0.944; communalities of items, 0.377-0.916; and factor loadings, 0.614-0.957. Seven items were included in the regression equation and the accuracy of back substitution test was 96.0%. The score of 33 was ascertained as the cut-off score by ROC fitting curve, the overlapping ratio compared with the expertise was 95.8%. The sensibility and the specificity were 0.938 and 0.966, respectively, while the positive and negative likelihood ratios were 27.67 and 0.06, respectively. With all items satisfied the requirement of homogeneity test, the rating scale has a reasonable construct and excellent diagnostic efficiency.

  2. A Comparison of Kernel Equating and Traditional Equipercentile Equating Methods and the Parametric Bootstrap Methods for Estimating Standard Errors in Equipercentile Equating

    ERIC Educational Resources Information Center

    Choi, Sae Il

    2009-01-01

    This study used simulation (a) to compare the kernel equating method to traditional equipercentile equating methods under the equivalent-groups (EG) design and the nonequivalent-groups with anchor test (NEAT) design and (b) to apply the parametric bootstrap method for estimating standard errors of equating. A two-parameter logistic item response…

  3. The Dutch Identity: A New Tool for the Study of Item Response Models.

    ERIC Educational Resources Information Center

    Holland, Paul W.

    1990-01-01

    The Dutch Identity is presented as a useful tool for expressing the basic equations of item response models that relate the manifest probabilities to the item response functions and the latent trait distribution. Ways in which the identity may be exploited are suggested and illustrated. (SLD)

  4. Developing physician pay arrangements: the cash and care equation.

    PubMed

    Levitch, J H

    1998-11-01

    Developing physician compensation packages that help a healthcare organization meet its business objectives while satisfying physician pay expectations requires new ways of linking pay to physician performance. Such compensation arrangements specifically should include pay tied to defined performance standards, compensation linked to group performance, performance incentives based on realistic, achievable goals, work performance measured by common criteria, and similar pay ensured for similar work. Final pay arrangements also should include items that are sometimes overlooked, such as fully delineated job responsibilities, performance measures aligned correctly with performance areas, and the value of benefits considered in the cash compensation levels.

  5. Preequating with Empirical Item Characteristic Curves: An Observed-Score Preequating Method

    ERIC Educational Resources Information Center

    Zu, Jiyun; Puhan, Gautam

    2014-01-01

    Preequating is in demand because it reduces score reporting time. In this article, we evaluated an observed-score preequating method: the empirical item characteristic curve (EICC) method, which makes preequating without item response theory (IRT) possible. EICC preequating results were compared with a criterion equating and with IRT true-score…

  6. Qualitative investigation of students' views about experimental physics

    NASA Astrophysics Data System (ADS)

    Hu, Dehui; Zwickl, Benjamin M.; Wilcox, Bethany R.; Lewandowski, H. J.

    2017-12-01

    This study examines students' reasoning surrounding seemingly contradictory Likert-scale responses within five items in the Colorado Learning Attitudes About Science Survey for Experimental Physics (E-CLASS). We administered the E-CLASS with embedded open-ended prompts, which asked students to provide explanations after making a Likert-scale selection. The quantitative scores on those items showed that our sample of the 216 students enrolled in first year and beyond first year physics courses demonstrated the same trends as previous national data. A qualitative analysis of students' open-ended responses was used to examine common reasoning patterns related to particular Likert-scale responses. When explaining responses to items regarding the role of experiments in confirming known results and also contributing to the growth of scientific knowledge, a common reasoning pattern suggested that confirming known results in a classroom experiment can help with understanding concepts. Thus, physics experiments contribute to students' personal scientific knowledge growth, while also confirming widely known results. Many students agreed that having correct formatting and making well-reasoned conclusions are the main goal for communicating experimental results. Students who focused on sections and formatting emphasized how it enables clear and efficient communication. However, very few students discussed the link between well-reasoned conclusions and effective scientific communication. Lastly, many students argued it was possible to complete experiments without understanding equations and physics concepts. The most common justification was that they could simply follow instructions to finish the lab without understanding. The findings suggest several implications for teaching physics laboratory courses, for example, incorporating some lab activities with outcomes that are unknown to the students might have a significant impact on students' understanding of experiments as an important approach for developing scientific knowledge.

  7. An Evaluation of the Kernel Equating Method: A Special Study with Pseudotests Constructed from Real Test Data. Research Report. ETS RR-06-02

    ERIC Educational Resources Information Center

    von Davier, Alina A.; Holland, Paul W.; Livingston, Samuel A.; Casabianca, Jodi; Grant, Mary C.; Martin, Kathleen

    2006-01-01

    This study examines how closely the kernel equating (KE) method (von Davier, Holland, & Thayer, 2004a) approximates the results of other observed-score equating methods--equipercentile and linear equatings. The study used pseudotests constructed of item responses from a real test to simulate three equating designs: an equivalent groups (EG)…

  8. Effect of Item Response Theory (IRT) Model Selection on Testlet-Based Test Equating. Research Report. ETS RR-14-19

    ERIC Educational Resources Information Center

    Cao, Yi; Lu, Ru; Tao, Wei

    2014-01-01

    The local item independence assumption underlying traditional item response theory (IRT) models is often not met for tests composed of testlets. There are 3 major approaches to addressing this issue: (a) ignore the violation and use a dichotomous IRT model (e.g., the 2-parameter logistic [2PL] model), (b) combine the interdependent items to form a…

  9. Using Patient Health Questionnaire-9 item parameters of a common metric resulted in similar depression scores compared to independent item response theory model reestimation.

    PubMed

    Liegl, Gregor; Wahl, Inka; Berghöfer, Anne; Nolte, Sandra; Pieh, Christoph; Rose, Matthias; Fischer, Felix

    2016-03-01

    To investigate the validity of a common depression metric in independent samples. We applied a common metrics approach based on item-response theory for measuring depression to four German-speaking samples that completed the Patient Health Questionnaire (PHQ-9). We compared the PHQ item parameters reported for this common metric to reestimated item parameters that derived from fitting a generalized partial credit model solely to the PHQ-9 items. We calibrated the new model on the same scale as the common metric using two approaches (estimation with shifted prior and Stocking-Lord linking). By fitting a mixed-effects model and using Bland-Altman plots, we investigated the agreement between latent depression scores resulting from the different estimation models. We found different item parameters across samples and estimation methods. Although differences in latent depression scores between different estimation methods were statistically significant, these were clinically irrelevant. Our findings provide evidence that it is possible to estimate latent depression scores by using the item parameters from a common metric instead of reestimating and linking a model. The use of common metric parameters is simple, for example, using a Web application (http://www.common-metrics.org) and offers a long-term perspective to improve the comparability of patient-reported outcome measures. Copyright © 2016 Elsevier Inc. All rights reserved.

  10. An Approach to Scoring and Equating Tests with Binary Items: Piloting With Large-Scale Assessments

    ERIC Educational Resources Information Center

    Dimitrov, Dimiter M.

    2016-01-01

    This article describes an approach to test scoring, referred to as "delta scoring" (D-scoring), for tests with dichotomously scored items. The D-scoring uses information from item response theory (IRT) calibration to facilitate computations and interpretations in the context of large-scale assessments. The D-score is computed from the…

  11. The Missing Data Assumptions of the Nonequivalent Groups with Anchor Test (NEAT) Design and Their Implications for Test Equating. Research Report. ETS RR-09-16

    ERIC Educational Resources Information Center

    Sinharay, Sandip; Holland, Paul W.

    2008-01-01

    The nonequivalent groups with anchor test (NEAT) design involves missing data that are missing by design. Three popular equating methods that can be used with a NEAT design are the poststratification equating method, the chain equipercentile equating method, and the item-response-theory observed-score-equating method. These three methods each…

  12. ESPACOMP Medication Adherence Reporting Guidelines (EMERGE): a reactive-Delphi study protocol

    PubMed Central

    Helmy, R; Zullig, L L; Dunbar-Jacob, J; Hughes, D A; Vrijens, B; Wilson, I B; De Geest, S

    2017-01-01

    Introduction Medication adherence is fundamental to achieving optimal patient outcomes. Reporting research on medication adherence suffers from some issues—including conceptualisation, measurement and data analysis—that thwart its advancement. Using the ABC taxonomy for medication adherence as the conceptual basis, a steering committee of members of the European Society for Patient Adherence, COMpliance, and Persistence (ESPACOMP) launched an initiative to develop ESPACOMP Medication Adherence Reporting Guidelines (EMERGE). This paper is a protocol for a Delphi study that aims to build consensus among a group of topic experts regarding an item list that will support developing EMERGE. Methods and analysis This study uses a reactive-Delphi design where a group of topic experts will be asked to rate the relevance and clarity of an initial list of items, in addition to suggesting further items and/or modifications of the initial items. The initial item list, generated by the EMERGE steering committee through a structured process, consists of 26 items distributed in 2 sections: 4 items representing the taxonomy-based minimum reporting criteria, and 22 items organised according to the common reporting sections. A purposive sample of experts will be selected from relevant disciplines and diverse geographical locations. Consensus will be achieved through predefined decision rules to keep, delete or modify the items. An iterative process of online survey rounds will be carried out until consensus is reached. Ethics and dissemination An ethics approval was not required for the study according to the Swiss federal act on research involving human beings. The participating experts will be asked to give an informed consent. The results of this Delphi study will feed into EMERGE, which will be disseminated through peer-reviewed publications and presentations at conferences. Additionally, the steering committee will encourage their endorsement by registering the guidelines at the Enhancing the QUAlity and Transparency Of health Research (EQUATOR) network and other relevant organisations. PMID:28188154

  13. IRT Equating of the MCAT. MCAT Monograph.

    ERIC Educational Resources Information Center

    Hendrickson, Amy B.; Kolen, Michael J.

    This study compared various equating models and procedures for a sample of data from the Medical College Admission Test(MCAT), considering how item response theory (IRT) equating results compare with classical equipercentile results and how the results based on use of various IRT models, observed score versus true score, direct versus linked…

  14. Item response theory - A first approach

    NASA Astrophysics Data System (ADS)

    Nunes, Sandra; Oliveira, Teresa; Oliveira, Amílcar

    2017-07-01

    The Item Response Theory (IRT) has become one of the most popular scoring frameworks for measurement data, frequently used in computerized adaptive testing, cognitively diagnostic assessment and test equating. According to Andrade et al. (2000), IRT can be defined as a set of mathematical models (Item Response Models - IRM) constructed to represent the probability of an individual giving the right answer to an item of a particular test. The number of Item Responsible Models available to measurement analysis has increased considerably in the last fifteen years due to increasing computer power and due to a demand for accuracy and more meaningful inferences grounded in complex data. The developments in modeling with Item Response Theory were related with developments in estimation theory, most remarkably Bayesian estimation with Markov chain Monte Carlo algorithms (Patz & Junker, 1999). The popularity of Item Response Theory has also implied numerous overviews in books and journals, and many connections between IRT and other statistical estimation procedures, such as factor analysis and structural equation modeling, have been made repeatedly (Van der Lindem & Hambleton, 1997). As stated before the Item Response Theory covers a variety of measurement models, ranging from basic one-dimensional models for dichotomously and polytomously scored items and their multidimensional analogues to models that incorporate information about cognitive sub-processes which influence the overall item response process. The aim of this work is to introduce the main concepts associated with one-dimensional models of Item Response Theory, to specify the logistic models with one, two and three parameters, to discuss some properties of these models and to present the main estimation procedures.

  15. Structural Equation Model Approach to the Use of Response Times for Improving Estimation in Item Response Models

    ERIC Educational Resources Information Center

    Sen, Rohini

    2012-01-01

    In the last five decades, research on the uses of response time has extended into the field of psychometrics (Schnikpe & Scrams, 1999; van der Linden, 2006; van der Linden, 2007), where interest has centered around the usefulness of response time information in item calibration and person measurement within an item response theory. framework.…

  16. Solving Differential Equations Analytically. Elementary Differential Equations. Modules and Monographs in Undergraduate Mathematics and Its Applications Project. UMAP Unit 335.

    ERIC Educational Resources Information Center

    Goldston, J. W.

    This unit introduces analytic solutions of ordinary differential equations. The objective is to enable the student to decide whether a given function solves a given differential equation. Examples of problems from biology and chemistry are covered. Problem sets, quizzes, and a model exam are included, and answers to all items are provided. The…

  17. Multiple determinants of lifespan memory differences

    PubMed Central

    Henson, Richard N.; Campbell, Karen L.; Davis, Simon W.; Taylor, Jason R.; Emery, Tina; Erzinclioglu, Sharon; Tyler, Lorraine K.; Brayne, Carol; Bullmore, Edward T.; Calder, Andrew C.; Cusack, Rhodri; Dalgleish, Tim; Duncan, John; Matthews, Fiona E.; Marslen-Wilson, William D.; Rowe, James B.; Shafto, Meredith A.; Cheung, Teresa; Geerligs, Linda; McCarrey, Anna; Mustafa, Abdur; Price, Darren; Samu, David; Treder, Matthias; Tsvetanov, Kamen A.; van Belle, Janna; Williams, Nitin; Bates, Lauren; Gadie, Andrew; Gerbase, Sofia; Georgieva, Stanimira; Hanley, Claire; Parkin, Beth; Troy, David; Auer, Tibor; Correia, Marta; Gao, Lu; Green, Emma; Henriques, Rafael; Allen, Jodie; Amery, Gillian; Amunts, Liana; Barcroft, Anne; Castle, Amanda; Dias, Cheryl; Dowrick, Jonathan; Fair, Melissa; Fisher, Hayley; Goulding, Anna; Grewa, Adarsh; Hale, Geoff; Hilton, Andrew; Johnson, Frances; Johnston, Patricia; Kavanagh-Williamson, Thea; Kwasniewska, Magdalena; McMinn, Alison; Norman, Kim; Penrose, Jessica; Roby, Fiona; Rowland, Diane; Sargeant, John; Squire, Maggie; Stevens, Beth; Stoddart, Aldabra; Stone, Cheryl; Thompson, Tracy; Yazlik, Ozlem; Barnes, Dan; Dixon, Marie; Hillman, Jaya; Mitchell, Joanne; Villis, Laura; Kievit, Rogier A.

    2016-01-01

    Memory problems are among the most common complaints as people grow older. Using structural equation modeling of commensurate scores of anterograde memory from a large (N = 315), population-derived sample (www.cam-can.org), we provide evidence for three memory factors that are supported by distinct brain regions and show differential sensitivity to age. Associative memory and item memory are dramatically affected by age, even after adjusting for education level and fluid intelligence, whereas visual priming is not. Associative memory and item memory are differentially affected by emotional valence, and the age-related decline in associative memory is faster for negative than for positive or neutral stimuli. Gray-matter volume in the hippocampus, parahippocampus and fusiform cortex, and a white-matter index for the fornix, uncinate fasciculus and inferior longitudinal fasciculus, show differential contributions to the three memory factors. Together, these data demonstrate the extent to which differential ageing of the brain leads to differential patterns of memory loss. PMID:27600595

  18. Grouping and binding in visual short-term memory.

    PubMed

    Quinlan, Philip T; Cohen, Dale J

    2012-09-01

    Findings of 2 experiments are reported that challenge the current understanding of visual short-term memory (VSTM). In both experiments, a single study display, containing 6 colored shapes, was presented briefly and then probed with a single colored shape. At stake is how VSTM retains a record of different objects that share common features: In the 1st experiment, 2 study items sometimes shared a common feature (either a shape or a color). The data revealed a color sharing effect, in which memory was much better for items that shared a common color than for items that did not. The 2nd experiment showed that the size of the color sharing effect depended on whether a single pair of items shared a common color or whether 2 pairs of items were so defined-memory for all items improved when 2 color groups were presented. In explaining performance, an account is advanced in which items compete for a fixed number of slots, but then memory recall for any given stored item is prone to error. A critical assumption is that items that share a common color are stored together in a slot as a chunk. The evidence provides further support for the idea that principles of perceptual organization may determine the manner in which items are stored in VSTM. PsycINFO Database Record (c) 2012 APA, all rights reserved.

  19. Ipsative imputation for a 15-item Geriatric Depression Scale in community-dwelling elderly people.

    PubMed

    Imai, Hissei; Furukawa, Toshiaki A; Kasahara, Yoriko; Ishimoto, Yasuko; Kimura, Yumi; Fukutomi, Eriko; Chen, Wen-Ling; Tanaka, Mire; Sakamoto, Ryota; Wada, Taizo; Fujisawa, Michiko; Okumiya, Kiyohito; Matsubayashi, Kozo

    2014-09-01

    Missing data are inevitable in almost all medical studies. Imputation methods using the probabilistic model are common, but they cannot impute individual data and require special software. In contrast, the ipsative imputation method, which substitutes the missing items by the mean of the remaining items within the individual, is easy and does not need any special software, but it can provide individual scores. The aim of the present study was to evaluate the validity of the ipsative imputation method using data involving the 15-item Geriatric Depression Scale. Participants were community-dwelling elderly individuals (n = 1178). A structural equation model was constructed. The model fit indexes were calculated to assess the validity of the imputation method when it is used for individuals who were missing 20% of data or less and 40% of data or less, depending on whether we assumed that their correlation coefficients were the same as the dataset with no missing items. Finally, we compared path coefficients of the dataset imputed by ipsative imputation with those by multiple imputation. When compared with the assumption that the datasets differed, all of the model fit indexes were better under the assumption that the dataset without missing data is the same as that that was missing 20% of data or less. However, by the same assumption, the model fit indexes were worse in the dataset that was missing 40% of data or less. The path coefficients of the dataset imputed by ipsative imputation and by multiple imputation were compatible with each other if the proportion of missing items was 20% or less. Ipsative imputation appears to be a valid imputation method and can be used to impute data in studies using the 15-item Geriatric Depression Scale, if the percentage of its missing items is 20% or less. © 2014 The Authors. Psychogeriatrics © 2014 Japanese Psychogeriatric Society.

  20. Toward A Theory of Construct Definition.

    ERIC Educational Resources Information Center

    Stenner, A. Jackson; And Others

    1983-01-01

    In an attempt to restore the symmetry and balance between the study of person and item variation, this paper presents a novel methodology construct specification equations, which allows one to ascertain from the lawful behavior of items what an instrument is measuring. (Author/PN)

  1. Application of a General Polytomous Testlet Model to the Reading Section of a Large-Scale English Language Assessment. Research Report. ETS RR-10-21

    ERIC Educational Resources Information Center

    Li, Yanmei; Li, Shuhong; Wang, Lin

    2010-01-01

    Many standardized educational tests include groups of items based on a common stimulus, known as "testlets". Standard unidimensional item response theory (IRT) models are commonly used to model examinees' responses to testlet items. However, it is known that local dependence among testlet items can lead to biased item parameter estimates…

  2. [Development of a Questionnaire Measuring Sexual Mental Health of Tibetan University Students].

    PubMed

    Chen, Jun-cheng; Yan, Yu-ruo; Ai, Li; Guo, Xue-hua; He, Jian-xiu; Yuan, Ping

    2016-05-01

    To develop a questionnaire measuring sexual mental health of Tibetan university students. A draft questionnaire was developed with reference to the Sexual Civilization Survey for University Students of New Century and other published literature, and in consultation with experts. The questionnaire was tested in 230 students. Exploratory factor analyses with principal component and varimax orthogonal rotation were performed. Common factors with a > 1 eigenvalues and ≥ 3 loaded items (factor loading ≥ 0.4) were retained. Items with a < 0.4 factor loading, < 0.2 commonality, or falling into a common factor with < 3 items were excluded. The revised questionnaire was administered in another sample of 481 university students. Cronbach's α and split-half reliabilities were estimated. Confirmatory factor analyses were performed to test the construct validity of the questionnaire. Four rounds of exploratory factor analyses reduced the draft questionnaire items from 39 to 34 with a 7-factor structure. The questionnaire had a Cronbach's α of 0.920, 0.898, 0.812, 0.844, 0.787, 0.684, 0.703, and 0.608, and a Spearman-Brown coefficient of 0.763, 0.867, 0.742, 0838, 0.746, 0.822, 0.677, and 0.564 for the overall questionnaire and its 7 domains, respectively, suggesting good internal reliability. The structural equation of confirmatory factor analysis fitted well with the raw data: fit index χ²/df 3.736; root mean square residual (RMR) 0.081; root mean square error of approximation (RMSEA = 0.076; goodness of fit index (GFI) 0.805; adjusted goodness of fit index (AGFI) 0.770; normed fit index (NFI) = 0.774; relative fit index (RFI) 0.749; incremental fit index (IFI) 0.824; non-normed fit index (NNFI) = 0.803; comparative fit index (CFI) = 0.823; parsimony goodness of fit index (PGFI) = 0.684; parsimony normed fit index (PNFI) = 0.698; parsimony comparative fit index (PCFI) = 0.742, suggesting good construct validity of the questionnaire. The Sexual Mental Health Questionnaire for Tibetan University Student has demonstrated good reliability and validity.

  3. Random Item IRT Models

    ERIC Educational Resources Information Center

    De Boeck, Paul

    2008-01-01

    It is common practice in IRT to consider items as fixed and persons as random. Both, continuous and categorical person parameters are most often random variables, whereas for items only continuous parameters are used and they are commonly of the fixed type, although exceptions occur. It is shown in the present article that random item parameters…

  4. Using Automated Essay Scores as an Anchor When Equating Constructed Response Writing Tests

    ERIC Educational Resources Information Center

    Almond, Russell G.

    2014-01-01

    Assessments consisting of only a few extended constructed response items (essays) are not typically equated using anchor test designs as there are typically too few essay prompts in each form to allow for meaningful equating. This article explores the idea that output from an automated scoring program designed to measure writing fluency (a common…

  5. The stroke impairment assessment set: its internal consistency and predictive validity.

    PubMed

    Tsuji, T; Liu, M; Sonoda, S; Domen, K; Chino, N

    2000-07-01

    To study the scale quality and predictive validity of the Stroke Impairment Assessment Set (SIAS) developed for stroke outcome research. Rasch analysis of the SIAS; stepwise multiple regression analysis to predict discharge functional independence measure (FIM) raw scores from demographic data, the SIAS scores, and the admission FIM scores; cross-validation of the prediction rule. Tertiary rehabilitation center in Japan. One hundred ninety stroke inpatients for the study of the scale quality and the predictive validity; a second sample of 116 stroke inpatients for the cross-validation study. Mean square fit statistics to study the degree of fit to the unidimensional model; logits to express item difficulties; discharge FIM scores for the study of predictive validity. The degree of misfit was acceptable except for the shoulder range of motion (ROM), pain, visuospatial function, and speech items; and the SIAS items could be arranged on a common unidimensional scale. The difficulty patterns were identical at admission and at discharge except for the deep tendon reflexes, ROM, and pain items. They were also similar for the right- and left-sided brain lesion groups except for the speech and visuospatial items. For the prediction of the discharge FIM scores, the independent variables selected were age, the SIAS total scores, and the admission FIM scores; and the adjusted R2 was .64 (p < .0001). Stability of the predictive equation was confirmed in the cross-validation sample (R2 = .68, p < .001). The unidimensionality of the SIAS was confirmed, and the SIAS total scores proved useful for stroke outcome prediction.

  6. An analysis of the optimal multiobjective inventory clustering decision with small quantity and great variety inventory by applying a DPSO.

    PubMed

    Wang, Shen-Tsu; Li, Meng-Hua

    2014-01-01

    When an enterprise has thousands of varieties in its inventory, the use of a single management method could not be a feasible approach. A better way to manage this problem would be to categorise inventory items into several clusters according to inventory decisions and to use different management methods for managing different clusters. The present study applies DPSO (dynamic particle swarm optimisation) to a problem of clustering of inventory items. Without the requirement of prior inventory knowledge, inventory items are automatically clustered into near optimal clustering number. The obtained clustering results should satisfy the inventory objective equation, which consists of different objectives such as total cost, backorder rate, demand relevance, and inventory turnover rate. This study integrates the above four objectives into a multiobjective equation, and inputs the actual inventory items of the enterprise into DPSO. In comparison with other clustering methods, the proposed method can consider different objectives and obtain an overall better solution to obtain better convergence results and inventory decisions.

  7. Assessing Psycho-social Barriers to Rehabilitation in Injured Workers with Chronic Musculoskeletal Pain: Development and Item Properties of the Yellow Flag Questionnaire (YFQ).

    PubMed

    Salathé, Cornelia Rolli; Trippolini, Maurizio Alen; Terribilini, Livio Claudio; Oliveri, Michael; Elfering, Achim

    2018-06-01

    Purpose To develop a multidimensional scale to asses psychosocial beliefs-the Yellow Flag Questionnaire (YFQ)-aimed at guiding interventions for workers with chronic musculoskeletal (MSK) pain. Methods Phase 1 consisted of item selection based on literature search, item development and expert consensus rounds. In phase 2, items were reduced with calculating a quality-score per item, using structure equation modeling and confirmatory factor analysis on data from 666 workers. In phase 3, Cronbach's α, and Pearson correlations coefficients were computed to compare YFQ with disability, anxiety, depression and self-efficacy and the YFQ score based on data from 253 injured workers. Regressions of YFQ total score on disability, anxiety, depression and self-efficacy were calculated. Results After phase 1, the YFQ included 116 items and 15 domains. Further reductions of items in phase 2 by applying the item quality criteria reduced the total to 48 items. Phase factor analysis with structural equation modeling confirmed 32 items in seven domains: activity, work, emotions, harm & blame, diagnosis beliefs, co-morbidity and control. Cronbach α was 0.91 for the total score, between 0.49 and 0.81 for the 7 distinct scores of each domain, respectively. Correlations between YFQ total score ranged with disability, anxiety, depression and self-efficacy was .58, .66, .73, -.51, respectively. After controlling for age and gender the YFQ total score explained between R2 27% and R2 53% variance of disability, anxiety, depression and self-efficacy. Conclusions The YFQ, a multidimensional screening scale is recommended for use to assess psychosocial beliefs of workers with chronic MSK pain. Further evaluation of the measurement properties such as the test-retest reliability, responsiveness and prognostic validity is warranted.

  8. Modeling Local Item Dependence Due to Common Test Format with a Multidimensional Rasch Model

    ERIC Educational Resources Information Center

    Baghaei, Purya; Aryadoust, Vahid

    2015-01-01

    Research shows that test method can exert a significant impact on test takers' performance and thereby contaminate test scores. We argue that common test method can exert the same effect as common stimuli and violate the conditional independence assumption of item response theory models because, in general, subsets of items which have a shared…

  9. Developing standards for reporting implementation studies of complex interventions (StaRI): a systematic review and e-Delphi.

    PubMed

    Pinnock, Hilary; Epiphaniou, Eleni; Sheikh, Aziz; Griffiths, Chris; Eldridge, Sandra; Craig, Peter; Taylor, Stephanie J C

    2015-03-30

    Dissemination and implementation of health care interventions are currently hampered by the variable quality of reporting of implementation research. Reporting of other study types has been improved by the introduction of reporting standards (e.g. CONSORT). We are therefore developing guidelines for reporting implementation studies (StaRI). Using established methodology for developing health research reporting guidelines, we systematically reviewed the literature to generate items for a checklist of reporting standards. We then recruited an international, multidisciplinary panel for an e-Delphi consensus-building exercise which comprised an initial open round to revise/suggest a list of potential items for scoring in the subsequent two scoring rounds (scale 1 to 9). Consensus was defined a priori as 80% agreement with the priority scores of 7, 8, or 9. We identified eight papers from the literature review from which we derived 36 potential items. We recruited 23 experts to the e-Delphi panel. Open round comments resulted in revisions, and 47 items went forward to the scoring rounds. Thirty-five items achieved consensus: 19 achieved 100% agreement. Prioritised items addressed the need to: provide an evidence-based justification for implementation; describe the setting, professional/service requirements, eligible population and intervention in detail; measure process and clinical outcomes at population level (using routine data); report impact on health care resources; describe local adaptations to the implementation strategy and describe barriers/facilitators. Over-arching themes from the free-text comments included balancing the need for detailed descriptions of interventions with publishing constraints, addressing the dual aims of reporting on the process of implementation and effectiveness of the intervention and monitoring fidelity to an intervention whilst encouraging adaptation to suit diverse local contexts. We have identified priority items for reporting implementation studies and key issues for further discussion. An international, multidisciplinary workshop, where participants will debate the issues raised, clarify specific items and develop StaRI standards that fit within the suite of EQUATOR reporting guidelines, is planned. The protocol is registered with Equator: http://www.equator-network.org/library/reporting-guidelines-under-development/#17 .

  10. The Long-Term Sustainability of Different Item Response Theory Scaling Methods

    ERIC Educational Resources Information Center

    Keller, Lisa A.; Keller, Robert R.

    2011-01-01

    This article investigates the accuracy of examinee classification into performance categories and the estimation of the theta parameter for several item response theory (IRT) scaling techniques when applied to six administrations of a test. Previous research has investigated only two administrations; however, many testing programs equate tests…

  11. The study on the outsourcing of Taiwan's hospitals: a questionnaire survey research

    PubMed Central

    Hsiao, Chih-Tung; Pai, Jar-Yuan; Chiu, Hero

    2009-01-01

    Background The aim of this study was to assess the outsourcing situation in Taiwanese hospitals and compares the differences in hospital ownership and in accreditation levels. Methods This research combined two kinds of methods: a questionnaire survey and the in-depth interview to two CEOs of the sample hospitals. One hospital is not-for-profit, while the other is a public hospital and the research samples are from the hospital data from Taiwan's 2005 to 2007 Department of Health qualifying lists of hospital accreditation. The returned questionnaires were analyzed with STATISTICA® 7.1 version software. Results The results for non-medical items showed medical waste and common trash both have the highest rate (94.6 percent) of being outsourced. The gift store (75 percent) and linen (73 percent) follow close behind, while the lowest rate of outsourcing is in utility maintenance (13.5 percent). For medical items, the highest rate of outsourcing is in the ambulance units (51.4 percent), while the hemodialysis center follows close behind with a rate of 50 percent. For departments of nutrition, pharmacy, and nursing however, the outsourcing rate is lower than 3 percent. This shows that Taiwan's hospitals are still conservative in their willingness to outsource for medical items. The results of the satisfaction paired t-test show that the non-medical items have a higher score than the medical items. The factor analysis showed the three significant factors in of non medical items' outsourcing are "performance", "finance", and "human resource". For medical items, the two factors are "operation" and satisfaction". To further exam the factor validity and reliability of the satisfaction model, a confirmative factor analysis (CFA) was conducted using structure equation modeling (SEM) method and found the model fitting well. Conclusion Hospitals, especially for public hospitals, can get benefits from outsourcing to revive the full-time-equivalent and human resource limitation. PMID:19435526

  12. The study on the outsourcing of Taiwan's hospitals: a questionnaire survey research.

    PubMed

    Hsiao, Chih-Tung; Pai, Jar-Yuan; Chiu, Hero

    2009-05-13

    The aim of this study was to assess the outsourcing situation in Taiwanese hospitals and compares the differences in hospital ownership and in accreditation levels. This research combined two kinds of methods: a questionnaire survey and the in-depth interview to two CEOs of the sample hospitals. One hospital is not-for-profit, while the other is a public hospital and the research samples are from the hospital data from Taiwan's 2005 to 2007 Department of Health qualifying lists of hospital accreditation. The returned questionnaires were analyzed with STATISTICA 7.1 version software. The results for non-medical items showed medical waste and common trash both have the highest rate (94.6 percent) of being outsourced. The gift store (75 percent) and linen (73 percent) follow close behind, while the lowest rate of outsourcing is in utility maintenance (13.5 percent). For medical items, the highest rate of outsourcing is in the ambulance units (51.4 percent), while the hemodialysis center follows close behind with a rate of 50 percent. For departments of nutrition, pharmacy, and nursing however, the outsourcing rate is lower than 3 percent. This shows that Taiwan's hospitals are still conservative in their willingness to outsource for medical items. The results of the satisfaction paired t-test show that the non-medical items have a higher score than the medical items. The factor analysis showed the three significant factors in of non medical items' outsourcing are "performance", "finance", and "human resource". For medical items, the two factors are "operation" and satisfaction". To further exam the factor validity and reliability of the satisfaction model, a confirmative factor analysis (CFA) was conducted using structure equation modeling (SEM) method and found the model fitting well. Hospitals, especially for public hospitals, can get benefits from outsourcing to revive the full-time-equivalent and human resource limitation.

  13. Measuring attributes of health literate health care organizations from the patients' perspective: Development and validation of a questionnaire to assess health literacy-sensitive communication (HL-COM).

    PubMed

    Ernstmann, Nicole; Halbach, Sarah; Kowalski, Christoph; Pfaff, Holger; Ansmann, Lena

    2017-04-01

    Studies addressing the organizational contexts of care that may help increase the patients' ability to cope with a disease and to navigate through the health care system are still rare. Especially instruments allowing the assessment of such organizational efforts from the patients' perspective are missing. The aim of our study was to develop a survey instrument assessing organizational health literacy (HL) from the patients' perspective, i. e., health care organizations' responsiveness to patients' individual needs. A pool of 30 items was developed by a group of experts based on a literature review. The items were developed, tested and prioritized according to their importance in 11 semi-structured interviews and cognitive think-aloud interviews with cancer patients. The resulting 16 items were rated in a standardized postal survey involving a total of N=453 colon and breast cancer patients treated in cancer centers in Germany. An exploratory factor analysis, a confirmatory factor analysis and structural equation modelling were conducted. Item properties were analyzed. 83.2 % of the patients were diagnosed with breast cancer, 16.8 % had a diagnosis of colon cancer. The patients' mean age was 61 (26-88), 89.4 % were female. The most common comorbidities were hypertension (34.0 %) and cardiovascular disease (11.0 %). The final prediction model included nine items measuring the degree of health literacy-sensitivity of communication. The model showed an acceptable model fit. The nine items showed corrected item-total correlations between .622 and .762 and item difficulties between 0.77 and 0.87. Cronbach's α was .912. In a comprehensive development process, the original item pool comprising several aspects of organizational HL was reduced to a one-dimensional scale. The instrument measures an important aspect of organizational HL; i.e., the degree of health literacy-sensitivity of communication (HL-COM). HL-COM was found to impact patient enablement, mediated through the support by physicians. Future research will have to test these associations in the context of other diseases or institutions. Copyright © 2017. Published by Elsevier GmbH.

  14. ESPACOMP Medication Adherence Reporting Guidelines (EMERGE): a reactive-Delphi study protocol.

    PubMed

    Helmy, R; Zullig, L L; Dunbar-Jacob, J; Hughes, D A; Vrijens, B; Wilson, I B; De Geest, S

    2017-02-10

    Medication adherence is fundamental to achieving optimal patient outcomes. Reporting research on medication adherence suffers from some issues-including conceptualisation, measurement and data analysis-that thwart its advancement. Using the ABC taxonomy for medication adherence as the conceptual basis, a steering committee of members of the European Society for Patient Adherence, COMpliance, and Persistence (ESPACOMP) launched an initiative to develop ESPACOMP Medication Adherence Reporting Guidelines (EMERGE). This paper is a protocol for a Delphi study that aims to build consensus among a group of topic experts regarding an item list that will support developing EMERGE. This study uses a reactive-Delphi design where a group of topic experts will be asked to rate the relevance and clarity of an initial list of items, in addition to suggesting further items and/or modifications of the initial items. The initial item list, generated by the EMERGE steering committee through a structured process, consists of 26 items distributed in 2 sections: 4 items representing the taxonomy-based minimum reporting criteria, and 22 items organised according to the common reporting sections. A purposive sample of experts will be selected from relevant disciplines and diverse geographical locations. Consensus will be achieved through predefined decision rules to keep, delete or modify the items. An iterative process of online survey rounds will be carried out until consensus is reached. An ethics approval was not required for the study according to the Swiss federal act on research involving human beings. The participating experts will be asked to give an informed consent. The results of this Delphi study will feed into EMERGE, which will be disseminated through peer-reviewed publications and presentations at conferences. Additionally, the steering committee will encourage their endorsement by registering the guidelines at the Enhancing the QUAlity and Transparency Of health Research (EQUATOR) network and other relevant organisations. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://www.bmj.com/company/products-services/rights-and-licensing/.

  15. A new predictive indicator for development of pressure ulcers in bedridden patients based on common laboratory tests results.

    PubMed

    Hatanaka, N; Yamamoto, Y; Ichihara, K; Mastuo, S; Nakamura, Y; Watanabe, M; Iwatani, Y

    2008-04-01

    Various scales have been devised to predict development of pressure ulcers on the basis of clinical and laboratory data, such as the Braden Scale (Braden score), which is used to monitor activity and skin conditions of bedridden patients. However, none of these scales facilitates clinically reliable prediction. To develop a clinical laboratory data-based predictive equation for the development of pressure ulcers. Subjects were 149 hospitalised patients with respiratory disorders who were monitored for the development of pressure ulcers over a 3-month period. The proportional hazards model (Cox regression) was used to analyse the results of 12 basic laboratory tests on the day of hospitalisation in comparison with Braden score. Pressure ulcers developed in 38 patients within the study period. A Cox regression model consisting solely of Braden scale items showed that none of these items contributed to significantly predicting pressure ulcers. Rather, a combination of haemoglobin (Hb), C-reactive protein (CRP), albumin (Alb), age, and gender produced the best model for prediction. Using the set of explanatory variables, we created a new indicator based on a multiple logistic regression equation. The new indicator showed high sensitivity (0.73) and specificity (0.70), and its diagnostic power was higher than that of Alb, Hb, CRP, or the Braden score alone. The new indicator may become a more useful clinical tool for predicting presser ulcers than Braden score. The new indicator warrants verification studies to facilitate its clinical implementation in the future.

  16. Test Score Equating Using a Mini-Version Anchor and a Midi Anchor: A Case Study Using SAT[R] Data

    ERIC Educational Resources Information Center

    Liu, Jinghua; Sinharay, Sandip; Holland, Paul W.; Curley, Edward; Feigenbaum, Miriam

    2011-01-01

    This study explores an anchor that is different from the traditional miniature anchor in test score equating. In contrast to a traditional "mini" anchor that has the same spread of item difficulties as the tests to be equated, the studied anchor, referred to as a "midi" anchor (Sinharay & Holland), has a smaller spread of…

  17. The Effects of Different Types of Anchor Tests on Observed Score Equating. Research Report. ETS RR-09-41

    ERIC Educational Resources Information Center

    Liu, Jinghua; Sinharay, Sandip; Holland, Paul W.; Feigenbaum, Miriam; Curley, Edward

    2009-01-01

    This study explores the use of a different type of anchor, a "midi anchor", that has a smaller spread of item difficulties than the tests to be equated, and then contrasts its use with the use of a "mini anchor". The impact of different anchors on observed score equating were evaluated and compared with respect to systematic…

  18. Force Limited Vibration Testing

    NASA Technical Reports Server (NTRS)

    Scharton, Terry; Chang, Kurng Y.

    2005-01-01

    This slide presentation reviews the concept and applications of Force Limited Vibration Testing. The goal of vibration testing of aerospace hardware is to identify problems that would result in flight failures. The commonly used aerospace vibration tests uses artificially high shaker forces and responses at the resonance frequencies of the test item. It has become common to limit the acceleration responses in the test to those predicted for the flight. This requires an analysis of the acceleration response, and requires placing accelerometers on the test item. With the advent of piezoelectric gages it has become possible to improve vibration testing. The basic equations have are reviewed. Force limits are analogous and complementary to the acceleration specifications used in conventional vibration testing. Just as the acceleration specification is the frequency spectrum envelope of the in-flight acceleration at the interface between the test item and flight mounting structure, the force limit is the envelope of the in-flight force at the interface . In force limited vibration tests, both the acceleration and force specifications are needed, and the force specification is generally based on and proportional to the acceleration specification. Therefore, force limiting does not compensate for errors in the development of the acceleration specification, e.g., too much conservatism or the lack thereof. These errors will carry over into the force specification. Since in-flight vibratory force data are scarce, force limits are often derived from coupled system analyses and impedance information obtained from measurements or finite element models (FEM). Fortunately, data on the interface forces between systems and components are now available from system acoustic and vibration tests of development test models and from a few flight experiments. Semi-empirical methods of predicting force limits are currently being developed on the basis of the limited flight and system test data. A simple two degree of freedom system is shown and the governing equations for basic force limiting results for this system are reviewed. The design and results of the shuttle vibration forces (SVF) experiments are reviewed. The Advanced Composition Explorer (ACE) also was used to validate force limiting. Test instrumentation and supporting equipment are reviewed including piezo-electric force transducers, signal processing and conditioning systems, test fixtures, and vibration controller systems. Several examples of force limited vibration testing are presented with some results.

  19. Thoughts of death and self-harm in patients with epilepsy or multiple sclerosis in a tertiary care center.

    PubMed

    Dickstein, Leah P; Viguera, Adele C; Nowacki, Amy S; Thompson, Nicolas R; Griffith, Sandra D; Baldessarini, Ross J; Katzan, Irene L

    2015-01-01

    Patients with epilepsy or multiple sclerosis (MS) have high risks of depression and increased risks of suicide, but little is known about their risks of suicidal ideation. We sought to (1) estimate the prevalence of thoughts of being better off dead or of self-harm among patients with epilepsy or MS, (2) identify risk factors for such thoughts, and (3) determine whether any risk factors interact with depression to predict such thoughts. A Cleveland Clinic database provided information on 20,734 visits of 6586 outpatients with epilepsy or MS. Outcome measures were thoughts of death or self-harm (Patient Health Questionnaire [PHQ] item-9), and total score ≥10 for the 8 remaining PHQ items (probable major depression). Generalized estimating equations accounted for repeat visits in tests of associations of PHQ item-9 responses with depression, age, sex, race, household income, disease severity, and quality of life. Prevalence of thoughts of death or self-harm averaged 14.4% overall (epilepsy, 14.0% and MS, 14.7%). Factors associated with positive PHQ item-9 responses in epilepsy were depression and male sex, modified by poor quality of life. Factors associated with positive PHQ item-9 in MS were depression, male sex, medical comorbidity, and poor quality of life; the effect of depression was worse with greater MS severity and being unmarried. Among patients with common neurologic disorders (epilepsy or MS), 14%-15% reported thoughts of death or self-harm associated with illness severity, depression, quality of life, male sex, and being unmarried. Such patients require further evaluation of clinical outcomes and effects of treatment. Copyright © 2015 The Academy of Psychosomatic Medicine. Published by Elsevier Inc. All rights reserved.

  20. Dissociative effects of orthographic distinctiveness in pure and mixed lists: an item-order account.

    PubMed

    McDaniel, Mark A; Cahill, Michael; Bugg, Julie M; Meadow, Nathaniel G

    2011-10-01

    We apply the item-order theory of list composition effects in free recall to the orthographic distinctiveness effect. The item-order account assumes that orthographically distinct items advantage item-specific encoding in both mixed and pure lists, but at the expense of exploiting relational information present in the list. Experiment 1 replicated the typical free recall advantage of orthographically distinct items in mixed lists and the elimination of that advantage in pure lists. Supporting the item-order account, recognition performances indicated that orthographically distinct items received greater item-specific encoding than did orthographically common items in mixed and pure lists (Experiments 1 and 2). Furthermore, order memory (input-output correspondence and sequential contiguity effects) was evident in recall of pure unstructured common lists, but not in recall of unstructured distinct lists (Experiment 1). These combined patterns, although not anticipated by prevailing views, are consistent with an item-order account.

  1. Measuring Emergent Organizational Properties: A Structural Equation Modeling Test of Self- versus Group-Referent Perceptions

    ERIC Educational Resources Information Center

    Goddard, Roger D.; LoGerfo, Laura F.

    2007-01-01

    This article presents a theoretical rationale and empirical evidence regarding the validity of scores obtained from two competing approaches to operationalizing scale items to measure emergent organizational properties. The authors consider whether items in scales intended to measure organizational properties should prompt survey takers to provide…

  2. The Matching Criterion Purification for Differential Item Functioning Analyses in a Large-Scale Assessment

    ERIC Educational Resources Information Center

    Lee, HyeSun; Geisinger, Kurt F.

    2016-01-01

    The current study investigated the impact of matching criterion purification on the accuracy of differential item functioning (DIF) detection in large-scale assessments. The three matching approaches for DIF analyses (block-level matching, pooled booklet matching, and equated pooled booklet matching) were employed with the Mantel-Haenszel…

  3. The Effect of Year-to-Year Rater Variation on IRT Linking

    ERIC Educational Resources Information Center

    Yen, Shu Jing; Ochieng, Charles; Michaels, Hillary; Friedman, Greg

    2005-01-01

    Year-to-year rater variation may result in constructed response (CR) parameter changes, making CR items inappropriate to use in anchor sets for linking or equating. This study demonstrates how rater severity affected the writing and reading scores. Rater adjustments were made to statewide results using an item response theory (IRT) methodology…

  4. Observed-Score Equating as a Test Assembly Problem.

    ERIC Educational Resources Information Center

    van der Linden, Wim J.; Luecht, Richard M.

    1998-01-01

    Derives a set of linear conditions of item-response functions that guarantees identical observed-score distributions on two test forms. The conditions can be added as constraints to a linear programming model for test assembly. An example illustrates the use of the model for an item pool from the Law School Admissions Test (LSAT). (SLD)

  5. PE Metrics: Background, Testing Theory, and Methods

    ERIC Educational Resources Information Center

    Zhu, Weimo; Rink, Judy; Placek, Judith H.; Graber, Kim C.; Fox, Connie; Fisette, Jennifer L.; Dyson, Ben; Park, Youngsik; Avery, Marybell; Franck, Marian; Raynes, De

    2011-01-01

    New testing theories, concepts, and psychometric methods (e.g., item response theory, test equating, and item bank) developed during the past several decades have many advantages over previous theories and methods. In spite of their introduction to the field, they have not been fully accepted by physical educators. Further, the manner in which…

  6. An Analysis of the Optimal Multiobjective Inventory Clustering Decision with Small Quantity and Great Variety Inventory by Applying a DPSO

    PubMed Central

    Li, Meng-Hua

    2014-01-01

    When an enterprise has thousands of varieties in its inventory, the use of a single management method could not be a feasible approach. A better way to manage this problem would be to categorise inventory items into several clusters according to inventory decisions and to use different management methods for managing different clusters. The present study applies DPSO (dynamic particle swarm optimisation) to a problem of clustering of inventory items. Without the requirement of prior inventory knowledge, inventory items are automatically clustered into near optimal clustering number. The obtained clustering results should satisfy the inventory objective equation, which consists of different objectives such as total cost, backorder rate, demand relevance, and inventory turnover rate. This study integrates the above four objectives into a multiobjective equation, and inputs the actual inventory items of the enterprise into DPSO. In comparison with other clustering methods, the proposed method can consider different objectives and obtain an overall better solution to obtain better convergence results and inventory decisions. PMID:25197713

  7. Variation in passing standards for graduation-level knowledge items at UK medical schools.

    PubMed

    Taylor, Celia A; Gurnell, Mark; Melville, Colin R; Kluth, David C; Johnson, Neil; Wass, Val

    2017-06-01

    Given the absence of a common passing standard for students at UK medical schools, this paper compares independently set standards for common 'one from five' single-best-answer (multiple-choice) items used in graduation-level applied knowledge examinations and explores potential reasons for any differences. A repeated cross-sectional study was conducted. Participating schools were sent a common set of graduation-level items (55 in 2013-2014; 60 in 2014-2015). Items were selected against a blueprint and subjected to a quality review process. Each school employed its own standard-setting process for the common items. The primary outcome was the passing standard for the common items by each medical school set using the Angoff or Ebel methods. Of 31 invited medical schools, 22 participated in 2013-2014 (71%) and 30 (97%) in 2014-2015. Schools used a mean of 49 and 53 common items in 2013-2014 and 2014-2015, respectively, representing around one-third of the items in the examinations in which they were embedded. Data from 19 (61%) and 26 (84%) schools, respectively, met the inclusion criteria for comparison of standards. There were statistically significant differences in the passing standards set by schools in both years (effect sizes (f 2 ): 0.041 in 2013-2014 and 0.218 in 2014-2015; both p < 0.001). The interquartile range of standards was 5.7 percentage points in 2013-2014 and 6.5 percentage points in 2014-2015. There was a positive correlation between the relative standards set by schools in the 2 years (Pearson's r = 0.57, n = 18, p = 0.014). Time allowed per item, method of standard setting and timing of examination in the curriculum did not have a statistically significant impact on standards. Independently set standards for common single-best-answer items used in graduation-level examinations vary across UK medical schools. Further work to examine standard-setting processes in more detail is needed to help explain this variability and develop methods to reduce it. © 2017 John Wiley & Sons Ltd and The Association for the Study of Medical Education.

  8. Diagrams benefit symbolic problem-solving.

    PubMed

    Chu, Junyi; Rittle-Johnson, Bethany; Fyfe, Emily R

    2017-06-01

    The format of a mathematics problem often influences students' problem-solving performance. For example, providing diagrams in conjunction with story problems can benefit students' understanding, choice of strategy, and accuracy on story problems. However, it remains unclear whether providing diagrams in conjunction with symbolic equations can benefit problem-solving performance as well. We tested the impact of diagram presence on students' performance on algebra equation problems to determine whether diagrams increase problem-solving success. We also examined the influence of item- and student-level factors to test the robustness of the diagram effect. We worked with 61 seventh-grade students who had received 2 months of pre-algebra instruction. Students participated in an experimenter-led classroom session. Using a within-subjects design, students solved algebra problems in two matched formats (equation and equation-with-diagram). The presence of diagrams increased equation-solving accuracy and the use of informal strategies. This diagram benefit was independent of student ability and item complexity. The benefits of diagrams found previously for story problems generalized to symbolic problems. The findings are consistent with cognitive models of problem-solving and suggest that diagrams may be a useful additional representation of symbolic problems. © 2017 The British Psychological Society.

  9. Confirming Testlet Effects

    ERIC Educational Resources Information Center

    DeMars, Christine E.

    2012-01-01

    A testlet is a cluster of items that share a common passage, scenario, or other context. These items might measure something in common beyond the trait measured by the test as a whole; if so, the model for the item responses should allow for this testlet trait. But modeling testlet effects that are negligible makes the model unnecessarily…

  10. Evaluation of MIMIC-Model Methods for DIF Testing with Comparison to Two-Group Analysis

    ERIC Educational Resources Information Center

    Woods, Carol M.

    2009-01-01

    Differential item functioning (DIF) occurs when an item on a test or questionnaire has different measurement properties for 1 group of people versus another, irrespective of mean differences on the construct. This study focuses on the use of multiple-indicator multiple-cause (MIMIC) structural equation models for DIF testing, parameterized as item…

  11. A SEM Model in Assessing the Effect of Convergent, Divergent and Logical Thinking on Students' Understanding of Chemical Phenomena

    ERIC Educational Resources Information Center

    Stamovlasis, D.; Kypraios, N.; Papageorgiou, G.

    2015-01-01

    In this study, structural equation modeling (SEM) is applied to an instrument assessing students' understanding of chemical change. The instrument comprised items on understanding the structure of substances, chemical changes and their interpretation. The structural relationships among particular groups of items are investigated and analyzed using…

  12. Normal Theory Two-Stage ML Estimator When Data Are Missing at the Item Level

    ERIC Educational Resources Information Center

    Savalei, Victoria; Rhemtulla, Mijke

    2017-01-01

    In many modeling contexts, the variables in the model are linear composites of the raw items measured for each participant; for instance, regression and path analysis models rely on scale scores, and structural equation models often use parcels as indicators of latent constructs. Currently, no analytic estimation method exists to appropriately…

  13. Understanding the Equals Sign as a Gateway to Algebraic Thinking

    ERIC Educational Resources Information Center

    Matthews, Percival G.; Rittle-Johnson, Bethany; Taylor, Roger S.; McEldoon, Katherine L.

    2010-01-01

    In this study, the authors wanted to examine whether success on items testing basic equivalence knowledge, such as the meaning of the equal sign and ability to solve problems such as 3 + 5 = 4 + _, predicted success on items testing more advanced algebraic thinking (i.e. principles of equality and solving equations that use letter variables). This…

  14. Seasonal food habits of the coyote in the South Carolina coastal plain.

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Schrecengost, J. D.; Kilgo, J. C.; Mallard, D.

    2008-07-01

    Abstract - Spatial and temporal plasticity in Canis latrans (coyote) diets require regional studies to understand the ecological role of this omnivorous canid. Because coyotes have recently become established in South Carolina, we investigated their food habits by collecting 415 coyote scats on the Savannah River Site in western South Carolina from May 2005-July 2006. Seasonally available soft mast was the most common food item in 12 of the 15 months we sampled. Odocoileus virginianus (white-tailed deer) was the most common food item during December (40%) and March (37%). During May-June, fruits of Prunus spp. and Rubus spp. were themore » most commonly occurring food items. Fawns were the most common mammalian food item during May and June of both years despite low deer density.« less

  15. An Empirical Comparison of Methods for Equating with Randomly Equivalent Groups of 50 to 400 Test Takers. Research Report. ETS RR-10-05

    ERIC Educational Resources Information Center

    Livingston, Samuel A.; Kim, Sooyeon

    2010-01-01

    A series of resampling studies investigated the accuracy of equating by four different methods in a random groups equating design with samples of 400, 200, 100, and 50 test takers taking each form. Six pairs of forms were constructed. Each pair was constructed by assigning items from an existing test taken by 9,000 or more test takers. The…

  16. Exploring the Robustness of a Unidimensional Item Response Theory Model with Empirically Multidimensional Data

    ERIC Educational Resources Information Center

    Anderson, Daniel; Kahn, Joshua D.; Tindal, Gerald

    2017-01-01

    Unidimensionality and local independence are two common assumptions of item response theory. The former implies that all items measure a common latent trait, while the latter implies that responses are independent, conditional on respondents' location on the latent trait. Yet, few tests are truly unidimensional. Unmodeled dimensions may result in…

  17. [Analysis of nursing-related content portrayed in middle and high school textbooks under the national common basic curriculum in Korea].

    PubMed

    Jung, Myun Sook; Choi, Hyeong Wook; Li, Dong Mei

    2010-02-01

    The purpose of this study was to analyze nursing-related content in middle, and high school textbooks under the National Common Basic Curriculum in Korea. Nursing-related content from 43 middle school textbooks and 13 high school textbooks was analyzed. There were 28 items of nursing-related content in the selected textbooks. Among them, 13 items were in the 'nursing activity' area, 6 items were in the 'nurse as an occupation' area, 2 items were in the 'major and career choice' area, 6 items were 'just one word' and 1 item in 'others'. The main nursing related content which portrayed in the middle and high school textbooks were caring for patients (7 items accounting for 46.5%), nurses working in hospitals (6 items accounting for 21.4%). In terms of gender perspective, female nurses (15 items accounting for 53.6%) were most prevalent.

  18. A Reporting Tool for Practice Guidelines in Health Care: The RIGHT Statement.

    PubMed

    Chen, Yaolong; Yang, Kehu; Marušic, Ana; Qaseem, Amir; Meerpohl, Joerg J; Flottorp, Signe; Akl, Elie A; Schünemann, Holger J; Chan, Edwin S Y; Falck-Ytter, Yngve; Ahmed, Faruque; Barber, Sarah; Chen, Chiehfeng; Zhang, Mingming; Xu, Bin; Tian, Jinhui; Song, Fujian; Shang, Hongcai; Tang, Kun; Wang, Qi; Norris, Susan L

    2017-01-17

    The quality of reporting practice guidelines is often poor, and there is no widely accepted guidance or standards for such reporting in health care. The international RIGHT (Reporting Items for practice Guidelines in HealThcare) Working Group was established to address this gap. The group followed an existing framework for developing guidelines for health research reporting and the EQUATOR (Enhancing the QUAlity and Transparency Of health Research) Network approach. It developed a checklist and an explanation and elaboration statement. The RIGHT checklist includes 22 items that are considered essential for good reporting of practice guidelines: basic information (items 1 to 4), background (items 5 to 9), evidence (items 10 to 12), recommendations (items 13 to 15), review and quality assurance (items 16 and 17), funding and declaration and management of interests (items 18 and 19), and other information (items 20 to 22). The RIGHT checklist can assist developers in reporting guidelines, support journal editors and peer reviewers when considering guideline reports, and help health care practitioners understand and implement a guideline.

  19. Mathematical and Numerical Studies of Nonstandard Difference Equation Models of Differential Equations

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Mickens, Ronald E.

    2008-12-22

    This research examined the following items/issues: the NSFD methodology, technical achievements and applications, dissemination efforts and research related professional activities. Also a list of unresolved issues were identified that could form the basis for future research in the area of constructing and analyzing NSFD schemes for both ODE's and PDE's.

  20. Toward the Development of a Model to Estimate the Readability of Credentialing-Examination Materials

    ERIC Educational Resources Information Center

    Badgett, Barbara A.

    2010-01-01

    The purpose of this study was to develop a set of procedures to establish readability, including an equation, that accommodates the multiple-choice item format and occupational-specific language related to credentialing examinations. The procedures and equation should be appropriate for learning materials, examination materials, and occupational…

  1. Bi-Factor MIRT Observed-Score Equating for Mixed-Format Tests

    ERIC Educational Resources Information Center

    Lee, Guemin; Lee, Won-Chan

    2016-01-01

    The main purposes of this study were to develop bi-factor multidimensional item response theory (BF-MIRT) observed-score equating procedures for mixed-format tests and to investigate relative appropriateness of the proposed procedures. Using data from a large-scale testing program, three types of pseudo data sets were formulated: matched samples,…

  2. Structural Equation Modeling in Assessing Students' Understanding of the State Changes of Matter

    ERIC Educational Resources Information Center

    Stamovlasis, Dimitrios; Tsitsipis, Georgios; Papageorgiou, George

    2012-01-01

    In this study, structural equation modeling (SEM) is applied to an instrument assessing students' understanding of the particulate nature of matter, the collective properties and physical changes, such as melting, evaporation, boiling and condensation. The structural relationships among particular groups of items were investigated. In addition,…

  3. An NCME Instructional Module on Polytomous Item Response Theory Models

    ERIC Educational Resources Information Center

    Penfield, Randall David

    2014-01-01

    A polytomous item is one for which the responses are scored according to three or more categories. Given the increasing use of polytomous items in assessment practices, item response theory (IRT) models specialized for polytomous items are becoming increasingly common. The purpose of this ITEMS module is to provide an accessible overview of…

  4. Examining an Alternative to Score Equating: A Randomly Equivalent Forms Approach. Research Report. ETS RR-08-14

    ERIC Educational Resources Information Center

    Liao, Chi-Wen; Livingston, Samuel A.

    2008-01-01

    Randomly equivalent forms (REF) of tests in listening and reading for nonnative speakers of English were created by stratified random assignment of items to forms, stratifying on item content and predicted difficulty. The study included 50 replications of the procedure for each test. Each replication generated 2 REFs. The equivalence of those 2…

  5. Avoiding and Correcting Bias in Score-Based Latent Variable Regression with Discrete Manifest Items

    ERIC Educational Resources Information Center

    Lu, Irene R. R.; Thomas, D. Roland

    2008-01-01

    This article considers models involving a single structural equation with latent explanatory and/or latent dependent variables where discrete items are used to measure the latent variables. Our primary focus is the use of scores as proxies for the latent variables and carrying out ordinary least squares (OLS) regression on such scores to estimate…

  6. Odds Ratio, Delta, ETS Classification, and Standardization Measures of DIF Magnitude for Binary Logistic Regression

    ERIC Educational Resources Information Center

    Monahan, Patrick O.; McHorney, Colleen A.; Stump, Timothy E.; Perkins, Anthony J.

    2007-01-01

    Previous methodological and applied studies that used binary logistic regression (LR) for detection of differential item functioning (DIF) in dichotomously scored items either did not report an effect size or did not employ several useful measures of DIF magnitude derived from the LR model. Equations are provided for these effect size indices.…

  7. An Information Analysis of 2-, 3-, and 4-Word Verbal Discrimination Learning.

    ERIC Educational Resources Information Center

    Arima, James K.; Gray, Francis D.

    Information theory was used to qualify the difficulty of verbal discrimination (VD) learning tasks and to measure VD performance. Words for VD items were selected with high background frequency and equal a priori probabilities of being selected as a first response. Three VD lists containing only 2-, 3-, or 4-word items were created and equated for…

  8. Translation of P = kT into a Pictorial External Representation by High School Seniors

    ERIC Educational Resources Information Center

    Matijaševic, Igor; Korolija, Jasminka N.; Mandic, Ljuba M.

    2016-01-01

    This paper describes the results achieved by high school seniors on an item which involves translation of the equation P = kT into a corresponding pictorial external representation. The majority of students (the classes of 2011, 2012 and 2013) did not give the correct answer to the multiple choice part of the translation item. They chose pictorial…

  9. Archaeological Investigations in Upper McNary Reservoir: 1981-1982.

    DTIC Science & Technology

    1983-01-01

    Sokulk have been equated with the ethnographic Wanapum (Smith 1982). In 1811 David Thompson of the British North West Company and Alexander Ross traveled...subdivided into three sub-clusters. It is not correct to statistically equate this solution to that of 11 clusters (8 original and the 3 subdivisions of...accept the assumption that increases in the quantity of materials roughly equate with increased use of an area. The average number of items per 50 m

  10. Using Mutual Information for Adaptive Item Comparison and Student Assessment

    ERIC Educational Resources Information Center

    Liu, Chao-Lin

    2005-01-01

    The author analyzes properties of mutual information between dichotomous concepts and test items. The properties generalize some common intuitions about item comparison, and provide principled foundations for designing item-selection heuristics for student assessment in computer-assisted educational systems. The proposed item-selection strategies…

  11. Standard Error Estimation of 3PL IRT True Score Equating with an MCMC Method

    ERIC Educational Resources Information Center

    Liu, Yuming; Schulz, E. Matthew; Yu, Lei

    2008-01-01

    A Markov chain Monte Carlo (MCMC) method and a bootstrap method were compared in the estimation of standard errors of item response theory (IRT) true score equating. Three test form relationships were examined: parallel, tau-equivalent, and congeneric. Data were simulated based on Reading Comprehension and Vocabulary tests of the Iowa Tests of…

  12. Three Approaches to Using Lengthy Ordinal Scales in Structural Equation Models: Parceling, Latent Scoring, and Shortening Scales

    ERIC Educational Resources Information Center

    Yang, Chongming; Nay, Sandra; Hoyle, Rick H.

    2010-01-01

    Lengthy scales or testlets pose certain challenges for structural equation modeling (SEM) if all the items are included as indicators of a latent construct. Three general approaches to modeling lengthy scales in SEM (parceling, latent scoring, and shortening) have been reviewed and evaluated. A hypothetical population model is simulated containing…

  13. Flipping an Algebra Classroom: Analyzing, Modeling, and Solving Systems of Linear Equations

    ERIC Educational Resources Information Center

    Kirvan, Rebecca; Rakes, Christopher R.; Zamora, Regie

    2015-01-01

    The present study investigated whether flipping an algebra classroom led to a stronger focus on conceptual understanding and improved learning of systems of linear equations for 54 seventh- and eighth-grade students using teacher journal data and district-mandated unit exam items. Multivariate analysis of covariance was used to compare scores on…

  14. Applying Hierarchical Model Calibration to Automatically Generated Items.

    ERIC Educational Resources Information Center

    Williamson, David M.; Johnson, Matthew S.; Sinharay, Sandip; Bejar, Isaac I.

    This study explored the application of hierarchical model calibration as a means of reducing, if not eliminating, the need for pretesting of automatically generated items from a common item model prior to operational use. Ultimately the successful development of automatic item generation (AIG) systems capable of producing items with highly similar…

  15. Improved collaborative filtering recommendation algorithm of similarity measure

    NASA Astrophysics Data System (ADS)

    Zhang, Baofu; Yuan, Baoping

    2017-05-01

    The Collaborative filtering recommendation algorithm is one of the most widely used recommendation algorithm in personalized recommender systems. The key is to find the nearest neighbor set of the active user by using similarity measure. However, the methods of traditional similarity measure mainly focus on the similarity of user common rating items, but ignore the relationship between the user common rating items and all items the user rates. And because rating matrix is very sparse, traditional collaborative filtering recommendation algorithm is not high efficiency. In order to obtain better accuracy, based on the consideration of common preference between users, the difference of rating scale and score of common items, this paper presents an improved similarity measure method, and based on this method, a collaborative filtering recommendation algorithm based on similarity improvement is proposed. Experimental results show that the algorithm can effectively improve the quality of recommendation, thus alleviate the impact of data sparseness.

  16. Development and Calibration of an Item Bank for PE Metrics Assessments: Standard 1

    ERIC Educational Resources Information Center

    Zhu, Weimo; Fox, Connie; Park, Youngsik; Fisette, Jennifer L.; Dyson, Ben; Graber, Kim C.; Avery, Marybell; Franck, Marian; Placek, Judith H.; Rink, Judy; Raynes, De

    2011-01-01

    The purpose of this study was to develop and calibrate an assessment system, or bank, using the latest measurement theories and methods to promote valid and reliable student assessment in physical education. Using an anchor-test equating design, a total of 30 items or assessments were administered to 5,021 (2,568 boys and 2,453 girls) students in…

  17. Separating Common from Unique Variance Within Emotional Distress: An Examination of Reliability and Relations to Worry.

    PubMed

    Marshall, Andrew J; Evanovich, Emma K; David, Sarah Jo; Mumma, Gregory H

    2018-01-17

    High comorbidity rates among emotional disorders have led researchers to examine transdiagnostic factors that may contribute to shared psychopathology. Bifactor models provide a unique method for examining transdiagnostic variables by modelling the common and unique factors within measures. Previous findings suggest that the bifactor model of the Depression Anxiety and Stress Scale (DASS) may provide a method for examining transdiagnostic factors within emotional disorders. This study aimed to replicate the bifactor model of the DASS, a multidimensional measure of psychological distress, within a US adult sample and provide initial estimates of the reliability of the general and domain-specific factors. Furthermore, this study hypothesized that Worry, a theorized transdiagnostic variable, would show stronger relations to general emotional distress than domain-specific subscales. Confirmatory factor analysis was used to evaluate the bifactor model structure of the DASS in 456 US adult participants (279 females and 177 males, mean age 35.9 years) recruited online. The DASS bifactor model fitted well (CFI = 0.98; RMSEA = 0.05). The General Emotional Distress factor accounted for most of the reliable variance in item scores. Domain-specific subscales accounted for modest portions of reliable variance in items after accounting for the general scale. Finally, structural equation modelling indicated that Worry was strongly predicted by the General Emotional Distress factor. The DASS bifactor model is generalizable to a US community sample and General Emotional Distress, but not domain-specific factors, strongly predict the transdiagnostic variable Worry.

  18. General relaxation schemes in multigrid algorithms for higher order singularity methods

    NASA Technical Reports Server (NTRS)

    Oskam, B.; Fray, J. M. J.

    1981-01-01

    Relaxation schemes based on approximate and incomplete factorization technique (AF) are described. The AF schemes allow construction of a fast multigrid method for solving integral equations of the second and first kind. The smoothing factors for integral equations of the first kind, and comparison with similar results from the second kind of equations are a novel item. Application of the MD algorithm shows convergence to the level of truncation error of a second order accurate panel method.

  19. [A reporting tool for practice guidelines in health care: the RIGHT statement].

    PubMed

    Chen, Yaolong; Yang, Kehu; Marušić, Ana; Qaseem, Amir; Meerpohl, Joerg J; Flottorp, Signe; Akl, Elie A; Schünemann, Holger J; Chan, Edwin S Y; Falck-Ytter, Yngve; Ahmed, Faruque; Barber, Sarah; Chen, Chiehfeng; Zhang, Mingming; Xu, Bin; Tian, Jinhui; Song, Fujian; Shang, Hongcai; Tang, Kun; Wang, Qi; Norris, Susan L; Labonté, Valérie C; Möhler, Ralph; Kopp, Ina; Nothacker, Monika; Meerpohl, Joerg J

    2017-11-01

    The quality of reporting practice guidelines is often poor, and there is no widely accepted guidance or standards for such reporting in health care. The international RIGHT (Reporting Items for practice Guidelines in HealThcare) Working Group was established to address this gap. The group followed an existing framework for developing guidelines for health research reporting and the EQUATOR (Enhancing the QUAlity and Transparency Of health Research) Network approach. A checklist and an explanation and elaboration statement were developed. The RIGHT checklist includes 22 items that are considered essential for good reporting of practice guidelines: basic information (items 1 to 4), background (items 5 to 9), evidence (items 10 to 12), recommendations (items 13 to 15), review and quality assurance (items 16 and 17), funding and declaration and management of interests (items 18 and 19), and other information (items 20 to 22). The RIGHT checklist can assist developers in reporting guidelines, support journal editors and peer reviewers when considering guideline reports, and help health care practitioners understand and implement a guideline. Copyright © 2017. Published by Elsevier GmbH.

  20. IRT Item Parameter Scaling for Developing New Item Pools

    ERIC Educational Resources Information Center

    Kang, Hyeon-Ah; Lu, Ying; Chang, Hua-Hua

    2017-01-01

    Increasing use of item pools in large-scale educational assessments calls for an appropriate scaling procedure to achieve a common metric among field-tested items. The present study examines scaling procedures for developing a new item pool under a spiraled block linking design. The three scaling procedures are considered: (a) concurrent…

  1. Modeling Item-Position Effects within an IRT Framework

    ERIC Educational Resources Information Center

    Debeer, Dries; Janssen, Rianne

    2013-01-01

    Changing the order of items between alternate test forms to prevent copying and to enhance test security is a common practice in achievement testing. However, these changes in item order may affect item and test characteristics. Several procedures have been proposed for studying these item-order effects. The present study explores the use of…

  2. Comparison of Methods for Adjusting Incorrect Assignments of Items to Subtests: Oblique Multiple Group Method versus Confirmatory Common Factor Method

    ERIC Educational Resources Information Center

    Stuive, Ilse; Kiers, Henk A. L.; Timmerman, Marieke E.

    2009-01-01

    A common question in test evaluation is whether an a priori assignment of items to subtests is supported by empirical data. If the analysis results indicate the assignment of items to subtests under study is not supported by data, the assignment is often adjusted. In this study the authors compare two methods on the quality of their suggestions to…

  3. The Empirical Verification of an Assignment of Items to Subtests: The Oblique Multiple Group Method versus the Confirmatory Common Factor Method

    ERIC Educational Resources Information Center

    Stuive, Ilse; Kiers, Henk A. L.; Timmerman, Marieke E.; ten Berge, Jos M. F.

    2008-01-01

    This study compares two confirmatory factor analysis methods on their ability to verify whether correct assignments of items to subtests are supported by the data. The confirmatory common factor (CCF) method is used most often and defines nonzero loadings so that they correspond to the assignment of items to subtests. Another method is the oblique…

  4. Personality in general and clinical samples: Measurement invariance of the Multidimensional Personality Questionnaire.

    PubMed

    Eigenhuis, Annemarie; Kamphuis, Jan H; Noordhof, Arjen

    2017-09-01

    A growing body of research suggests that the same general dimensions can describe normal and pathological personality, but most of the supporting evidence is exploratory. We aim to determine in a confirmatory framework the extent to which responses on the Multidimensional Personality Questionnaire (MPQ) are identical across general and clinical samples. We tested the Dutch brief form of the MPQ (MPQ-BF-NL) for measurement invariance across a general population subsample (N = 365) and a clinical sample (N = 365), using Multiple Group Confirmatory Factor Analysis (MGCFA) and Multiple Group Exploratory Structural Equation Modeling (MGESEM). As an omnibus personality test, the MPQ-BF-NL revealed strict invariance, indicating absence of bias. Unidimensional per scale tests for measurement invariance revealed that 10% of items appeared to contain bias across samples. Item bias only affected the scale interpretation of Achievement, with individuals from the clinical sample more readily admitting to put high demands on themselves than individuals from the general sample, regardless of trait level. This formal test of equivalence provides strong evidence for the common structure of normal and pathological personality and lends further support to the clinical utility of the MPQ. (PsycINFO Database Record (c) 2017 APA, all rights reserved).

  5. An Empirical Comparison of Five Linear Equating Methods for the NEAT Design

    ERIC Educational Resources Information Center

    Suh, Youngsuk; Mroch, Andrew A.; Kane, Michael T.; Ripkey, Douglas R.

    2009-01-01

    In this study, a data base containing the responses of 40,000 candidates to 90 multiple-choice questions was used to mimic data sets for 50-item tests under the "nonequivalent groups with anchor test" (NEAT) design. Using these smaller data sets, we evaluated the performance of five linear equating methods for the NEAT design with five levels of…

  6. Seasonal food habits of the coyote in the South Carolina coastal plain.

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Schrecengost, J., D.; Kilgo, J., C.; Mallard, D.

    2008-07-01

    Spatial and temporal plasticity in Canis latrans (coyote) diets require regional studies to understand the ecological role of this omnivorous canid. Because coyotes have recently become established in South Carolina, we investigated their food habits by collecting 415 coyote scats on the Savannah River Site in western South Carolina from May 2005-July 2006. Seasonally available soft mast was the most common food item in 12 of the 15 months we sampled. Odocoileus virginianus (white-tailed deer) was the most common food item during December (40%) and March (37%). During May-June, fruits of Prunus spp. and Rubus spp. were the most commonlymore » occurring food items. Fawns were the most common mammalian food item during May and June of both years despite low deer density.« less

  7. Middle school students' reading comprehension of mathematical texts and algebraic equations

    NASA Astrophysics Data System (ADS)

    Duru, Adem; Koklu, Onder

    2011-06-01

    In this study, middle school students' abilities to translate mathematical texts into algebraic representations and vice versa were investigated. In addition, students' difficulties in making such translations and the potential sources for these difficulties were also explored. Both qualitative and quantitative methods were used to collect data for this study: questionnaire and clinical interviews. The questionnaire consisted of two general types of items: (1) selected-response (multiple-choice) items for which the respondent selects from multiple options and (2) open-ended items for which the respondent constructs a response. In order to further investigate the students' strategies while they were translating the given mathematical texts to algebraic equations and vice versa, five randomly chosen (n = 5) students were interviewed. Data were collected in the 2007-2008 school year from 185 middle-school students in five teachers' classrooms in three different schools in the city of Adıyaman, Turkey. After the analysis of data, it was found that students who participated in this study had difficulties in translating the mathematical texts into algebraic equations by using symbols. It was also observed that these students had difficulties in translating the symbolic representations into mathematical texts because of their weak reading comprehension. In addition, finding of this research revealed that students' difficulties in translating the given mathematical texts into symbolic representations or vice versa come from different sources.

  8. Conjunctive and Disjunctive Item Response Functions.

    DTIC Science & Technology

    1984-10-01

    fed set ofvaluesof a, b, AI , B1 A2 2 . 2 A3 , and 13 , the f ’. g ’a. nd h’a in (7) are fied. Equation (7) must still hold for S - e19029e3,..* . Thus...for Item I Is -- b ?(a:1 , b1 ,O) (1 + ’)(I + e4 (22 where a and pi are arbitrary constants. These constants mst be the sam for all Items In a given...NETHERLIS I E3I1 Focility-Acquisitions 4133 Rugby Avnue 1 Lee Cronbach Bethesda, NO 20014 16 Laburnue Road Atherton, CA 94205 1 Dr. Benjamin A. Fairbank

  9. Differences between Presentation Methods in Working Memory Procedures: A Matter of Working Memory Consolidation

    PubMed Central

    Ricker, Timothy J.; Cowan, Nelson

    2014-01-01

    Understanding forgetting from working memory, the memory used in ongoing cognitive processing, is critical to understanding human cognition. In the last decade a number of conflicting findings have been reported regarding the role of time in forgetting from working memory. This has led to a debate concerning whether longer retention intervals necessarily result in more forgetting. An obstacle to directly comparing conflicting reports is a divergence in methodology across studies. Studies which find no forgetting as a function of retention-interval duration tend to use sequential presentation of memory items, while studies which find forgetting as a function of retention-interval duration tend to use simultaneous presentation of memory items. Here, we manipulate the duration of retention and the presentation method of memory items, presenting items either sequentially or simultaneously. We find that these differing presentation methods can lead to different rates of forgetting because they tend to differ in the time available for consolidation into working memory. The experiments detailed here show that equating the time available for working memory consolidation equates the rates of forgetting across presentation methods. We discuss the meaning of this finding in the interpretation of previous forgetting studies and in the construction of working memory models. PMID:24059859

  10. Equation Structure and the Meaning of the Equal Sign: The Impact of Task Selection in Eliciting Elementary Students' Understandings

    ERIC Educational Resources Information Center

    Stephens, Ana C.; Knuth, Eric J.; Blanton, Maria L.; Isler, Isil; Gardiner, Angela Murphy; Marum, Tim

    2013-01-01

    This paper reports results from a written assessment given to 290 third-, fourth-, and fifth-grade students prior to any instructional intervention. We share and discuss students' responses to items addressing their understanding of equation structure and the meaning of the equal sign. We found that many students held an operational conception of…

  11. Do Examinees Understand Score Reports for Alternate Methods of Scoring Computer Based Tests?

    ERIC Educational Resources Information Center

    Whittaker, Tiffany A.; Williams, Natasha J.; Dodd, Barbara G.

    2011-01-01

    This study assessed the interpretability of scaled scores based on either number correct (NC) scoring for a paper-and-pencil test or one of two methods of scoring computer-based tests: an item pattern (IP) scoring method and a method based on equated NC scoring. The equated NC scoring method for computer-based tests was proposed as an alternative…

  12. Evaluating Equating Results in the Non-Equivalent Groups with Anchor Test Design Using Equipercentile and Equity Criteria

    ERIC Educational Resources Information Center

    Duong, Minh Quang

    2011-01-01

    Testing programs often use multiple test forms of the same test to control item exposure and to ensure test security. Although test forms are constructed to be as similar as possible, they often differ. Test equating techniques are those statistical methods used to adjust scores obtained on different test forms of the same test so that they are…

  13. Implementing statistical equating for MRCP(UK) Parts 1 and 2.

    PubMed

    McManus, I C; Chis, Liliana; Fox, Ray; Waller, Derek; Tang, Peter

    2014-09-26

    The MRCP(UK) exam, in 2008 and 2010, changed the standard-setting of its Part 1 and Part 2 examinations from a hybrid Angoff/Hofstee method to statistical equating using Item Response Theory, the reference group being UK graduates. The present paper considers the implementation of the change, the question of whether the pass rate increased amongst non-UK candidates, any possible role of Differential Item Functioning (DIF), and changes in examination predictive validity after the change. Analysis of data of MRCP(UK) Part 1 exam from 2003 to 2013 and Part 2 exam from 2005 to 2013. Inspection suggested that Part 1 pass rates were stable after the introduction of statistical equating, but showed greater annual variation probably due to stronger candidates taking the examination earlier. Pass rates seemed to have increased in non-UK graduates after equating was introduced, but was not associated with any changes in DIF after statistical equating. Statistical modelling of the pass rates for non-UK graduates found that pass rates, in both Part 1 and Part 2, were increasing year on year, with the changes probably beginning before the introduction of equating. The predictive validity of Part 1 for Part 2 was higher with statistical equating than with the previous hybrid Angoff/Hofstee method, confirming the utility of IRT-based statistical equating. Statistical equating was successfully introduced into the MRCP(UK) Part 1 and Part 2 written examinations, resulting in higher predictive validity than the previous Angoff/Hofstee standard setting. Concerns about an artefactual increase in pass rates for non-UK candidates after equating were shown not to be well-founded. Most likely the changes resulted from a genuine increase in candidate ability, albeit for reasons which remain unclear, coupled with a cognitive illusion giving the impression of a step-change immediately after equating began. Statistical equating provides a robust standard-setting method, with a better theoretical foundation than judgemental techniques such as Angoff, and is more straightforward and requires far less examiner time to provide a more valid result. The present study provides a detailed case study of introducing statistical equating, and issues which may need to be considered with its introduction.

  14. Automatic Item Generation of Probability Word Problems

    ERIC Educational Resources Information Center

    Holling, Heinz; Bertling, Jonas P.; Zeuch, Nina

    2009-01-01

    Mathematical word problems represent a common item format for assessing student competencies. Automatic item generation (AIG) is an effective way of constructing many items with predictable difficulties, based on a set of predefined task parameters. The current study presents a framework for the automatic generation of probability word problems…

  15. Common data items in seven European oesophagogastric cancer surgery registries: towards a European upper GI cancer audit (EURECCA Upper GI).

    PubMed

    de Steur, W O; Henneman, D; Allum, W H; Dikken, J L; van Sandick, J W; Reynolds, J; Mariette, C; Jensen, L; Johansson, J; Kolodziejczyk, P; Hardwick, R H; van de Velde, C J H

    2014-03-01

    Seven countries (Denmark, France, Ireland, the Netherlands, Poland, Sweden, United Kingdom) collaborated to initiate a EURECCA (European Registration of Cancer Care) Upper GI project. The aim of this study was to identify a core dataset of shared items in the different data registries which can be used for future collaboration between countries. Item lists from all participating Upper GI cancer registries were collected. Items were scored 'present' when included in the registry, or when the items could be deducted from other items in the registry. The definition of a common item was that it was present in at least six of the seven participating countries. The number of registered items varied between 40 (Poland) and 650 (Ireland). Among the 46 shared items were data on patient characteristics, staging and diagnostics, neoadjuvant treatment, surgery, postoperative course, pathology, and adjuvant treatment. Information on non-surgical treatment was available in only 4 registries. A list of 46 shared items from seven participating Upper GI cancer registries was created, providing a basis for future quality assurance and research in Upper GI cancer treatment on a European level. Copyright © 2013 Elsevier Ltd. All rights reserved.

  16. Using Game Play to Diagnose and Remediate Students’ Misconceptions in Solving Equations

    DTIC Science & Technology

    2015-07-21

    on pretest or posttest , and no main effects for assessment form or condition were found on pretest or posttest . Results further...attachment). Table 3 (see Appendix P2- D in the Attachment) reports item-level performance on the pretest and posttest . These findings indicated that...approximately 12% of students scored incorrect on two pretest items then correct on the posttest , and 18% increased from incorrect to correct on

  17. An evaluation of computerized adaptive testing for general psychological distress: combining GHQ-12 and Affectometer-2 in an item bank for public mental health research.

    PubMed

    Stochl, Jan; Böhnke, Jan R; Pickett, Kate E; Croudace, Tim J

    2016-05-20

    Recent developments in psychometric modeling and technology allow pooling well-validated items from existing instruments into larger item banks and their deployment through methods of computerized adaptive testing (CAT). Use of item response theory-based bifactor methods and integrative data analysis overcomes barriers in cross-instrument comparison. This paper presents the joint calibration of an item bank for researchers keen to investigate population variations in general psychological distress (GPD). Multidimensional item response theory was used on existing health survey data from the Scottish Health Education Population Survey (n = 766) to calibrate an item bank consisting of pooled items from the short common mental disorder screen (GHQ-12) and the Affectometer-2 (a measure of "general happiness"). Computer simulation was used to evaluate usefulness and efficacy of its adaptive administration. A bifactor model capturing variation across a continuum of population distress (while controlling for artefacts due to item wording) was supported. The numbers of items for different required reliabilities in adaptive administration demonstrated promising efficacy of the proposed item bank. Psychometric modeling of the common dimension captured by more than one instrument offers the potential of adaptive testing for GPD using individually sequenced combinations of existing survey items. The potential for linking other item sets with alternative candidate measures of positive mental health is discussed since an optimal item bank may require even more items than these.

  18. Normal Theory Two-Stage ML Estimator When Data Are Missing at the Item Level

    PubMed Central

    Savalei, Victoria; Rhemtulla, Mijke

    2017-01-01

    In many modeling contexts, the variables in the model are linear composites of the raw items measured for each participant; for instance, regression and path analysis models rely on scale scores, and structural equation models often use parcels as indicators of latent constructs. Currently, no analytic estimation method exists to appropriately handle missing data at the item level. Item-level multiple imputation (MI), however, can handle such missing data straightforwardly. In this article, we develop an analytic approach for dealing with item-level missing data—that is, one that obtains a unique set of parameter estimates directly from the incomplete data set and does not require imputations. The proposed approach is a variant of the two-stage maximum likelihood (TSML) methodology, and it is the analytic equivalent of item-level MI. We compare the new TSML approach to three existing alternatives for handling item-level missing data: scale-level full information maximum likelihood, available-case maximum likelihood, and item-level MI. We find that the TSML approach is the best analytic approach, and its performance is similar to item-level MI. We recommend its implementation in popular software and its further study. PMID:29276371

  19. Normal Theory Two-Stage ML Estimator When Data Are Missing at the Item Level.

    PubMed

    Savalei, Victoria; Rhemtulla, Mijke

    2017-08-01

    In many modeling contexts, the variables in the model are linear composites of the raw items measured for each participant; for instance, regression and path analysis models rely on scale scores, and structural equation models often use parcels as indicators of latent constructs. Currently, no analytic estimation method exists to appropriately handle missing data at the item level. Item-level multiple imputation (MI), however, can handle such missing data straightforwardly. In this article, we develop an analytic approach for dealing with item-level missing data-that is, one that obtains a unique set of parameter estimates directly from the incomplete data set and does not require imputations. The proposed approach is a variant of the two-stage maximum likelihood (TSML) methodology, and it is the analytic equivalent of item-level MI. We compare the new TSML approach to three existing alternatives for handling item-level missing data: scale-level full information maximum likelihood, available-case maximum likelihood, and item-level MI. We find that the TSML approach is the best analytic approach, and its performance is similar to item-level MI. We recommend its implementation in popular software and its further study.

  20. Trace DNA Sampling Success from Evidence Items Commonly Encountered in Forensic Casework.

    PubMed

    Dziak, Renata; Peneder, Amy; Buetter, Alicia; Hageman, Cecilia

    2018-05-01

    Trace DNA analysis is a significant part of a forensic laboratory's workload. Knowing optimal sampling strategies and item success rates for particular item types can assist in evidence selection and examination processes and shorten turnaround times. In this study, forensic short tandem repeat (STR) casework results were reviewed to determine how often STR profiles suitable for comparison were obtained from "handler" and "wearer" areas of 764 items commonly submitted for examination. One hundred and fifty-five (155) items obtained from volunteers were also sampled. Items were analyzed for best sampling location and strategy. For casework items, headwear and gloves provided the highest success rates. Experimentally, eyeglasses and earphones, T-shirts, fabric gloves and watches provided the highest success rates. Eyeglasses and latex gloves provided optimal results if the entire surfaces were swabbed. In general, at least 10%, and up to 88% of all trace DNA analyses resulted in suitable STR profiles for comparison. © 2017 American Academy of Forensic Sciences.

  1. Locally Dependent Linear Logistic Test Model with Person Covariates

    ERIC Educational Resources Information Center

    Ip, Edward H.; Smits, Dirk J. M.; De Boeck, Paul

    2009-01-01

    The article proposes a family of item-response models that allow the separate and independent specification of three orthogonal components: item attribute, person covariate, and local item dependence. Special interest lies in extending the linear logistic test model, which is commonly used to measure item attributes, to tests with embedded item…

  2. Cross-Classification and Category Representation in Children's Concepts

    ERIC Educational Resources Information Center

    Nguyen, Simone P.

    2007-01-01

    Items commonly belong to many categories. Cross-classification is the classification of a single item into more than one category. This research explored 2- to 6-year-old children's use of 2 different category systems for cross-classification: script (e.g., school-time items, birthday party items) and taxonomic (e.g., animals, clothes). The…

  3. Environmental Knowledge and Beliefs among Grade 10 Students in Australia.

    ERIC Educational Resources Information Center

    Eyers, Vivian George

    To develop environmental education in Australia, a survey of tenth-grade students was undertaken. Thirty knowledge items and ten belief items were constructed. A panel of environmentalists and educators identified best responses for the knowledge items, and a common reference point, preservation of homo sapiens, for the belief items, so a…

  4. F-35 Engine Quality Assurance Inspection

    DTIC Science & Technology

    2015-04-27

    area nor protected from common FOD items. The F135 engine final assembly area had FOD signage at the two entry lobbies of the building; however...there were no FOD signage within the engine final assembly areas. Pratt & Whitney FOD procedures also did not prevent common FOD items from entering

  5. T56. AN EXPLORATORY ANALYSIS CONVERTING SCORES BETWEEN THE PANSS AND BNSS

    PubMed Central

    Kott, Alan; Daniel, David

    2018-01-01

    Abstract Background The Brief Negative Symptom Scale is a relatively new instrument designed specifically to measure the negative symptoms in schizophrenia. Recently more clinical trials include the BNSS scale as a secondary or exploratory outcome, typically along with the PANSS. In the current analysis we aimed at establishing the equations that would allow conversion between the BNSS scale total score and the PANSS negative subscale and PANSS negative factors score as well as conversion equations between the expressive deficits and avolition/apathy factors of the scales. (Kirkpatrick, 2011; Strauss, 2012) Methods Data from 518 schizophrenia clinical trials subjects with both PANSS and BNSS data available were used. Regression analyses predicting the BNSS total score with the PANSS negative subscale score, and the BNSS total score with the PANSS Negative factor (NFS) score were performed on data from all subjects. Regression analyses predicting the BNSS avolition/apathy factor (items 1, 2, 3, 5, 6, 7, and 8) with the PANSS avolition/apathy factor (items N2, N4 and G16) and the BNSS expressive deficits factor (items 4, 9, 10, 11, 12, and 13)with the expressive deficits factor (items N1, N3, N6, G5, G7, and G13)of the PANSS were performed on a sample of 318 subjects with individual BNSS item scores available. In addition to estimating the equations we as well calculated the Pearson’s correlations between the scales. Results The PANSS and BNSS avolition/apathy factors were highly correlated (r=0.70) as were the expressive deficit factors r=0.83). The following equations predicting the BNSS total score were obtained from regression analyses performed on 2,560 data points: BNSS_total = -11.64 + 2.10*PANSS_negative_subscale BNSS_total = -9.26 + 2.11*PANSS_NFS The following equations predicting the BNSS factor scores from the PANSS factor scores were obtained from regression analyses performed on 1,634 data points: BNSS_avolition/apathy = -2.40 + 2.38 * PANSS_avolition/apathy BNSS_expressive_deficit_factor = -4.21 + 1.27 * PANSS_expressive_deficit_factor Discussion The BNSS differs from the PANSS negative factor because it addresses all five currently recognized domains of negative symptoms including anhedonia and attempts to differentiate anticipatory from consummatory states. In our analysis we have replicated the strong correlation between the BNSS total score and PANSS negative subscale and newly identified strong correlations between the BNSS total score and NFS as well as strong correlations between the avolotion/apathy and expressive deficit factors of the BNSS and the PANSS scales. (Kirkpatrick, 2011)The provided equations offer a useful tool allowing researchers and clinicians to easily convert the data between the instruments for reasons such as pooling data from multiple trials using one of the instruments, to allow interpretation of results within the context of previously conducted research, etc. but as well offer a framework for risk based monitoring to identify data deviating from the expected relationship and allow for a targeted exploration of the causes for such a disagreement. The data used for analysis included not only subjects with predominantly negative symptoms but as well acutely psychotic subjects as well as subjects in stable conditions allowing therefore to generalize the results across the majority of schizophrenic subjects. This post-hoc analysis is exploratory. We plan to further explore the potential utility of equations addressing the relationships among schizophrenia measures of symptom severity in an iterative manner with larger datasets.

  6. Boundary curves of individual items in the distribution of total depressive symptom scores approximate an exponential pattern in a general population.

    PubMed

    Tomitaka, Shinichiro; Kawasaki, Yohei; Ide, Kazuki; Akutagawa, Maiko; Yamada, Hiroshi; Furukawa, Toshiaki A; Ono, Yutaka

    2016-01-01

    Previously, we proposed a model for ordinal scale scoring in which individual thresholds for each item constitute a distribution by each item. This lead us to hypothesize that the boundary curves of each depressive symptom score in the distribution of total depressive symptom scores follow a common mathematical model, which is expressed as the product of the frequency of the total depressive symptom scores and the probability of the cumulative distribution function of each item threshold. To verify this hypothesis, we investigated the boundary curves of the distribution of total depressive symptom scores in a general population. Data collected from 21,040 subjects who had completed the Center for Epidemiologic Studies Depression Scale (CES-D) questionnaire as part of a national Japanese survey were analyzed. The CES-D consists of 20 items (16 negative items and four positive items). The boundary curves of adjacent item scores in the distribution of total depressive symptom scores for the 16 negative items were analyzed using log-normal scales and curve fitting. The boundary curves of adjacent item scores for a given symptom approximated a common linear pattern on a log normal scale. Curve fitting showed that an exponential fit had a markedly higher coefficient of determination than either linear or quadratic fits. With negative affect items, the gap between the total score curve and boundary curve continuously increased with increasing total depressive symptom scores on a log-normal scale, whereas the boundary curves of positive affect items, which are not considered manifest variables of the latent trait, did not exhibit such increases in this gap. The results of the present study support the hypothesis that the boundary curves of each depressive symptom score in the distribution of total depressive symptom scores commonly follow the predicted mathematical model, which was verified to approximate an exponential mathematical pattern.

  7. Boundary curves of individual items in the distribution of total depressive symptom scores approximate an exponential pattern in a general population

    PubMed Central

    Kawasaki, Yohei; Akutagawa, Maiko; Yamada, Hiroshi; Furukawa, Toshiaki A.; Ono, Yutaka

    2016-01-01

    Background Previously, we proposed a model for ordinal scale scoring in which individual thresholds for each item constitute a distribution by each item. This lead us to hypothesize that the boundary curves of each depressive symptom score in the distribution of total depressive symptom scores follow a common mathematical model, which is expressed as the product of the frequency of the total depressive symptom scores and the probability of the cumulative distribution function of each item threshold. To verify this hypothesis, we investigated the boundary curves of the distribution of total depressive symptom scores in a general population. Methods Data collected from 21,040 subjects who had completed the Center for Epidemiologic Studies Depression Scale (CES-D) questionnaire as part of a national Japanese survey were analyzed. The CES-D consists of 20 items (16 negative items and four positive items). The boundary curves of adjacent item scores in the distribution of total depressive symptom scores for the 16 negative items were analyzed using log-normal scales and curve fitting. Results The boundary curves of adjacent item scores for a given symptom approximated a common linear pattern on a log normal scale. Curve fitting showed that an exponential fit had a markedly higher coefficient of determination than either linear or quadratic fits. With negative affect items, the gap between the total score curve and boundary curve continuously increased with increasing total depressive symptom scores on a log-normal scale, whereas the boundary curves of positive affect items, which are not considered manifest variables of the latent trait, did not exhibit such increases in this gap. Discussion The results of the present study support the hypothesis that the boundary curves of each depressive symptom score in the distribution of total depressive symptom scores commonly follow the predicted mathematical model, which was verified to approximate an exponential mathematical pattern. PMID:27761346

  8. Applications of He's semi-inverse method, ITEM and GGM to the Davey-Stewartson equation

    NASA Astrophysics Data System (ADS)

    Zinati, Reza Farshbaf; Manafian, Jalil

    2017-04-01

    We investigate the Davey-Stewartson (DS) equation. Travelling wave solutions were found. In this paper, we demonstrate the effectiveness of the analytical methods, namely, He's semi-inverse variational principle method (SIVPM), the improved tan(φ/2)-expansion method (ITEM) and generalized G'/G-expansion method (GGM) for seeking more exact solutions via the DS equation. These methods are direct, concise and simple to implement compared to other existing methods. The exact solutions containing four types solutions have been achieved. The results demonstrate that the aforementioned methods are more efficient than the Ansatz method applied by Mirzazadeh (2015). Abundant exact travelling wave solutions including solitons, kink, periodic and rational solutions have been found by the improved tan(φ/2)-expansion and generalized G'/G-expansion methods. By He's semi-inverse variational principle we have obtained dark and bright soliton wave solutions. Also, the obtained semi-inverse variational principle has profound implications in physical understandings. These solutions might play important role in engineering and physics fields. Moreover, by using Matlab, some graphical simulations were done to see the behavior of these solutions.

  9. Exploring the Effects of Rater Linking Designs and Rater Fit on Achievement Estimates within the Context of Music Performance Assessments

    ERIC Educational Resources Information Center

    Wind, Stefanie A.; Engelhard, George, Jr.; Wesolowski, Brian

    2016-01-01

    When good model-data fit is observed, the Many-Facet Rasch (MFR) model acts as a linking and equating model that can be used to estimate student achievement, item difficulties, and rater severity on the same linear continuum. Given sufficient connectivity among the facets, the MFR model provides estimates of student achievement that are equated to…

  10. Joining of thermoplastic substrates by microwaves

    DOEpatents

    Paulauskas, Felix L.; Meek, Thomas T.

    1997-01-01

    A method for joining two or more items having surfaces of thermoplastic material includes the steps of depositing an electrically-conductive material upon the thermoplastic surface of at least one of the items, and then placing the other of the two items adjacent the one item so that the deposited material is in intimate contact with the surfaces of both the one and the other items. The deposited material and the thermoplastic surfaces contacted thereby are then exposed to microwave radiation so that the thermoplastic surfaces in contact with the deposited material melt, and then pressure is applied to the two items so that the melted thermoplastic surfaces fuse to one another. Upon discontinuance of the exposure to the microwave energy, and after permitting the thermoplastic surfaces to cool from the melted condition, the two items are joined together by the fused thermoplastic surfaces. The deposited material has a thickness which is preferably no greater than a skin depth, .delta..sub.s, which is related to the frequency of the microwave radiation and characteristics of the deposited material in accordance with an equation.

  11. Item Parameter Estimation for the MIRT Model: Bias and Precision of Confirmatory Factor Analysis-Based Models

    ERIC Educational Resources Information Center

    Finch, Holmes

    2010-01-01

    The accuracy of item parameter estimates in the multidimensional item response theory (MIRT) model context is one that has not been researched in great detail. This study examines the ability of two confirmatory factor analysis models specifically for dichotomous data to properly estimate item parameters using common formulae for converting factor…

  12. Modeling Item-Level and Step-Level Invariance Effects in Polytomous Items Using the Partial Credit Model

    ERIC Educational Resources Information Center

    Gattamorta, Karina A.; Penfield, Randall D.; Myers, Nicholas D.

    2012-01-01

    Measurement invariance is a common consideration in the evaluation of the validity and fairness of test scores when the tested population contains distinct groups of examinees, such as examinees receiving different forms of a translated test. Measurement invariance in polytomous items has traditionally been evaluated at the item-level,…

  13. Conditional Covariance Theory and DETECT for Polytomous Items. Research Report. ETS RR-04-50

    ERIC Educational Resources Information Center

    Zhang, Jinming

    2004-01-01

    This paper extends the theory of conditional covariances to polytomous items. It has been mathematically proven that under some mild conditions, commonly assumed in the analysis of response data, the conditional covariance of two items, dichotomously or polytomously scored, is positive if the two items are dimensionally homogeneous and negative…

  14. Estimation of Item Response Theory Parameters in the Presence of Missing Data

    ERIC Educational Resources Information Center

    Finch, Holmes

    2008-01-01

    Missing data are a common problem in a variety of measurement settings, including responses to items on both cognitive and affective assessments. Researchers have shown that such missing data may create problems in the estimation of item difficulty parameters in the Item Response Theory (IRT) context, particularly if they are ignored. At the same…

  15. A Conditional Exposure Control Method for Multidimensional Adaptive Testing

    ERIC Educational Resources Information Center

    Finkelman, Matthew; Nering, Michael L.; Roussos, Louis A.

    2009-01-01

    In computerized adaptive testing (CAT), ensuring the security of test items is a crucial practical consideration. A common approach to reducing item theft is to define maximum item exposure rates, i.e., to limit the proportion of examinees to whom a given item can be administered. Numerous methods for controlling exposure rates have been proposed…

  16. Existing reporting guidelines for clinical trials are not completely relevant for implantable medical devices: a systematic review.

    PubMed

    Motte, Anne-France; Diallo, Stéphanie; van den Brink, Hélène; Châteauvieux, Constance; Serrano, Carole; Naud, Carole; Steelandt, Julie; Alsac, Jean-Marc; Aubry, Pierre; Cour, Florence; Pellerin, Olivier; Pineau, Judith; Prognon, Patrice; Borget, Isabelle; Bonan, Brigitte; Martelli, Nicolas

    2017-11-01

    The aim of this study was to determine relevant items for reporting clinical trials on implantable medical devices (IMDs) and to identify reporting guidelines which include these items. A panel of experts identified the most relevant items for evaluating IMDs from an initial list based on reference papers. We then conducted a systematic review of articles indexed in MEDLINE. We retrieved reporting guidelines from the EQUATOR network's library for health research reporting. Finally, we screened these reporting guidelines to find those using our set of reporting items. Seven relevant reporting items were selected that related to four topics: randomization, learning curve, surgical setting, and device information. A total of 348 reporting guidelines were identified, among which 26 met our inclusion criteria. However, none of the 26 reporting guidelines presented all seven items together. The most frequently reported item was timing of randomization (65%). On the contrary, device information and learning curve effects were poorly specified. To our knowledge, this study is the first to identify specific items related to IMDs in reporting guidelines for clinical trials. We have shown that no existing reporting guideline is totally suitable for these devices. Copyright © 2017 Elsevier Inc. All rights reserved.

  17. Unidimensional and Multidimensional Models for Item Response Theory.

    ERIC Educational Resources Information Center

    McDonald, Roderick P.

    This paper provides an up-to-date review of the relationship between item response theory (IRT) and (nonlinear) common factor theory and draws out of this relationship some implications for current and future research in IRT. Nonlinear common factor analysis yields a natural embodiment of the weak principle of local independence in appropriate…

  18. Effects of age on negative subsequent memory effects associated with the encoding of item and item-context information.

    PubMed

    Mattson, Julia T; Wang, Tracy H; de Chastelaine, Marianne; Rugg, Michael D

    2014-12-01

    It has consistently been reported that "negative" subsequent memory effects--lower study activity for later remembered than later forgotten items--are attenuated in older individuals. The present functional magnetic resonance imaging study investigated whether these findings extend to subsequent memory effects associated with successful encoding of item-context information. Older (n = 25) and young (n = 17) subjects were scanned while making 1 of 2 encoding judgments on a series of pictures. Memory was assessed for the study item and, for items judged old, the item's encoding task. Both memory judgments were made using confidence ratings, permitting item and source memory strength to be unconfounded and source confidence to be equated across age groups. Replicating prior findings, negative item effects in regions of the default mode network in young subjects were reversed in older subjects. Negative source effects, however, were invariant with respect to age and, in both age groups, the magnitude of the effects correlated with source memory performance. It is concluded that negative item effects do not reflect processes necessary for the successful encoding of item-context associations in older subjects. Negative source effects, in contrast, appear to reflect the engagement of processes that are equally important for successful episodic encoding in older and younger individuals. © The Author 2013. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  19. Integrating patient reported outcome measures and computerized adaptive test estimates on the same common metrics: an example from the assessment of activities in rheumatoid arthritis.

    PubMed

    Doğanay Erdoğan, Beyza; Elhan, Atilla Halİl; Kaskatı, Osman Tolga; Öztuna, Derya; Küçükdeveci, Ayşe Adile; Kutlay, Şehim; Tennant, Alan

    2017-10-01

    This study aimed to explore the potential of an inclusive and fully integrated measurement system for the Activities component of the International Classification of Functioning, Disability and Health (ICF), incorporating four classical scales, including the Health Assessment Questionnaire (HAQ), and a Computerized Adaptive Testing (CAT). Three hundred patients with rheumatoid arthritis (RA) answered relevant questions from four questionnaires. Rasch analysis was performed to create an item bank using this item pool. A further 100 RA patients were recruited for a CAT application. Both real and simulated CATs were applied and the agreement between these CAT-based scores and 'paper-pencil' scores was evaluated with intraclass correlation coefficient (ICC). Anchoring strategies were used to obtain a direct translation from the item bank common metric to the HAQ score. Mean age of 300 patients was 52.3 ± 11.7 years; disease duration was 11.3 ± 8.0 years; 74.7% were women. After testing for the assumptions of Rasch analysis, a 28-item Activities item bank was created. The agreement between CAT-based scores and paper-pencil scores were high (ICC = 0.993). Using those HAQ items in the item bank as anchoring items, another Rasch analysis was performed with HAQ-8 scores as separate items together with anchoring items. Finally a conversion table of the item bank common metric to the HAQ scores was created. A fully integrated and inclusive health assessment system, illustrating the Activities component of the ICF, was built to assess RA patients. Raw score to metric conversions and vice versa were available, giving access to the metric by a simple look-up table. © 2015 Asia Pacific League of Associations for Rheumatology and Wiley Publishing Asia Pty Ltd.

  20. Tracking functional status across the spinal cord injury lifespan: linking pediatric and adult patient-reported outcome scores.

    PubMed

    Tian, Feng; Ni, Pengsheng; Mulcahey, M J; Hambleton, Ronald K; Tulsky, David; Haley, Stephen M; Jette, Alan M

    2014-11-01

    To use item response theory (IRT) methods to link scores from 2 recently developed contemporary functional outcome measures, the adult Spinal Cord Injury-Functional Index (SCI-FI) and the Pedi SCI (both the parent version and the child version). Secondary data analysis of the physical functioning items of the adult SCI-FI and the Pedi SCI instruments. We used a nonequivalent group design with items common to both instruments and the Stocking-Lord method for the linking. Linking was conducted so that the adult SCI-FI and Pedi SCI scaled scores could be compared. Community. This study included a total sample of 1558 participants. Pedi SCI items were administered to a sample of children (n=381) with SCI aged 8 to 21 years, and of parents/caregivers (n=322) of children with SCI aged 4 to 21 years. Adult SCI-FI items were administered to a sample of adults (n=855) with SCI aged 18 to 92 years. Not applicable. Five scales common to both instruments were included in the analysis: Wheelchair, Daily Routine/Self-care, Daily Routine/Fine Motor, Ambulation, and General Mobility functioning. Confirmatory factor analysis and exploratory factor analysis results indicated that the 5 scales are unidimensional. A graded response model was used to calibrate the items. Misfitting items were identified and removed from the item banks. Items that function differently between the adult and child samples (ie, exhibit differential item functioning) were identified and removed from the common items used for linking. Domain scores from the Pedi SCI instruments were transformed onto the adult SCI-FI metric. This IRT linking allowed estimation of adult SCI-FI scale scores based on Pedi SCI scale scores and vice versa; therefore, it provides clinicians with a means of tracking long-term functional data for children with an SCI across their entire lifespan. Copyright © 2014 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.

  1. Development of the PROMIS positive emotional and sensory expectancies of smoking item banks.

    PubMed

    Tucker, Joan S; Shadel, William G; Edelen, Maria Orlando; Stucky, Brian D; Li, Zhen; Hansen, Mark; Cai, Li

    2014-09-01

    The positive emotional and sensory expectancies of cigarette smoking include improved cognitive abilities, positive affective states, and pleasurable sensorimotor sensations. This paper describes development of Positive Emotional and Sensory Expectancies of Smoking item banks that will serve to standardize the assessment of this construct among daily and nondaily cigarette smokers. Data came from daily (N = 4,201) and nondaily (N =1,183) smokers who completed an online survey. To identify a unidimensional set of items, we conducted item factor analyses, item response theory analyses, and differential item functioning analyses. Additionally, we evaluated the performance of fixed-item short forms (SFs) and computer adaptive tests (CATs) to efficiently assess the construct. Eighteen items were included in the item banks (15 common across daily and nondaily smokers, 1 unique to daily, 2 unique to nondaily). The item banks are strongly unidimensional, highly reliable (reliability = 0.95 for both), and perform similarly across gender, age, and race/ethnicity groups. A SF common to daily and nondaily smokers consists of 6 items (reliability = 0.86). Results from simulated CATs indicated that, on average, less than 8 items are needed to assess the construct with adequate precision using the item banks. These analyses identified a new set of items that can assess the positive emotional and sensory expectancies of smoking in a reliable and standardized manner. Considerable efficiency in assessing this construct can be achieved by using the item bank SF, employing computer adaptive tests, or selecting subsets of items tailored to specific research or clinical purposes. © The Author 2014. Published by Oxford University Press on behalf of the Society for Research on Nicotine and Tobacco. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  2. The Mindful Attention Awareness Scale: Further Examination of Dimensionality, Reliability, and Concurrent Validity Estimates.

    PubMed

    Osman, Augustine; Lamis, Dorian A; Bagge, Courtney L; Freedenthal, Stacey; Barnes, Sean M

    2016-01-01

    We examined the factor structure and psychometric properties of the Mindful Attention Awareness Scale (MAAS) in a sample of 810 undergraduate students. Using common exploratory factor analysis (EFA), we obtained evidence for a 1-factor solution (41.84% common variance). To confirm unidimensionality of the 15-item MAAS, we conducted a 1-factor confirmatory factor analysis (CFA). Results of the EFA and CFA, respectively, provided support for a unidimensional model. Using differential item functioning analysis methods within item response theory modeling (IRT-based DIF), we found that individuals with high and low levels of nonattachment responded similarly to the MAAS items. Following a detailed item analysis, we proposed a 5-item short version of the instrument and present descriptive statistics and composite score reliability for the short and full versions of the MAAS. Finally, correlation analyses showed that scores on the full and short versions of the MAAS were associated with measures assessing related constructs. The 5-item MAAS is as useful as the original MAAS in enhancing our understanding of the mindfulness construct.

  3. Anchor Selection Strategies for DIF Analysis: Review, Assessment, and New Approaches

    ERIC Educational Resources Information Center

    Kopf, Julia; Zeileis, Achim; Strobl, Carolin

    2015-01-01

    Differential item functioning (DIF) indicates the violation of the invariance assumption, for instance, in models based on item response theory (IRT). For item-wise DIF analysis using IRT, a common metric for the item parameters of the groups that are to be compared (e.g., for the reference and the focal group) is necessary. In the Rasch model,…

  4. Polytomous versus Dichotomous Scoring on Multiple-Choice Examinations: Development of a Rubric for Rating Partial Credit

    ERIC Educational Resources Information Center

    Grunert, Megan L.; Raker, Jeffrey R.; Murphy, Kristen L.; Holme, Thomas A.

    2013-01-01

    The concept of assigning partial credit on multiple-choice test items is considered for items from ACS Exams. Because the items on these exams, particularly the quantitative items, use common student errors to define incorrect answers, it is possible to assign partial credits to some of these incorrect responses. To do so, however, it becomes…

  5. 47 CFR 32.25 - Unusual items and contingent liabilities.

    Code of Federal Regulations, 2011 CFR

    2011-10-01

    ... 47 Telecommunication 2 2011-10-01 2011-10-01 false Unusual items and contingent liabilities. 32.25 Section 32.25 Telecommunication FEDERAL COMMUNICATIONS COMMISSION (CONTINUED) COMMON CARRIER SERVICES UNIFORM SYSTEM OF ACCOUNTS FOR TELECOMMUNICATIONS COMPANIES General Instructions § 32.25 Unusual items and...

  6. 47 CFR 32.25 - Unusual items and contingent liabilities.

    Code of Federal Regulations, 2013 CFR

    2013-10-01

    ... 47 Telecommunication 2 2013-10-01 2013-10-01 false Unusual items and contingent liabilities. 32.25 Section 32.25 Telecommunication FEDERAL COMMUNICATIONS COMMISSION (CONTINUED) COMMON CARRIER SERVICES UNIFORM SYSTEM OF ACCOUNTS FOR TELECOMMUNICATIONS COMPANIES General Instructions § 32.25 Unusual items and...

  7. 47 CFR 32.25 - Unusual items and contingent liabilities.

    Code of Federal Regulations, 2014 CFR

    2014-10-01

    ... 47 Telecommunication 2 2014-10-01 2014-10-01 false Unusual items and contingent liabilities. 32.25 Section 32.25 Telecommunication FEDERAL COMMUNICATIONS COMMISSION (CONTINUED) COMMON CARRIER SERVICES UNIFORM SYSTEM OF ACCOUNTS FOR TELECOMMUNICATIONS COMPANIES General Instructions § 32.25 Unusual items and...

  8. Successfully Transitioning to Linear Equations

    ERIC Educational Resources Information Center

    Colton, Connie; Smith, Wendy M.

    2014-01-01

    The Common Core State Standards for Mathematics (CCSSI 2010) asks students in as early as fourth grade to solve word problems using equations with variables. Equations studied at this level generate a single solution, such as the equation x + 10 = 25. For students in fifth grade, the Common Core standard for algebraic thinking expects them to…

  9. On Studying Common Factor Dominance and Approximate Unidimensionality in Multicomponent Measuring Instruments with Discrete Items

    ERIC Educational Resources Information Center

    Raykov, Tenko; Marcoulides, George A.

    2018-01-01

    This article outlines a procedure for examining the degree to which a common factor may be dominating additional factors in a multicomponent measuring instrument consisting of binary items. The procedure rests on an application of the latent variable modeling methodology and accounts for the discrete nature of the manifest indicators. The method…

  10. Development of a noise annoyance sensitivity scale

    NASA Technical Reports Server (NTRS)

    Bregman, H. L.; Pearson, R. G.

    1972-01-01

    Examining the problem of noise pollution from the psychological rather than the engineering view, a test of human sensitivity to noise was developed against the criterion of noise annoyance. Test development evolved from a previous study in which biographical, attitudinal, and personality data was collected on a sample of 166 subjects drawn from the adult community of Raleigh. Analysis revealed that only a small subset of the data collected was predictive of noise annoyance. Item analysis yielded 74 predictive items that composed the preliminary noise sensitivity test. This was administered to a sample of 80 adults who later rate the annoyance value of six sounds (equated in terms of peak sound pressure level) presented in a simulated home, living-room environment. A predictive model involving 20 test items was developed using multiple regression techniques, and an item weighting scheme was evaluated.

  11. Development of the PROMIS health expectancies of smoking item banks.

    PubMed

    Edelen, Maria Orlando; Tucker, Joan S; Shadel, William G; Stucky, Brian D; Cerully, Jennifer; Li, Zhen; Hansen, Mark; Cai, Li

    2014-09-01

    Smokers' health-related outcome expectancies are associated with a number of important constructs in smoking research, yet there are no measures currently available that focus exclusively on this domain. This paper describes the development and evaluation of item banks for assessing the health expectancies of smoking. Using data from a sample of daily (N = 4,201) and nondaily (N = 1,183) smokers, we conducted a series of item factor analyses, item response theory analyses, and differential item functioning analyses (according to gender, age, and race/ethnicity) to arrive at a unidimensional set of health expectancies items for daily and nondaily smokers. We also evaluated the performance of short forms (SFs) and computer adaptive tests (CATs) to efficiently assess health expectancies. A total of 24 items were included in the Health Expectancies item banks; 13 items are common across daily and nondaily smokers, 6 are unique to daily, and 5 are unique to nondaily. For both daily and nondaily smokers, the Health Expectancies item banks are unidimensional, reliable (reliability = 0.95 and 0.96, respectively), and perform similarly across gender, age, and race/ethnicity groups. A SF common to daily and nondaily smokers consists of 6 items (reliability = 0.87). Results from simulated CATs showed that health expectancies can be assessed with good precision with an average of 5-6 items adaptively selected from the item banks. Health expectancies of smoking can be assessed on the basis of these item banks via SFs, CATs, or through a tailored set of items selected for a specific research purpose. © The Author 2014. Published by Oxford University Press on behalf of the Society for Research on Nicotine and Tobacco. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  12. Development of the PROMIS nicotine dependence item banks.

    PubMed

    Shadel, William G; Edelen, Maria Orlando; Tucker, Joan S; Stucky, Brian D; Hansen, Mark; Cai, Li

    2014-09-01

    Nicotine dependence is a core construct important for understanding cigarette smoking and smoking cessation behavior. This article describes analyses conducted to develop and evaluate item banks for assessing nicotine dependence among daily and nondaily smokers. Using data from a sample of daily (N = 4,201) and nondaily (N =1,183) smokers, we conducted a series of item factor analyses, item response theory analyses, and differential item functioning analyses (according to gender, age, and race/ethnicity) to arrive at a unidimensional set of nicotine dependence items for daily and nondaily smokers. We also evaluated performance of short forms (SFs) and computer adaptive tests (CATs) to efficiently assess dependence. A total of 32 items were included in the Nicotine Dependence item banks; 22 items are common across daily and nondaily smokers, 5 are unique to daily smokers, and 5 are unique to nondaily smokers. For both daily and nondaily smokers, the Nicotine Dependence item banks are strongly unidimensional, highly reliable (reliability = 0.97 and 0.97, respectively), and perform similarly across gender, age, and race/ethnicity groups. SFs common to daily and nondaily smokers consist of 8 and 4 items (reliability = 0.91 and 0.81, respectively). Results from simulated CATs showed that dependence can be assessed with very good precision for most respondents using fewer than 6 items adaptively selected from the item banks. Nicotine dependence on cigarettes can be assessed on the basis of these item banks via one of the SFs, by using CATs, or through a tailored set of items selected for a specific research purpose. © The Author 2014. Published by Oxford University Press on behalf of the Society for Research on Nicotine and Tobacco. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  13. Development of the PROMIS negative psychosocial expectancies of smoking item banks.

    PubMed

    Stucky, Brian D; Edelen, Maria Orlando; Tucker, Joan S; Shadel, William G; Cerully, Jennifer; Kuhfeld, Megan; Hansen, Mark; Cai, Li

    2014-09-01

    Negative psychosocial expectancies of smoking include aspects of social disapproval and disappointment in oneself. This paper describes analyses conducted to develop and evaluate item banks for assessing psychosocial expectancies among daily and nondaily smokers. Using data from a sample of daily (N = 4,201) and nondaily (N =1,183) smokers, we conducted a series of item factor analyses, item response theory analyses, and differential item functioning analyses (according to gender, age, and race/ethnicity) to arrive at a unidimensional set of psychosocial expectancies items for daily and nondaily smokers. We also evaluated performance of short forms (SFs) and computer adaptive tests (CATs) to efficiently assess psychosocial expectancies. A total of 21 items were included in the Psychosocial Expectancies item banks: 14 items are common across daily and nondaily smokers, 6 are unique to daily, and 1 is unique to nondaily. For both daily and nondaily smokers, the Psychosocial Expectancies item banks are strongly unidimensional, highly reliable (reliability = 0.95 and 0.93, respectively), and perform similarly across gender, age, and race/ethnicity groups. A SF common to daily and nondaily smokers consists of 6 items (reliability = 0.85). Results from simulated CATs showed that, on average, fewer than 8 items are needed to assess psychosocial expectancies with adequate precision when using the item banks. Psychosocial expectancies of smoking can be assessed on the basis of these item banks via the SF, by using CAT, or through a tailored set of items selected for a specific research purpose. © The Author 2014. Published by Oxford University Press on behalf of the Society for Research on Nicotine and Tobacco. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  14. Modeling individualized coefficient alpha to measure quality of test score data.

    PubMed

    Liu, Molei; Hu, Ming; Zhou, Xiao-Hua

    2018-05-23

    Individualized coefficient alpha is defined. It is item and subject specific and is used to measure the quality of test score data with heterogenicity among the subjects and items. A regression model is developed based on 3 sets of generalized estimating equations. The first set of generalized estimating equation models the expectation of the responses, the second set models the response's variance, and the third set is proposed to estimate the individualized coefficient alpha, defined and used to measure individualized internal consistency of the responses. We also use different techniques to extend our method to handle missing data. Asymptotic property of the estimators is discussed, based on which inference on the coefficient alpha is derived. Performance of our method is evaluated through simulation study and real data analysis. The real data application is from a health literacy study in Hunan province of China. Copyright © 2018 John Wiley & Sons, Ltd.

  15. CTTITEM: SAS macro and SPSS syntax for classical item analysis.

    PubMed

    Lei, Pui-Wa; Wu, Qiong

    2007-08-01

    This article describes the functions of a SAS macro and an SPSS syntax that produce common statistics for conventional item analysis including Cronbach's alpha, item difficulty index (p-value or item mean), and item discrimination indices (D-index, point biserial and biserial correlations for dichotomous items and item-total correlation for polytomous items). These programs represent an improvement over the existing SAS and SPSS item analysis routines in terms of completeness and user-friendliness. To promote routine evaluations of item qualities in instrument development of any scale, the programs are available at no charge for interested users. The program codes along with a brief user's manual that contains instructions and examples are downloadable from suen.ed.psu.edu/-pwlei/plei.htm.

  16. Effects of Item Parameter Drift on Vertical Scaling with the Nonequivalent Groups with Anchor Test (NEAT) Design

    ERIC Educational Resources Information Center

    Ye, Meng; Xin, Tao

    2014-01-01

    The authors explored the effects of drifting common items on vertical scaling within the higher order framework of item parameter drift (IPD). The results showed that if IPD occurred between a pair of test levels, the scaling performance started to deviate from the ideal state, as indicated by bias of scaling. When there were two items drifting…

  17. Do the Guideline Violations Influence Test Difficulty of High-Stake Test?: An Investigation on University Entrance Examination in Turkey

    ERIC Educational Resources Information Center

    Atalmis, Erkan Hasan

    2016-01-01

    Multiple-choice (MC) items are commonly used in high-stake tests. Thus, each item of such tests should be meticulously constructed to increase the accuracy of decisions based on test results. Haladyna and his colleagues (2002) addressed the valid item-writing guidelines to construct high quality MC items in order to increase test reliability and…

  18. Assessing cross-cultural validity of scales: a methodological review and illustrative example.

    PubMed

    Beckstead, Jason W; Yang, Chiu-Yueh; Lengacher, Cecile A

    2008-01-01

    In this article, we assessed the cross-cultural validity of the Women's Role Strain Inventory (WRSI), a multi-item instrument that assesses the degree of strain experienced by women who juggle the roles of working professional, student, wife and mother. Cross-cultural validity is evinced by demonstrating the measurement invariance of the WRSI. Measurement invariance is the extent to which items of multi-item scales function in the same way across different samples of respondents. We assessed measurement invariance by comparing a sample of working women in Taiwan with a similar sample from the United States. Structural equation models (SEMs) were employed to determine the invariance of the WRSI and to estimate the unique validity variance of its items. This article also provides nurse-researchers with the necessary underlying measurement theory and illustrates how SEMs may be applied to assess cross-cultural validity of instruments used in nursing research. Overall performance of the WRSI was acceptable but our analysis showed that some items did not display invariance properties across samples. Item analysis is presented and recommendations for improving the instrument are discussed.

  19. Developing an Experiential Definition of Recovery: Participatory Research with Recovering Substance Abusers from Multiple Pathways

    PubMed Central

    Borkman, Thomasina J.; Stunz, Aina; Kaskutas, Lee Ann

    2016-01-01

    Background The What is Recovery? (WIR) study identified specific elements of a recovery definition that people in substance abuse recovery from multiple pathways would endorse. Objectives To explain how participatory research contributed to the development of a comprehensive pool of items defining recovery; and to identify the commonality between the specific items endorsed by participants as defining recovery and the abstract components of recovery found in four important broad recovery definitions Methods A four-step, mixed-methods, iterative process was used to develop and pretest items (August 2010 to February 2012). Online survey recruitment (n=238) was done via email lists of individuals in recovery and electronic advertisements; 54 were selected for in-depth telephone interviews. Analyses using experientially-based and survey research criteria resulted in a revised item pool of 47 refined and specific items. The WIR items were matched with the components of four important definitions. Results Recovering participants (1) proposed and validated new items; (2) developed an alternative response category to the Likert; (3) suggested criteria for eliminating items irrelevant to recovery. The matching of WIR items with the components of important abstract definitions revealed extensive commonality. Conclusions, importance The WIR items define recovery as ways of being, as a growth and learning process involving internal values and self-awareness with moral dimensions. This is the first wide-scale research identifying specific items defining recovery, which can be used to guide service provision in Recovery-Oriented Systems of Care. PMID:27159851

  20. Developing an Experiential Definition of Recovery: Participatory Research With Recovering Substance Abusers From Multiple Pathways.

    PubMed

    Borkman, Thomasina Jo; Stunz, Aina; Kaskutas, Lee Ann

    2016-07-28

    The What is Recovery? (WIR) study identified specific elements of a recovery definition that people in substance abuse recovery from multiple pathways would endorse. To explain how participatory research contributed to the development of a comprehensive pool of items defining recovery; and to identify the commonality between the specific items endorsed by participants as defining recovery and the abstract components of recovery found in four important broad recovery definitions. A four-step, mixed-methods, iterative process was used to develop and pretest items (August 2010 to February 2012). Online survey recruitment (n = 238) was done via email lists of individuals in recovery and electronic advertisements; 54 were selected for in-depth telephone interviews. Analyses using experientially-based and survey research criteria resulted in a revised item pool of 47 refined and specific items. The WIR items were matched with the components of four important definitions. Recovering participants (1) proposed and validated new items; (2) developed an alternative response category to the Likert; (3) suggested criteria for eliminating items irrelevant to recovery. The matching of WIR items with the components of important abstract definitions revealed extensive commonality. The WIR items define recovery as ways of being, as a growth and learning process involving internal values and self-awareness with moral dimensions. This is the first wide-scale research identifying specific items defining recovery, which can be used to guide service provision in Recovery-Oriented Systems of Care.

  1. Advising on Preferred Reporting Items for patient-reported outcome instrument development: the PRIPROID.

    PubMed

    Hou, Zheng-Kun; Liu, Feng-Bin; Fang, Ji-Qian; Li, Xiao-Ying; Li, Li-Juan; Lin, Chu-Hua

    2013-03-01

    The reporting of patient-reported outcomes (PRO) instrument development is vital for both researchers and clinicians to determine its validity, thus, we propose the Preferred Reporting Items for PRO Instrument Development (PRIPROID) to improve the quality of reports. Abiding by the guidance published by the Enhancing the QUAlity and Transparency Of health Research (EQUATOR) Network, we had performed 6 steps for items development: identified the need for a guideline, performed a literature review, obtained funding for the guideline initiative, identified participants, conducted a Delphi exercise and generated a list of PRIPROID items for consideration at the face-to-face meeting. Twenty three items subheadings under 7 topics were included: title and structured abstract, rationale, objectives, intention, eligibility criteria, conceptual framework, items generation, response options, scoring, times, administrative modes, burden assessment, properties assessment, statistical methods, participants, main results, and additional analysis, summary of evidence, limitations, clinical attentions, and conclusions, item pools or final form, and funding. The PRIPROID contains many elements of the PRO research, and this assists researchers to report their results more accurately and to a certain degree use this instrument to evaluate the quality of the research methods.

  2. Item Response Modeling of Multivariate Count Data with Zero Inflation, Maximum Inflation, and Heaping

    ERIC Educational Resources Information Center

    Magnus, Brooke E.; Thissen, David

    2017-01-01

    Questionnaires that include items eliciting count responses are becoming increasingly common in psychology. This study proposes methodological techniques to overcome some of the challenges associated with analyzing multivariate item response data that exhibit zero inflation, maximum inflation, and heaping at preferred digits. The modeling…

  3. Practical Guide to Conducting an Item Response Theory Analysis

    ERIC Educational Resources Information Center

    Toland, Michael D.

    2014-01-01

    Item response theory (IRT) is a psychometric technique used in the development, evaluation, improvement, and scoring of multi-item scales. This pedagogical article provides the necessary information needed to understand how to conduct, interpret, and report results from two commonly used ordered polytomous IRT models (Samejima's graded…

  4. 38 CFR 3.1606 - Transportation items.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... 38 Pensions, Bonuses, and Veterans' Relief 1 2010-07-01 2010-07-01 false Transportation items. 3... Burial Benefits § 3.1606 Transportation items. The transportation costs of those persons who come within... shipment. (6) Cost of transportation by common carrier including amounts paid as Federal taxes. (7) Cost of...

  5. 38 CFR 3.1606 - Transportation items.

    Code of Federal Regulations, 2011 CFR

    2011-07-01

    ... 38 Pensions, Bonuses, and Veterans' Relief 1 2011-07-01 2011-07-01 false Transportation items. 3... Burial Benefits § 3.1606 Transportation items. The transportation costs of those persons who come within... shipment. (6) Cost of transportation by common carrier including amounts paid as Federal taxes. (7) Cost of...

  6. 38 CFR 3.1606 - Transportation items.

    Code of Federal Regulations, 2014 CFR

    2014-07-01

    ... 38 Pensions, Bonuses, and Veterans' Relief 1 2014-07-01 2014-07-01 false Transportation items. 3... Burial Benefits § 3.1606 Transportation items. The transportation costs of those persons who come within... shipment. (6) Cost of transportation by common carrier including amounts paid as Federal taxes. (7) Cost of...

  7. 38 CFR 3.1606 - Transportation items.

    Code of Federal Regulations, 2013 CFR

    2013-07-01

    ... 38 Pensions, Bonuses, and Veterans' Relief 1 2013-07-01 2013-07-01 false Transportation items. 3... Burial Benefits § 3.1606 Transportation items. The transportation costs of those persons who come within... shipment. (6) Cost of transportation by common carrier including amounts paid as Federal taxes. (7) Cost of...

  8. Analyzing Longitudinal Item Response Data via the Pairwise Fitting Method

    ERIC Educational Resources Information Center

    Fu, Zhi-Hui; Tao, Jian; Shi, Ning-Zhong; Zhang, Ming; Lin, Nan

    2011-01-01

    Multidimensional item response theory (MIRT) models can be applied to longitudinal educational surveys where a group of individuals are administered different tests over time with some common items. However, computational problems typically arise as the dimension of the latent variables increases. This is especially true when the latent variable…

  9. Disruption of Relational Processing Underlies Poor Memory for Order

    ERIC Educational Resources Information Center

    Jonker, Tanya R.; MacLeod, Colin M.

    2015-01-01

    McDaniel and Bugg (2008) proposed that relatively uncommon stimuli and encoding tasks encourage elaborative encoding of individual items (item-specific processing), whereas relatively typical or common encoding tasks encourage encoding of associations among list items (relational processing). It is this relational processing that is thought to…

  10. Development of the parental needs scale for rare diseases: a tool for measuring the supportive care needs of parents caring for a child with a rare disease.

    PubMed

    Pelentsov, Lemuel J; Fielder, Andrea L; Laws, Thomas A; Esterman, Adrian J

    2016-01-01

    Children and families affected by rare diseases have received scant consideration from the medical, scientific, and political communities, with parents' needs especially having received little attention. Affected parents often have limited access to information and support and appropriate health care services. While scales to measure the needs of parents of children with chronic illnesses have been developed, there have been no previous attempts to develop a scale to assess the needs of parents of children with rare diseases. To develop a scale for measuring the supportive care needs of parents of children with rare diseases. A total of 301 responses to our Parental Needs Survey were randomly divided into two halves, one for exploratory factor analysis and the other for confirmatory factor analysis (CFA). After removing unsuitable items, exploratory factor analysis was undertaken to determine the factor structure of the data. CFA using structural equation modeling was then undertaken to confirm the factor structure. Seventy-two items were entered into the CFA, with a scree plot showing a likely four-factor solution. The results provided four independent subscales of parental needs: Understanding the disease (four items); Working with health professionals (four items); Emotional issues (three items); and Financial needs (three items). The structural equation modeling confirmed the suitability of the four-factor solution and demonstrated that the four subscales could be added to provide an overall scale of parental need. This is the first scale developed to measure the supportive care needs of parents of children with rare diseases. The scale is suitable for use in surveys to develop policy, in individual clinical assessments, and, potentially, for evaluating new programs. Measuring the supportive care needs of parents caring for a child with a rare disease will hopefully lead to better physical and psychological health outcomes for parents and their affected children.

  11. The suitability of common compressibility equations for characterizing plasticity of diverse powders.

    PubMed

    Paul, Shubhajit; Sun, Changquan Calvin

    2017-10-30

    The analysis of powder compressibility data yields useful information for characterizing compaction behavior and mechanical properties of powders, especially plasticity. Among the many compressibility equations proposed in powder compaction research, the Heckel equation and the Kawakita equation are the most commonly used, despite their known limitations. Systematic evaluation of the performance in analyzing compressibility data suggested the Kuentz-Leuenberger equation is superior to both the Heckel equation and the Kawakita equation for characterizing plasticity of powders exhibiting a wide range of mechanical properties. Copyright © 2017 Elsevier B.V. All rights reserved.

  12. Examining Power and Type 1 Error for Step and Item Level Tests of Invariance: Investigating the Effect of the Number of Item Score Levels

    ERIC Educational Resources Information Center

    Ayodele, Alicia Nicole

    2017-01-01

    Within polytomous items, differential item functioning (DIF) can take on various forms due to the number of response categories. The lack of invariance at this level is referred to as differential step functioning (DSF). The most common DSF methods in the literature are the adjacent category log odds ratio (AC-LOR) estimator and cumulative…

  13. Using the Cumulative Common Log-Odds Ratio to Identify Differential Item Functioning of Rating Scale Items in the Exercise and Sport Sciences

    ERIC Educational Resources Information Center

    Penfield, Randall D.; Giacobbi, Peter R., Jr.; Myers, Nicholas D.

    2007-01-01

    One aspect of construct validity is the extent to which the measurement properties of a rating scale are invariant across the groups being compared. An increasingly used method for assessing between-group differences in the measurement properties of items of a scale is the framework of differential item functioning (DIF). In this paper we…

  14. Assessment of item-writing flaws in multiple-choice questions.

    PubMed

    Nedeau-Cayo, Rosemarie; Laughlin, Deborah; Rus, Linda; Hall, John

    2013-01-01

    This study evaluated the quality of multiple-choice questions used in a hospital's e-learning system. Constructing well-written questions is fraught with difficulty, and item-writing flaws are common. Study results revealed that most items contained flaws and were written at the knowledge/comprehension level. Few items had linked objectives, and no association was found between the presence of objectives and flaws. Recommendations include education for writing test questions.

  15. Evaluating HIV Knowledge Questionnaires Among Men Who Have Sex with Men: A Multi-Study Item Response Theory Analysis.

    PubMed

    Janulis, Patrick; Newcomb, Michael E; Sullivan, Patrick; Mustanski, Brian

    2018-01-01

    Knowledge about the transmission, prevention, and treatment of HIV remains a critical element in psychosocial models of HIV risk behavior and is commonly used as an outcome in HIV prevention interventions. However, most HIV knowledge questions have not undergone rigorous psychometric testing such as using item response theory. The current study used data from six studies of men who have sex with men (MSM; n = 3565) to (1) examine the item properties of HIV knowledge questions, (2) test for differential item functioning on commonly studied characteristics (i.e., age, race/ethnicity, and HIV risk behavior), (3) select items with the optimal item characteristics, and (4) leverage this combined dataset to examine the potential moderating effect of age on the relationship between condomless anal sex (CAS) and HIV knowledge. Findings indicated that existing questions tend to poorly differentiate those with higher levels of HIV knowledge, but items were relatively robust across diverse individuals. Furthermore, age moderated the relationship between CAS and HIV knowledge with older MSM having the strongest association. These findings suggest that additional items are required in order to capture a more nuanced understanding of HIV knowledge and that the association between CAS and HIV knowledge may vary by age.

  16. The precategorical nature of visual short-term memory.

    PubMed

    Quinlan, Philip T; Cohen, Dale J

    2016-11-01

    We conducted a series of recognition experiments that assessed whether visual short-term memory (VSTM) is sensitive to shared category membership of to-be-remembered (tbr) images of common objects. In Experiment 1 some of the tbr items shared the same basic level category (e.g., hand axe): Such items were no better retained than others. In the remaining experiments, displays contained different images of items from the same higher-level category (e.g., food: a bagel, a sandwich, a pizza). Evidence from the later experiments did suggest that participants were sensitive to the categorical relations present in the displays. However, when separate measures of sensitivity and bias were computed, the data revealed no effects on sensitivity, but a greater tendency to respond positively to noncategory items relative to items from the depicted category. Across all experiments, there was no evidence that items from a common category were better remembered than unique items. Previous work has shown that principles of perceptual organization do affect the storage and maintenance of tbr items. The present work shows that there are no corresponding conceptual principles of organization in VSTM. It is concluded that the sort of VSTM tapped by single probe recognition methods is precategorical in nature. (PsycINFO Database Record (c) 2016 APA, all rights reserved).

  17. Promoting CPAP adherence in clinical practice: A survey of Swedish and Norwegian CPAP practitioners' beliefs and practices.

    PubMed

    Broström, Anders; Pakpour, Amir H; Nilsen, Per; Gardner, Benjamin; Ulander, Martin

    2018-03-01

    The benefits of continuous positive airway pressure (CPAP) treatment for obstructive sleep apnea are well established, but adherence tends to be low. Research exploring CPAP practitioners' beliefs around determinants of CPAP adherence, and the actions they use in clinical practice to promote CPAP adherence is lacking. This study aimed to: (i) develop and validate a questionnaire to assess beliefs and current practices among CPAP practitioners; (ii) explore practitioners' beliefs regarding the main determinants of patient adherence, and the actions practitioners most commonly use to promote CPAP adherence; and (iii) explore the associations between perceived determinants and adherence-promotion actions. One-hundred and forty-two CPAP practitioners in Sweden and Norway, representing 93% of all Swedish and 62% of all Norwegian CPAP centres, were surveyed via a questionnaire exploring potential determinants (18 items) and adherence-promotion actions (20 items). Confirmatory factor analysis and second-order structural equational modelling were used to identify patterns of beliefs, and potential associations with adherence-promotion actions. Patients' knowledge, motivation and attitudes were perceived by practitioners to be the main determinants of CPAP adherence, and educating patients about effects, management and treatment adjustments were the most common practices. Knowledge was shown to predict educational and informational actions (e.g. education about obstructive sleep apnea and CPAP). Educational and informational actions were associated with medical actions (e.g. treatment adjustment), but knowledge, attitude and support had no association with medical actions. These findings indicate that a wide variety of determinants and actions are considered important, though the only relationship observed between beliefs and actions was found for knowledge and educational and informational actions. © 2018 European Sleep Research Society.

  18. Construct validity and reliability of the Single Checking Administration of Medications Scale.

    PubMed

    O'Connell, Beverly; Hawkins, Mary; Ockerby, Cherene

    2013-06-01

    Research indicates that single checking of medications is as safe as double checking; however, many nurses are averse to independently checking medications. To assist with the introduction and use of single checking, a measure of nurses' attitudes, the thirteen-item Single Checking Administration of Medications Scale (SCAMS) was developed. We examined the psychometric properties of the SCAMS. Secondary analyses were conducted on data collected from 503 nurses across a large Australian health-care service. Analyses using exploratory and confirmatory factor analyses supported by structural equation modelling resulted in a valid twelve-item SCAMS containing two reliable subscales, the nine-item Attitudes towards single checking and three-item Advantages of single checking subscales. The SCAMS is recommended as a valid and reliable measure for monitoring nurses' attitudes to single checking prior to introducing single checking medications and after its implementation. © 2013 Wiley Publishing Asia Pty Ltd.

  19. Online Calibration Methods for the DINA Model with Independent Attributes in CD-CAT

    ERIC Educational Resources Information Center

    Chen, Ping; Xin, Tao; Wang, Chun; Chang, Hua-Hua

    2012-01-01

    Item replenishing is essential for item bank maintenance in cognitive diagnostic computerized adaptive testing (CD-CAT). In regular CAT, online calibration is commonly used to calibrate the new items continuously. However, until now no reference has publicly become available about online calibration for CD-CAT. Thus, this study investigates the…

  20. Conditional Covariance Theory and Detect for Polytomous Items

    ERIC Educational Resources Information Center

    Zhang, Jinming

    2007-01-01

    This paper extends the theory of conditional covariances to polytomous items. It has been proven that under some mild conditions, commonly assumed in the analysis of response data, the conditional covariance of two items, dichotomously or polytomously scored, given an appropriately chosen composite is positive if, and only if, the two items…

  1. Automatically Scoring Short Essays for Content. CRESST Report 836

    ERIC Educational Resources Information Center

    Kerr, Deirdre; Mousavi, Hamid; Iseli, Markus R.

    2013-01-01

    The Common Core assessments emphasize short essay constructed response items over multiple choice items because they are more precise measures of understanding. However, such items are too costly and time consuming to be used in national assessments unless a way is found to score them automatically. Current automatic essay scoring techniques are…

  2. Modeling Booklet Effects for Nonequivalent Group Designs in Large-Scale Assessment

    ERIC Educational Resources Information Center

    Hecht, Martin; Weirich, Sebastian; Siegle, Thilo; Frey, Andreas

    2015-01-01

    Multiple matrix designs are commonly used in large-scale assessments to distribute test items to students. These designs comprise several booklets, each containing a subset of the complete item pool. Besides reducing the test burden of individual students, using various booklets allows aligning the difficulty of the presented items to the assumed…

  3. Regression Effects in Angoff Ratings: Examples from Credentialing Exams

    ERIC Educational Resources Information Center

    Wyse, Adam E.

    2018-01-01

    This article discusses regression effects that are commonly observed in Angoff ratings where panelists tend to think that hard items are easier than they are and easy items are more difficult than they are in comparison to estimated item difficulties. Analyses of data from two credentialing exams illustrate these regression effects and the…

  4. The Health Education Impact Questionnaire (heiQ): an outcomes and evaluation measure for patient education and self-management interventions for people with chronic conditions.

    PubMed

    Osborne, Richard H; Elsworth, Gerald R; Whitfield, Kathryn

    2007-05-01

    This paper describes the development and validation of the Health Education Impact Questionnaire (heiQ). The aim was to develop a user-friendly, relevant, and psychometrically sound instrument for the comprehensive evaluation of patient education programs, which can be applied across a broad range of chronic conditions. Item development for the heiQ was guided by a Program Logic Model, Concept Mapping, interviews with stakeholders and psychometric analyses. Construction (N=591) and confirmatory (N=598) samples were drawn from consumers of patient education programs and hospital outpatients. The properties of the heiQ were investigated using item response theory and structural equation modeling. Over 90 candidate items were generated, with 42 items selected for inclusion in the final scale. Eight independent dimensions were derived: Positive and Active Engagement in Life (five items, Cronbach's alpha (alpha)=0.86); Health Directed Behavior (four items, alpha=0.80); Skill and Technique Acquisition (five items, alpha=0.81); Constructive Attitudes and Approaches (five items, alpha=0.81); Self-Monitoring and Insight (seven items, alpha=0.70); Health Service Navigation (five items, alpha=0.82); Social Integration and Support (five items, alpha=0.86); and Emotional Wellbeing (six items, alpha=0.89). The heiQ has high construct validity and is a reliable measure of a broad range of patient education program benefits. The heiQ will provide valuable information to clinicians, researchers, policymakers and other stakeholders about the value of patient education programs in chronic disease management.

  5. Heterocentric language in commonly used measures of social anxiety: recommended alternate wording.

    PubMed

    Weiss, Brandon J; Hope, Debra A; Capozzoli, Michelle C

    2013-03-01

    A number of self-report measures of social anxiety contain language that appears to assume heterosexuality. It is unclear how such items should be answered by individuals who are not exclusively heterosexual, which may lead to inaccurate measurement of symptoms, perpetuation of stigma, and alienation of respondents. More specific wording could improve measurement accuracy for sexual minorities as well as heterosexual respondents. Gender-neutral wording was developed for items containing the phrase "opposite sex" in commonly used self-report measures of social anxiety (Interaction Anxiousness Scale [Leary, 1983], Social Avoidance and Distress Scale [Watson & Friend, 1969], Social Interaction Anxiety Scale [Mattick & Clarke, 1998], and Social Phobia and Anxiety Inventory [Turner, Beidel, Dancu, & Stanley, 1989]). Undergraduate college students (N=405; mean age=19.88, SD=2.05) completed measures containing original and revised items. Overall, results indicated that the alternate-worded items demonstrated equivalent or slightly stronger psychometric properties compared to original items. Select alternate-worded items are recommended for clinical and research use, and directions for future research are recommended. Copyright © 2012. Published by Elsevier Ltd.

  6. Translating questionnaire items for a multi-lingual worker population: the iterative process of translation and cognitive interviews with English-, Spanish-, and Chinese-speaking workers.

    PubMed

    Fujishiro, Kaori; Gong, Fang; Baron, Sherry; Jacobson, C Jeffery; DeLaney, Sheli; Flynn, Michael; Eggerth, Donald E

    2010-02-01

    The increasing ethnic diversity of the US workforce has created a need for research tools that can be used with multi-lingual worker populations. Developing multi-language questionnaire items is a complex process; however, very little has been documented in the literature. Commonly used English items from the Job Content Questionnaire and Quality of Work Life Questionnaire were translated by two interdisciplinary bilingual teams and cognitively tested in interviews with English-, Spanish-, and Chinese-speaking workers. Common problems across languages mainly concerned response format. Language-specific problems required more conceptual than literal translations. Some items were better understood by non-English speakers than by English speakers. De-centering (i.e., modifying the English original to correspond with translation) produced better understanding for one item. Translating questionnaire items and achieving equivalence across languages require various kinds of expertise. Backward translation itself is not sufficient. More research efforts should be concentrated on qualitative approaches to developing useful research tools. Published 2009 Wiley-Liss, Inc.

  7. Measuring Latent Quantities

    ERIC Educational Resources Information Center

    McDonald, Roderick P.

    2011-01-01

    A distinction is proposed between measures and predictors of latent variables. The discussion addresses the consequences of the distinction for the true-score model, the linear factor model, Structural Equation Models, longitudinal and multilevel models, and item-response models. A distribution-free treatment of calibration and…

  8. Effect of baking and fermentation on the stable carbon and nitrogen isotope ratios of grain-based food.

    PubMed

    Bostic, Joshua N; Palafox, Sherilyn J; Rottmueller, Marina E; Jahren, A Hope

    2015-05-30

    Isotope ratio mass spectrometry (IRMS) is used extensively to reconstruct general attributes of prehistoric and modern diets in both humans and animals. In order to apply these methods to the accurate determination of specific intakes of foods/nutrients of interest, the isotopic signature of individually consumed foods must be constrained. For example, 86% of the calories consumed in the USA are derived from processed and prepared foods, but the relationship between the stable isotope composition of raw ingredients and the resulting products has not been characterized. To examine the effect of common cooking techniques on the stable isotope composition of grain-based food items, we prepared yeast buns and sugar cookies from standardized recipes and measured bulk δ(13) C and δ(15) N values of samples collected throughout a 75 min fermentation process (buns) and before and after baking at 190°C (buns and cookies). Simple isotope mixing models were used to determine if the isotopic signatures of 13 multi-ingredient foods could be estimated from the isotopic signatures of their constituent raw ingredients. No variations in δ(13) C or δ(15) N values were detected between pre- and post-baked yeast buns (pre: -24.78‰/2.61‰, post: -24.75‰/2.74‰), beet-sugar cookies (pre: -24.48‰/3.84‰, post: -24.47‰/3.57‰), and cane-sugar cookies (pre: -19.07‰/2.97‰, post: -19.02‰/3.21‰), or throughout a 75 min fermentation process in yeast buns. Using isotopic mass balance equations, the δ(13) C/δ(15) N values of multi-ingredient foods were estimated from the isotopic composition of constituent raw ingredients to within 0.14 ± 0.13‰/0.24 ± 0.17‰ for gravimetrically measured recipes and 0.40 ± 0.38‰/0.58 ± 0.53‰ for volumetrically measured recipes. Two common food preparation techniques, baking and fermentation, do not substantially affect the carbon or nitrogen isotopic signature of grain-based foods. Mass-balance equations can be used to accurately estimate the isotopic signature of multi-ingredient food items for which quantitative ingredient information is available. Copyright © 2015 John Wiley & Sons, Ltd.

  9. Seasonal trends in abundance and composition of marine debris in selected public beaches in Peninsular Malaysia

    NASA Astrophysics Data System (ADS)

    Mobilik, Julyus-Melvin; Ling, Teck-Yee; Husain, Mohd-Lokman Bin; Hassan, Ruhana

    2015-09-01

    The abundance and composition of marine debris were investigated at Saujana (in the state of Negeri Sembilan) and Batu Rakit (in the state of Terengganu) beaches during surveys conducted in December 2012 (northeast monsoon), May 2013 (intermediate monsoon) and July 2013 (southwest monsoon). A total of 4,682 items of debris weighing 231.4 kg were collected and sorted. Batu Rakit received substantially greater quantities of debris (815±717 items/km or 40.4±13.0 kg/km) compared to Saujana (745±444 items/km or 36.7±18.0 kg/km). Total debris item was more abundant during the southwest monsoon (SWM) (1,122±737 items/km) compared to the northeast monsoon (NEM) (825±593 items/ km) and the intermediate monsoon (IM) (394±4 items/km) seasons. Plastic category (88%) was the most numerous items collected and object items contributed 44.18% includes packaging, plastic fragments, cups, plastic shopping bags, plastic food wrapper, clear plastic bottles from the total debris items collected. Object items associated with common source (47%) were the highest debris accumulated, followed by terrestrial (30%) and marine (23%) sources. The high percentage of common and terrestrial sources during SWM season requires immediate action by marine environment stakeholders to develop and introduce strategies to reduce if not totally eliminates the marine debris in the marine environment. Awareness should be continued and focused on beach users and vessels' crew to alert them on the alarming accumulation rate of marine debris and its pathways into the marine environment.

  10. A signal detection-item response theory model for evaluating neuropsychological measures.

    PubMed

    Thomas, Michael L; Brown, Gregory G; Gur, Ruben C; Moore, Tyler M; Patt, Virginie M; Risbrough, Victoria B; Baker, Dewleen G

    2018-02-05

    Models from signal detection theory are commonly used to score neuropsychological test data, especially tests of recognition memory. Here we show that certain item response theory models can be formulated as signal detection theory models, thus linking two complementary but distinct methodologies. We then use the approach to evaluate the validity (construct representation) of commonly used research measures, demonstrate the impact of conditional error on neuropsychological outcomes, and evaluate measurement bias. Signal detection-item response theory (SD-IRT) models were fitted to recognition memory data for words, faces, and objects. The sample consisted of U.S. Infantry Marines and Navy Corpsmen participating in the Marine Resiliency Study. Data comprised item responses to the Penn Face Memory Test (PFMT; N = 1,338), Penn Word Memory Test (PWMT; N = 1,331), and Visual Object Learning Test (VOLT; N = 1,249), and self-report of past head injury with loss of consciousness. SD-IRT models adequately fitted recognition memory item data across all modalities. Error varied systematically with ability estimates, and distributions of residuals from the regression of memory discrimination onto self-report of past head injury were positively skewed towards regions of larger measurement error. Analyses of differential item functioning revealed little evidence of systematic bias by level of education. SD-IRT models benefit from the measurement rigor of item response theory-which permits the modeling of item difficulty and examinee ability-and from signal detection theory-which provides an interpretive framework encompassing the experimentally validated constructs of memory discrimination and response bias. We used this approach to validate the construct representation of commonly used research measures and to demonstrate how nonoptimized item parameters can lead to erroneous conclusions when interpreting neuropsychological test data. Future work might include the development of computerized adaptive tests and integration with mixture and random-effects models.

  11. Reporting of suicide in the Australian media.

    PubMed

    Pirkis, Jane; Francis, Catherine; Blood, Richard Warwick; Burgess, Philip; Morley, Belinda; Stewart, Andrew; Putnis, Peter

    2002-04-01

    The media monitoring project aimed to establish a baseline picture of the extent, nature and quality of reporting of suicide by the Australian media, with a view to informing future strategies intended to optimize reporting of suicide. Newspaper, television and radio items on suicide were retrieved over 12 months. Identifying and descriptive information were extracted for each item. Approximately 10% of items were rated for quality, using a rating scale based on criteria from Achieving the Balance, a kit designed to promote awareness among media professionals of issues relating to suicide. The scale ranged from 0 (poor quality) to 100 (good quality). Reporting of suicide was extensive (with 4813 items retrieved). The nature of reporting was variable. Items tended to be about completed suicide (rather than attempted suicide or suicidal ideation), and most commonly involved content related to an individual's experiences, policy/programme initiatives and/or suicide statistics, although there were differences across media types. Items showed variability across dimensions of quality. The majority of suicide items did not have examples of inappropriate language, were not inappropriately located, did not use the word 'suicide' in the headline, and did not use explicit photographs/diagrams or footage. However, around half of the suicide items provided a detailed discussion of the method of self-harm and portrayed suicide as merely a social phenomenon. Where items concerned the suicide of a celebrity, reference was commonly made to that person's celebrity status. Most items failed to provide information on help services. The median total quality score was 57.1%. The reporting of suicide is extensive across all media types, and varies in nature and quality. In general, good items outnumber poorer items. However, there are still opportunities for improving media reporting of suicide.

  12. Using MathCAD to Teach One-Dimensional Graphs

    ERIC Educational Resources Information Center

    Yushau, B.

    2004-01-01

    Topics such as linear and nonlinear equations and inequalities, compound inequalities, linear and nonlinear absolute value equations and inequalities, rational equations and inequality are commonly found in college algebra and precalculus textbooks. What is common about these topics is the fact that their solutions and graphs lie in the real line…

  13. Assessment of the Item Selection and Weighting in the Birmingham Vasculitis Activity Score for Wegener's Granulomatosis

    PubMed Central

    MAHR, ALFRED D.; NEOGI, TUHINA; LAVALLEY, MICHAEL P.; DAVIS, JOHN C.; HOFFMAN, GARY S.; MCCUNE, W. JOSEPH; SPECKS, ULRICH; SPIERA, ROBERT F.; ST.CLAIR, E. WILLIAM; STONE, JOHN H.; MERKEL, PETER A.

    2013-01-01

    Objective To assess the Birmingham Vasculitis Activity Score for Wegener's Granulomatosis (BVAS/WG) with respect to its selection and weighting of items. Methods This study used the BVAS/WG data from the Wegener's Granulomatosis Etanercept Trial. The scoring frequencies of the 34 predefined items and any “other” items added by clinicians were calculated. Using linear regression with generalized estimating equations in which the physician global assessment (PGA) of disease activity was the dependent variable, we computed weights for all predefined items. We also created variables for clinical manifestations frequently added as other items, and computed weights for these as well. We searched for the model that included the items and their generated weights yielding an activity score with the highest R2 to predict the PGA. Results We analyzed 2,044 BVAS/WG assessments from 180 patients; 734 assessments were scored during active disease. The highest R2 with the PGA was obtained by scoring WG activity based on the following items: the 25 predefined items rated on ≥5 visits, the 2 newly created fatigue and weight loss variables, the remaining minor other and major other items, and a variable that signified whether new or worse items were present at a specific visit. The weights assigned to the items ranged from 1 to 21. Compared with the original BVAS/WG, this modified score correlated significantly more strongly with the PGA. Conclusion This study suggests possibilities to enhance the item selection and weighting of the BVAS/WG. These changes may increase this instrument's ability to capture the continuum of disease activity in WG. PMID:18512722

  14. Integrating competing dimensional models of personality: linking the SNAP, TCI, and NEO using Item Response Theory.

    PubMed

    Stepp, Stephanie D; Yu, Lan; Miller, Joshua D; Hallquist, Michael N; Trull, Timothy J; Pilkonis, Paul A

    2012-04-01

    Mounting evidence suggests that several inventories assessing both normal personality and personality disorders measure common dimensional personality traits (i.e., Antagonism, Constraint, Emotional Instability, Extraversion, and Unconventionality), albeit providing unique information along the underlying trait continuum. We used Widiger and Simonsen's (2005) pantheoretical integrative model of dimensional personality assessment as a guide to create item pools. We then used Item Response Theory (IRT) to compare the assessment of these five personality traits across three established dimensional measures of personality: the Schedule for Nonadaptive and Adaptive Personality (SNAP), the Temperament and Character Inventory (TCI), and the Revised NEO Personality Inventory (NEO PI-R). We found that items from each inventory map onto these five common personality traits in predictable ways. The IRT analyses, however, documented considerable variability in the item and test information derived from each inventory. Our findings support the notion that the integration of multiple perspectives will provide greater information about personality while minimizing the weaknesses of any single instrument.

  15. Integrating Competing Dimensional Models of Personality: Linking the SNAP, TCI, and NEO Using Item Response Theory

    PubMed Central

    Stepp, Stephanie D.; Yu, Lan; Miller, Joshua D.; Hallquist, Michael N.; Trull, Timothy J.; Pilkonis, Paul A.

    2013-01-01

    Mounting evidence suggests that several inventories assessing both normal personality and personality disorders measure common dimensional personality traits (i.e., Antagonism, Constraint, Emotional Instability, Extraversion, and Unconventionality), albeit providing unique information along the underlying trait continuum. We used Widiger and Simonsen’s (2005) pantheoretical integrative model of dimensional personality assessment as a guide to create item pools. We then used Item Response Theory (IRT) to compare the assessment of these five personality traits across three established dimensional measures of personality: the Schedule for Nonadaptive and Adaptive Personality (SNAP), the Temperament and Character Inventory (TCI), and the Revised NEO Personality Inventory (NEO PI-R). We found that items from each inventory map onto these five common personality traits in predictable ways. The IRT analyses, however, documented considerable variability in the item and test information derived from each inventory. Our findings support the notion that the integration of multiple perspectives will provide greater information about personality while minimizing the weaknesses of any single instrument. PMID:22452759

  16. Memory for conversation and the development of common ground.

    PubMed

    McKinley, Geoffrey L; Brown-Schmidt, Sarah; Benjamin, Aaron S

    2017-11-01

    Efficient conversation is guided by the mutual knowledge, or common ground, that interlocutors form as a conversation progresses. Characterized from the perspective of commonly used measures of memory, efficient conversation should be closely associated with item memory-what was said-and context memory-who said what to whom. However, few studies have explicitly probed memory to evaluate what type of information is maintained following a communicative exchange. The current study examined how item and context memory relate to the development of common ground over the course of a conversation, and how these forms of memory vary as a function of one's role in a conversation as speaker or listener. The process of developing common ground was positively related to both item and context memory. In addition, content that was spoken was remembered better than content that was heard. Our findings illustrate how memory assessments can complement language measures by revealing the impact that basic conversational processes have on memory for what has been discussed. By taking this approach, we show that not only does the process of forming common ground facilitate communication in the present, but it also promotes an enduring record of that event, facilitating conversation into the future.

  17. Rasch analysis for psychometric improvement of science attitude rating scales

    NASA Astrophysics Data System (ADS)

    Oon, Pey-Tee; Fan, Xitao

    2017-04-01

    Students' attitude towards science (SAS) is often a subject of investigation in science education research. Survey of rating scale is commonly used in the study of SAS. The present study illustrates how Rasch analysis can be used to provide psychometric information of SAS rating scales. The analyses were conducted on a 20-item SAS scale used in an existing dataset of The Trends in International Mathematics and Science Study (TIMSS) (2011). Data of all the eight-grade participants from Hong Kong and Singapore (N = 9942) were retrieved for analyses. Additional insights from Rasch analysis that are not commonly available from conventional test and item analyses were discussed, such as invariance measurement of SAS, unidimensionality of SAS construct, optimum utilization of SAS rating categories, and item difficulty hierarchy in the SAS scale. Recommendations on how TIMSS items on the measurement of SAS can be better designed were discussed. The study also highlights the importance of using Rasch estimates for statistical parametric tests (e.g. ANOVA, t-test) that are common in science education research for group comparisons.

  18. An introduction to Item Response Theory and Rasch Analysis of the Eating Assessment Tool (EAT-10).

    PubMed

    Kean, Jacob; Brodke, Darrel S; Biber, Joshua; Gross, Paul

    2018-03-01

    Item response theory has its origins in educational measurement and is now commonly applied in health-related measurement of latent traits, such as function and symptoms. This application is due in large part to gains in the precision of measurement attributable to item response theory and corresponding decreases in response burden, study costs, and study duration. The purpose of this paper is twofold: introduce basic concepts of item response theory and demonstrate this analytic approach in a worked example, a Rasch model (1PL) analysis of the Eating Assessment Tool (EAT-10), a commonly used measure for oropharyngeal dysphagia. The results of the analysis were largely concordant with previous studies of the EAT-10 and illustrate for brain impairment clinicians and researchers how IRT analysis can yield greater precision of measurement.

  19. Exploring the Manifestations of Anxiety in Children with Autism Spectrum Disorders

    ERIC Educational Resources Information Center

    Hallett, Victoria; Lecavalier, Luc; Sukhodolsky, Denis G.; Cipriano, Noreen; Aman, Michael G.; McCracken, James T.; McDougle, Christopher J.; Tierney, Elaine; King, Bryan H.; Hollander, Eric; Sikich, Linmarie; Bregman, Joel; Anagnostou, Evdokia; Donnelly, Craig; Katsovich, Lily; Dukes, Kimberly; Vitiello, Benedetto; Gadow, Kenneth; Scahill, Lawrence

    2013-01-01

    This study explores the manifestation and measurement of anxiety symptoms in 415 children with ASDs on a 20-item, parent-rated, DSM-IV referenced anxiety scale. In both high and low-functioning children (IQ above vs. below 70), commonly endorsed items assessed restlessness, tension and sleep difficulties. Items requiring verbal expression of worry…

  20. The Construction of a Long Variable of Conceptual Development in Social Education.

    ERIC Educational Resources Information Center

    Doig, Brian

    This paper demonstrates a method for constructing long variables using items that elicit partically correct responses across ages. Long variables may be defined by students at different ages (year levels) attempting common items within a test containing other items considered to be appropriate for each age or year level. A developmental model of…

  1. Optimizing the Use of Response Times for Item Selection in Computerized Adaptive Testing

    ERIC Educational Resources Information Center

    Choe, Edison M.; Kern, Justin L.; Chang, Hua-Hua

    2018-01-01

    Despite common operationalization, measurement efficiency of computerized adaptive testing should not only be assessed in terms of the number of items administered but also the time it takes to complete the test. To this end, a recent study introduced a novel item selection criterion that maximizes Fisher information per unit of expected response…

  2. PROGRAMED INSTRUCTION AS A STRATEGY FOR DEVELOPING CURRICULA FOR CHILDREN FROM DISADVANTAGED BACKGROUNDS.

    ERIC Educational Resources Information Center

    GOTKIN, LASSAR G.

    MATRIX GAMES IS A MODIFIED PROGRAMED-INSTRUCTION APPROACH TO TEACHING AND DEVELOPING LANGUAGE SKILLS. IN THIS STUDY, A BOARD DISPLAYING 16 PICTURES IN A 4 X 4 MATRIX WAS PLACED IN FRONT OF SEVERAL 4- OR 5-YEAR-OLDS. THE PICTURES COMPOSING A ROW CONTAINED A COMMON ITEM, FOR EXAMPLE, A BOY. THE PICTURES OF A COLUMN ALSO CONTAINED A COMMON ITEM, FOR…

  3. The Effects of Small Sample Size on Identifying Polytomous DIF Using the Liu-Agresti Estimator of the Cumulative Common Odds Ratio

    ERIC Educational Resources Information Center

    Carvajal, Jorge; Skorupski, William P.

    2010-01-01

    This study is an evaluation of the behavior of the Liu-Agresti estimator of the cumulative common odds ratio when identifying differential item functioning (DIF) with polytomously scored test items using small samples. The Liu-Agresti estimator has been proposed by Penfield and Algina as a promising approach for the study of polytomous DIF but no…

  4. Item validity vs. item discrimination index: a redundancy?

    NASA Astrophysics Data System (ADS)

    Panjaitan, R. L.; Irawati, R.; Sujana, A.; Hanifah, N.; Djuanda, D.

    2018-03-01

    In several literatures about evaluation and test analysis, it is common to find that there are calculations of item validity as well as item discrimination index (D) with different formula for each. Meanwhile, other resources said that item discrimination index could be obtained by calculating the correlation between the testee’s score in a particular item and the testee’s score on the overall test, which is actually the same concept as item validity. Some research reports, especially undergraduate theses tend to include both item validity and item discrimination index in the instrument analysis. It seems that these concepts might overlap for both reflect the test quality on measuring the examinees’ ability. In this paper, examples of some results of data processing on item validity and item discrimination index were compared. It would be discussed whether item validity and item discrimination index can be represented by one of them only or it should be better to present both calculations for simple test analysis, especially in undergraduate theses where test analyses were included.

  5. Effects of Age on Negative Subsequent Memory Effects Associated with the Encoding of Item and Item–Context Information

    PubMed Central

    Mattson, Julia T.; Wang, Tracy H.; de Chastelaine, Marianne; Rugg, Michael D.

    2014-01-01

    It has consistently been reported that “negative” subsequent memory effects—lower study activity for later remembered than later forgotten items—are attenuated in older individuals. The present functional magnetic resonance imaging study investigated whether these findings extend to subsequent memory effects associated with successful encoding of item–context information. Older (n = 25) and young (n = 17) subjects were scanned while making 1 of 2 encoding judgments on a series of pictures. Memory was assessed for the study item and, for items judged old, the item's encoding task. Both memory judgments were made using confidence ratings, permitting item and source memory strength to be unconfounded and source confidence to be equated across age groups. Replicating prior findings, negative item effects in regions of the default mode network in young subjects were reversed in older subjects. Negative source effects, however, were invariant with respect to age and, in both age groups, the magnitude of the effects correlated with source memory performance. It is concluded that negative item effects do not reflect processes necessary for the successful encoding of item–context associations in older subjects. Negative source effects, in contrast, appear to reflect the engagement of processes that are equally important for successful episodic encoding in older and younger individuals. PMID:23904464

  6. Measuring the Diagnostic Features of Social (Pragmatic) Communication Disorder: An Exploratory Study.

    PubMed

    Yuan, Haiying; Dollaghan, Christine

    2018-03-27

    The Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition introduced a new neurodevelopmental disorder, social (pragmatic) communication disorder (SPCD), that is characterized by deficits in 4 areas of communication. Although descriptions of these areas are provided, no assessment tools for SPCD are recommended. The purpose of this study was to examine the extent to which items from measurement tools commonly used in assessing pragmatic language impairment and related disorders might be useful in assessing the characteristics of social communication that define SPCD in the Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition. Based on a literature search, 594 items from assessment tools commonly used to measure social communication abilities in people with pragmatic language impairment were identified. The first author judged whether each item reflected 1, more than 1, or none of the 4 SPCD diagnostic characteristics. After a brief training process, 5 second raters independently mapped subsets of items to the 6 categories. We calculated the percentage of agreement and Cohen's kappa for each pair of raters in assigning items to categories. Percentages of agreement ranged from 76% to 82%, and Cohen's kappa values ranged from .69 to .76, indicating substantial agreement. Sources and item numbers for the 206 items that both raters assigned to the same SPCD feature are provided. These items may provide guidance in assessing SPCD and in designing standardized screening and diagnostic measures for SPCD.

  7. Substance use avoidance among Iranian male adolescents: a comparison of three versions of the theory of reasoned action.

    PubMed

    Tavousi, Mahmoud; Montazeri, Ali; Hidarnia, Alireza; Hajizadeh, Ebrahim; Taremian, Farhad; Haerimehrizi, Aliasghar

    2015-08-01

    The theory of reasoned action (TRA) is one of the most common models in predicting health-related behaviors and is used more often in health education studies. This study aimed to add two control constructs (perceived behavioral control - PBC and self-efficacy - SE) to the TRA and compare them using the structural equation modeling (SEM) for substance use avoidance among Iranian male adolescents in order to find out which model was a better fit in predicting the intention. This was a cross-sectional study carried out in Tehran, Iran. Data were collected from a random sample of high school male students (15-19 years of age) using a questionnaire containing items related to the TRA plus items reflecting two additional constructs (SE and PBC). In all, 433 students completed the questionnaires. The results obtained from SEM indicated a better fit to the data for the TRA with SE compared to the TPB (TRA with PBC) and TRA (χ2/df=2.55, RMSEA=0.072, CFI=0.96, NFI=0.94, NNFI=0.95, SRMR=0.058). Comparing SE and PBC, the results showed that self-efficacy was a better control construct in improving the TRA and predicting substance use avoidance intention (41%). The TRA with SE had a better model fit than TPB and the original version of the TRA.

  8. Is adaptation of the word accentuation test of premorbid intelligence necessary for use among older, Spanish-speaking immigrants in the United States?

    PubMed

    Schrauf, Robert W; Weintraub, Sandra; Navarro, Ellen

    2006-05-01

    Adaptations of the National Adult Reading Test (NART) for assessing premorbid intelligence in languages other than English requires (a) generating word-items that are rare and do not follow grapheme-to-phoneme mappings common in that language, and (b) subsequent validation against a cognitive battery normed on the population of interest. Such tests exist for Italy, France, Spain, and Argentina, all normed against national versions of the Wechsler Adult Intelligence Scale. Given the varieties of Spanish spoken in the United States, the adaptation of the Spanish Word Accentuation Test (WAT) requires re-validating the original word list, plus possible new items, against a cognitive battery that has been normed on Spanish-speakers from many countries. This study reports the generation of 55 additional words and revalidation in a sample of 80 older, Spanish-dominant immigrants. The Batería Woodcock-Muñoz Revisada (BWM-R), normed on Spanish speakers from six countries and five U.S. states, was used to establish criterion validity. The original WAT word list accounted for 77% of the variance in the BWM-R and 58% of the variance in Ravens Colored Progressive Matrices, suggesting that the unmodified list possesses adequate predictive validity as an indicator of intelligence. Regression equations are provided for estimating BWM-R and Ravens scores from WAT scores.

  9. HIV-related stigma and health-related quality of life among children living with HIV in Sweden.

    PubMed

    Rydström, Lise-Lott; Wiklander, Maria; Navér, Lars; Ygge, Britt-Marie; Eriksson, Lars E

    2016-01-01

    The relationship between HIV-related stigma and health-related quality of life (HRQoL) among children living with HIV infection is unknown. The objectives of this study were to describe HIV-related stigma and HRQoL among children with perinatal HIV living in Sweden, and to investigate the relationship between these two factors in the same infection group. In a cross-sectional nationwide survey, HIV-related stigma was measured with the 8-item HIV Stigma Scale for Children. HRQoL was measured with the 37-item DISABKIDS Chronic Generic Module. Structural equation modeling was used to explore the relationship between HIV-related stigma and HRQoL. Fifty-eight children participated, age 9-18 years (mean = 13.9). The HIV stigma general scale showed a mean score of 17.6 (SD = 5.0; possible range 8-32). DISABKIDS Chronic Generic Module general scale showed a mean score of 80.7 (SD = 14.1; possible range 0-100). HIV-related stigma was negatively associated with HRQoL (standardized β = -0.790, p = .017). The results indicate that children's concerns related to disclosure of their HIV infection seem to be common (i.e. 75% agreed) which, together with the negative association between ratings of HIV-relatively stigma and HRQoL, might indicate that disclosure concerns would be a relevant target for interventions to decrease HIV-related stigma and increase HRQoL.

  10. Quantifying traditional Chinese medicine patterns using modern test theory: an example of functional constipation.

    PubMed

    Shen, Minxue; Cui, Yuanwu; Hu, Ming; Xu, Linyong

    2017-01-13

    The study aimed to validate a scale to assess the severity of "Yin deficiency, intestine heat" pattern of functional constipation based on the modern test theory. Pooled longitudinal data of 237 patients with "Yin deficiency, intestine heat" pattern of constipation from a prospective cohort study were used to validate the scale. Exploratory factor analysis was used to examine the common factors of items. A multidimensional item response model was used to assess the scale with the presence of multidimensionality. The Cronbach's alpha ranged from 0.79 to 0.89, and the split-half reliability ranged from 0.67 to 0.79 at different measurements. Exploratory factor analysis identified two common factors, and all items had cross factor loadings. Bidimensional model had better goodness of fit than the unidimensional model. Multidimensional item response model showed that the all items had moderate to high discrimination parameters. Parameters indicated that the first latent trait signified intestine heat, while the second trait characterized Yin deficiency. Information function showed that items demonstrated highest discrimination power among patients with moderate to high level of disease severity. Multidimensional item response theory provides a useful and rational approach in validating scales for assessing the severity of patterns in traditional Chinese medicine.

  11. Depictions of mental illness in print media: a prospective national sample.

    PubMed

    Coverdale, John; Nairn, Raymond; Claasen, Donna

    2002-10-01

    Because there are no published reports of depictions of mental illness in print media based on national samples, we set out to prospectively collect and analyse a near complete New Zealand sample of print media. A commercial clipping bureau was contracted to provide cuttings of all items with any mental health or illness aspect over a four week period. These items were analysed for potentially positive and negative depictions and how mental illness was represented within each item. An independent search for additional newspaper items concerning one prominently featured topic indicated that the rate of identification of relevant stories was at least 91%. The collection consisted of six hundred print items which were most commonly news or editorial pieces (n = 562, 93.7%). Negative depictions predominated, with dangerousness to others (n = 368, 61.3%) and criminality (n = 284, 47.3%) being the most common. Positive depictions, including human rights themes, leadership and educational accomplishments occurred in 27% (n = 164) of all items. Generic mental illness terminology without reference to specific diagnostic categories was present in 47% of all items (n = 284). Negative depictions that predominate confirm the stereotypic understanding of mental illness that is stigmatizing. These findings underscore the challenge facing us as mental health professionals attempting to change attitudes towards mental disorders when the stereotypes are so regularly reinforced.

  12. Estimating and Interpreting Latent Variable Interactions: A Tutorial for Applying the Latent Moderated Structural Equations Method

    ERIC Educational Resources Information Center

    Maslowsky, Julie; Jager, Justin; Hemken, Douglas

    2015-01-01

    Latent variables are common in psychological research. Research questions involving the interaction of two variables are likewise quite common. Methods for estimating and interpreting interactions between latent variables within a structural equation modeling framework have recently become available. The latent moderated structural equations (LMS)…

  13. Measurement Invariance and the Five-Factor Model of Personality: Asian International and Euro American Cultural Groups.

    PubMed

    Rollock, David; Lui, P Priscilla

    2016-10-01

    This study examined measurement invariance of the NEO Five-Factor Inventory (NEO-FFI), assessing the five-factor model (FFM) of personality among Euro American (N = 290) and Asian international (N = 301) students (47.8% women, Mage = 19.69 years). The full 60-item NEO-FFI data fit the expected five-factor structure for both groups using exploratory structural equation modeling, and achieved configural invariance. Only 37 items significantly loaded onto the FFM-theorized factors for both groups and demonstrated metric invariance. Threshold invariance was not supported with this reduced item set. Groups differed the most in the item-factor relationships for Extraversion and Agreeableness, as well as in response styles. Asian internationals were more likely to use midpoint responses than Euro Americans. While the FFM can characterize broad nomothetic patterns of personality traits, metric invariance with only the subset of NEO-FFI items identified limits direct group comparisons of correlation coefficients among personality domains and with other constructs, and of mean differences on personality domains. © The Author(s) 2015.

  14. Promising Areas for Psychometric Research.

    ERIC Educational Resources Information Center

    Angoff, William H.

    1988-01-01

    An overview of four papers on useful future directions for psychometric research is provided. The papers were drawn from American Psychological Association symposia; they cover the nature of general intelligence, item bias and selection, cut scores, equating problems, computer-adaptive testing, and individual and group achievement measurement.…

  15. 34 CFR 462.11 - What must an application contain?

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... the methodology and procedures used to measure the reliability of the test. (h) Construct validity... previous test, and results from validity, reliability, and equating or standard-setting studies undertaken... NRS educational functioning levels (content validity). Documentation of the extent to which the items...

  16. New exact solutions for a discrete electrical lattice using the analytical methods

    NASA Astrophysics Data System (ADS)

    Manafian, Jalil; Lakestani, Mehrdad

    2018-03-01

    This paper retrieves soliton solutions to an equation in nonlinear electrical transmission lines using the semi-inverse variational principle method (SIVPM), the \\exp(-Ω(ξ)) -expansion method (EEM) and the improved tan(φ/2) -expansion method (ITEM), with the aid of the symbolic computation package Maple. As a result, the SIVPM, EEM and ITEM methods are successfully employed and some new exact solitary wave solutions are acquired in terms of kink-singular soliton solution, hyperbolic solution, trigonometric solution, dark and bright soliton solutions. All solutions have been verified back into their corresponding equations with the aid of the Maple package program. We depicted the physical explanation of the extracted solutions with the choice of different parameters by plotting some 2D and 3D illustrations. Finally, we show that the used methods are robust and more efficient than other methods. More importantly, the solutions found in this work can have significant applications in telecommunication systems where solitons are used to codify data.

  17. Linking Measures of Adult Nicotine Dependence to a Common Latent Continuum and a Comparison with Adolescent Patterns

    PubMed Central

    Strong, David R.; Schonbrun, Yael Chatav; Schaffran, Christine; Griesler, Pamela C.; Kandel, Denise

    2012-01-01

    Background An ongoing debate regarding the nature of Nicotine Dependence (ND) is whether the same instrument can be applied to measure ND among adults and adolescents. Using a hierarchical item response model (IRM), we examined evidence for a common continuum underlying ND symptoms among adults and adolescents. Method The analyses are based on two waves of interviews with subsamples of parents and adolescents from a multi-ethnic longitudinal cohort of 1,039 6th–10th graders from the Chicago Public Schools (CPS). Adults and adolescents who reported smoking cigarettes the last 30 days prior to waves 3 and 5 completed three common instruments measuring ND symptoms and one item measuring loss of autonomy. Results A stable continuum of ND, first identified among adolescents, was replicated among adults. However, some symptoms, such as tolerance and withdrawal, differed markedly across adults and adolescents. The majority of mFTQ items were observed within the highest levels of ND, the NDSS items within the lowest levels, and the DSM-IV items were arrayed in the middle and upper third of the continuum of dependence severity. Loss of Autonomy was positioned at the lower end of the continuum. We propose a ten-symptom measure of ND for adolescents and adults. Conclusions Despite marked differences in the relative severity of specific ND symptoms in each group, common instrumentation of ND can apply to adults and adolescents. The results increase confidence in the ability to describe phenotypic heterogeneity in ND across important developmental periods. PMID:21855236

  18. Comparison of Factor Simplicity Indices for Dichotomous Data: DETECT R, Bentler's Simplicity Index, and the Loading Simplicity Index

    ERIC Educational Resources Information Center

    Finch, Holmes; Stage, Alan Kirk; Monahan, Patrick

    2008-01-01

    A primary assumption underlying several of the common methods for modeling item response data is unidimensionality, that is, test items tap into only one latent trait. This assumption can be assessed several ways, using nonlinear factor analysis and DETECT, a method based on the item conditional covariances. When multidimensionality is identified,…

  19. A Comparison of Different Psychometric Approaches to Modeling Testlet Structures: An Example with C-Tests

    ERIC Educational Resources Information Center

    Schroeders, Ulrich; Robitzsch, Alexander; Schipolowski, Stefan

    2014-01-01

    C-tests are a specific variant of cloze tests that are considered time-efficient, valid indicators of general language proficiency. They are commonly analyzed with models of item response theory assuming local item independence. In this article we estimated local interdependencies for 12 C-tests and compared the changes in item difficulties,…

  20. Faster on Easy Items, More Accurate on Difficult Ones: Cognitive Ability and Performance on a Task of Varying Difficulty

    ERIC Educational Resources Information Center

    Dodonova, Yulia A.; Dodonov, Yury S.

    2013-01-01

    Using more complex items than those commonly employed within the information-processing approach, but still easier than those used in intelligence tests, this study analyzed how the association between processing speed and accuracy level changes as the difficulty of the items increases. The study involved measuring cognitive ability using Raven's…

  1. Evaluating Statistical Targets for Assembling Parallel Mixed-Format Test Forms

    ERIC Educational Resources Information Center

    Debeer, Dries; Ali, Usama S.; van Rijn, Peter W.

    2017-01-01

    Test assembly is the process of selecting items from an item pool to form one or more new test forms. Often new test forms are constructed to be parallel with an existing (or an ideal) test. Within the context of item response theory, the test information function (TIF) or the test characteristic curve (TCC) are commonly used as statistical…

  2. Factor- and Item-Level Analyses of the 38-Item Activities Scale for Kids-Performance

    ERIC Educational Resources Information Center

    Bagley, Anita M.; Gorton, George E.; Bjornson, Kristie; Bevans, Katherine; Stout, Jean L.; Narayanan, Unni; Tucker, Carole A.

    2011-01-01

    Aim: Children and adolescents highly value their ability to participate in relevant daily life and recreational activities. The Activities Scale for Kids-performance (ASKp) instrument measures the frequency of performance of 30 common childhood activities, and has been shown to be valid and reliable. A revised and expanded 38-item ASKp (ASKp38)…

  3. Morphology and dynamics of galaxies; Proceedings of the Twelfth Advanced Course, Saas-Fee, Switzerland, March 29-April 3, 1982

    NASA Astrophysics Data System (ADS)

    Martinet, L.; Mayor, M.

    The basic problems and analysis techniques in examining the morphology, dynamics, and interactions between star systems, galaxies, and galactic clusters are detailed. Attention is devoted to the dynamics of hot stellar systems, with note taken of the derivation and application of the Vlasov equation, Jean's theorem, and the virial equations. Observations of galactic structure and dynamics are reviewed, and consideration is directed toward environmental influences on galactic structure. For individual items see A84-15503 to A84-15505

  4. Therapist Competence in Global Mental Health: Development of the Enhancing Assessment of Common Therapeutic Factors (ENACT) Rating Scale

    PubMed Central

    Kohrt, Brandon A.; Jordans, Mark J.D.; Rai, Sauharda; Shrestha, Pragya; Luitel, Nagendra P.; Ramaiya, Megan; Singla, Daisy; Patel, Vikram

    2015-01-01

    Lack of reliable and valid measures of therapist competence is a barrier to dissemination and implementation of psychological treatments in global mental health. We developed the ENhancing Assessment of Common Therapeutic factors (ENACT) rating scale for training and supervision across settings varied by culture and access to mental health resources. We employed a four-step process in Nepal: (1) Item generation: We extracted 1,081 items (grouped into 104 domains) from 56 existing tools; role-plays with Nepali therapists generated 11 additional domains. (2) Item relevance: From the 115 domains, Nepali therapists selected 49 domains of therapeutic importance and high comprehensibility. (3) Item utility: We piloted the ENACT scale through rating role-play videotapes, patient session transcripts, and live observations of primary care workers in trainings for psychological treatments and the Mental Health Gap Action Programme (mhGAP). (4) Inter-rater reliability was acceptable for experts (intraclass correlation coefficient, ICC(2,7)=0.88 (95% confidence interval (CI) 0.81—0.93), N=7) and non-specialists (ICC(1,3)=0.67 (95% CI 0.60—0.73), N=34). In sum, the ENACT scale is an 18-item assessment for common factors in psychological treatments, including task-sharing initiatives with non-specialists across cultural settings. Further research is needed to evaluate applications for therapy quality and association with patient outcomes. PMID:25847276

  5. Separating Cognitive and Content Domains in Mathematical Competence

    ERIC Educational Resources Information Center

    Harks, Birgit; Klieme, Eckhard; Hartig, Johannes; Leiss, Dominik

    2014-01-01

    The present study investigates the empirical separability of mathematical (a) content domains, (b) cognitive domains, and (c) content-specific cognitive domains. There were 122 items representing two content domains (linear equations vs. theorem of Pythagoras) combined with two cognitive domains (modeling competence vs. technical competence)…

  6. Harmonizing routinely collected health information for strengthening quality management in health systems: requirements and practice.

    PubMed

    Prodinger, Birgit; Tennant, Alan; Stucki, Gerold; Cieza, Alarcos; Üstün, Tevfik Bedirhan

    2016-10-01

    Our aim was to specify the requirements of an architecture to serve as the foundation for standardized reporting of health information and to provide an exemplary application of this architecture. The World Health Organization's International Classification of Functioning, Disability and Health (ICF) served as the conceptual framework. Methods to establish content comparability were the ICF Linking Rules. The Rasch measurement model, as a special case of additive conjoint measurement, which satisfies the required criteria for fundamental measurement, allowed for the development of a common metric foundation for measurement unit conversion. Secondary analysis of data from the North Yorkshire Survey was used to illustrate these methods. Patients completed three instruments and the items were linked to the ICF. The Rasch measurement model was applied, first to each scale, and then to items across scales which were linked to a common domain. Based on the linking of items to the ICF, the majority of items were grouped into two domains, Mobility and Self-care. Analysis of the individual scales and of items linked to a common domain across scales satisfied the requirements of the Rasch measurement model. The measurement unit conversion between items from the three instruments linked to the Mobility and Self-care domains, respectively, was demonstrated. The realization of an ICF-based architecture for information on patients' functioning enables harmonization of health information while allowing clinicians and researchers to continue using their existing instruments. This architecture will facilitate access to comprehensive and consistently reported health information to serve as the foundation for informed decision-making. © The Author(s) 2016.

  7. Comparison promotes learning and transfer of relational categories.

    PubMed

    Kurtz, Kenneth J; Boukrina, Olga; Gentner, Dedre

    2013-07-01

    We investigated the effect of co-presenting training items during supervised classification learning of novel relational categories. Strong evidence exists that comparison induces a structural alignment process that renders common relational structure more salient. We hypothesized that comparisons between exemplars would facilitate learning and transfer of categories that cohere around a common relational property. The effect of comparison was investigated using learning trials that elicited a separate classification response for each item in presentation pairs that could be drawn from the same or different categories. This methodology ensures consideration of both items and invites comparison through an implicit same-different judgment inherent in making the two responses. In a test phase measuring learning and transfer, the comparison group significantly outperformed a control group receiving an equivalent training session of single-item classification learning. Comparison-based learners also outperformed the control group on a test of far transfer, that is, the ability to accurately classify items from a novel domain that was relationally alike, but surface-dissimilar, to the training materials. Theoretical and applied implications of this comparison advantage are discussed. PsycINFO Database Record (c) 2013 APA, all rights reserved.

  8. Evaluting the Validity of Technology-Enhanced Educational Assessment Items and Tasks: An Emprical Approach to Studying Item Features and Scoring Rubrics

    ERIC Educational Resources Information Center

    Thomas, Ally

    2016-01-01

    With the advent of the newly developed Common Core State Standards and the Next Generation Science Standards, innovative assessments, including technology-enhanced items and tasks, will be needed to meet the challenges of developing valid and reliable assessments in a world of computer-based testing. In a recent critique of the next generation…

  9. Automatic Short Essay Scoring Using Natural Language Processing to Extract Semantic Information in the Form of Propositions. CRESST Report 831

    ERIC Educational Resources Information Center

    Kerr, Deirdre; Mousavi, Hamid; Iseli, Markus R.

    2013-01-01

    The Common Core assessments emphasize short essay constructed-response items over multiple-choice items because they are more precise measures of understanding. However, such items are too costly and time consuming to be used in national assessments unless a way to score them automatically can be found. Current automatic essay-scoring techniques…

  10. A Combined IRT and SEM Approach for Individual-Level Assessment in Test-Retest Studies

    ERIC Educational Resources Information Center

    Ferrando, Pere J.

    2015-01-01

    The standard two-wave multiple-indicator model (2WMIM) commonly used to analyze test-retest data provides information at both the group and item level. Furthermore, when applied to binary and graded item responses, it is related to well-known item response theory (IRT) models. In this article the IRT-2WMIM relations are used to obtain additional…

  11. A Method for Generating Educational Test Items That Are Aligned to the Common Core State Standards

    ERIC Educational Resources Information Center

    Gierl, Mark J.; Lai, Hollis; Hogan, James B.; Matovinovic, Donna

    2015-01-01

    The demand for test items far outstrips the current supply. This increased demand can be attributed, in part, to the transition to computerized testing, but, it is also linked to dramatic changes in how 21st century educational assessments are designed and administered. One way to address this growing demand is with automatic item generation.…

  12. The Relationship between Symptom Relief and Psychosocial Functional Improvement during Acute Electroconvulsive Therapy for Patients with Major Depressive Disorder.

    PubMed

    Lin, Ching-Hua; Yang, Wei-Cheng

    2017-07-01

    We aimed to compare the degree of symptom relief to psychosocial functional (abbreviated as "functional") improvement and explore the relationships between symptom relief and functional improvement during acute electroconvulsive therapy for patients with major depressive disorder. Major depressive disorder inpatients (n=130) requiring electroconvulsive therapy were recruited. Electroconvulsive therapy was generally performed for a maximum of 12 treatments. Symptom severity, using the 17-item Hamilton Depression Rating Scale, and psychosocial functioning (abbreviated as "functioning"), using the Modified Work and Social Adjustment Scale, were assessed before electroconvulsive therapy, after every 3 electroconvulsive therapy treatments, and after the final electroconvulsive therapy. Both 17-item Hamilton Depression Rating Scale and Modified Work and Social Adjustment Scale scores were converted to T-score units to compare the degrees of changes between depressive symptoms and functioning after electroconvulsive therapy. Structural equation modeling was used to test the relationships between 17-item Hamilton Depression Rating Scale and Modified Work and Social Adjustment Scale during acute electroconvulsive therapy. One hundred sixteen patients who completed at least the first 3 electroconvulsive therapy treatments entered the analysis. Reduction of 17-item Hamilton Depression Rating Scale T-scores was significantly greater than that of Modified Work and Social Adjustment Scale T-scores at assessments 2, 3, 4, and 5. The model analyzed by structural equation modeling satisfied all indices of goodness-of-fit (chi-square = 32.882, P =.107, TLI = 0.92, CFI = 0.984, RMSEA = 0.057). The 17-item Hamilton Depression Rating Scale change did not predict subsequent Modified Work and Social Adjustment Scale change. Functioning improved less than depressive symptoms during acute electroconvulsive therapy. Symptom reduction did not predict subsequent functional improvement. Depressive symptoms and functional impairment are distinct domains and should be assessed independently to accurately reflect the effectiveness of electroconvulsive therapy. © The Author 2017. Published by Oxford University Press on behalf of CINP.

  13. The Importance of Isomorphism for Conclusions about Homology: A Bayesian Multilevel Structural Equation Modeling Approach with Ordinal Indicators.

    PubMed

    Guenole, Nigel

    2016-01-01

    We describe a Monte Carlo study examining the impact of assuming item isomorphism (i.e., equivalent construct meaning across levels of analysis) on conclusions about homology (i.e., equivalent structural relations across levels of analysis) under varying degrees of non-isomorphism in the context of ordinal indicator multilevel structural equation models (MSEMs). We focus on the condition where one or more loadings are higher on the between level than on the within level to show that while much past research on homology has ignored the issue of psychometric isomorphism, psychometric isomorphism is in fact critical to valid conclusions about homology. More specifically, when a measurement model with non-isomorphic items occupies an exogenous position in a multilevel structural model and the non-isomorphism of these items is not modeled, the within level exogenous latent variance is under-estimated leading to over-estimation of the within level structural coefficient, while the between level exogenous latent variance is overestimated leading to underestimation of the between structural coefficient. When a measurement model with non-isomorphic items occupies an endogenous position in a multilevel structural model and the non-isomorphism of these items is not modeled, the endogenous within level latent variance is under-estimated leading to under-estimation of the within level structural coefficient while the endogenous between level latent variance is over-estimated leading to over-estimation of the between level structural coefficient. The innovative aspect of this article is demonstrating that even minor violations of psychometric isomorphism render claims of homology untenable. We also show that posterior predictive p-values for ordinal indicator Bayesian MSEMs are insensitive to violations of isomorphism even when they lead to severely biased within and between level structural parameters. We highlight conditions where poor estimation of even correctly specified models rules out empirical examination of isomorphism and homology without taking precautions, for instance, larger Level-2 sample sizes, or using informative priors.

  14. The Importance of Isomorphism for Conclusions about Homology: A Bayesian Multilevel Structural Equation Modeling Approach with Ordinal Indicators

    PubMed Central

    Guenole, Nigel

    2016-01-01

    We describe a Monte Carlo study examining the impact of assuming item isomorphism (i.e., equivalent construct meaning across levels of analysis) on conclusions about homology (i.e., equivalent structural relations across levels of analysis) under varying degrees of non-isomorphism in the context of ordinal indicator multilevel structural equation models (MSEMs). We focus on the condition where one or more loadings are higher on the between level than on the within level to show that while much past research on homology has ignored the issue of psychometric isomorphism, psychometric isomorphism is in fact critical to valid conclusions about homology. More specifically, when a measurement model with non-isomorphic items occupies an exogenous position in a multilevel structural model and the non-isomorphism of these items is not modeled, the within level exogenous latent variance is under-estimated leading to over-estimation of the within level structural coefficient, while the between level exogenous latent variance is overestimated leading to underestimation of the between structural coefficient. When a measurement model with non-isomorphic items occupies an endogenous position in a multilevel structural model and the non-isomorphism of these items is not modeled, the endogenous within level latent variance is under-estimated leading to under-estimation of the within level structural coefficient while the endogenous between level latent variance is over-estimated leading to over-estimation of the between level structural coefficient. The innovative aspect of this article is demonstrating that even minor violations of psychometric isomorphism render claims of homology untenable. We also show that posterior predictive p-values for ordinal indicator Bayesian MSEMs are insensitive to violations of isomorphism even when they lead to severely biased within and between level structural parameters. We highlight conditions where poor estimation of even correctly specified models rules out empirical examination of isomorphism and homology without taking precautions, for instance, larger Level-2 sample sizes, or using informative priors. PMID:26973580

  15. Availability of Vending Machines and School Stores in California Schools.

    PubMed

    Cisse-Egbuonye, Nafissatou; Liles, Sandy; Schmitz, Katharine E; Kassem, Nada; Irvin, Veronica L; Hovell, Melbourne F

    2016-01-01

    This study examined the availability of foods sold in vending machines and school stores in United States public and private schools, and associations of availability with students' food purchases and consumption. Descriptive analyses, chi-square tests, and Spearman product-moment correlations were conducted on data collected from 521 students aged 8 to 15 years recruited from orthodontic offices in California. Vending machines were more common in private schools than in public schools, whereas school stores were common in both private and public schools. The food items most commonly available in both vending machines and school stores in all schools were predominately foods of minimal nutritional value (FMNV). Participant report of availability of food items in vending machines and/or school stores was significantly correlated with (1) participant purchase of each item from those sources, except for energy drinks, milk, fruits, and vegetables; and (2) participants' friends' consumption of items at lunch, for 2 categories of FMNV (candy, cookies, or cake; soda or sports drinks). Despite the Child Nutrition and Women, Infants, and Children (WIC) Reauthorization Act of 2004, FMNV were still available in schools, and may be contributing to unhealthy dietary choices and ultimately to health risks. © 2015, American School Health Association.

  16. Availability of vending machines and school stores in California schools

    PubMed Central

    Liles, Sandy; Schmitz, Katharine E.; Kassem, Nada O.F; Irvin, Veronica L; Hovell, Melbourne F.

    2015-01-01

    Background This study examined the availability of foods sold in vending machines and school stores in US public and private schools, and associations of availability with students' food purchases and consumption. Methods Descriptive analyses, chi-square tests, and Spearman product-moment correlations were conducted on data collected from 521 students aged 8 to15 years recruited from orthodontic offices in California. Results Vending machines were more common in private schools than in public schools, while school stores were common in both private and public schools. The food items most commonly available in both vending machines and school stores in all schools were predominately foods of minimal nutritional value (FMNV). Participant report of availability of food items in vending machines and/or school stores was significantly correlated with: (1) participant purchase of each item from those sources, except for energy drinks, milk, fruits, and vegetables; and (2) participants' friends' consumption of items at lunch, for two categories of FMNV (candy, cookies, or cake; soda or sports drinks). Conclusions Despite the Child Nutrition and WIC reauthorization Act of 2004, FMNV were still available in schools, and may be contributing to unhealthy dietary choices and ultimately to health risks. PMID:26645420

  17. MMPI-2 Item Endorsements in Dissociative Identity Disorder vs. Simulators.

    PubMed

    Brand, Bethany L; Chasson, Gregory S; Palermo, Cori A; Donato, Frank M; Rhodes, Kyle P; Voorhees, Emily F

    2016-03-01

    Elevated scores on some MMPI-2 (Minnesota Multiphasic Inventory-2) validity scales are common among patients with dissociative identity disorder (DID), which raises questions about the validity of their responses. Such patients show elevated scores on atypical answers (F), F-psychopathology (Fp), atypical answers in the second half of the test (FB), schizophrenia (Sc), and depression (D) scales, with Fp showing the greatest utility in distinguishing them from coached and uncoached DID simulators. In the current study, we investigated the items on the MMPI-2 F, Fp, FB, Sc, and D scales that were most and least commonly endorsed by participants with DID in our 2014 study and compared these responses with those of coached and uncoached DID simulators. The comparisons revealed that patients with DID most frequently endorsed items related to dissociation, trauma, depression, fearfulness, conflict within family, and self-destructiveness. The coached group more successfully imitated item endorsements of the DID group than did the uncoached group. However, both simulating groups, especially the uncoached group, frequently endorsed items that were uncommonly endorsed by the DID group. The uncoached group endorsed items consistent with popular media portrayals of people with DID being violent, delusional, and unlawful. These results suggest that item endorsement patterns can provide useful information to clinicians making determinations about whether an individual is presenting with DID or feigning. © 2016 American Academy of Psychiatry and the Law.

  18. Construction and validation of a psychometric scale to measure awareness on consumption of irradiated foods.

    PubMed

    Rusin, Tiago; Araújo, Wilma Maria Coelho; Faiad, Cristiane; Vital, Helio de Carvalho

    2017-01-01

    Although food irradiation has been used to ensure food safety, most consumers are unaware of the basic concepts of irradiation, misinterpreting information and demonstrating a negative attitude toward food items treated with ionizing radiation. This research is aimed at developing a tool to assess the awareness on the consumption of irradiated food. The sample was composed by employees from different social classes and school levels of Brazilian universities, who reflect the end-users of the irradiated foods, representative of the views of lay consumers. The total number of respondents was 614. In order to assess the Awareness Scale on Consumption of Irradiated Foods (ASCIF), an instrument has been developed and submitted to semantic tests and judge's validation. The instrument, that included 32 items, contemplated four construct factors: concepts (6 items), awareness (10 items), labeling (7 items) and safety of Irradiated foods (9 items). The data were collected by electronic means, through the site . By using exploratory factorial analysis (EFA) 4 factors have been found. They summarize the 31 items included. These factors account for 64.32% of the variance of the items and the internal consistency of the factors has been deemed good. An Exploratory Structural Equation Modeling (ESEM) was conducted to evaluate the factor structure of the instrument. The proposed instrument has been found to meet consistency criteria as an efficient tool for indicating assessing potential challenges and opportunities for the irradiated food markets.

  19. Construction and validation of a psychometric scale to measure awareness on consumption of irradiated foods

    PubMed Central

    2017-01-01

    Although food irradiation has been used to ensure food safety, most consumers are unaware of the basic concepts of irradiation, misinterpreting information and demonstrating a negative attitude toward food items treated with ionizing radiation. This research is aimed at developing a tool to assess the awareness on the consumption of irradiated food. The sample was composed by employees from different social classes and school levels of Brazilian universities, who reflect the end-users of the irradiated foods, representative of the views of lay consumers. The total number of respondents was 614. In order to assess the Awareness Scale on Consumption of Irradiated Foods (ASCIF), an instrument has been developed and submitted to semantic tests and judge’s validation. The instrument, that included 32 items, contemplated four construct factors: concepts (6 items), awareness (10 items), labeling (7 items) and safety of Irradiated foods (9 items). The data were collected by electronic means, through the site . By using exploratory factorial analysis (EFA) 4 factors have been found. They summarize the 31 items included. These factors account for 64.32% of the variance of the items and the internal consistency of the factors has been deemed good. An Exploratory Structural Equation Modeling (ESEM) was conducted to evaluate the factor structure of the instrument. The proposed instrument has been found to meet consistency criteria as an efficient tool for indicating assessing potential challenges and opportunities for the irradiated food markets. PMID:29220375

  20. Scale Development for Perceived School Climate for Girls' Physical Activity

    ERIC Educational Resources Information Center

    Birnbaum, Amanda S.; Evenson, Kelly R.; Motl, Robert W.; Dishman, Rod K.; Voorhees, Carolyn C.; Sallis, James F.; Elder, John P.; Dowda, Marsha

    2005-01-01

    Objectives: To test an original scale assessing perceived school climate for girls' physical activity in middle school girls. Methods: Confirmatory factor analysis (CFA) and structural equation modeling (SEM). Results: CFA retained 5 of 14 original items. A model with 2 correlated factors, perceptions about teachers' and boys' behaviors,…

  1. PAN AIR: A Computer Program for Predicting Subsonic or Supersonic Linear Potential Flows About Arbitrary Configurations Using a Higher Order Panel Method. Volume 1; Theory Document (Version 1.1)

    NASA Technical Reports Server (NTRS)

    Magnus, Alfred E.; Epton, Michael A.

    1981-01-01

    An outline of the derivation of the differential equation governing linear subsonic and supersonic potential flow is given. The use of Green's Theorem to obtain an integral equation over the boundary surface is discussed. The engineering techniques incorporated in the PAN AIR (Panel Aerodynamics) program (a discretization method which solves the integral equation for arbitrary first order boundary conditions) are then discussed in detail. Items discussed include the construction of the compressibility transformations, splining techniques, imposition of the boundary conditions, influence coefficient computation (including the concept of the finite part of an integral), computation of pressure coefficients, and computation of forces and moments.

  2. The Montgomery Äsberg and the Hamilton Ratings of Depression

    PubMed Central

    Carmody, Thomas; Rush, A. John; Bernstein, Ira; Warden, Diane; Brannan, Stephen; Burnham, Daniel; Woo, Ada; Trivedi, Madhukar

    2007-01-01

    The 17-item Hamilton Rating Scale for Depression (HRSD17) and the Montgomery Äsberg Depression Rating Scale (MADRS) are two widely used clinicianrated symptom scales. A 6-item version of the HRSD (HRSD6) was created by Bech to address the psychometric limitations of the HRSD17. The psychometric properties of these measures were compared using classical test theory (CTT) and item response theory (IRT) methods. IRT methods were used to equate total scores on any two scales. Data from two distinctly different outpatient studies of nonpsychotic major depression: a 12-month study of highly treatment-resistant patients (n=233) and an 8-week acute phase drug treatment trial (n=985) were used for robustness of results. MADRS and HRSD6 items generally contributed more to the measurement of depression than HRSD17 items as shown by higher item-total correlations and higher IRT slope parameters. The MADRS and HRSD6 were unifactorial while the HRSD17 contained 2 factors. The MADRS showed about twice the precision in estimating depression as either the HRSD17 or HRSD6 for average severity of depression. An HRSD17 of 7 corresponded to an 8 or 9 on the MADRS and 4 on the HRSD6. The MADRS would be superior to the HRSD17 in the conduct of clinical trials. PMID:16769204

  3. [Preliminary study on civil capacity rating scale for mental disabled patients].

    PubMed

    Zhang, Qin-Ting; Pang, Yan-Xia; Cai, Wei-Xiong; Tang, Tao; Huang, Fu-Yin

    2010-10-01

    To create civil capacity rating scale for mentally disabled patients, and explore its feasibility during the forensic psychiatric expertise. The civil capacity-related items were determined after discussion and consultation. The civil capacity rating scale for mentally disabled patients was established and the manual was created according to the logistic sequence of the assessment. The rating scale was used during the civil assessment in four institutes. There were 14 items in civil capacity rating scale for mentally disabled patients. Two hundred and two subjects were recruited and divided into three groups according to the experts' opinion on their civil capacities: full civil capacity, partial civil capacity and no civil capacity. The mean score of the three groups were 2.32 +/- 2.45, 11.62 +/- 4.01 and 25.02 +/- 3.90, respectively, and there was statistical differences among the groups. The Cronbach alpha of the rating scale was 0.9724, and during the split-reliability test, the two-splited part of the rating scale were highly correlated (r = 0.9729, P = 0.000). The Spearman correlative coefficient between each item and the score of the rating scale was from 0.643 to 0.882 (P = 0.000). There was good correlation between the conclusion according to the rating scale and the experts' opinion (kappa = 0.841, P = 0.000). When the discriminate analysis was used, 7 items were included into the discrimination equation, and 92.6% subjects were identified as the correct groups using the equation. There is satisfied reliability and validity on civil capacity rating scale for mentally disabled patients. The rating scale can be used as effective tools to grade their civil capacity during the forensic expertise.

  4. Identifying predictors of physics item difficulty: A linear regression approach

    NASA Astrophysics Data System (ADS)

    Mesic, Vanes; Muratovic, Hasnija

    2011-06-01

    Large-scale assessments of student achievement in physics are often approached with an intention to discriminate students based on the attained level of their physics competencies. Therefore, for purposes of test design, it is important that items display an acceptable discriminatory behavior. To that end, it is recommended to avoid extraordinary difficult and very easy items. Knowing the factors that influence physics item difficulty makes it possible to model the item difficulty even before the first pilot study is conducted. Thus, by identifying predictors of physics item difficulty, we can improve the test-design process. Furthermore, we get additional qualitative feedback regarding the basic aspects of student cognitive achievement in physics that are directly responsible for the obtained, quantitative test results. In this study, we conducted a secondary analysis of data that came from two large-scale assessments of student physics achievement at the end of compulsory education in Bosnia and Herzegovina. Foremost, we explored the concept of “physics competence” and performed a content analysis of 123 physics items that were included within the above-mentioned assessments. Thereafter, an item database was created. Items were described by variables which reflect some basic cognitive aspects of physics competence. For each of the assessments, Rasch item difficulties were calculated in separate analyses. In order to make the item difficulties from different assessments comparable, a virtual test equating procedure had to be implemented. Finally, a regression model of physics item difficulty was created. It has been shown that 61.2% of item difficulty variance can be explained by factors which reflect the automaticity, complexity, and modality of the knowledge structure that is relevant for generating the most probable correct solution, as well as by the divergence of required thinking and interference effects between intuitive and formal physics knowledge structures. Identified predictors point out the fundamental cognitive dimensions of student physics achievement at the end of compulsory education in Bosnia and Herzegovina, whose level of development influenced the test results within the conducted assessments.

  5. A study of Korean students' creativity in science using structural equation modeling

    NASA Astrophysics Data System (ADS)

    Jo, Son Mi

    Through the review of creativity research I have found that studies lack certain crucial parts: (a) a theoretical framework for the study of creativity in science, (b) studies considering the unique components related to scientific creativity, and (c) studies of the interactions among key components through simultaneous analyses. The primary purpose of this study is to explore the dynamic interactions among four components (scientific proficiency, intrinsic motivation, creative competence, context supporting creativity) related to scientific creativity under the framework of scientific creativity. A total of 295 Korean middle school students participated. Well-known and commonly used measurements were selected and developed. Two scientific achievement scores and one score measured by performance-based assessment were used to measure student scientific knowledge/inquiry skills. Six items selected from the study of Lederman, Abd-El-Khalick, Bell, and Schwartz (2002) were used to assess how well students understand the nature of science. Five items were selected from the subscale of the scientific attitude inventory version II (Moore & Foy, 1997) to assess student attitude toward science. The Test of Creative Thinking-Drawing Production (Urban & Jellen, 1996) was used to measure creative competence. Eight items chosen from the 15 items of the Work Preference Inventory (1994) were applied to measure students' intrinsic motivation. To assess the level of context supporting creativity, eight items were adapted from measurement of the work environment (Amabile, Conti, Coon, Lazenby, and Herron, 1996). To assess scientific creativity, one open-ended science problem was used and three raters rated the level of scientific creativity through the Consensual Assessment Technique (Amabile, 1996). The results show that scientific proficiency and creative competence correlates with scientific creativity. Intrinsic motivation and context components do not predict scientific creativity. The strength of relationships between scientific proficiency and scientific creativity (estimate parameter=0.43) and creative competence and scientific creativity (estimate parameter=0.17) are similar [chi2.05(1)=0.670, P>.05]. In specific analysis of structural model, I found that creative competence and scientific proficiency play a role of partial mediators among three components (general creativity, scientific proficiency, and scientific creativity). The moderate effects of intrinsic motivation and context component were investigated, but the moderation effects were not found.

  6. Empirical Correction to the Likelihood Ratio Statistic for Structural Equation Modeling with Many Variables.

    PubMed

    Yuan, Ke-Hai; Tian, Yubin; Yanagihara, Hirokazu

    2015-06-01

    Survey data typically contain many variables. Structural equation modeling (SEM) is commonly used in analyzing such data. The most widely used statistic for evaluating the adequacy of a SEM model is T ML, a slight modification to the likelihood ratio statistic. Under normality assumption, T ML approximately follows a chi-square distribution when the number of observations (N) is large and the number of items or variables (p) is small. However, in practice, p can be rather large while N is always limited due to not having enough participants. Even with a relatively large N, empirical results show that T ML rejects the correct model too often when p is not too small. Various corrections to T ML have been proposed, but they are mostly heuristic. Following the principle of the Bartlett correction, this paper proposes an empirical approach to correct T ML so that the mean of the resulting statistic approximately equals the degrees of freedom of the nominal chi-square distribution. Results show that empirically corrected statistics follow the nominal chi-square distribution much more closely than previously proposed corrections to T ML, and they control type I errors reasonably well whenever N ≥ max(50,2p). The formulations of the empirically corrected statistics are further used to predict type I errors of T ML as reported in the literature, and they perform well.

  7. Application of Item Response Theory to Tests of Substance-related Associative Memory

    PubMed Central

    Shono, Yusuke; Grenard, Jerry L.; Ames, Susan L.; Stacy, Alan W.

    2015-01-01

    A substance-related word association test (WAT) is one of the commonly used indirect tests of substance-related implicit associative memory and has been shown to predict substance use. This study applied an item response theory (IRT) modeling approach to evaluate psychometric properties of the alcohol- and marijuana-related WATs and their items among 775 ethnically diverse at-risk adolescents. After examining the IRT assumptions, item fit, and differential item functioning (DIF) across gender and age groups, the original 18 WAT items were reduced to 14- and 15-items in the alcohol- and marijuana-related WAT, respectively. Thereafter, unidimensional one- and two-parameter logistic models (1PL and 2PL models) were fitted to the revised WAT items. The results demonstrated that both alcohol- and marijuana-related WATs have good psychometric properties. These results were discussed in light of the framework of a unified concept of construct validity (Messick, 1975, 1989, 1995). PMID:25134051

  8. Linking Parameter Estimates Derived from an Item Response Model through Separate Calibrations. Research Report. ETS RR-09-40

    ERIC Educational Resources Information Center

    Haberman, Shelby J.

    2009-01-01

    A regression procedure is developed to link simultaneously a very large number of item response theory (IRT) parameter estimates obtained from a large number of test forms, where each form has been separately calibrated and where forms can be linked on a pairwise basis by means of common items. An application is made to forms in which a…

  9. Development of a Culture Specific Critical Thinking Ability Test and Using It as a Supportive Diagnostic Test for Giftedness

    ERIC Educational Resources Information Center

    Köksal, Mustafa Serdar

    2016-01-01

    The purposes of this study were to develop a culture specific critical thinking ability test for 6, 7, and 8. grade students in Turkey and to use it as an assessment instrument for giftedness. For these purposes, item pool involving 22 items was formed by writing items focusing on the current and common events presented in (Turkish) media from…

  10. Study of the Reliability of CCSS-Aligned Math Measures (2012 Research Version): Grades 6-8. Technical Report #1312

    ERIC Educational Resources Information Center

    Anderson, Daniel; Alonzo, Julie; Tindal, Gerald

    2012-01-01

    In this technical report, we describe the results of a study of mathematics items written to align with the Common Core State Standards (CCSS) in grades 6-8. In each grade, CCSS items were organized into forms, and the reliability of these forms was evaluated along with an experimental form including items aligned with the National Council of…

  11. Development of a prototype commonality analysis tool for use in space programs

    NASA Technical Reports Server (NTRS)

    Yeager, Dorian P.

    1988-01-01

    A software tool to aid in performing commonality analyses, called Commonality Analysis Problem Solver (CAPS), was designed, and a prototype version (CAPS 1.0) was implemented and tested. The CAPS 1.0 runs in an MS-DOS or IBM PC-DOS environment. The CAPS is designed around a simple input language which provides a natural syntax for the description of feasibility constraints. It provides its users with the ability to load a database representing a set of design items, describe the feasibility constraints on items in that database, and do a comprehensive cost analysis to find the most economical substitution pattern.

  12. Item analysis of three Spanish naming tests: a cross-cultural investigation.

    PubMed

    Marquez de la Plata, Carlos; Arango-Lasprilla, Juan Carlos; Alegret, Montse; Moreno, Alexander; Tárraga, Luis; Lara, Mar; Hewlitt, Margaret; Hynan, Linda; Cullum, C Munro

    2009-01-01

    Neuropsychological evaluations conducted in the United States and abroad commonly include the use of tests translated from English to Spanish. The use of translated naming tests for evaluating predominately Spanish-speakers has recently been challenged on the grounds that translating test items may compromise a test's construct validity. The Texas Spanish Naming Test (TNT) has been developed in Spanish specifically for use with Spanish-speakers; however, it is unlikely patients from diverse Spanish-speaking geographical regions will perform uniformly on a naming test. The present study evaluated and compared the internal consistency and patterns of item-difficulty and -discrimination for the TNT and two commonly used translated naming tests in three countries (i.e., United States, Colombia, Spain). Two hundred fifty two subjects (136 demented, 116 nondemented) across three countries were administered the TNT, Modified Boston Naming Test-Spanish, and the naming subtest from the CERAD. The TNT demonstrated superior internal consistency to its counterparts, a superior item difficulty pattern than the CERAD naming test, and a superior item discrimination pattern than the MBNT-S across countries. Overall, all three Spanish naming tests differentiated nondemented and moderately demented individuals, but the results suggest the items of the TNT are most appropriate to use with Spanish-speakers. Preliminary normative data for the three tests examined in each country are provided.

  13. Do children with gender dysphoria have intense/obsessional interests?

    PubMed

    VanderLaan, Doug P; Postema, Lori; Wood, Hayley; Singh, Devita; Fantus, Sophia; Hyun, Jessica; Leef, Jonathan; Bradley, Susan J; Zucker, Kenneth J

    2015-01-01

    This study examined whether children clinically referred for gender dysphoria (GD) show increased symptoms of autism spectrum disorder (ASD). Circumscribed preoccupations or intense interests were considered as overlapping symptoms expressed in GD and ASD. In gender-referred children (n = 534; 82.2% male) and their siblings (n = 419; 57.5% male), we examined Items 9 and 66 on the Child Behavior Checklist, which measure obsessions and compulsions, respectively. Non-GD clinic-referred (n = 1,201; 48.5% male) and nonreferred (n = 1,201; 48.5% male) children were also examined. Gender-referred children were elevated compared to all other groups for Item 9, and compared to siblings and nonreferred children for Item 66. A gender-related theme was significantly more common for gender-referred boys than male siblings on Item 9 only. A gender-related theme was not significantly more common for gender-referred girls compared to their female siblings on either item. The findings for Item 9 support the idea that children with GD show an elevation in obsessional interests. For gender-referred boys in particular, gender-related themes constituted more than half of the examples provided by their mothers. Intense/obsessional interests in children with GD may be one of the factors underlying the purported link between GD and ASD.

  14. Psychometric Properties of the Chinese Shortened Version of the Zuckerman–Kuhlman Personality Questionnaire in a Sample of Adolescents and Young Adults

    PubMed Central

    Wang, Daoyang; Hu, Mingming; Zheng, Chanjin; Liu, Zhengguang

    2017-01-01

    Introduction: The original 89-item Zuckerman–Kuhlman Personality Questionnaire (form III Revised, ZKPQ-III-R) is a widely accepted and used self-report measure for personality traits. This study assessed the reliability and construct validity of the Chinese short 46-item version of the ZKPQ-III-R in a sample of adolescents and young adults. Methodology: A total of 1,019 Chinese adolescents and young adults completed the Chinese version of the original 89-item version ZKPQ-III-R and short 46-item version ZKPQ-III-R, self-report measures of depression, life satisfaction, and subjective health complaints (SHC), the Big Five personality traits, and a substance use risk profile. We explored the internal consistency of five dimensions of the short 46-item version ZKPQ-III-R and compared it with observations in previous studies of Chinese and other populations. The structure of the questionnaire was analyzed by confirmatory factor analysis and exploratory structural equation modeling. Results: The short 46-item version ZKPQ-III-R had adequate internal reliability for all five dimensions, with Cronbach’s α coefficients of 0.63 to 0.84. The concurrent validity of the short 46-item version ZKPQ-III-R was supported by significant correlations with depression, life satisfaction, and SHC. The short 46-item version ZKPQ-III-R had better fit, similar reliability coefficients, and slightly better construct and convergent validity than the 89-item version. Conclusion: The Chinese version of the 46-item ZKPQ-III-R presented reliability and validity in measuring personality in Chinese adolescents and young adults. PMID:28326057

  15. Thirty Years of Nonparametric Item Response Theory.

    ERIC Educational Resources Information Center

    Molenaar, Ivo W.

    2001-01-01

    Discusses relationships between a mathematical measurement model and its real-world applications. Makes a distinction between large-scale data matrices commonly found in educational measurement and smaller matrices found in attitude and personality measurement. Also evaluates nonparametric methods for estimating item response functions and…

  16. Age differences in short-term memory binding are related to working memory performance across the lifespan.

    PubMed

    Fandakova, Yana; Sander, Myriam C; Werkle-Bergner, Markus; Shing, Yee Lee

    2014-03-01

    Memory performance increases during childhood and adolescence, and decreases in old age. Among younger adults, better ability to bind items to the context in which they were experienced is associated with higher working memory performance (Oberauer, 2005). Here, we examined the extent to which age differences in binding contribute to life span age differences in short-term memory (STM). Younger children (N = 85; 10 to 12 years), teenagers (N = 41; 13 to 15 years), younger adults (N = 84; 20 to 25 years), and older adults (N = 86; 70 to 75 years) worked on global and local short-term recognition tasks that are assumed to measure item and item-context memory, respectively. Structural equation models showed that item-context bindings are functioning less well in children and older adults compared with younger adults and teenagers. This result suggests protracted development of the ability to form and recollect detailed short-term memories, and decline of this ability in aging. Across all age groups, better item-context binding was associated with higher working memory performance, indicating that developmental differences in binding mechanisms are closely related to working memory development in childhood and old age. (c) 2014 APA, all rights reserved.

  17. Factor structure and clinical correlates of the 61-item Wender Utah Rating Scale (WURS).

    PubMed

    Calamia, Matthew; Hill, Benjamin D; Musso, Mandi W; Pella, Russell D; Gouvier, Wm Drew

    2018-02-09

    The objective of this study was to assess the factor structure and clinical correlates of a 61-item version of the Wender Utah Rating Scale (WURS), a self-report retrospective measure of childhood problems, experiences, and behavior used in ADHD assessment. Given the currently mostly widely used form of the WURS was derived via a criterion-keyed approach, the study aimed to use latent variable modeling of the 61-item WURS to potentially identify more and more homogeneous set of items reflecting current conceptualizations of ADHD symptoms. Exploratory structural equation modeling was used to generate factor scores which were then correlated with neuropsychological measures of intelligence and executive attention as well as a broad measure of personality and emotional functioning. Support for a modified five-factor model was found: ADHD, disruptive mood and behavior, negative affectivity, social confidence, and academic problems. The ADHD factor differed somewhat from the traditional 25-item WURS short form largely through weaker associations with several measures of personality and psychopathology. This study identified a factor more aligned with DSM-5 conceptualization of ADHD as well as measures of other types of childhood characteristics and symptoms which may prove useful for both research and clinical practice.

  18. Dietary patterns and whole grain cereals in the Scandinavian countries--differences and similarities. The HELGA project.

    PubMed

    Engeset, Dagrun; Hofoss, Dag; Nilsson, Lena M; Olsen, Anja; Tjønneland, Anne; Skeie, Guri

    2015-04-01

    To identify dietary patterns with whole grains as a main focus to see if there is a similar whole grain pattern in the three Scandinavian countries; Denmark, Sweden and Norway. Another objective is to see if items suggested for a Nordic Food Index will form a typical Nordic pattern when using factor analysis. The HELGA study population is based on samples of existing cohorts: the Norwegian Women and Cancer Study, the Swedish Västerbotten cohort and the Danish Diet, Cancer and Health study. The HELGA study aims to generate knowledge about the health effects of whole grain foods. The study included a total of 119 913 participants. The associations among food variables from FFQ were investigated by principal component analysis. Only food groups common for all three cohorts were included. High factor loading of a food item shows high correlation of the item to the specific diet pattern. The main whole grain for Denmark and Sweden was rye, while Norway had highest consumption of wheat. Three similar patterns were found: a cereal pattern, a meat pattern and a bread pattern. However, even if the patterns look similar, the food items belonging to the patterns differ between countries. High loadings on breakfast cereals and whole grain oat were common in the cereal patterns for all three countries. Thus, the cereal pattern may be considered a common Scandinavian whole grain pattern. Food items belonging to a Nordic Food Index were distributed between different patterns.

  19. Reviewing the psychometric properties of contemporary circadian typology measures.

    PubMed

    Di Milia, Lee; Adan, Ana; Natale, Vincenzo; Randler, Christoph

    2013-12-01

    The accurate measurement of circadian typology (CT) is critical because the construct has implications for a number of health disorders. In this review, we focus on the evidence to support the reliability and validity of the more commonly used CT scales: the Morningness-Eveningness Questionnaire (MEQ), reduced Morningness-Eveningness Questionnaire (rMEQ), the Composite Scale of Morningness (CSM), and the Preferences Scale (PS). In addition, we also consider the Munich ChronoType Questionnaire (MCTQ). In terms of reliability, the MEQ, CSM, and PS consistently report high levels of reliability (>0.80), whereas the reliability of the rMEQ is satisfactory. The stability of these scales is sound at follow-up periods up to 13 mos. The MCTQ is not a scale; therefore, its reliability cannot be assessed. Although it is possible to determine the stability of the MCTQ, these data are yet to be reported. Validity must be given equal weight in assessing the measurement properties of CT instruments. Most commonly reported is convergent and construct validity. The MEQ, rMEQ, and CSM are highly correlated and this is to be expected, given that these scales share common items. The level of agreement between the MCTQ and the MEQ is satisfactory, but the correlation between these two constructs decreases in line with the number of "corrections" applied to the MCTQ. The interesting question is whether CT is best represented by a psychological preference for behavior or by using a biomarker such as sleep midpoint. Good-quality subjective and objective data suggest adequate construct validity for each of the CT instruments, but a major limitation of this literature is studies that assess the predictive validity of these instruments. We make a number of recommendations with the aim of advancing science. Future studies need to (1) focus on collecting data from representative samples that consider a number of environmental factors; (2) employ longitudinal designs to allow the predictive validity of CT measures to be assessed and preferably make use of objective data; (3) employ contemporary statistical approaches, including structural equation modeling and item-response models; and (4) provide better information concerning sample selection and a rationale for choosing cutoff points.

  20. A Calibration to Predict the Concentrations of Impurities in Plutonium Oxide by Prompt Gamma Analysis Revision 2

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Narlesky, Joshua Edward; Kelly, Elizabeth J.

    2015-09-10

    This report documents the new PG calibration regression equation. These calibration equations incorporate new data that have become available since revision 1 of “A Calibration to Predict the Concentrations of Impurities in Plutonium Oxide by Prompt Gamma Analysis” was issued [3] The calibration equations are based on a weighted least squares (WLS) approach for the regression. The WLS method gives each data point its proper amount of influence over the parameter estimates. This gives two big advantages, more precise parameter estimates and better and more defensible estimates of uncertainties. The WLS approach makes sense both statistically and experimentally because themore » variances increase with concentration, and there are physical reasons that the higher measurements are less reliable and should be less influential. The new magnesium calibration includes a correction for sodium and separate calibration equation for items with and without chlorine. These additional calibration equations allow for better predictions and smaller uncertainties for sodium in materials with and without chlorine. Chlorine and sodium have separate equations for RICH materials. Again, these equations give better predictions and smaller uncertainties chlorine and sodium for RICH materials.« less

  1. Techniques to evaluate the importance of common cause degradation on reliability and safety of nuclear weapons.

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Darby, John L.

    2011-05-01

    As the nuclear weapon stockpile ages, there is increased concern about common degradation ultimately leading to common cause failure of multiple weapons that could significantly impact reliability or safety. Current acceptable limits for the reliability and safety of a weapon are based on upper limits on the probability of failure of an individual item, assuming that failures among items are independent. We expanded the current acceptable limits to apply to situations with common cause failure. Then, we developed a simple screening process to quickly assess the importance of observed common degradation for both reliability and safety to determine if furthermore » action is necessary. The screening process conservatively assumes that common degradation is common cause failure. For a population with between 100 and 5000 items we applied the screening process and conclude the following. In general, for a reliability requirement specified in the Military Characteristics (MCs) for a specific weapon system, common degradation is of concern if more than 100(1-x)% of the weapons are susceptible to common degradation, where x is the required reliability expressed as a fraction. Common degradation is of concern for the safety of a weapon subsystem if more than 0.1% of the population is susceptible to common degradation. Common degradation is of concern for the safety of a weapon component or overall weapon system if two or more components/weapons in the population are susceptible to degradation. Finally, we developed a technique for detailed evaluation of common degradation leading to common cause failure for situations that are determined to be of concern using the screening process. The detailed evaluation requires that best estimates of common cause and independent failure probabilities be produced. Using these techniques, observed common degradation can be evaluated for effects on reliability and safety.« less

  2. Six steps to a successful dose-reduction strategy

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Bennett, M.

    1995-03-01

    The increased importance of demonstrating achievement of the ALARA principle has helped produce a proliferation of dose-reduction ideas. Across a company there may be many dose-reduction items being pursued in a variety of areas. However, companies have a limited amount of resource and, therefore, to ensure funding is directed to those items which will produce the most benefit and that all areas apply a common policy, requires the presence of a dose-reduction strategy. Six steps were identified in formulating the dose-reduction strategy for Rolls-Royce and Associates (RRA): (1) collating the ideas; (2) quantitatively evaluating them on a common basis; (3)more » prioritizing the ideas in terms of cost benefit, (4) implementation of the highest priority items; (5) monitoring their success; (6) periodically reviewing the strategy. Inherent in producing the dose-reduction strategy has been a comprehensive dose database and the RRA-developed dose management computer code DOMAIN, which allows prediction of dose rates and dose. The database enabled high task dose items to be identified, assisted in evaluating dose benefits, and monitored dose trends once items had been implemented. The DOMAIN code was used both in quantifying some of the project dose benefits and its results, such as dose contours, used in some of the dose-reduction items themselves. In all, over fifty dose-reduction items were evaluated in the strategy process and the items which will give greatest benefit are being implemented. The strategy has been successful in giving renewed impetus and direction to dose-reduction management.« less

  3. Maintenance of item and order information in verbal working memory.

    PubMed

    Camos, Valérie; Lagner, Prune; Loaiza, Vanessa M

    2017-09-01

    Although verbal recall of item and order information is well-researched in short-term memory paradigms, there is relatively little research concerning item and order recall from working memory. The following study examined whether manipulating the opportunity for attentional refreshing and articulatory rehearsal in a complex span task differently affected the recall of item- and order-specific information of the memoranda. Five experiments varied the opportunity for articulatory rehearsal and attentional refreshing in a complex span task, but the type of recall was manipulated between experiments (item and order, order only, and item only recall). The results showed that impairing attentional refreshing and articulatory rehearsal similarly affected recall regardless of whether the scoring procedure (Experiments 1 and 4) or recall requirements (Experiments 2, 3, and 5) reflected item- or order-specific recall. This implies that both mechanisms sustain the maintenance of item and order information, and suggests that the common cumulative functioning of these two mechanisms to maintain items could be at the root of order maintenance.

  4. 48 CFR 12.213 - Other commercial practices.

    Code of Federal Regulations, 2010 CFR

    2010-10-01

    ... 48 Federal Acquisition Regulations System 1 2010-10-01 2010-10-01 false Other commercial practices... ACQUISITION PLANNING ACQUISITION OF COMMERCIAL ITEMS Special Requirements for the Acquisition of Commercial Items 12.213 Other commercial practices. It is a common practice in the commercial marketplace for both...

  5. Bayes Factor Covariance Testing in Item Response Models.

    PubMed

    Fox, Jean-Paul; Mulder, Joris; Sinharay, Sandip

    2017-12-01

    Two marginal one-parameter item response theory models are introduced, by integrating out the latent variable or random item parameter. It is shown that both marginal response models are multivariate (probit) models with a compound symmetry covariance structure. Several common hypotheses concerning the underlying covariance structure are evaluated using (fractional) Bayes factor tests. The support for a unidimensional factor (i.e., assumption of local independence) and differential item functioning are evaluated by testing the covariance components. The posterior distribution of common covariance components is obtained in closed form by transforming latent responses with an orthogonal (Helmert) matrix. This posterior distribution is defined as a shifted-inverse-gamma, thereby introducing a default prior and a balanced prior distribution. Based on that, an MCMC algorithm is described to estimate all model parameters and to compute (fractional) Bayes factor tests. Simulation studies are used to show that the (fractional) Bayes factor tests have good properties for testing the underlying covariance structure of binary response data. The method is illustrated with two real data studies.

  6. Teacher role stress, satisfaction, commitment, and intentions to leave: a structural model.

    PubMed

    Conley, Sharon; You, Sukkyung

    2009-12-01

    Structural equation modeling was used to assess the plausibility of a conceptual model specifying hypothesized linkages among teachers' perceptions of the role stresses of role ambiguity, role conflict, and role overload and commitment, satisfaction, and intentions to leave their employing school. 178 teachers in four high schools in a southern coastal region of California responded to survey questions designed to capture the above constructs. Confirmatory factor analysis was used to assess whether the role-stress items fit hypothesized constructs. Structural equation modeling results indicated that satisfaction and commitment are two mediators in the role stresses-intentions to leave relationship.

  7. THE BRIEF PSYCHIATRIC RATING SCALE IN POSITIVE AND NEGATIVE SUBTYPES OF SCHIZOPHRENIA

    PubMed Central

    Kulhara, P.; Mattoo, S.K.; Avasthi, A.; Malhotra, A.

    1987-01-01

    SUMMARY Usefulness of the Brief Psychiatric Rating Scale (BPRS) in distinguishing positive and negative subtypes of schizophrenia is presented. Ninety five schizophrenic patients were assessed on BPRS. Significant differences emerged between positive and negative subtypes of schizophrenia on items like emotional withdrawal, guilt feelings, tension, hallucinatory behaviour, motor retardation, blunted affect and excitement. Discriminant function equation generated by these items had a high rate of prediction of group membership either to positive or negative schizophrenia group. Principal components analysis of BPRS scores yielded factors which favour categorization of patients in positive, negative subtypes. The study provides support for classification of schizophrenia into these subtypes. PMID:21927241

  8. Predictive Equations Are Inaccurate in the Estimation of the Resting Energy Expenditure of Children With End-Stage Liver Disease.

    PubMed

    Carpenter, Andrea; Ng, Vicky Lee; Chapman, Karen; Ling, Simon C; Mouzaki, Marialena

    2017-03-01

    Malnutrition is common in children with end-stage liver disease (ESLD) and is associated with increased morbidity and mortality. The inability to accurately estimate energy needs of these patients may contribute to their poor nutrition status. In clinical practice, predictive equations are used to calculate resting energy expenditure (cREE). The objective of this study is to assess the accuracy of commonly used equations in pediatric patients with ESLD. Retrospective study performed at the Hospital for Sick Children. Clinical, laboratory, and indirect calorimetry data from children listed for liver transplant between February 2013 and December 2014 were reviewed. Calorimetry results were compared with cREE estimated using the Food and Agriculture Organization/World Health Organization/United Nations University (FAO/WHO/UNU), Schofield [weight], and Schofield [weight and height] equations. Forty-five patients were included in this study. The median age was 9 months, and the most common indication for transplantation was biliary atresia (64%). The Schofield [weight and height], FAO/WHO/UNU, and Schofield [weight] equations were compared with indirect calorimetry and found to have a mean (SD) difference of 48.8 (344.0), 59.3 (229.8), and 206.5 (502.6) kcal/d, respectively. The FAO/WHO/UNU, Schofield [weight], and Schofield [weight and height] equations introduced a mean error of 21%, 38%, and 76%, respectively. The FAO/WHO/UNU equation tended to underestimate, whereas the Schofield equations overestimated the REE. Commonly used predictive equations perform poorly in infants and young children with ESLD. Indirect calorimetry should be used when available to guide energy provision, particularly in children who are already malnourished.

  9. A Graphical Approach to Evaluating Equating Using Test Characteristic Curves

    ERIC Educational Resources Information Center

    Wyse, Adam E.; Reckase, Mark D.

    2011-01-01

    An essential concern in the application of any equating procedure is determining whether tests can be considered equated after the tests have been placed onto a common scale. This article clarifies one equating criterion, the first-order equity property of equating, and develops a new method for evaluating equating that is linked to this…

  10. Some Architecture for Embedded-Assessment Systems

    ERIC Educational Resources Information Center

    Kane, Michael T.; Tannenbaum, Richard J.

    2016-01-01

    It is one thing to produce an innovative, construct-based assessment task; it's another to produce 10 a year that are comparable in difficulty, measure the same competencies, are free of differential item functioning, and can be scaled and equated. These challenges contributed to the failure of the performance (or authentic) assessment movement of…

  11. Measuring Experiential Avoidance: A Preliminary Test of a Working Model

    ERIC Educational Resources Information Center

    Hayes, Steven C.; Strosahl, Kirk; Wilson, Kelly G.; Bissett, Richard T.; Pistorello, Jacqueline; Toarmino, Dosheen; Polusny, Melissa A.; Dykstra, Thane A.; Batten, Sonja V.; Bergan, John; Stewart, Sherry H.; Zvolensky, Michael J.; Eifert, Georg H.; Bond, Frank W.; Forsyth, John P.; Karekla, Maria; Mccurry, Susan M.

    2004-01-01

    The present study describes the development of a short, general measure of experiential avoidance, based on a specific theoretical approach to this process. A theoretically driven iterative exploratory analysis using structural equation modeling on data from a clinical sample yielded a single factor comprising 9 items. A fully confirmatory factor…

  12. A Permutation Test for Correlated Errors in Adjacent Questionnaire Items

    ERIC Educational Resources Information Center

    Hildreth, Laura A.; Genschel, Ulrike; Lorenz, Frederick O.; Lesser, Virginia M.

    2013-01-01

    Response patterns are of importance to survey researchers because of the insight they provide into the thought processes respondents use to answer survey questions. In this article we propose the use of structural equation modeling to examine response patterns and develop a permutation test to quantify the likelihood of observing a specific…

  13. Pre-Service Teacher Self-Efficacy for Teaching Students with Disabilities: What Knowledge Matters?

    ERIC Educational Resources Information Center

    Browarnik, Brooke; Bell, Sherry Mee; McCallum, R. Steve; Smyth, Kelly; Martin, Melissa

    2017-01-01

    The relation between items assessing knowledge about educating students with disabilities and the Tschannen-Moran and Hoy's Teachers' Sense of Efficacy Scale (TSES; 2001) was explored for 140 preservice, general education teachers using biserial correlation coefficients and a multiple regression equation. From the data collected, 8 correlations…

  14. Contribution of strategy use to performance on complex and simple span tasks.

    PubMed

    Bailey, Heather; Dunlosky, John; Kane, Michael J

    2011-04-01

    Simple and complex span tasks are widely thought to measure related but separable memory constructs. Recently, however, research has demonstrated that simple and complex span tasks may tap, in part, the same construct because both similarly predict performance on measures of fluid intelligence (Gf) when the number of items retrieved from secondary memory (SM) is equated (Unsworth & Engle, Journal of Memory and Language 54:68-80 2006). Two studies (n = 105 and n = 152) evaluated whether retrieval from SM is influenced by individual differences in the use of encoding strategies during span tasks. Results demonstrated that, after equating the number of items retrieved from SM, simple and complex span performance similarly predicted Gf performance, but rates of effective strategy use did not mediate the span-Gf relationships. Moreover, at the level of individual differences, effective strategy use was more highly related to complex span performance than to simple span performance. Thus, even though individual differences in effective strategy use influenced span performance on trials that required retrieval from SM, strategic behavior at encoding cannot account for the similarities between simple and complex span tasks.

  15. Dimensionality of the 9-item Utrecht Work Engagement Scale revisited: A Bayesian structural equation modeling approach.

    PubMed

    Fong, Ted C T; Ho, Rainbow T H

    2015-01-01

    The aim of this study was to reexamine the dimensionality of the widely used 9-item Utrecht Work Engagement Scale using the maximum likelihood (ML) approach and Bayesian structural equation modeling (BSEM) approach. Three measurement models (1-factor, 3-factor, and bi-factor models) were evaluated in two split samples of 1,112 health-care workers using confirmatory factor analysis and BSEM, which specified small-variance informative priors for cross-loadings and residual covariances. Model fit and comparisons were evaluated by posterior predictive p-value (PPP), deviance information criterion, and Bayesian information criterion (BIC). None of the three ML-based models showed an adequate fit to the data. The use of informative priors for cross-loadings did not improve the PPP for the models. The 1-factor BSEM model with approximately zero residual covariances displayed a good fit (PPP>0.10) to both samples and a substantially lower BIC than its 3-factor and bi-factor counterparts. The BSEM results demonstrate empirical support for the 1-factor model as a parsimonious and reasonable representation of work engagement.

  16. Anorexia/cachexia-related quality of life for children with cancer.

    PubMed

    Lai, Jin-Shei; Cella, David; Peterman, Amy; Barocas, Joshua; Goldman, Stewart

    2005-10-01

    Anorexia is a common symptom in patients with cancer, which can lead to poor tolerance of treatment and can contribute to cachexia in extreme cases. Children with advanced-stage cancer are especially vulnerable to malnutrition resulting from anorexia and cachexia. Currently, there are no instruments that measure common concerns specifically associated with anorexia and cachexia in children with cancer. The purpose of the current article was to test the psychometric properties of a newly developed pediatric Functional Assessment of Anorexia and Cachexia Therapy (peds-FAACT) for children with cancer. Ninety-six patients (ages 7-17 yrs) receiving cancer treatment and their parents were asked to complete the 12-item peds-FAACT. The authors implemented both classical test theory and item response theory to evaluate the agreement between parents and patients, internal consistency and unidimensionality of the scale, and stability of items across subgroups. As a result, a patient-reported six-item scale was recommended as the core measure for all pediatric patients with cancer and four additional peripheral items were recommended for adolescent patients. The peds-FAACT demonstrated good psychometric properties, differentiated patients with different functional performance status, and was determined to be a useful tool for future clinical trials.

  17. Ability of commonly used prediction equations to predict resting energy expenditure in children with inflammatory bowel disease.

    PubMed

    Hill, Rebecca J; Lewindon, Peter J; Withers, Geoffrey D; Connor, Frances L; Ee, Looi C; Cleghorn, Geoffrey J; Davies, Peter S W

    2011-07-01

    Paediatric onset inflammatory bowel disease (IBD) may cause alterations in energy requirements and invalidate the use of standard prediction equations. Our aim was to evaluate four commonly used prediction equations for resting energy expenditure (REE) in children with IBD. Sixty-three children had repeated measurements of REE as part of a longitudinal research study yielding a total of 243 measurements. These were compared with predicted REE from Schofield, Oxford, FAO/WHO/UNU, and Harris-Benedict equations using the Bland-Altman method. Mean (±SD) age of the patients was 14.2 (2.4) years. Mean measured REE was 1566 (336) kcal per day compared with 1491 (236), 1441 (255), 1481 (232), and 1435 (212) kcal per day calculated from Schofield, Oxford, FAO/WHO/UNU, and Harris-Benedict, respectively. While the Schofield equation demonstrated the least difference between measured and predicted REE, it, along with the other equations tested, did not perform uniformly across all subjects, indicating greater errors at either end of the spectrum of energy expenditure. Smaller differences were found for all prediction equations for Crohn's disease compared with ulcerative colitis. Of the commonly used equations, the equation of Schofield should be used in pediatric patients with IBD when measured values are not able to be obtained. Copyright © 2010 Crohn's & Colitis Foundation of America, Inc.

  18. Toward a Conceptualization of the Content of Psychosocial Screening in Living Organ Donors: An Ethical Legal Psychological Aspects of Transplantation Consensus.

    PubMed

    Ismail, Sohal Y; Duerinckx, Nathalie; van der Knoop, Marieke M; Timmerman, Lotte; Weimar, Willem; Dobbels, Fabienne; Massey, Emma K; Busschbach, Jan J J V

    2015-11-01

    Across Europe, transplant centers vary in the content of the psychosocial evaluation for eligible living organ donors. To identify whether a common framework underlies this variation in this evaluation, we studied which psychosocial screening items are most commonly used and considered as most important in current psychosocial screening programs of living organ donors. A multivariate analytic method, concept mapping, was used to generate a visual representation of the "psychosocial" screening items of living kidney and liver donors. A list of 75 potential screening items was derived from a systematic literature review and sorted and rated for their importance and commonness by multidisciplinary affiliated health care professionals from across Europe. Results were discussed and fine-tuned during a consensus meeting. The analyses resulted in a 6-cluster solution. The following clusters on psychosocial screening items were identified, listed from most to least important: (1) personal resources, (2) motivation and decision making, (3) psychopathology, (4) social resources, (5) ethical and legal factors, and (6) information and risk processing. We provided a conceptual framework of the essential elements in psychosocial evaluation of living donors which can serve as a uniform basis for the selection of relevant psychosocial evaluation tools, which can be further tested in prospective studies.

  19. Debugging embedded computer programs. [tactical missile computers

    NASA Technical Reports Server (NTRS)

    Kemp, G. H.

    1980-01-01

    Every embedded computer program must complete its debugging cycle using some system that will allow real time debugging. Many of the common items addressed during debugging are listed. Seven approaches to debugging are analyzed to evaluate how well they treat those items. Cost evaluations are also included in the comparison. The results indicate that the best collection of capabilities to cover the common items present in the debugging task occurs in the approach where a minicomputer handles the environment simulation with an emulation of some kind representing the embedded computer. This approach can be taken at a reasonable cost. The case study chosen is an embedded computer in a tactical missile. Several choices of computer for the environment simulation are discussed as well as different approaches to the embedded emulator.

  20. Prediction Equations Overestimate the Energy Requirements More for Obesity-Susceptible Individuals.

    PubMed

    McLay-Cooke, Rebecca T; Gray, Andrew R; Jones, Lynnette M; Taylor, Rachael W; Skidmore, Paula M L; Brown, Rachel C

    2017-09-13

    Predictive equations to estimate resting metabolic rate (RMR) are often used in dietary counseling and by online apps to set energy intake goals for weight loss. It is critical to know whether such equations are appropriate for those susceptible to obesity. We measured RMR by indirect calorimetry after an overnight fast in 26 obesity susceptible (OSI) and 30 obesity resistant (ORI) individuals, identified using a simple 6-item screening tool. Predicted RMR was calculated using the FAO/WHO/UNU (Food and Agricultural Organisation/World Health Organisation/United Nations University), Oxford and Miflin-St Jeor equations. Absolute measured RMR did not differ significantly between OSI versus ORI (6339 vs. 5893 kJ·d -1 , p = 0.313). All three prediction equations over-estimated RMR for both OSI and ORI when measured RMR was ≤5000 kJ·d -1 . For measured RMR ≤7000 kJ·d -1 there was statistically significant evidence that the equations overestimate RMR to a greater extent for those classified as obesity susceptible with biases ranging between around 10% to nearly 30% depending on the equation. The use of prediction equations may overestimate RMR and energy requirements particularly in those who self-identify as being susceptible to obesity, which has implications for effective weight management.

  1. Treatment of Not-Administered Items on Individually Administered Intelligence Tests

    ERIC Educational Resources Information Center

    He, Wei; Wolfe, Edward W.

    2012-01-01

    In administration of individually administered intelligence tests, items are commonly presented in a sequence of increasing difficulty, and test administration is terminated after a predetermined number of incorrect answers. This practice produces stochastically censored data, a form of nonignorable missing data. By manipulating four factors…

  2. 45 CFR 96.87 - Leveraging incentive program.

    Code of Federal Regulations, 2013 CFR

    2013-10-01

    ... energy, or the purchase of items that help these households meet the cost of home energy, at commonly... fees, application fees, late payment charges, bulk fuel tank rental or purchase costs, and security...; space cooling devices, equipment, and systems; and other tangible items that help low-income households...

  3. 45 CFR 96.87 - Leveraging incentive program.

    Code of Federal Regulations, 2014 CFR

    2014-10-01

    ... energy, or the purchase of items that help these households meet the cost of home energy, at commonly... fees, application fees, late payment charges, bulk fuel tank rental or purchase costs, and security...; space cooling devices, equipment, and systems; and other tangible items that help low-income households...

  4. 45 CFR 96.87 - Leveraging incentive program.

    Code of Federal Regulations, 2012 CFR

    2012-10-01

    ... energy, or the purchase of items that help these households meet the cost of home energy, at commonly... fees, application fees, late payment charges, bulk fuel tank rental or purchase costs, and security...; space cooling devices, equipment, and systems; and other tangible items that help low-income households...

  5. Consistency Check for the Bin Packing Constraint Revisited

    NASA Astrophysics Data System (ADS)

    Dupuis, Julien; Schaus, Pierre; Deville, Yves

    The bin packing problem (BP) consists in finding the minimum number of bins necessary to pack a set of items so that the total size of the items in each bin does not exceed the bin capacity C. The bin capacity is common for all the bins.

  6. Incidental histopathological findings in hearts of control beagle dogs in toxicity studies.

    PubMed

    Bodié, Karen; Decker, Joshua H

    2014-08-01

    In preclinical studies of pharmaceutical agents, the beagle dog is a commonly used model for the detection of cardiotoxicity. Incidental findings, postmortem changes, and artifacts must be distinguished histopathologically from test item-related findings in the heart. In this retrospective analysis, cardiac sections from 88 control beagles (41 male, 47 female; ages 5-18 months) in preclinical studies were examined histopathologically. The most common finding was thickening of the tunica media of intramural coronary arteries, most likely a postmortem change. The second most common finding was the presence of vacuoles within Purkinje fibers. Dilated lymphatic and blood vessels at the insertion of chordae tendineae were noted more commonly in males than in females and were considered a normal anatomic feature. Mesothelial-lined papillary fronds along the epicardial surface of the atria were present in several dogs, as were small infiltrates of inflammatory cells usually within the myocardium. In summary, control beagles' hearts frequently have incidental findings that must be differentiated from test item-related pathologic changes. Historical control data can be useful for the interpretation of incidental and test item-related findings in the beagle heart. © 2013 by The Author(s).

  7. The value of item response theory in clinical assessment: a review.

    PubMed

    Thomas, Michael L

    2011-09-01

    Item response theory (IRT) and related latent variable models represent modern psychometric theory, the successor to classical test theory in psychological assessment. Although IRT has become prevalent in the measurement of ability and achievement, its contributions to clinical domains have been less extensive. Applications of IRT to clinical assessment are reviewed to appraise its current and potential value. Benefits of IRT include comprehensive analyses and reduction of measurement error, creation of computer adaptive tests, meaningful scaling of latent variables, objective calibration and equating, evaluation of test and item bias, greater accuracy in the assessment of change due to therapeutic intervention, and evaluation of model and person fit. The theory may soon reinvent the manner in which tests are selected, developed, and scored. Although challenges remain to the widespread implementation of IRT, its application to clinical assessment holds great promise. Recommendations for research, test development, and clinical practice are provided.

  8. The Effectiveness of Circular Equating as a Criterion for Evaluating Equating.

    ERIC Educational Resources Information Center

    Wang, Tianyou; Hanson, Bradley A.; Harris, Deborah J.

    Equating a test form to itself through a chain of equatings, commonly referred to as circular equating, has been widely used as a criterion to evaluate the adequacy of equating. This paper uses both analytical methods and simulation methods to show that this criterion is in general invalid in serving this purpose. For the random groups design done…

  9. Normative data for the Rappel libre/Rappel indicé à 16 items (16-item Free and Cued Recall) in the elderly Quebec-French population.

    PubMed

    Dion, Mélissa; Potvin, Olivier; Belleville, Sylvie; Ferland, Guylaine; Renaud, Mélanie; Bherer, Louis; Joubert, Sven; Vallet, Guillaume T; Simard, Martine; Rouleau, Isabelle; Lecomte, Sarah; Macoir, Joël; Hudon, Carol

    2015-01-01

    Performance on verbal memory tests is generally associated with socio-demographic variables such as age, sex, and education level. Performance also varies between different cultural groups. The present study aimed to establish normative data for the Rappel libre/Rappel indicé à 16 items (16-item Free and Cued Recall; RL/RI-16), a French adaptation of the Free and Cued Selective Reminding Test (Buschke, 1984; Grober, Buschke, Crystal, Bang, & Dresner, 1988). The sample consisted of 566 healthy French-speaking older adults (50-88 years old) from the province of Quebec, Canada. Normative data for the RL/RI-16 were derived from 80% of the total sample (normative sample) and cross-validated using the remaining participants (20%; validation sample). The effects of participants' age, sex, and education level were assessed on different indices of memory performance. Results indicated that these variables were independently associated with performance. Normative data are presented as regression equations with standard deviations (symmetric distributions) and percentiles (asymmetric distributions).

  10. Leadership: validation of a self-report scale.

    PubMed

    Dussault, Marc; Frenette, Eric; Fernet, Claude

    2013-04-01

    The aim of this paper was to propose and test the factor structure of a new self-report questionnaire on leadership. A sample of 373 school principals in the Province of Quebec, Canada completed the initial 46-item version of the questionnaire. In order to obtain a questionnaire of minimal length, a four-step procedure was retained. First, items analysis was performed using Classical Test Theory. Second, Rasch analysis was used to identify non-fitting or overlapping items. Third, a confirmatory factor analysis (CFA) using structural equation modelling was performed on the 21 remaining items to verify the factor structure of the scale. Results show that the model with a single third-order dimension (leadership), two second-order dimensions (transactional and transformational leadership), and one first-order dimension (laissez-faire leadership) provides a good fit to the data. Finally, invariance of factor structure was assessed with a second sample of 222 vice-principals in the Province of Quebec, Canada. This model is in agreement with the theoretical model developed by Bass (1985), upon which the questionnaire is based.

  11. Australian Defence Force Requirements for a Group-feeding Ration Pack

    DTIC Science & Technology

    2010-04-01

    items were instant noodles and pasta (20% and 18% of respondents, respectively). 3.1.6 Items Commonly Discarded Beef and Pasta, Fruit Pudding...sheet). Although not usually required, there is provision to supplement the CR5M with a cereal adjunct such as bread, rice, pasta or noodles [6]. It is...of all the drink items. The Chocolate Drink Powder had the highest acceptability; its consumption was second to that of the Instant Coffee. The

  12. Competitive foods available in Pennsylvania public high schools.

    PubMed

    Probart, Claudia; McDonnell, Elaine; Weirich, J Elaine; Hartman, Terryl; Bailey-Davis, Lisa; Prabhakher, Vaheedha

    2005-08-01

    This study examined the types and extent of competitive foods available in public high schools in Pennsylvania. We developed, pilot tested, and distributed surveys to school foodservice directors in a random sample of 271 high schools in Pennsylvania. Two hundred twenty-eight surveys were returned, for a response rate of 84%. Statistical analyses were performed: Descriptive statistics were used to examine the extent of competitive food sales in Pennsylvania public high schools. The survey data were analyzed using SPSS software version 11.5.1 (2002, SPSS base 11.0 for Windows, SPSS Inc, Chicago, IL). A la carte sales provide almost dollar 700/day to school foodservice programs, almost 85% of which receive no financial support from their school districts. The top-selling a la carte items are "hamburgers, pizza, and sandwiches." Ninety-four percent of respondents indicated that vending machines are accessible to students. The item most commonly offered in vending machines is bottled water (71.5%). While food items are less often available through school stores and club fund-raisers, candy is the item most commonly offered through these sources. Competitive foods are widely available in high schools. Although many of the items available are low in nutritional value, we found several of the top-selling a la carte options to be nutritious and bottled water the item most often identified as available through vending machines.

  13. ITEM ANALYSIS OF THREE SPANISH NAMING TESTS: A CROSS-CULTURAL INVESTIGATION

    PubMed Central

    de la Plata, Carlos Marquez; Arango-Lasprilla, Juan Carlos; Alegret, Montse; Moreno, Alexander; Tárraga, Luis; Lara, Mar; Hewlitt, Margaret; Hynan, Linda; Cullum, C. Munro

    2009-01-01

    Neuropsychological evaluations conducted in the United States and abroad commonly include the use of tests translated from English to Spanish. The use of translated naming tests for evaluating predominately Spanish-speakers has recently been challenged on the grounds that translating test items may compromise a test’s construct validity. The Texas Spanish Naming Test (TNT) has been developed in Spanish specifically for use with Spanish-speakers; however, it is unlikely patients from diverse Spanish-speaking geographical regions will perform uniformly on a naming test. The present study evaluated and compared the internal consistency and patterns of item-difficulty and -discrimination for the TNT and two commonly used translated naming tests in three countries (i.e., United States, Colombia, Spain). Two hundred fifty two subjects (126 demented, 116 nondemented) across three countries were administered the TNT, Modified Boston Naming Test-Spanish, and the naming subtest from the CERAD. The TNT demonstrated superior internal consistency to its counterparts, a superior item difficulty pattern than the CERAD naming test, and a superior item discrimination pattern than the MBNT-S across countries. Overall, all three Spanish naming tests differentiated nondemented and moderately demented individuals, but the results suggest the items of the TNT are most appropriate to use with Spanish-speakers. Preliminary normative data for the three tests examined in each country are provided. PMID:19208960

  14. Guide to Mathematics Released Items: Understanding Scoring

    ERIC Educational Resources Information Center

    Partnership for Assessment of Readiness for College and Careers, 2017

    2017-01-01

    The Partnership for Assessment of Readiness for College and Careers (PARCC) mathematics items measure critical thinking, mathematical reasoning, and the ability to apply skills and knowledge to real-world problems. Students are asked to solve problems involving the key knowledge and skills for their grade level as identified by the Common Core…

  15. Item Screening in Graphical Loglinear Rasch Models

    ERIC Educational Resources Information Center

    Kreiner, Svend; Christensen, Karl Bang

    2011-01-01

    In behavioural sciences, local dependence and DIF are common, and purification procedures that eliminate items with these weaknesses often result in short scales with poor reliability. Graphical loglinear Rasch models (Kreiner & Christensen, in "Statistical Methods for Quality of Life Studies," ed. by M. Mesbah, F.C. Cole & M.T.…

  16. Model Selection Indices for Polytomous Items

    ERIC Educational Resources Information Center

    Kang, Taehoon; Cohen, Allan S.; Sung, Hyun-Jung

    2009-01-01

    This study examines the utility of four indices for use in model selection with nested and nonnested polytomous item response theory (IRT) models: a cross-validation index and three information-based indices. Four commonly used polytomous IRT models are considered: the graded response model, the generalized partial credit model, the partial credit…

  17. Recent advances in analysis of differential item functioning in health research using the Rasch model.

    PubMed

    Hagquist, Curt; Andrich, David

    2017-09-19

    Rasch analysis with a focus on Differential Item Functioning (DIF) is increasingly used for examination of psychometric properties of health outcome measures. To take account of DIF in order to retain precision of measurement, split of DIF-items into separate sample specific items has become a frequently used technique. The purpose of the paper is to present and summarise recent advances of analysis of DIF in a unified methodology. In particular, the paper focuses on the use of analysis of variance (ANOVA) as a method to simultaneously detect uniform and non-uniform DIF, the need to distinguish between real and artificial DIF and the trade-off between reliability and validity. An illustrative example from health research is used to demonstrate how DIF, in this case between genders, can be identified, quantified and under specific circumstances accounted for using the Rasch model. Rasch analyses of DIF were conducted of a composite measure of psychosomatic problems using Swedish data from the Health Behaviour in School-aged Children study for grade 9 students collected during the 1985-2014 time periods. The procedures demonstrate how DIF can be identified efficiently by ANOVA of residuals, and how the magnitude of DIF can be quantified and potentially accounted for by resolving items according to identifiable groups and using principles of test equating on the resolved items. The results of the analysis also show that the real DIF in some items does affect person measurement estimates. Firstly, in order to distinguish between real and artificial DIF, the items showing DIF initially should not be resolved simultaneously but sequentially. Secondly, while resolving instead of deleting a DIF item may retain reliability, both options may affect the content validity negatively. Resolving items with DIF is not justified if the source of the DIF is relevant for the content of the variable; then resolving DIF may deteriorate the validity of the instrument. Generally, decisions on resolving items to deal with DIF should also rely on external information.

  18. Single-Item Screening for Agoraphobic Symptoms: Validation of a Web-Based Audiovisual Screening Instrument

    PubMed Central

    van Ballegooijen, Wouter; Riper, Heleen; Donker, Tara; Martin Abello, Katherina; Marks, Isaac; Cuijpers, Pim

    2012-01-01

    The advent of web-based treatments for anxiety disorders creates a need for quick and valid online screening instruments, suitable for a range of social groups. This study validates a single-item multimedia screening instrument for agoraphobia, part of the Visual Screener for Common Mental Disorders (VS-CMD), and compares it with the text-based agoraphobia items of the PDSS-SR. The study concerned 85 subjects in an RCT of the effects of web-based therapy for panic symptoms. The VS-CMD item and items 4 and 5 of the PDSS-SR were validated by comparing scores to the outcomes of the CIDI diagnostic interview. Screening for agoraphobia was found moderately valid for both the multimedia item (sensitivity.81, specificity.66, AUC.734) and the text-based items (AUC.607–.697). Single-item multimedia screening for anxiety disorders should be further developed and tested in the general population and in patient, illiterate and immigrant samples. PMID:22844391

  19. Cross-sectional time trends in psychological and somatic health complaints among adolescents: a structural equation modelling analysis of 'Health Behaviour in School-aged Children' data from Switzerland.

    PubMed

    Dey, Michelle; Jorm, Anthony F; Mackinnon, Andrew J

    2015-08-01

    This study examined cross-sectional time trends in health complaints among adolescents living in Switzerland, including differences between population subgroups and sources of differential response to items. Swiss data were analysed from the Health Behaviour in School-aged Children (HBSC; including 11-15 years old) from 1994 (n = 7008), 1998 (n = 8296), 2002 (n = 9066) and 2006 (n = 9255). Structural equation modelling was used to assess (1) the structure of the HBSC Symptom Checklist (HBSC-SCL; questionnaire, which asks about the frequency of eight health complaints) and (2) associations between the HBSC-SCL with year of data collection and demographic characteristics of the participants. Two correlated factors fitted the data better than a single factor. The psychological factor included the items 'feeling low,' 'irritability and bad temper,' 'nervousness' and 'difficulties in getting to sleep,' and the somatic factor the items 'headache', 'backache', 'stomach ache' and 'dizziness'. Relative to 1994, lower levels of psychological health complaints were experienced in 1998, 2002 and 2006. However, the changes were only minor. In contrast, somatic health complaints increased monotonically over the years of the survey. Experiencing psychological and somatic health complaints was more pronounced with age among females relative to males and was associated with living in particular language regions of Switzerland. Different cross-sectional time trends were identified for the psychological and somatic latent variables, indicating that both factors should be investigated when studying period effects.

  20. Why do participants initiate free recall of short lists of words with the first list item? Toward a general episodic memory explanation.

    PubMed

    Spurgeon, Jessica; Ward, Geoff; Matthews, William J

    2014-11-01

    Participants who are presented with a short list of words for immediate free recall (IFR) show a strong tendency to initiate their recall with the 1st list item and then proceed in forward serial order. We report 2 experiments that examined whether this tendency was underpinned by a short-term memory store, of the type that is argued by some to underpin recency effects in IFR. In Experiment 1, we presented 3 groups of participants with lists of between 2 and 12 words for IFR, delayed free recall, and continuous-distractor free recall. The to-be-remembered words were simultaneously spoken and presented visually, and the distractor task involved silently solving a series of self-paced, visually presented mathematical equations (e.g., 3 + 2 + 4 = ?). The tendency to initiate recall at the start of short lists was greatest in IFR but was also present in the 2 other recall conditions. This finding was replicated in Experiment 2, where the to-be-remembered items were presented visually in silence and the participants spoke aloud their answers to computer-paced mathematical equations. Our results necessitate that a short-term buffer cannot be fully responsible for the tendency to initiate recall from the beginning of a short list; rather, they suggest that the tendency represents a general property of episodic memory that occurs across a range of time scales. PsycINFO Database Record (c) 2014 APA, all rights reserved.

  1. Screening for elevated levels of fear-avoidance beliefs regarding work or physical activities in people receiving outpatient therapy.

    PubMed

    Hart, Dennis L; Werneke, Mark W; George, Steven Z; Matheson, James W; Wang, Ying-Chih; Cook, Karon F; Mioduski, Jerome E; Choi, Seung W

    2009-08-01

    Screening people for elevated levels of fear-avoidance beliefs is uncommon, but elevated levels of fear could worsen outcomes. Developing short screening tools might reduce the data collection burden and facilitate screening, which could prompt further testing or management strategy modifications to improve outcomes. The purpose of this study was to develop efficient yet accurate screening methods for identifying elevated levels of fear-avoidance beliefs regarding work or physical activities in people receiving outpatient rehabilitation. A secondary analysis of data collected prospectively from people with a variety of common neuromusculoskeletal diagnoses was conducted. Intake Fear-Avoidance Beliefs Questionnaire (FABQ) data were collected from 17,804 people who had common neuromusculoskeletal conditions and were receiving outpatient rehabilitation in 121 clinics in 26 states (in the United States). Item response theory (IRT) methods were used to analyze the FABQ data, with particular emphasis on differential item functioning among clinically logical groups of subjects, and to identify screening items. The accuracy of screening items for identifying subjects with elevated levels of fear was assessed with receiver operating characteristic analyses. Three items for fear of physical activities and 10 items for fear of work activities represented unidimensional scales with adequate IRT model fit. Differential item functioning was negligible for variables known to affect functional status outcomes: sex, age, symptom acuity, surgical history, pain intensity, condition severity, and impairment. Items that provided maximum information at the median for the FABQ scales were selected as screening items to dichotomize subjects by high versus low levels of fear. The accuracy of the screening items was supported for both scales. This study represents a retrospective analysis, which should be replicated using prospective designs. Future prospective studies should assess the reliability and validity of using one FABQ item to screen people for high levels of fear-avoidance beliefs. The lack of differential item functioning in the FABQ scales in the sample tested in this study suggested that FABQ screening could be useful in routine clinical practice and allowed the development of single-item screening for fear-avoidance beliefs that accurately identified subjects with elevated levels of fear. Because screening was accurate and efficient, single IRT-based FABQ screening items are recommended to facilitate improved evaluation and care of heterogeneous populations of people receiving outpatient rehabilitation.

  2. Raters Interpret Positively and Negatively Worded Items Similarly in a Quality of Life Instrument for Children

    PubMed Central

    Lin, Chung-Ying; Strong, Carol; Tsai, Meng-Che; Lee, Chih-Ting

    2017-01-01

    Measurement invariance is an important assumption to meaningfully compare children’s quality of life (QoL) between different raters (eg, children and parents) and across genders. Moreover, QoL instruments may combine using negatively and positively worded items—a common method to reduce response bias. However, the wording effects may have different levels of impact on different raters and genders. Our aim was to investigate the measurement invariance of Kid-KINDL, a commonly used QoL instrument, across genders and raters and to consider the wording effects simultaneously. Third to sixth graders (208 boys and 235 girls) completed the self-rated Kid-KINDL, and 1 parent each of 241 children completed the parent-rated Kid-KINDL. The wording effects were accounted for by correlated traits-uncorrelated methods model. The measurement invariance was examined using multigroup confirmatory factor analysis. Item loadings and item intercepts were invariant across gender and rater when we simultaneously accounted for the wording effects of Kid-KINDL. Our results suggest that Kid-KINDL could be used to compare QoL across gender and that parent-rated Kid-KINDL could be used to measure children’s QoL. Specifically, the invariant factor loadings across child-rated and parent-rated Kid-KINDL suggest that the score weights in each item were the same for both children and parents (ie, the important items identified by the children are the same items identified by the parents). The invariant item intercepts suggest that both children and parents share the same threshold for each item. Based on the results, we tentatively recommend that each score of a parent-rated Kid-KINDL can stand for each child’s QoL. PMID:28292193

  3. A cross "ethnical" comparison of the Driver Behaviour Questionnaire (DBQ) in an economically fast developing country.

    PubMed

    Bener, Abdulbari; Verjee, Mohamud; Dafeeah, Elnour E; Yousafzai, Mohammad T; Mari, Sundus; Hassib, Ahmed; Al-Khatib, Hamza; Choi, Min Kyung; Nema, Noor; Ozkan, Türker; Lajunen, Timo

    2013-05-12

    The aim of this study was to compare the driving behaviours of four ethnic groups and to investigate the relationship between violations, errors and lapses of DBQ and accident involvement in Qatar. The Driver Behaviour Questionnaire (DBQ) was used to measure the aberrant driving behaviours leading to accidents. Of 2400 drivers approached, 1824 drivers agreed to participate (76%) and completed the driver behaviour questionnaire and background information. The study revealed that the majority of the Qatari (35.9%) and Jordanian drivers (37.5%) were below 30 years of age, whereas Filipino (42.3%) and Indian subcontinent (34.1%) drivers were in the age group of 30-39 years. Qatari drivers (52%) were involved in most accidents, followed by Jordanians (48.3%). The most common type of collision was a head on collision, which was similar in all four ethnic groups. The Qatari drivers scored higher on almost all items of violations, errors and lapses compared to other ethnic groups, while Filipino drivers were lower on all the items. The most common violation was the same in all four ethnic groups "Disregard the speed limits on a motorway". The most common error item observed was "Queing to turn right/left on to a main road". "Forget where you left your car" and "Hit something when reversing" were the two lapses identified in factor analysis. The present study identified that Qatari drivers scored higher on most of the items of violations, errors and lapses of DBQ compared to other countries, whereas Filipino drivers scored lower in DBQ items.

  4. 17 CFR 229.201 - (Item 201) Market price of and dividends on the registrant's common equity and related...

    Code of Federal Regulations, 2010 CFR

    2010-04-01

    ... dividends on the registrant's common equity and related stockholder matters. 229.201 Section 229.201... the registrant's common equity and related stockholder matters. (a) Market information. (1)(i) Identify the principal United States market or markets in which each class of the registrant's common...

  5. 17 CFR 229.201 - (Item 201) Market price of and dividends on the registrant's common equity and related...

    Code of Federal Regulations, 2011 CFR

    2011-04-01

    ... dividends on the registrant's common equity and related stockholder matters. 229.201 Section 229.201... the registrant's common equity and related stockholder matters. (a) Market information. (1)(i) Identify the principal United States market or markets in which each class of the registrant's common...

  6. Smoothing and Equating Methods Applied to Different Types of Test Score Distributions and Evaluated with Respect to Multiple Equating Criteria. Research Report. ETS RR-11-20

    ERIC Educational Resources Information Center

    Moses, Tim; Liu, Jinghua

    2011-01-01

    In equating research and practice, equating functions that are smooth are typically assumed to be more accurate than equating functions with irregularities. This assumption presumes that population test score distributions are relatively smooth. In this study, two examples were used to reconsider common beliefs about smoothing and equating. The…

  7. Development of an Item Bank for the Assessment of Knowledge on Biology in Argentine University Students.

    PubMed

    Cupani, Marcos; Zamparella, Tatiana Castro; Piumatti, Gisella; Vinculado, Grupo

    The calibration of item banks provides the basis for computerized adaptive testing that ensures high diagnostic precision and minimizes participants' test burden. This study aims to develop a bank of items to measure the level of Knowledge on Biology using the Rasch model. The sample consisted of 1219 participants that studied in different faculties of the National University of Cordoba (mean age = 21.85 years, SD = 4.66; 66.9% are women). The items were organized in different forms and into separate subtests, with some common items across subtests. The students were told they had to answer 60 questions of knowledge on biology. Evaluation of Rasch model fit (Zstd >|2.0|), differential item functioning, dimensionality, local independence, item and person separation (>2.0), and reliability (>.80) resulted in a bank of 180 items with good psychometric properties. The bank provides items with a wide range of content coverage and may serve as a sound basis for computerized adaptive testing applications. The contribution of this work is significant in the field of educational assessment in Argentina.

  8. Microplastic Contamination of Wild and Captive Flathead Grey Mullet (Mugil cephalus)

    PubMed Central

    Lui, Ching Yee

    2018-01-01

    A total of 60 flathead grey mullets were examined for microplastic ingestion. Thirty wild mullets were captured from the eastern coast of Hong Kong and 30 captive mullets were obtained from fish farms. Microplastic ingestion was detected in 60% of the wild mullets, with an average of 4.3 plastic items per mullet, while only 16.7% of captive mullets were found to have ingested microplastics, with an average of 0.2 items per mullet. The results suggested that wild mullets have a higher risk of microplastic ingestion than their captive counterparts. The most common plastic items were fibres that were green in colour and small in size (<2 mm). Polypropylene was the most common polymer (42%), followed by polyethylene (25%). In addition, the abundance of microplastics was positively correlated with larger body size among the mullets. PMID:29587444

  9. Microplastic Contamination of Wild and Captive Flathead Grey Mullet (Mugil cephalus).

    PubMed

    Cheung, Lewis T O; Lui, Ching Yee; Fok, Lincoln

    2018-03-26

    A total of 60 flathead grey mullets were examined for microplastic ingestion. Thirty wild mullets were captured from the eastern coast of Hong Kong and 30 captive mullets were obtained from fish farms. Microplastic ingestion was detected in 60% of the wild mullets, with an average of 4.3 plastic items per mullet, while only 16.7% of captive mullets were found to have ingested microplastics, with an average of 0.2 items per mullet. The results suggested that wild mullets have a higher risk of microplastic ingestion than their captive counterparts. The most common plastic items were fibres that were green in colour and small in size (<2 mm). Polypropylene was the most common polymer (42%), followed by polyethylene (25%). In addition, the abundance of microplastics was positively correlated with larger body size among the mullets.

  10. Re-Examining the Relationship between Need for Cognition and Creativity: Predicting Creative Problem Solving across Multiple Domains

    ERIC Educational Resources Information Center

    Watts, Logan L.; Steele, Logan M.; Song, Hairong

    2017-01-01

    Prior studies have demonstrated inconsistent findings with regard to the relationship between need for cognition and creativity. In our study, measurement issues were explored as a potential source of these inconsistencies. Structural equation modeling techniques were used to examine the factor structure underlying the 18-item need for cognition…

  11. Data Collection Design for Equivalent Groups Equating: Using a Matrix Stratification Framework for Mixed-Format Assessment

    ERIC Educational Resources Information Center

    Mbella, Kinge Keka

    2012-01-01

    Mixed-format assessments are increasingly being used in large scale standardized assessments to measure a continuum of skills ranging from basic recall to higher order thinking skills. These assessments are usually comprised of a combination of (a) multiple-choice items which can be efficiently scored, have stable psychometric properties, and…

  12. Factors Affecting Perceived Learning, Satisfaction, and Quality in the Online MBA: A Structural Equation Modeling Approach

    ERIC Educational Resources Information Center

    Sebastianelli, Rose; Swift, Caroline; Tamimi, Nabil

    2015-01-01

    The authors examined how six factors related to content and interaction affect students' perceptions of learning, satisfaction, and quality in online master of business administration (MBA) courses. They developed three scale items to measure each factor. Using survey data from MBA students at a private university, the authors estimated structural…

  13. To Parcel or Not To Parcel: Exploring the Question, Weighing the Merits.

    ERIC Educational Resources Information Center

    Little, Todd D.; Cunningham, William A.; Shahar, Golan; Widaman, Keith F.

    2002-01-01

    Studied the evidence for the practice of using parcels of item as manifest variables in structural equation modeling procedures. Findings suggest that the unconsidered use of parcels is never warranted, but the considered use of parcels cannot be dismissed out of hand. Describes a number of parceling techniques and their strengths and weaknesses.…

  14. Construct Validity of the Multidimensional Structure of Bullying and Victimization: An Application of Exploratory Structural Equation Modeling

    ERIC Educational Resources Information Center

    Marsh, Herbert W.; Nagengast, Benjamin; Morin, Alexandre J. S.; Parada, Roberto H.; Craven, Rhonda G.; Hamilton, Linda R.

    2011-01-01

    Existing research posits multiple dimensions of bullying and victimization but has not identified well-differentiated facets of these constructs that meet standards of good measurement: goodness of fit, measurement invariance, lack of differential item functioning, and well-differentiated factors that are not so highly correlated as to detract…

  15. The 1980-81 AFOSR-HTTM-Stanford Conference on Complex Turbulent Flows: Comparison of Computation & Experiment.

    DTIC Science & Technology

    1982-02-01

    1968, 1969 and 1972 Confereaces. Zec -tain items at. the list delineate problems needing research (reattachment zones, *iv/bou j.Azy layer interactions...viscous energy equation--each in unaveraged form. As Peter Bradshaw has put it, God gave us one good model. Why should there be another model that is

  16. Comprehensiveness of care from the patient perspective: comparison of primary healthcare evaluation instruments.

    PubMed

    Haggerty, Jeannie L; Beaulieu, Marie-Dominique; Pineault, Raynald; Burge, Frederick; Lévesque, Jean-Frédéric; Santor, Darcy A; Bouharaoui, Fatima; Beaulieu, Christine

    2011-12-01

    Comprehensiveness relates both to scope of services offered and to a whole-person clinical approach. Comprehensive services are defined as "the provision, either directly or indirectly, of a full range of services to meet most patients' healthcare needs"; whole-person care is "the extent to which a provider elicits and considers the physical, emotional and social aspects of a patient's health and considers the community context in their care." Among instruments that evaluate primary healthcare, two had subscales that mapped to comprehensive services and to the community component of whole-person care: the Primary Care Assessment Tool - Short Form (PCAT-S) and the Components of Primary Care Index (CPCI, a limited measure of whole-person care). To examine how well comprehensiveness is captured in validated instruments that evaluate primary healthcare from the patient's perspective. 645 adults with at least one healthcare contact in the previous 12 months responded to six instruments that evaluate primary healthcare. Scores were normalized for descriptive comparison. Exploratory and confirmatory (structural equation modelling) factor analysis examined fit to operational definition, and item response theory analysis examined item performance on common constructs. Over one-quarter of respondents had missing responses on services offered or doctor's knowledge of the community. The subscales did not load on a single factor; comprehensive services and community orientation were examined separately. The community orientation subscales did not perform satisfactorily. The three comprehensive services subscales fit very modestly onto two factors: (1) most healthcare needs (from one provider) (CPCI Comprehensive Care, PCAT-S First-Contact Utilization) and (2) range of services (PCAT-S Comprehensive Services Available). Individual item performance revealed several problems. Measurement of comprehensiveness is problematic, making this attribute a priority for measure development. Range of services offered is best obtained from providers. Whole-person care is not addressed as a separate construct, but some dimensions are covered by attributes such as interpersonal communication and relational continuity.

  17. An Illustration of the Exploratory Structural Equation Modeling (ESEM) Framework on the Passion Scale

    PubMed Central

    Tóth-Király, István; Bõthe, Beáta; Rigó, Adrien; Orosz, Gábor

    2017-01-01

    While exploratory factor analysis (EFA) provides a more realistic presentation of the data with the allowance of item cross-loadings, confirmatory factor analysis (CFA) includes many methodological advances that the former does not. To create a synergy of the two, exploratory structural equation modeling (ESEM) was proposed as an alternative solution, incorporating the advantages of EFA and CFA. The present investigation is thus an illustrative demonstration of the applicability and flexibility of ESEM. To achieve this goal, we compared CFA and ESEM models, then thoroughly tested measurement invariance and differential item functioning through multiple-indicators-multiple-causes (MIMIC) models on the Passion Scale, the only measure of the Dualistic Model of Passion (DMP) which differentiates between harmonious and obsessive forms of passion. Moreover, a hybrid model was also created to overcome the drawbacks of the two methods. Analyses of the first large community sample (N = 7,466; 67.7% females; Mage = 26.01) revealed the superiority of the ESEM model relative to CFA in terms of improved goodness-of-fit and less correlated factors, while at the same time retaining the high definition of the factors. However, this fit was only achieved with the inclusion of three correlated uniquenesses, two of which appeared in previous studies and one of which was specific to the current investigation. These findings were replicated on a second, comprehensive sample (N = 504; 51.8% females; Mage = 39.59). After combining the two samples, complete measurement invariance (factor loadings, item intercepts, item uniquenesses, factor variances-covariances, and latent means) was achieved across gender and partial invariance across age groups and their combination. Only one item intercept was non-invariant across both multigroup and MIMIC approaches, an observation that was further corroborated by the hybrid model. While obsessive passion showed a slight decline in the hybrid model, harmonious passion did not. Overall, the ESEM framework is a viable alternative of CFA that could be used and even extended to address substantially important questions and researchers should systematically compare these two approaches to identify the most suitable one. PMID:29163325

  18. An Illustration of the Exploratory Structural Equation Modeling (ESEM) Framework on the Passion Scale.

    PubMed

    Tóth-Király, István; Bõthe, Beáta; Rigó, Adrien; Orosz, Gábor

    2017-01-01

    While exploratory factor analysis (EFA) provides a more realistic presentation of the data with the allowance of item cross-loadings, confirmatory factor analysis (CFA) includes many methodological advances that the former does not. To create a synergy of the two, exploratory structural equation modeling (ESEM) was proposed as an alternative solution, incorporating the advantages of EFA and CFA. The present investigation is thus an illustrative demonstration of the applicability and flexibility of ESEM. To achieve this goal, we compared CFA and ESEM models, then thoroughly tested measurement invariance and differential item functioning through multiple-indicators-multiple-causes (MIMIC) models on the Passion Scale, the only measure of the Dualistic Model of Passion (DMP) which differentiates between harmonious and obsessive forms of passion. Moreover, a hybrid model was also created to overcome the drawbacks of the two methods. Analyses of the first large community sample ( N = 7,466; 67.7% females; M age = 26.01) revealed the superiority of the ESEM model relative to CFA in terms of improved goodness-of-fit and less correlated factors, while at the same time retaining the high definition of the factors. However, this fit was only achieved with the inclusion of three correlated uniquenesses, two of which appeared in previous studies and one of which was specific to the current investigation. These findings were replicated on a second, comprehensive sample ( N = 504; 51.8% females; M age = 39.59). After combining the two samples, complete measurement invariance (factor loadings, item intercepts, item uniquenesses, factor variances-covariances, and latent means) was achieved across gender and partial invariance across age groups and their combination. Only one item intercept was non-invariant across both multigroup and MIMIC approaches, an observation that was further corroborated by the hybrid model. While obsessive passion showed a slight decline in the hybrid model, harmonious passion did not. Overall, the ESEM framework is a viable alternative of CFA that could be used and even extended to address substantially important questions and researchers should systematically compare these two approaches to identify the most suitable one.

  19. Item response theory and the measurement of motor behavior.

    PubMed

    Safrit, M J; Cohen, A S; Costa, M G

    1989-12-01

    Item response theory (IRT) has been the focus of intense research and development activity in educational and psychological measurement during the past decade. Because this theory can provide more precise information about test items than other theories usually used in measuring motor behavior, the application of IRT in physical education and exercise science merits investigation. In IRT, the difficulty level of each item (e.g., trial or task) can be estimated and placed on the same scale as the ability of the examinee. Using this information, the test developer can determine the ability levels at which the test functions best. Equating the scores of individuals on two or more items or tests can be handled efficiently by applying IRT. The precision of the identification of performance standards in a mastery test context can be enhanced, as can adaptive testing procedures. In this tutorial, several potential benefits of applying IRT to the measurement of motor behavior were described. An example is provided using bowling data and applying the graded-response form of the Rasch IRT model. The data were calibrated and the goodness of fit was examined. This analysis is described in a step-by-step approach. Limitations to using an IRT model with a test consisting of repeated measures were noted.

  20. Non-Volatile Residue (NVR) Contamination from Dry Handling and Solvent Cleaning

    NASA Technical Reports Server (NTRS)

    Sovinski, Marjorie F.

    2009-01-01

    This slide presentation reviews the testing for Non-Volatile Residue contamination transferred to surfaces from handling and solvent cleaning. Included in the presentation is a list of the items tested, formal work instructions dealing with NVR. There is an explanation of the Gravimetric determination method used to test the NVR in a variety of items, i.e., Gloves, Swabs, Garments, Bagging material, film and Wipes. Another method to test for contamination from NVR is the contact transfer method. The use of this method for testing gloves, garments, bagging material and film is explained. Certain equations use in NVR analysis and the use of a database for testing of NVR in consumables are reviewed.

  1. Testing to the Top: Everything But the Kitchen Sink?

    ERIC Educational Resources Information Center

    Dietel, Ron

    2011-01-01

    Two tests intended to measure student achievement of the Common Core State Standards will face intense scrutiny, but the test makers say they will include performance assessments and other items that are not multiple-choice questions. Incorporating performance items on this tests will bring up issues over scoring, costs, and validity.

  2. A Comparison of Three Test Formats to Assess Word Difficulty

    ERIC Educational Resources Information Center

    Culligan, Brent

    2015-01-01

    This study compared three common vocabulary test formats, the Yes/No test, the Vocabulary Knowledge Scale (VKS), and the Vocabulary Levels Test (VLT), as measures of vocabulary difficulty. Vocabulary difficulty was defined as the item difficulty estimated through Item Response Theory (IRT) analysis. Three tests were given to 165 Japanese students,…

  3. Probing University Students' Pre-Knowledge in Quantum Physics with QPCS Survey

    ERIC Educational Resources Information Center

    Asikainen, Mervi A.

    2017-01-01

    The study investigated the use of Quantum Physics Conceptual Survey (QPCS) in probing student understanding of quantum physics. Altogether 103 Finnish university students responded to QPCS. The mean scores of the student responses were calculated and the test was evaluated using common five indices: Item difficulty index, Item discrimination…

  4. Exploring Alternative Characteristic Curve Approaches to Linking Parameter Estimates from the Generalized Partial Credit Model.

    ERIC Educational Resources Information Center

    Roberts, James S.; Bao, Han; Huang, Chun-Wei; Gagne, Phill

    Characteristic curve approaches for linking parameters from the generalized partial credit model were examined for cases in which common (anchor) items are calibrated separately in two groups. Three of these approaches are simple extensions of the test characteristic curve (TCC), item characteristic curve (ICC), and operating characteristic curve…

  5. Acquiescent Responding in Balanced Multidimensional Scales and Exploratory Factor Analysis

    ERIC Educational Resources Information Center

    Lorenzo-Seva, Urbano; Rodriguez-Fornells, Antoni

    2006-01-01

    Personality tests often consist of a set of dichotomous or Likert items. These response formats are known to be susceptible to an agreeing-response bias called acquiescence. The common assumption in balanced scales is that the sum of appropriately reversed responses should be reasonably free of acquiescence. However, inter-item correlation (or…

  6. Population Invariance of Vertical Scaling Results

    ERIC Educational Resources Information Center

    Powers, Sonya; Turhan, Ahmet; Binici, Salih

    2012-01-01

    The population sensitivity of vertical scaling results was evaluated for a state reading assessment spanning grades 3-10 and a state mathematics test spanning grades 3-8. Subpopulations considered included males and females. The 3-parameter logistic model was used to calibrate math and reading items and a common item design was used to construct…

  7. A Substantive Process Analysis of Responses to Items from the Multistate Bar Examination

    ERIC Educational Resources Information Center

    Bonner, Sarah M.; D'Agostino, Jerome V.

    2012-01-01

    We investigated examinees' cognitive processes while they solved selected items from the Multistate Bar Exam (MBE), a high-stakes professional certification examination. We focused on ascertaining those mental processes most frequently used by examinees, and the most common types of errors in their thinking. We compared the relationships between…

  8. The Sequential Probability Ratio Test and Binary Item Response Models

    ERIC Educational Resources Information Center

    Nydick, Steven W.

    2014-01-01

    The sequential probability ratio test (SPRT) is a common method for terminating item response theory (IRT)-based adaptive classification tests. To decide whether a classification test should stop, the SPRT compares a simple log-likelihood ratio, based on the classification bound separating two categories, to prespecified critical values. As has…

  9. Differing Levels of Superstitious Beliefs among Three Groups: Psychiatric Inpatients, Churchgoers, and Students.

    ERIC Educational Resources Information Center

    Robinson, Sheryl L.

    This study investigated the level of superstitious belief among 175 persons in three categories: persons undergoing inpatient psychiatric treatment, churchgoers, and college students. A 50-item inventory consisting of positive and negative common superstitions, including a 5-item invalidity subscale, was administered. Using a 2 (male, female) x 3…

  10. 78 FR 22282 - Notice of Intent To Repatriate a Cultural Item: U.S. Department of the Interior, National Park...

    Federal Register 2010, 2011, 2012, 2013, 2014

    2013-04-15

    ... item described above. The medicine bundle is needed by Mr. Whitedirt to continue traditional ceremonies... of the Northern Cheyenne Tribe. The sacred object is a medicine bundle containing multiple objects... the Northern Cheyenne traditional kinship system and common law system of descendance. Determinations...

  11. Strategy Execution in Cognitive Skill Learning: An Item-Level Test of Candidate Models

    ERIC Educational Resources Information Center

    Rickard, Timothy C.

    2004-01-01

    This article investigates the transition to memory-based performance that commonly occurs with practice on tasks that initially require use of a multistep algorithm. In an alphabet arithmetic task, item response times exhibited pronounced step-function decreases after moderate practice that were uniquely predicted by T. C. Rickard's (1997)…

  12. 26 CFR 1.732-2 - Special partnership basis of distributed property.

    Code of Federal Regulations, 2010 CFR

    2010-04-01

    ...). The basis of the unrealized receivables in C's hands would be $100 (zero plus $100, one-half of C's...,300 for the inventory items ($500 plus $800) and $200 for the unrealized receivables (zero plus $200... basis adjustments, plus $500 common partnership basis, the amount allocated to inventory items and...

  13. DETERMINATION OF A STANDARD FOOD ITEM FOR ANALYSIS OF PESTICIDE CONSUMPTION IN THE DIETARY INTAKE OF YOUNG CHILDREN

    EPA Science Inventory

    The objective of this study was to establish a standard food item for the collection of residential use pesticides from household surfaces commonly encountered by young children while eating. The amount of a pesticide that young children ingest during eating is influenced by the ...

  14. Science Shorts: Sort It out

    ERIC Educational Resources Information Center

    Adams, Barbara

    2007-01-01

    Many children enjoy collecting items such as seashells, state quarters, and trading cards. Asking students to think about the ways in which similar items differ, how objects can be grouped by a common characteristic, and how groups can be subsets of a larger category leads to an understanding of fundamental mathematics and science concepts: sets,…

  15. Symptom Frequency Characteristics of the Hamilton Depression Rating Scale of Major Depressive Disorder in Epilepsy.

    PubMed

    Wiglusz, Mariusz S; Landowski, Jerzy; Michalak, Lidia; Cubała, Wiesław J

    2015-09-01

    Depressive disorders are common among patients with epilepsy (PWE). The aim of this study was to explore symptom frequencies of 17-item Hamilton Depression Rating Scale (HDRS-17) and recognize the clinical characteristics of Major Depressive Disorder in PWE. A sample of 40 adults outpatients with epilepsy and depression was diagnosed using SCID-I for DSM-IV-TR and HDRS-17. The total HDRS-17 score was analysed followed by the exploratory analysis based on the hierarchical model. The frequencies of HDRS-17 items varied widely in this study. Insomnia related items and general somatic symptoms items as well as insomnia and somatic factors exhibited constant and higher frequency. Feeling guilty, suicide, psychomotor retardation and depressed mood showed relatively lower frequencies. Other symptoms had variable frequencies across the study population. Depressive disorders are common among PWE. In the study group insomnia and somatic symptoms displayed highest values which could represent atypical clinical features of mood disorders in PWE. There is a need for more studies with a use of standardized approach to the problem.

  16. Interference and memory capacity limitations.

    PubMed

    Endress, Ansgar D; Szabó, Szilárd

    2017-10-01

    Working memory (WM) is thought to have a fixed and limited capacity. However, the origins of these capacity limitations are debated, and generally attributed to active, attentional processes. Here, we show that the existence of interference among items in memory mathematically guarantees fixed and limited capacity limits under very general conditions, irrespective of any processing assumptions. Assuming that interference (a) increases with the number of interfering items and (b) brings memory performance to chance levels for large numbers of interfering items, capacity limits are a simple function of the relative influence of memorization and interference. In contrast, we show that time-based memory limitations do not lead to fixed memory capacity limitations that are independent of the timing properties of an experiment. We show that interference can mimic both slot-like and continuous resource-like memory limitations, suggesting that these types of memory performance might not be as different as commonly believed. We speculate that slot-like WM limitations might arise from crowding-like phenomena in memory when participants have to retrieve items. Further, based on earlier research on parallel attention and enumeration, we suggest that crowding-like phenomena might be a common reason for the 3 major cognitive capacity limitations. As suggested by Miller (1956) and Cowan (2001), these capacity limitations might arise because of a common reason, even though they likely rely on distinct processes. (PsycINFO Database Record (c) 2017 APA, all rights reserved).

  17. Solving Nonlinear Coupled Differential Equations

    NASA Technical Reports Server (NTRS)

    Mitchell, L.; David, J.

    1986-01-01

    Harmonic balance method developed to obtain approximate steady-state solutions for nonlinear coupled ordinary differential equations. Method usable with transfer matrices commonly used to analyze shaft systems. Solution to nonlinear equation, with periodic forcing function represented as sum of series similar to Fourier series but with form of terms suggested by equation itself.

  18. 26 CFR 1.58-5 - Common trust funds.

    Code of Federal Regulations, 2010 CFR

    2010-04-01

    ... 26 Internal Revenue 1 2010-04-01 2010-04-01 true Common trust funds. 1.58-5 Section 1.58-5... Preference Regulations § 1.58-5 Common trust funds. Section 58(e) provides that each participant in a common trust fund (as defined in section 584 and the regulations thereunder) is to treat as items of tax...

  19. The intergenerational transmission of conduct problems.

    PubMed

    Raudino, Alessandra; Fergusson, David M; Woodward, Lianne J; Horwood, L John

    2013-03-01

    Drawing on prospective longitudinal data, this paper examines the intergenerational transmission of childhood conduct problems in a sample of 209 parents and their 331 biological offspring studied as part of the Christchurch Health and Developmental Study. The aims were to estimate the association between parental and offspring conduct problems and to examine the extent to which this association could be explained by (a) confounding social/family factors from the parent's childhood and (b) intervening factors reflecting parental behaviours and family functioning. The same item set was used to assess childhood conduct problems in parents and offspring. Two approaches to data analysis (generalised estimating equation regression methods and latent variable structural equation modelling) were used to examine possible explanations of the intergenerational continuity in behaviour. Regression analysis suggested that there was moderate intergenerational continuity (r = 0.23, p < 0.001) between parental and offspring conduct problems. This continuity was not explained by confounding factors but was partially mediated by parenting behaviours, particularly parental over-reactivity. Latent variable modelling designed to take account of non-observed common genetic and environmental factors underlying the continuities in problem behaviours across generations also suggested that parenting behaviour played a role in mediating the intergenerational transmission of conduct problems. There is clear evidence of intergenerational continuity in conduct problems. In part this association reflects a causal chain process in which parental conduct problems are associated (directly or indirectly) with impaired parenting behaviours that in turn influence risks of conduct problems in offspring.

  20. Study on the initial value for the exterior orientation of the mobile version

    NASA Astrophysics Data System (ADS)

    Yu, Zhi-jing; Li, Shi-liang

    2011-10-01

    Single mobile vision coordinate measurement system is in the measurement site using a single camera body and a notebook computer to achieve three-dimensional coordinates. To obtain more accurate approximate values of exterior orientation calculation in the follow-up is very important in the measurement process. The problem is a typical one for the space resection, and now studies on this topic have been widely conducted in research. Single-phase space resection mainly focuses on two aspects: of co-angular constraint based on the method, its representatives are camera co-angular constraint pose estimation algorithm and the cone angle law; the other is a direct linear transformation (DLT). One common drawback for both methods is that the CCD lens distortion is not considered. When the initial value was calculated with the direct linear transformation method, the distribution and abundance of control points is required relatively high, the need that control points can not be distributed in the same plane must be met, and there are at least six non- coplanar control points. However, its usefulness is limited. Initial value will directly influence the convergence and convergence speed of the ways of calculation. This paper will make the nonlinear of the total linear equations linearized by using the total linear equations containing distorted items and Taylor series expansion, calculating the initial value of the camera exterior orientation. Finally, the initial value is proved to be better through experiments.

  1. The case for an international patient-reported outcomes measurement information system (PROMIS®) initiative

    PubMed Central

    2013-01-01

    Patient-reported outcomes (PROs) play an increasingly important role in clinical practice and research. Modern psychometric methods such as item response theory (IRT) enable the creation of item banks that support fixed-length forms as well as computerized adaptive testing (CAT), often resulting in improved measurement precision and responsiveness. Here we describe and discuss the case for developing an international core set of PROs building from the US PROMIS® network. PROMIS is a U.S.-based cooperative group of research sites and centers of excellence convened to develop and standardize PRO measures across studies and settings. If extended to a global collaboration, PROMIS has the potential to transform PRO measurement by creating a shared, unifying terminology and metric for reporting of common symptoms and functional life domains. Extending a common set of standardized PRO measures to the international community offers great potential for improving patient-centered research, clinical trials reporting, population monitoring, and health care worldwide. Benefits of such standardization include the possibility of: international syntheses (such as meta-analyses) of research findings; international population monitoring and policy development; health services administrators and planners access to relevant information on the populations they serve; better assessment and monitoring of patients by providers; and improved shared decision making. The goal of the current PROMIS International initiative is to ensure that item banks are translated and culturally adapted for use in adults and children in as many countries as possible. The process includes 3 key steps: translation/cultural adaptation, calibration, and validation. A universal translation, an approach focusing on commonalities, rather than differences across versions developed in regions or countries speaking the same language, is proposed to ensure conceptual equivalence for all items. International item calibration using nationally representative samples of adults and children within countries is essential to demonstrate that all items possess expected strong measurement properties. Finally, it is important to demonstrate that the PROMIS measures are valid, reliable and responsive to change when used in an international context. IRT item banking will allow for tailoring within countries and facilitate growth and evolution of PROs through contributions from the international measurement community. A number of opportunities and challenges of international development of PROs item banks are discussed. PMID:24359143

  2. The case for an international patient-reported outcomes measurement information system (PROMIS®) initiative.

    PubMed

    Alonso, Jordi; Bartlett, Susan J; Rose, Matthias; Aaronson, Neil K; Chaplin, John E; Efficace, Fabio; Leplège, Alain; Lu, Aiping; Tulsky, David S; Raat, Hein; Ravens-Sieberer, Ulrike; Revicki, Dennis; Terwee, Caroline B; Valderas, Jose M; Cella, David; Forrest, Christopher B

    2013-12-20

    Patient-reported outcomes (PROs) play an increasingly important role in clinical practice and research. Modern psychometric methods such as item response theory (IRT) enable the creation of item banks that support fixed-length forms as well as computerized adaptive testing (CAT), often resulting in improved measurement precision and responsiveness. Here we describe and discuss the case for developing an international core set of PROs building from the US PROMIS® network.PROMIS is a U.S.-based cooperative group of research sites and centers of excellence convened to develop and standardize PRO measures across studies and settings. If extended to a global collaboration, PROMIS has the potential to transform PRO measurement by creating a shared, unifying terminology and metric for reporting of common symptoms and functional life domains. Extending a common set of standardized PRO measures to the international community offers great potential for improving patient-centered research, clinical trials reporting, population monitoring, and health care worldwide. Benefits of such standardization include the possibility of: international syntheses (such as meta-analyses) of research findings; international population monitoring and policy development; health services administrators and planners access to relevant information on the populations they serve; better assessment and monitoring of patients by providers; and improved shared decision making.The goal of the current PROMIS International initiative is to ensure that item banks are translated and culturally adapted for use in adults and children in as many countries as possible. The process includes 3 key steps: translation/cultural adaptation, calibration, and validation. A universal translation, an approach focusing on commonalities, rather than differences across versions developed in regions or countries speaking the same language, is proposed to ensure conceptual equivalence for all items. International item calibration using nationally representative samples of adults and children within countries is essential to demonstrate that all items possess expected strong measurement properties. Finally, it is important to demonstrate that the PROMIS measures are valid, reliable and responsive to change when used in an international context.IRT item banking will allow for tailoring within countries and facilitate growth and evolution of PROs through contributions from the international measurement community. A number of opportunities and challenges of international development of PROs item banks are discussed.

  3. Clusters of cultures: diversity in meaning of family value and gender role items across Europe.

    PubMed

    van Vlimmeren, Eva; Moors, Guy B D; Gelissen, John P T M

    2017-01-01

    Survey data are often used to map cultural diversity by aggregating scores of attitude and value items across countries. However, this procedure only makes sense if the same concept is measured in all countries. In this study we argue that when (co)variances among sets of items are similar across countries, these countries share a common way of assigning meaning to the items. Clusters of cultures can then be observed by doing a cluster analysis on the (co)variance matrices of sets of related items. This study focuses on family values and gender role attitudes. We find four clusters of cultures that assign a distinct meaning to these items, especially in the case of gender roles. Some of these differences reflect response style behavior in the form of acquiescence. Adjusting for this style effect impacts on country comparisons hence demonstrating the usefulness of investigating the patterns of meaning given to sets of items prior to aggregating scores into cultural characteristics.

  4. Measurement equivalence of the KINDL questionnaire across child self-reports and parent proxy-reports: a comparison between item response theory and ordinal logistic regression.

    PubMed

    Jafari, Peyman; Sharafi, Zahra; Bagheri, Zahra; Shalileh, Sara

    2014-06-01

    Measurement equivalence is a necessary assumption for meaningful comparison of pediatric quality of life rated by children and parents. In this study, differential item functioning (DIF) analysis is used to examine whether children and their parents respond consistently to the items in the KINDer Lebensqualitätsfragebogen (KINDL; in German, Children Quality of Life Questionnaire). Two DIF detection methods, graded response model (GRM) and ordinal logistic regression (OLR), were applied for comparability. The KINDL was completed by 1,086 school children and 1,061 of their parents. While the GRM revealed that 12 out of the 24 items were flagged with DIF, the OLR identified 14 out of the 24 items with DIF. Seven items with DIF and five items without DIF were common across the two methods, yielding a total agreement rate of 50 %. This study revealed that parent proxy-reports cannot be used as a substitute for a child's ratings in the KINDL.

  5. You look familiar, but I don’t care: Lure rejection in hybrid visual and memory search is not based on familiarity

    PubMed Central

    Wolfe, Jeremy M.; Boettcher, Sage E. P.; Josephs, Emilie L.; Cunningham, Corbin A.; Drew, Trafton

    2015-01-01

    In “hybrid” search tasks, observers hold multiple possible targets in memory while searching for those targets amongst distractor items in visual displays. Wolfe (2012) found that, if the target set is held constant over a block of trials, RTs in such tasks were a linear function of the number of items in the visual display and a linear function of the log of the number of items held in memory. However, in such tasks, the targets can become far more familiar than the distractors. Does this “familiarity” – operationalized here as the frequency and recency with which an item has appeared – influence performance in hybrid tasks In Experiment 1, we compared searches where distractors appeared with the same frequency as the targets to searches where all distractors were novel. Distractor familiarity did not have any reliable effect on search. In Experiment 2, most distractors were novel but some critical distractors were as common as the targets while others were 4× more common. Familiar distractors did not produce false alarm errors, though they did slightly increase response times (RTs). In Experiment 3, observers successfully searched for the new, unfamiliar item among distractors that, in many cases, had been seen only once before. We conclude that when the memory set is held constant for many trials, item familiarity alone does not cause observers to mistakenly confuse target with distractors. PMID:26191615

  6. Preferred Reporting Items for a Systematic Review and Meta-analysis of Diagnostic Test Accuracy Studies: The PRISMA-DTA Statement.

    PubMed

    McInnes, Matthew D F; Moher, David; Thombs, Brett D; McGrath, Trevor A; Bossuyt, Patrick M; Clifford, Tammy; Cohen, Jérémie F; Deeks, Jonathan J; Gatsonis, Constantine; Hooft, Lotty; Hunt, Harriet A; Hyde, Christopher J; Korevaar, Daniël A; Leeflang, Mariska M G; Macaskill, Petra; Reitsma, Johannes B; Rodin, Rachel; Rutjes, Anne W S; Salameh, Jean-Paul; Stevens, Adrienne; Takwoingi, Yemisi; Tonelli, Marcello; Weeks, Laura; Whiting, Penny; Willis, Brian H

    2018-01-23

    Systematic reviews of diagnostic test accuracy synthesize data from primary diagnostic studies that have evaluated the accuracy of 1 or more index tests against a reference standard, provide estimates of test performance, allow comparisons of the accuracy of different tests, and facilitate the identification of sources of variability in test accuracy. To develop the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) diagnostic test accuracy guideline as a stand-alone extension of the PRISMA statement. Modifications to the PRISMA statement reflect the specific requirements for reporting of systematic reviews and meta-analyses of diagnostic test accuracy studies and the abstracts for these reviews. Established standards from the Enhancing the Quality and Transparency of Health Research (EQUATOR) Network were followed for the development of the guideline. The original PRISMA statement was used as a framework on which to modify and add items. A group of 24 multidisciplinary experts used a systematic review of articles on existing reporting guidelines and methods, a 3-round Delphi process, a consensus meeting, pilot testing, and iterative refinement to develop the PRISMA diagnostic test accuracy guideline. The final version of the PRISMA diagnostic test accuracy guideline checklist was approved by the group. The systematic review (produced 64 items) and the Delphi process (provided feedback on 7 proposed items; 1 item was later split into 2 items) identified 71 potentially relevant items for consideration. The Delphi process reduced these to 60 items that were discussed at the consensus meeting. Following the meeting, pilot testing and iterative feedback were used to generate the 27-item PRISMA diagnostic test accuracy checklist. To reflect specific or optimal contemporary systematic review methods for diagnostic test accuracy, 8 of the 27 original PRISMA items were left unchanged, 17 were modified, 2 were added, and 2 were omitted. The 27-item PRISMA diagnostic test accuracy checklist provides specific guidance for reporting of systematic reviews. The PRISMA diagnostic test accuracy guideline can facilitate the transparent reporting of reviews, and may assist in the evaluation of validity and applicability, enhance replicability of reviews, and make the results from systematic reviews of diagnostic test accuracy studies more useful.

  7. Influence of item distribution pattern and abundance on efficiency of benthic core sampling

    USGS Publications Warehouse

    Behney, Adam C.; O'Shaughnessy, Ryan; Eichholz, Michael W.; Stafford, Joshua D.

    2014-01-01

    ore sampling is a commonly used method to estimate benthic item density, but little information exists about factors influencing the accuracy and time-efficiency of this method. We simulated core sampling in a Geographic Information System framework by generating points (benthic items) and polygons (core samplers) to assess how sample size (number of core samples), core sampler size (cm2), distribution of benthic items, and item density affected the bias and precision of estimates of density, the detection probability of items, and the time-costs. When items were distributed randomly versus clumped, bias decreased and precision increased with increasing sample size and increased slightly with increasing core sampler size. Bias and precision were only affected by benthic item density at very low values (500–1,000 items/m2). Detection probability (the probability of capturing ≥ 1 item in a core sample if it is available for sampling) was substantially greater when items were distributed randomly as opposed to clumped. Taking more small diameter core samples was always more time-efficient than taking fewer large diameter samples. We are unable to present a single, optimal sample size, but provide information for researchers and managers to derive optimal sample sizes dependent on their research goals and environmental conditions.

  8. Strategic assessment of the availability of pediatric trauma care equipment, technology and supplies in Ghana.

    PubMed

    Ankomah, James; Stewart, Barclay T; Oppong-Nketia, Victor; Koranteng, Adofo; Gyedu, Adam; Quansah, Robert; Donkor, Peter; Abantanga, Francis; Mock, Charles

    2015-11-01

    This study aimed to assess the availability of pediatric trauma care items (i.e. equipment, supplies, technology) and factors contributing to deficiencies in Ghana. Ten universal and 9 pediatric-sized items were selected from the World Health Organization's Guidelines for Essential Trauma Care. Direct inspection and structured interviews with administrative, clinical and biomedical engineering staff were used to assess item availability at 40 purposively sampled district, regional and tertiary hospitals in Ghana. Hospital assessments demonstrated marked deficiencies for a number of essential items (e.g. basic airway supplies, chest tubes, blood pressure cuffs, electrolyte determination, portable X-ray). Lack of pediatric-sized items resulting from equipment absence, lack of training, frequent stock-outs and technology breakage were common. Pediatric items were consistently less available than adult-sized items at each hospital level. This study identified several successes and problems with pediatric trauma care item availability in Ghana. Item availability could be improved, both affordably and reliably, by better organization and planning (e.g. regular assessment of demand and inventory, reliable financing for essential trauma care items). In addition, technology items were often broken. Developing local service and biomedical engineering capability was highlighted as a priority to avoid long periods of equipment breakage. Copyright © 2015 Elsevier Inc. All rights reserved.

  9. Development of the Contact Lens User Experience: CLUE Scales

    PubMed Central

    Wirth, R. J.; Edwards, Michael C.; Henderson, Michael; Henderson, Terri; Olivares, Giovanna; Houts, Carrie R.

    2016-01-01

    ABSTRACT Purpose The field of optometry has become increasingly interested in patient-reported outcomes, reflecting a common trend occurring across the spectrum of healthcare. This article reviews the development of the Contact Lens User Experience: CLUE system designed to assess patient evaluations of contact lenses. CLUE was built using modern psychometric methods such as factor analysis and item response theory. Methods The qualitative process through which relevant domains were identified is outlined as well as the process of creating initial item banks. Psychometric analyses were conducted on the initial item banks and refinements were made to the domains and items. Following this data-driven refinement phase, a second round of data was collected to further refine the items and obtain final item response theory item parameters estimates. Results Extensive qualitative work identified three key areas patients consider important when describing their experience with contact lenses. Based on item content and psychometric dimensionality assessments, the developing CLUE instruments were ultimately focused around four domains: comfort, vision, handling, and packaging. Item response theory parameters were estimated for the CLUE item banks (377 items), and the resulting scales were found to provide precise and reliable assignment of scores detailing users’ subjective experiences with contact lenses. Conclusions The CLUE family of instruments, as it currently exists, exhibits excellent psychometric properties. PMID:27383257

  10. Strategic assessment of the availability of pediatric trauma care equipment, technology and supplies in Ghana

    PubMed Central

    Ankomah, James; Stewart, Barclay T; Oppong-Nketia, Victor; Koranteng, Adofo; Gyedu, Adam; Quansah, Robert; Donkor, Peter; Abantanga, Francis; Mock, Charles

    2015-01-01

    Background This study aimed to assess the availability of pediatric trauma care items (i.e. equipment, supplies, technology) and factors contributing to deficiencies in Ghana. Methods Ten universal and 9 pediatric-sized items were selected from the World Health Organization’s Guidelines for Essential Trauma Care. Direct inspection and structured interviews with administrative, clinical and biomedical engineering staff were used to assess item availability at 40 purposively sampled district, regional and tertiary hospitals in Ghana. Results Hospital assessments demonstrated marked deficiencies for a number of essential items (e.g. basic airway supplies, chest tubes, blood pressure cuffs, electrolyte determination, portable Xray). Lack of pediatric-sized items resulting from equipment absence, lack of training, frequent stock-outs and technology breakage were common. Pediatric items were consistently less available than adult-sized items at each hospital level. Conclusion This study identified several successes and problems with pediatric trauma care item availability in Ghana. Item availability could be improved, both affordably and reliably, by better organization and planning (e.g. regular assessment of demand and inventory, reliable financing for essential trauma care items). In addition, technology items were often broken. Developing local service and biomedical engineering capability was highlighted as a priority to avoid long periods of equipment breakage. PMID:25841284

  11. Pediatric airway foreign bodies.

    PubMed

    Fitzpatrick, P C; Guarisco, J L

    1998-04-01

    Foreign body aspiration (FBA) is a leading cause of accidental death in children less than one year old and is the cause of death in 7% of children less than four. Food items, especially peanuts, are the most common items aspirated in infants and toddlers, whereas older children are more likely to aspirate non-food items such as pen caps, pins, and paper clips. A high degree of suspicion is required to diagnose FBA. A history of a witnessed choking episode is most important in early diagnosis. An asymptomatic period is common after aspiration and contributes to a delay in diagnosis of greater than one week in 12% to 26% of patients. This delay in diagnosis causes increased morbidity from bronchial inflammation, obstruction, and pneumonia which is resistant to treatment. Prompt endoscopic removal of the foreign body with an open rigid bronchoscope under general anesthesia is the mainstay of therapy.

  12. A shared, flexible neural map architecture reflects capacity limits in both visual short-term memory and enumeration.

    PubMed

    Knops, André; Piazza, Manuela; Sengupta, Rakesh; Eger, Evelyn; Melcher, David

    2014-07-23

    Human cognition is characterized by severe capacity limits: we can accurately track, enumerate, or hold in mind only a small number of items at a time. It remains debated whether capacity limitations across tasks are determined by a common system. Here we measure brain activation of adult subjects performing either a visual short-term memory (vSTM) task consisting of holding in mind precise information about the orientation and position of a variable number of items, or an enumeration task consisting of assessing the number of items in those sets. We show that task-specific capacity limits (three to four items in enumeration and two to three in vSTM) are neurally reflected in the activity of the posterior parietal cortex (PPC): an identical set of voxels in this region, commonly activated during the two tasks, changed its overall response profile reflecting task-specific capacity limitations. These results, replicated in a second experiment, were further supported by multivariate pattern analysis in which we could decode the number of items presented over a larger range during enumeration than during vSTM. Finally, we simulated our results with a computational model of PPC using a saliency map architecture in which the level of mutual inhibition between nodes gives rise to capacity limitations and reflects the task-dependent precision with which objects need to be encoded (high precision for vSTM, lower precision for enumeration). Together, our work supports the existence of a common, flexible system underlying capacity limits across tasks in PPC that may take the form of a saliency map. Copyright © 2014 the authors 0270-6474/14/349857-10$15.00/0.

  13. Old and New Ideas for Data Screening and Assumption Testing for Exploratory and Confirmatory Factor Analysis

    PubMed Central

    Flora, David B.; LaBrish, Cathy; Chalmers, R. Philip

    2011-01-01

    We provide a basic review of the data screening and assumption testing issues relevant to exploratory and confirmatory factor analysis along with practical advice for conducting analyses that are sensitive to these concerns. Historically, factor analysis was developed for explaining the relationships among many continuous test scores, which led to the expression of the common factor model as a multivariate linear regression model with observed, continuous variables serving as dependent variables, and unobserved factors as the independent, explanatory variables. Thus, we begin our paper with a review of the assumptions for the common factor model and data screening issues as they pertain to the factor analysis of continuous observed variables. In particular, we describe how principles from regression diagnostics also apply to factor analysis. Next, because modern applications of factor analysis frequently involve the analysis of the individual items from a single test or questionnaire, an important focus of this paper is the factor analysis of items. Although the traditional linear factor model is well-suited to the analysis of continuously distributed variables, commonly used item types, including Likert-type items, almost always produce dichotomous or ordered categorical variables. We describe how relationships among such items are often not well described by product-moment correlations, which has clear ramifications for the traditional linear factor analysis. An alternative, non-linear factor analysis using polychoric correlations has become more readily available to applied researchers and thus more popular. Consequently, we also review the assumptions and data-screening issues involved in this method. Throughout the paper, we demonstrate these procedures using an historic data set of nine cognitive ability variables. PMID:22403561

  14. A Cross “Ethnical” Comparison of the Driver Behaviour Questionnaire (DBQ) in an Economically Fast Developing Country

    PubMed Central

    Bener, Abdulbari; Verjee, Mohamud; Dafeeah, Elnour E.; Yousafzai, Mohammad T.; Mari, Sundus; Hassib, Ahmed; Al-Khatib, Hamza; Choi, Min Kyung; Nema, Noor; Özkan, Türker; Lajunen, Timo

    2013-01-01

    Aim: The aim of this study was to compare the driving behaviours of four ethnic groups and to investigate the relationship between violations, errors and lapses of DBQ and accident involvement in Qatar. Subjects and Methods: The Driver Behaviour Questionnaire (DBQ) was used to measure the aberrant driving behaviours leading to accidents. Of 2400 drivers approached, 1824 drivers agreed to participate (76%) and completed the driver behaviour questionnaire and background information. Results: The study revealed that the majority of the Qatari (35.9%) and Jordanian drivers (37.5%) were below 30 years of age, whereas Filipino (42.3%) and Indian subcontinent (34.1%) drivers were in the age group of 30-39 years. Qatari drivers (52%) were involved in most accidents, followed by Jordanians (48.3%). The most common type of collision was a head on collision, which was similar in all four ethnic groups. The Qatari drivers scored higher on almost all items of violations, errors and lapses compared to other ethnic groups, while Filipino drivers were lower on all the items. The most common violation was the same in all four ethnic groups “Disregard the speed limits on a motorway”. The most common error item observed was “Queing to turn right/left on to a main road”. “Forget where you left your car” and “Hit something when reversing” were the two lapses identified in factor analysis. Conclusion: The present study identified that Qatari drivers scored higher on most of the items of violations, errors and lapses of DBQ compared to other countries, whereas Filipino drivers scored lower in DBQ items. PMID:23777732

  15. Calibrating well-being, quality of life and common mental disorder items: psychometric epidemiology in public mental health research.

    PubMed

    Böhnke, Jan R; Croudace, Tim J

    2016-08-01

    The assessment of 'general health and well-being' in public mental health research stimulates debates around relative merits of questionnaire instruments and their items. Little evidence regarding alignment or differential advantages of instruments or items has appeared to date. Population-based psychometric study of items employed in public mental health narratives. Multidimensional item response theory was applied to General Health Questionnaire (GHQ-12), Warwick-Edinburgh Mental Well-being Scale (WEMWBS) and EQ-5D items (Health Survey for England, 2010-2012; n = 19 290). A bifactor model provided the best account of the data and showed that the GHQ-12 and WEMWBS items assess mainly the same construct. Only one item of the EQ-5D showed relevant overlap with this dimension (anxiety/depression). Findings were corroborated by comparisons with alternative models and cross-validation analyses. The consequences of this lack of differentiation (GHQ-12 v. WEMWBS) for mental health and well-being narratives deserves discussion to enrich debates on priorities in public mental health and its assessment. © The Royal College of Psychiatrists 2015.

  16. Is the picture bizarreness effect a generation effect?

    PubMed

    Marchal, A; Nicolas, S

    2000-08-01

    Bizarre stimuli usually facilitate recall compared to common stimuli. This investigation explored the so-called bizarreness effect in free recall by using 80 simple line drawings of common objects (common vs bizarre). 64 subjects participated with 16 subjects in each group. Half of the subjects received learning instructions and the other half rated the bizarreness of each drawing. Moreover, drawings were presented either alone or with the name of the object under mixed-list encoding conditions. After the free recall task, subjects had to make metamemory judgments about how many items of each format they had seen and recalled. The key result was that a superiority of bizarre pictures over common ones was found in all conditions although performance was better when the pictures were presented alone than with their corresponding label. Subsequent metamemory judgments, however, showed that subjects underestimated the number of bizarre items actually recalled.

  17. Local Discontinuous Galerkin Methods for the Cahn-Hilliard Type Equations

    DTIC Science & Technology

    2007-01-01

    Kuramoto-Sivashinsky equations , the Ito-type coupled KdV equa- tions, the Kadomtsev - Petviashvili equation , and the Zakharov-Kuznetsov equation . A common...Local discontinuous Galerkin methods for the Cahn-Hilliard type equations Yinhua Xia∗, Yan Xu† and Chi-Wang Shu ‡ Abstract In this paper we develop...local discontinuous Galerkin (LDG) methods for the fourth-order nonlinear Cahn-Hilliard equation and system. The energy stability of the LDG methods is

  18. CRANS - CONFIGURABLE REAL-TIME ANALYSIS SYSTEM

    NASA Technical Reports Server (NTRS)

    Mccluney, K.

    1994-01-01

    In a real-time environment, the results of changes or failures in a complex, interconnected system need evaluation quickly. Tabulations showing the effects of changes and/or failures of a given item in the system are generally only useful for a single input, and only with regard to that item. Subsequent changes become harder to evaluate as combinations of failures produce a cascade effect. When confronted by multiple indicated failures in the system, it becomes necessary to determine a single cause. In this case, failure tables are not very helpful. CRANS, the Configurable Real-time ANalysis System, can interpret a logic tree, constructed by the user, describing a complex system and determine the effects of changes and failures in it. Items in the tree are related to each other by Boolean operators. The user is then able to change the state of these items (ON/OFF FAILED/UNFAILED). The program then evaluates the logic tree based on these changes and determines any resultant changes to other items in the tree. CRANS can also search for a common cause for multiple item failures, and allow the user to explore the logic tree from within the program. A "help" mode and a reference check provide the user with a means of exploring an item's underlying logic from within the program. A commonality check determines single point failures for an item or group of items. Output is in the form of a user-defined matrix or matrices of colored boxes, each box representing an item or set of items from the logic tree. Input is via mouse selection of the matrix boxes, using the mouse buttons to toggle the state of the item. CRANS is written in C-language and requires the MIT X Window System, Version 11 Revision 4 or Revision 5. It requires 78K of RAM for execution and a three button mouse. It has been successfully implemented on Sun4 workstations running SunOS, HP9000 workstations running HP-UX, and DECstations running ULTRIX. No executable is provided on the distribution medium; however, a sample makefile is included. Sample input files are also included. The standard distribution medium is a .25 inch streaming magnetic tape cartridge (Sun QIC-24) in UNIX tar format. Alternate distribution media and formats are available upon request. This program was developed in 1992.

  19. Testing enhances both encoding and retrieval for both tested and untested items.

    PubMed

    Cho, Kit W; Neely, James H; Crocco, Stephanie; Vitrano, Deana

    2017-07-01

    In forward testing effects, taking a test enhances memory for subsequently studied material. These effects have been observed for previously studied and tested items, a potentially item-specific testing effect, and newly studied untested items, a purely generalized testing effect. We directly compared item-specific and generalized forward testing effects using procedures to separate testing benefits due to encoding versus retrieval. Participants studied two lists of Swahili-English word pairs, with the second study list containing "new" pairs intermixed with the previously studied "old" pairs. Participants completed a review phase in which they took a cued-recall test on only the "old" pairs or restudied them. In Experiments 1a, 1b, and 2, the review phase was given either before or after the second study list. Testing benefited memory to the same degree for both "new" and "old" pairs, suggesting that there were no pair-specific benefits of testing. The larger benefit from testing when review was given before rather than after the second study list suggests that the memory enhancement was due to both testing-enhanced encoding and testing-enhanced retrieval. To better equate generalized testing effects for "new" and "old" pairs, Experiment 3 intermixed them in the review phase. A statistically significant pair-specific testing effect for "old" items was now observed. Overall, these results show that forward testing effects are due to both testing-enhanced encoding and retrieval effects and that direct, pair-specific forward testing benefits are considerably smaller than indirect, generalized forward testing benefits.

  20. Development of Two-Tier Diagnostic Test Pictorial-Based for Identifying High School Students Misconceptions on the Mole Concept

    NASA Astrophysics Data System (ADS)

    Siswaningsih, W.; Firman, H.; Zackiyah; Khoirunnisa, A.

    2017-02-01

    The aim of this study was to develop the two-tier pictorial-based diagnostic test for identifying student misconceptions on mole concept. The method of this study is used development and validation. The development of the test Obtained through four phases, development of any items, validation, determination key, and application test. Test was developed in the form of pictorial consisting of two tier, the first tier Consist of four possible answers and the second tier Consist of four possible reasons. Based on the results of content validity of 20 items using the CVR (Content Validity Ratio), a number of 18 items declared valid. Based on the results of the reliability test using SPSS, Obtained 17 items with Cronbach’s Alpha value of 0703, the which means that items have accepted. A total of 10 items was conducted to 35 students of senior high school students who have studied the mole concept on one of the high schools in Cimahi. Based on the results of the application test, student misconceptions were identified in each label concept in mole concept with the percentage of misconceptions on the label concept of mole (60.15%), Avogadro’s number (34.28%), relative atomic mass (62, 84%), relative molecule mass (77.08%), molar mass (68.53%), molar volume of gas (57.11%), molarity (71.32%), chemical equation (82.77%), limiting reactants (91.40%), and molecular formula (77.13%).

  1. Standardized reporting of functioning information on ICF-based common metrics.

    PubMed

    Prodinger, Birgit; Tennant, Alan; Stucki, Gerold

    2018-02-01

    In clinical practice and research a variety of clinical data collection tools are used to collect information on people's functioning for clinical practice and research and national health information systems. Reporting on ICF-based common metrics enables standardized documentation of functioning information in national health information systems. The objective of this methodological note on applying the ICF in rehabilitation is to demonstrate how to report functioning information collected with a data collection tool on ICF-based common metrics. We first specify the requirements for the standardized reporting of functioning information. Secondly, we introduce the methods needed for transforming functioning data to ICF-based common metrics. Finally, we provide an example. The requirements for standardized reporting are as follows: 1) having a common conceptual framework to enable content comparability between any health information; and 2) a measurement framework so that scores between two or more clinical data collection tools can be directly compared. The methods needed to achieve these requirements are the ICF Linking Rules and the Rasch measurement model. Using data collected incorporating the 36-item Short Form Health Survey (SF-36), the World Health Organization Disability Assessment Schedule 2.0 (WHODAS 2.0), and the Stroke Impact Scale 3.0 (SIS 3.0), the application of the standardized reporting based on common metrics is demonstrated. A subset of items from the three tools linked to common chapters of the ICF (d4 Mobility, d5 Self-care and d6 Domestic life), were entered as "super items" into the Rasch model. Good fit was achieved with no residual local dependency and a unidimensional metric. A transformation table allows for comparison between scales, and between a scale and the reporting common metric. Being able to report functioning information collected with commonly used clinical data collection tools with ICF-based common metrics enables clinicians and researchers to continue using their tools while still being able to compare and aggregate the information within and across tools.

  2. Stability of Thin-Walled Tubes Under Torsion

    NASA Technical Reports Server (NTRS)

    Donnell, L H

    1935-01-01

    In this report a theoretical solution is developed for the torsion on a round thin-walled tube for which the walls become unstable. The results of this theory are given by a few simple formulas and curves which cover all cases. The differential equations of equilibrium are derived in a simpler form than previously found, it being shown that many items can be neglected.

  3. Methodological Measurement Fruitfulness of Exploratory Structural Equation Modeling (ESEM): New Approaches to Key Substantive Issues in Motivation and Engagement

    ERIC Educational Resources Information Center

    Marsh, Herbert W.; Liem, Gregory Arief D.; Martin, Andrew J.; Morin, Alexandre J. S.; Nagengast, Benjamin

    2011-01-01

    The most popular measures of multidimensional constructs typically fail to meet standards of good measurement: goodness of fit, measurement invariance, lack of differential item functioning, and well-differentiated factors that are not so highly correlated as to detract from their discriminant validity. Part of the problem, the authors argue, is…

  4. Modeling the Psychometric Properties of Complex Performance Assessment Tasks Using Confirmatory Factor Analysis: A Multistage Model for Calibrating Tasks

    ERIC Educational Resources Information Center

    Kahraman, Nilufer; De Champlain, Andre; Raymond, Mark

    2012-01-01

    Item-level information, such as difficulty and discrimination are invaluable to the test assembly, equating, and scoring practices. Estimating these parameters within the context of large-scale performance assessments is often hindered by the use of unbalanced designs for assigning examinees to tasks and raters because such designs result in very…

  5. Software Cost Estimating,

    DTIC Science & Technology

    1982-05-13

    Size Of The Software. A favourite measure for software system size is linos of operational code, or deliverable code (operational code plus...regression models, these conversions are either derived from productivity measures using the "cost per instruction" type of equation or they are...appropriate to different development organisattons, differert project types, different sets of units for measuring e and s, and different items

  6. The efficacy of self-paced study in multitrial learning.

    PubMed

    de Jonge, Mario; Tabbers, Huib K; Pecher, Diane; Jang, Yoonhee; Zeelenberg, René

    2015-05-01

    In 2 experiments we investigated the efficacy of self-paced study in multitrial learning. In Experiment 1, native speakers of English studied lists of Dutch-English word pairs under 1 of 4 imposed fixed presentation rate conditions (24 × 1 s, 12 × 2 s, 6 × 4 s, or 3 × 8 s) and a self-paced study condition. Total study time per list was equated for all conditions. We found that self-paced study resulted in better recall performance than did most of the fixed presentation rates, with the exception of the 12 × 2 s condition, which did not differ from the self-paced condition. Additional correlational analyses suggested that the allocation of more study time to difficult pairs than to easy pairs might be a beneficial strategy for self-paced learning. Experiment 2 was designed to test this hypothesis. In 1 condition, participants studied word pairs in a self-paced fashion without any restrictions. In the other condition, participants studied word pairs in a self-paced fashion but total study time per item was equated. The results showed that allowing self-paced learners to freely allocate study time over items resulted in better recall performance. (c) 2015 APA, all rights reserved).

  7. Effect of Violating Unidimensional Item Response Theory Vertical Scaling Assumptions on Developmental Score Scales

    ERIC Educational Resources Information Center

    Topczewski, Anna Marie

    2013-01-01

    Developmental score scales represent the performance of students along a continuum, where as students learn more they move higher along that continuum. Unidimensional item response theory (UIRT) vertical scaling has become a commonly used method to create developmental score scales. Research has shown that UIRT vertical scaling methods can be…

  8. La Vente promotionnelle: Vocabulaire general de la vente en magasin (Vocabulary Used for the Promotional Sale).

    ERIC Educational Resources Information Center

    de Villers-Sidani, Marie-Eva, Comp.; And Others

    This vocabulary list consists of 84 commonly used terms and expressions pertaining to the sale of store merchandise. The vocabulary items are listed alphabetically in English, with the French equivalent given opposite the English. In many cases, explanatory notes and examples illustrating the use of individual items are included. An alphabetical…

  9. "Homemade" Equipment That Can Be Used In Teaching Physical Education Classes.

    ERIC Educational Resources Information Center

    Davis, Kermit R.

    This manual is designed to help elementary school teachers create games and equipment for use in physical education activities. It suggests items to acquire (cartons, string, plastic jugs, cardboard tubes) and places to look for them. It describes how such items can be used and how to construct some common gym class accessories. There are also…

  10. Optimal and Most Exact Confidence Intervals for Person Parameters in Item Response Theory Models

    ERIC Educational Resources Information Center

    Doebler, Anna; Doebler, Philipp; Holling, Heinz

    2013-01-01

    The common way to calculate confidence intervals for item response theory models is to assume that the standardized maximum likelihood estimator for the person parameter [theta] is normally distributed. However, this approximation is often inadequate for short and medium test lengths. As a result, the coverage probabilities fall below the given…

  11. A Comparison of the Rasch Separate Calibration and Between-Fit Methods of Detecting Item Bias.

    ERIC Educational Resources Information Center

    Smith, Richard M.

    1996-01-01

    The separate calibration t-test approach of B. Wright and M. Stone (1979) and the common calibration between-fit approach of B. Wright, R. Mead, and R. Draba (1976) appeared to have similar Type I error rates and similar power to detect item bias within a Rasch framework. (SLD)

  12. Subitizing Reflects Visuo-Spatial Object Individuation Capacity

    ERIC Educational Resources Information Center

    Piazza, Manuela; Fumarola, Antonia; Chinello, Alessandro; Melcher, David

    2011-01-01

    Subitizing is the immediate apprehension of the exact number of items in small sets. Despite more than a 100 years of research around this phenomenon, its nature and origin are still unknown. One view posits that it reflects a number estimation process common for small and large sets, which precision decreases as the number of items increases,…

  13. Multiple Maximum Exposure Rates in Computerized Adaptive Testing

    ERIC Educational Resources Information Center

    Ramon Barrada, Juan; Veldkamp, Bernard P.; Olea, Julio

    2009-01-01

    Computerized adaptive testing is subject to security problems, as the item bank content remains operative over long periods and administration time is flexible for examinees. Spreading the content of a part of the item bank could lead to an overestimation of the examinees' trait level. The most common way of reducing this risk is to impose a…

  14. Eating Well While Dining Out: Collaborating with Local Restaurants to Promote Heart Healthy Menu Items

    ERIC Educational Resources Information Center

    Thayer, Linden M.; Pimentel, Daniela C.; Smith, Janice C.; Garcia, Beverly A.; Sylvester, Laura Lee; Kelly, Tammy; Johnston, Larry F.; Ammerman, Alice S.; Keyserling, Thomas C.

    2017-01-01

    Background: Because Americans commonly consume restaurant foods with poor dietary quality, effective interventions are needed to improve food choices at restaurants. Purpose: The purpose of this study was to design and evaluate a restaurant-based intervention to help customers select and restaurants promote heart healthy menu items with healthful…

  15. Using Data Augmentation and Markov Chain Monte Carlo for the Estimation of Unfolding Response Models

    ERIC Educational Resources Information Center

    Johnson, Matthew S.; Junker, Brian W.

    2003-01-01

    Unfolding response models, a class of item response theory (IRT) models that assume a unimodal item response function (IRF), are often used for the measurement of attitudes. Verhelst and Verstralen (1993)and Andrich and Luo (1993) independently developed unfolding response models by relating the observed responses to a more common monotone IRT…

  16. 42 CFR 414.412 - Submission of bids under a competitive bidding program.

    Code of Federal Regulations, 2010 CFR

    2010-10-01

    ... D of this part. (c) Furnishing of items. A bid must include all costs related to furnishing an item... capital, stock or profits of another supplier; (ii) A controlling interest exists if one or more of owners... controlling interest and each supplier which has an ownership or controlling interest in it. (3) Commonly...

  17. Proximity Analysis and the Structure of Organization in Free Recall.

    ERIC Educational Resources Information Center

    Friendly, Michael L.

    A method for assessing the structure of organization was developed on the basis of the ordinal separation, or proximity, between pairs ot items in recall protocols over a series of trials. The proximity measure is based on the assumption, common to all indices of organization, that items which are coded together in subjective memory units will…

  18. The Precategorical Nature of Visual Short-Term Memory

    ERIC Educational Resources Information Center

    Quinlan, Philip T.; Cohen, Dale J.

    2016-01-01

    We conducted a series of recognition experiments that assessed whether visual short-term memory (VSTM) is sensitive to shared category membership of to-be-remembered (tbr) images of common objects. In Experiment 1 some of the tbr items shared the same basic level category (e.g., hand axe): Such items were no better retained than others. In the…

  19. An Introduction to Item Response Theory and Rasch Models for Speech-Language Pathologists

    ERIC Educational Resources Information Center

    Baylor, Carolyn; Hula, William; Donovan, Neila J.; Doyle, Patrick J.; Kendall, Diane; Yorkston, Kathryn

    2011-01-01

    Purpose: To present a primarily conceptual introduction to item response theory (IRT) and Rasch models for speech-language pathologists (SLPs). Method: This tutorial introduces SLPs to basic concepts and terminology related to IRT as well as the most common IRT models. The article then continues with an overview of how instruments are developed…

  20. Linking Outcomes from Peabody Picture Vocabulary Test Forms Using Item Response Models

    ERIC Educational Resources Information Center

    Hoffman, Lesa; Templin, Jonathan; Rice, Mabel L.

    2012-01-01

    Purpose: The present work describes how vocabulary ability as assessed by 3 different forms of the Peabody Picture Vocabulary Test (PPVT; Dunn & Dunn, 1997) can be placed on a common latent metric through item response theory (IRT) modeling, by which valid comparisons of ability between samples or over time can then be made. Method: Responses…

  1. A Factor Analytic Study of the Items in the Personal Report of Communication Apprehension and the Rathus Assertiveness Schedule.

    ERIC Educational Resources Information Center

    Pearson, Judy C.

    A study was undertaken to determine the relationship between assertiveness and communication apprehension by examining common factors that exist between the items on the Rathus Assertiveness Schedule and the Personal Report of Communication Apprehension. The two instruments were administered to students at a large midwestern university. Responses…

  2. Higher Order Testlet Response Models for Hierarchical Latent Traits and Testlet-Based Items

    ERIC Educational Resources Information Center

    Huang, Hung-Yu; Wang, Wen-Chung

    2013-01-01

    Both testlet design and hierarchical latent traits are fairly common in educational and psychological measurements. This study aimed to develop a new class of higher order testlet response models that consider both local item dependence within testlets and a hierarchy of latent traits. Due to high dimensionality, the authors adopted the Bayesian…

  3. 40 CFR 2.207 - Class determinations.

    Code of Federal Regulations, 2010 CFR

    2010-07-01

    ... Confidentiality of Business Information § 2.207 Class determinations. (a) The General Counsel may make and issue a... items of business information; (2) One or more characteristics common to all such items of information... § 2.204(b)(1), § 2.204(d), § 2.205(d), or § 2.206. However, the existence of a class determination...

  4. Solving Graphics Problems: Student Performance in Junior Grades

    ERIC Educational Resources Information Center

    Lowrie, Tom; Diezmann, Carmel M.

    2007-01-01

    The authors investigated the performance of 172 Grade 4 students (9 to 10 years) over 12 months on a 36-item test that comprised items from 6 distinct graphical languages (e.g., maps) commonly used to convey mathematical information. Results revealed (a) difficulties in Grade 4 students' capacity to decode a variety of graphics, (b) significant…

  5. Method of mechanical quadratures for solving singular integral equations of various types

    NASA Astrophysics Data System (ADS)

    Sahakyan, A. V.; Amirjanyan, H. A.

    2018-04-01

    The method of mechanical quadratures is proposed as a common approach intended for solving the integral equations defined on finite intervals and containing Cauchy-type singular integrals. This method can be used to solve singular integral equations of the first and second kind, equations with generalized kernel, weakly singular equations, and integro-differential equations. The quadrature rules for several different integrals represented through the same coefficients are presented. This allows one to reduce the integral equations containing integrals of different types to a system of linear algebraic equations.

  6. Back to the Future: Past and Future Era-Based Schematic Support and Associative Memory for Prices in Younger and Older Adults

    PubMed Central

    Castel, Alan D.; McGillivray, Shannon; Worden, Kendell M.

    2014-01-01

    Older adults typically display various associative memory deficits, but these deficits can be reduced when conditions allow for the use of prior knowledge or schematic support. To determine how era-specific schematic support and future simulation might influence associative memory, we examined how younger and older adults remember prices from the past as well as the future. Younger and older adults were asked to imagine the past or future, and then studied items and prices from approximately 40 years ago (market value prices from the 1970s) or 40 years in the future. In Experiment 1, all items were common items (e.g., movie ticket, coffee) and the associated prices reflected the era in question, whereas in Experiment 2, some item-price pairs were specific to the time period (e.g., typewriter, robot maid), to test different degrees of schematic support. After studying the pairs, participants were shown each item and asked to recall the associated price. In both experiments, older adults showed similar performance as younger adults in the past condition for the common items, whereas age-related differences were greater in the future condition and for the era-specific items. The findings suggest that in order for schematic support to be effective, recent (and not simply remote) experience is needed in order to enhance memory. Thus, whereas older adults can benefit from “turning back the clock,” younger adults better remember future-oriented information compared with older adults, outlining age-related similarities and differences in associative memory and the efficient use of past and future-based schematic support. PMID:24128073

  7. Refinement of the distress management problem list as the basis for a holistic therapeutic conversation among UK patients with cancer.

    PubMed

    Brennan, James; Gingell, Polly; Brant, Heather; Hollingworth, William

    2012-12-01

    Originally devised in the USA, the Distress Thermometer is being deployed in many cancer settings in the UK. It is commonly used with a Problem List (PL), which has never been validated with a UK population. This study aimed to refine the PL items based upon the concerns of a sample of UK patients attending a regional cancer centre. Existing versions of the PL were scrutinised by a focus group comprising five ex-patients, six health care staff and two academics. This group considered the intelligibility, ambiguity and redundancy of items, sometimes making alternative suggestions or pooling items. The resulting 46 candidate items were sent to 735 patients with mixed cancer, asking them to endorse items that had been 'a source of concern or distress' during their recently finished treatment. We used multivariate logistic regression to evaluate the association between the prevalence of problems and patient characteristics. In this study, 395 (53%) people responded. 'Fatigue, exhaustion or extreme tiredness' (70%), 'worry, fear or anxiety' (45%) and 'sleep problems' (38%) were the most frequently endorsed items. Items not appearing on the original PL were commonly endorsed such as 'memory or concentration' (30%) and 'loneliness or isolation' (15%), suggesting that they should be routinely included in the Distress Thermometer Problem List. The current study offers a more comprehensive PL, on the basis of actual patients' concerns, using words that are understood by UK patients. The reluctance of some patients to volunteer their concerns suggests that screening for distress should be undertaken within the context of a structured conversation. Copyright © 2011 John Wiley & Sons, Ltd.

  8. Back to the future: past and future era-based schematic support and associative memory for prices in younger and older adults.

    PubMed

    Castel, Alan D; McGillivray, Shannon; Worden, Kendell M

    2013-12-01

    Older adults typically display various associative memory deficits, but these deficits can be reduced when conditions allow for the use of prior knowledge or schematic support. To determine how era-specific schematic support and future simulation might influence associative memory, we examined how younger and older adults remember prices from the past as well as the future. Younger and older adults were asked to imagine the past or future, and then studied items and prices from approximately 40 years ago (market value prices from the 1970s) or 40 years in the future. In Experiment 1, all items were common items (e.g., movie ticket, coffee) and the associated prices reflected the era in question, whereas in Experiment 2, some item-price pairs were specific to the time period (e.g., typewriter, robot maid), to test different degrees of schematic support. After studying the pairs, participants were shown each item and asked to recall the associated price. In both experiments, older adults showed similar performance as younger adults in the past condition for the common items, whereas age-related differences were greater in the future condition and for the era-specific items. The findings suggest that in order for schematic support to be effective, recent (and not simply remote) experience is needed in order to enhance memory. Thus, whereas older adults can benefit from "turning back the clock," younger adults better remember future-oriented information compared with older adults, outlining age-related similarities and differences in associative memory and the efficient use of past and future-based schematic support. PsycINFO Database Record (c) 2013 APA, all rights reserved.

  9. Response Mixture Modeling: Accounting for Heterogeneity in Item Characteristics across Response Times.

    PubMed

    Molenaar, Dylan; de Boeck, Paul

    2018-06-01

    In item response theory modeling of responses and response times, it is commonly assumed that the item responses have the same characteristics across the response times. However, heterogeneity might arise in the data if subjects resort to different response processes when solving the test items. These differences may be within-subject effects, that is, a subject might use a certain process on some of the items and a different process with different item characteristics on the other items. If the probability of using one process over the other process depends on the subject's response time, within-subject heterogeneity of the item characteristics across the response times arises. In this paper, the method of response mixture modeling is presented to account for such heterogeneity. Contrary to traditional mixture modeling where the full response vectors are classified, response mixture modeling involves classification of the individual elements in the response vector. In a simulation study, the response mixture model is shown to be viable in terms of parameter recovery. In addition, the response mixture model is applied to a real dataset to illustrate its use in investigating within-subject heterogeneity in the item characteristics across response times.

  10. The PROMIS Physical Function item bank was calibrated to a standardized metric and shown to improve measurement efficiency.

    PubMed

    Rose, Matthias; Bjorner, Jakob B; Gandek, Barbara; Bruce, Bonnie; Fries, James F; Ware, John E

    2014-05-01

    To document the development and psychometric evaluation of the Patient-Reported Outcomes Measurement Information System (PROMIS) Physical Function (PF) item bank and static instruments. The items were evaluated using qualitative and quantitative methods. A total of 16,065 adults answered item subsets (n>2,200/item) on the Internet, with oversampling of the chronically ill. Classical test and item response theory methods were used to evaluate 149 PROMIS PF items plus 10 Short Form-36 and 20 Health Assessment Questionnaire-Disability Index items. A graded response model was used to estimate item parameters, which were normed to a mean of 50 (standard deviation [SD]=10) in a US general population sample. The final bank consists of 124 PROMIS items covering upper, central, and lower extremity functions and instrumental activities of daily living. In simulations, a 10-item computerized adaptive test (CAT) eliminated floor and decreased ceiling effects, achieving higher measurement precision than any comparable length static tool across four SDs of the measurement range. Improved psychometric properties were transferred to the CAT's superior ability to identify differences between age and disease groups. The item bank provides a common metric and can improve the measurement of PF by facilitating the standardization of patient-reported outcome measures and implementation of CATs for more efficient PF assessments over a larger range. Copyright © 2014. Published by Elsevier Inc.

  11. Reliability and Validity of Wisconsin Upper Respiratory Symptom Survey, Korean Version

    PubMed Central

    Yang, Su-Young; Kang, Weechang; Yeo, Yoon; Park, Yang-Chun

    2011-01-01

    Background The Wisconsin Upper Respiratory Symptom Survey (WURSS) is a self-administered questionnaire developed in the United States to evaluate the severity of the common cold and its reliability has been validated. We developed a Korean language version of this questionnaire by using a sequential forward and backward translation approach. The purpose of this study was to validate the Korean version of the Wisconsin Upper Respiratory Symptom Survey (WURSS-K) in Korean patients with common cold. Methods This multicenter prospective study enrolled 107 participants who were diagnosed with common cold and consented to participate in the study. The WURSS-K includes 1 global illness severity item, 32 symptom-based items, 10 functional quality-of-life (QOL) items, and 1 item assessing global change. The SF-8 was used as an external comparator. Results The participants were 54 women and 53 men aged 18 to 42 years. The WURSS-K showed good reliability in 10 domains, with Cronbach’s alphas ranging from 0.67 to 0.96 (mean: 0.84). Comparison of the reliability coefficients of the WURSS-K and WURSS yielded a Pearson correlation coefficient of 0.71 (P = 0.02). Validity of the WURSS-K was evaluated by comparing it with the SF-8, which yielded a Pearson correlation coefficient of −0.267 (P < 0.001). The Guyatt’s responsiveness index of the WURSS-K ranged from 0.13 to 0.46, and the correlation coefficient with the WURSS was 0.534 (P < 0.001), indicating that there was close correlation between the WURSS-K and WURSS. Conclusions The WURSS-K is a reliable, valid, and responsive disease-specific questionnaire for assessing symptoms and QOL in Korean patients with common cold. PMID:21691034

  12. Development of common metrics for donation attitude, subjective norm, perceived behavioral control, and intention for the blood donation context.

    PubMed

    France, Janis L; Kowalsky, Jennifer M; France, Christopher R; McGlone, Sarah T; Himawan, Lina K; Kessler, Debra A; Shaz, Beth H

    2014-03-01

    The Theory of Planned Behavior has been widely used in blood donation research, but the lack of uniform, psychometrically sound measures makes it difficult to draw firm conclusions or compare results across studies. Accordingly, the goal of this study was to develop such measures of donation attitude, subjective norm, perceived behavioral control, and intention. Exploratory and confirmatory factor analyses (CFAs) were conducted on survey responses collected from college students (n = 1080). The resulting scales were then administered to an independent sample of experienced donors (n = 433) for additional CFAs and to test whether the Theory of Planned Behavior model provided a good fit to the data. CFAs conducted on both samples support the use of six-item scales, with two factors each, to measure donation attitude, subjective norm, and perceived behavioral control and a single-factor three-item scale to measure donation intention. Further, structural equation modeling of these measures revealed that the Theory of Planned Behavior provided a strong fit to the data (comparative fit index, 0.976; root mean square error of approximation, 0.041; standardized root mean square residual, 0.055) and accounted for 73.7% of the variance in donation intention. The present findings confirm the applicability of the Theory of Planned Behavior to the blood donation context and more importantly provide psychometric support for the future use of four brief measures of donation attitude, subjective norm, perceived behavioral control, and intention. © 2013 American Association of Blood Banks.

  13. Associations between loneliness, depressive symptoms and perceived togetherness in older people.

    PubMed

    Tiikkainen, P; Heikkinen, R-L

    2005-11-01

    This study explores the associations of loneliness with depressive symptoms in a five-year follow-up and describes how the six dimensions of perceived togetherness explain loneliness and depressive symptoms at baseline. The data were collected on 207 residents of Jyväskylä, central Finland, who at baseline in 1990 were aged 80; and 133 residents who at follow-up in 1995 were aged 85. Loneliness was assessed using a questionnaire item with four preset response options, perceived togetherness using the Social Provisions Scale, and depressive symptoms using the CES-D scale. A recursive structural equation model showed that in women but not in men, depressive symptoms predicted more experiences of loneliness. Those who were lonely were more depressed (CES-D score 16 or over) and experienced less togetherness than those who were not. Loneliness was explained by reliable alliance, social integration and attachment; and depressive symptoms were explained by guidance, reassurance of worth, reliable alliance and attachment. A common feature in both loneliness and depressive symptoms was a lower level of perceived emotional togetherness in social interaction.

  14. Identification of measurement differences between English and Spanish language versions of the Mini-Mental State Examination. Detecting differential item functioning using MIMIC modeling.

    PubMed

    Jones, Richard N

    2006-11-01

    Knowledge of the extent to which measurement of adult cognitive functioning differs between Spanish and English language administrations of the Mini-Mental State Examination (MMSE) is critical for inclusive, representative, and valid research of older adults in the United States. We sought to demonstrate the use of an item response theory (IRT) based structural equation model, that is, the MIMIC model (multiple indicators, multiple causes), to evaluate MMSE responses for evidence of differential item functioning (DIF) attributable to language of administration. We studied participants in a dementia case registry study (n = 1546), 42% of whom were examined with the Spanish language MMSE. Twelve of 21 items were identified as having significant uniform DIF. The 4 most discrepant included orientation to season, orientation to state, repeat phrase, and follow command. DIF accounted for two-thirds of the observed difference in underlying level of cognitive functioning between Spanish- and English-language administration groups. Failing to account for measurement differences may lead to spurious inferences regarding language group differences in level of underlying level of cognitive functioning. The MIMIC model can be used to detect and adjust for such measurement differences in substantive research.

  15. [Has the translation process impact on the psychometric structure of a questionnaire?].

    PubMed

    Pook, Martin; Tuschen-Caffier, Brunna; Kaufmann, Ulrike

    2006-01-01

    Little is known about the impact of item translation on the psychometric structure of questionnaires. The analysis of different translation versions within the same language provides an opportunity to address this question. Therefore, in the present study, two of the six available German translations of Eating Disorder Inventory (EDI) were compared with respect to their psychometric structure. A total of 449 female students completed the short forms of the EDI (consisting of the subscales drive for thinness, bulimia and body dissatisfaction). Structural equation modeling revealed that the item contents in both translations had been interpreted equivalently by the participants. In addition, the structural relations among the factors were equivalent across both versions. Whereas invariance of item-pair reliability was not tenable, the distribution of raw scores of the scales was similar. All in all, the findings suggest a very large degree of similarity in the psychometric structure of the alternative translations of the EDI versions. The results are discussed with respect to the lack of standards for the translation of questionnaires.

  16. Age differences in implicit memory: more apparent than real.

    PubMed

    Russo, R; Parkin, A J

    1993-01-01

    Elderly subjects and a group of young subjects identified fragmented picture sequences under conditions of focused attention. Two other groups of young subjects carried out this task under divided-attention conditions. Implicit memory, as measured by item-specific savings, was found in all groups, but this effect was smaller in the elderly group. The young subjects, but not elderly subjects, performed better on new items. The divided-attention conditions equated recall and recognition by the young and the elderly, but only the young subjects showed greater savings for recalled items. The elderly subjects' reduced implicit memory therefore stemmed from their inability to facilitate implicit memory with explicit memory. A second experiment, involving only young subjects tested after delay, produced findings similar to those for the young divided-attention subjects. Implicit memory, as measured by savings in picture completion, does not show an age-related change when the role of explicit memory is considered. Age does, however, reduce skill learning.

  17. [Evaluation of the factorial and metric equivalence of the Sexual Assertiveness Scale (SAS) by sex].

    PubMed

    Sierra, Juan Carlos; Santos-Iglesias, Pablo; Vallejo-Medina, Pablo

    2012-05-01

    Sexual assertiveness refers to the ability to initiate sexual activity, refuse unwanted sexual activity, and use contraceptive methods to avoid sexually transmitted diseases, developing healthy sexual behaviors. The Sexual Assertiveness Scale (SAS) assesses these three dimensions. The purpose of this study is to evaluate, using structural equation modeling and differential item functioning, the equivalence of the scale between men and women. Standard scores are also provided. A total of 4,034 participants from 21 Spanish provinces took part in the study. Quota sampling method was used. Results indicate a strict equivalent dimensionality of the Sexual Assertiveness Scale across sexes. One item was flagged by differential item functioning, although it does not affect the scale. Therefore, there is no significant bias in the scale when comparing across sexes. Standard scores show similar Initiation assertiveness scores for men and women, and higher scores on Refusal and Sexually Transmitted Disease Prevention for women. This scale can be used on men and women with sufficient psychometric guarantees.

  18. Development of item bank to measure deliberate self-harm behaviours: facilitating tailored scales and computer adaptive testing for specific research and clinical purposes.

    PubMed

    Latimer, Shane; Meade, Tanya; Tennant, Alan

    2014-07-30

    The purpose of this study was to investigate the application of item banking to questionnaire items intended to measure Deliberate Self-Harm (DSH) behaviours. The Rasch measurement model was used to evaluate behavioural items extracted from seven published DSH scales administered to 568 Australians aged 18-30 years (62% university students, 21% mental health patients, and 17% community members). Ninety four items were calibrated in the item bank (including 12 items with differential item functioning for gender and age). Tailored scale construction was demonstrated by extracting scales covering different combinations of DSH methods but with the same raw score for each person location on the latent DSH construct. A simulated computer adaptive test (starting with common self-harm methods to minimise presentation of extreme behaviours) demonstrated that 11 items (on average) were needed to achieve a standard error of measurement of 0.387 (corresponding to a Cronbach׳s Alpha of 0.85). This study lays the groundwork for advancing DSH measurement to an item bank approach with the flexibility to measure a specific definitional orientation (e.g., non-suicidal self-injury) or a broad continuum of self-harmful acts, as appropriate to a particular research/clinical purpose. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.

  19. Cross-validation of generalised body composition equations with diverse young men and women: the Training Intervention and Genetics of Exercise Response (TIGER) Study

    USDA-ARS?s Scientific Manuscript database

    Generalised skinfold equations developed in the 1970s are commonly used to estimate laboratory-measured percentage fat (BF%). The equations were developed on predominately white individuals using Siri's two-component percentage fat equation (BF%-GEN). We cross-validated the Jackson-Pollock (JP) gene...

  20. Evaluating Students' Conceptual Understanding of Balanced Equations and Stoichiometric Ratios Using a Particulate Drawing

    ERIC Educational Resources Information Center

    Sanger, Michael J.

    2005-01-01

    A total of 156 students were asked to provide free-response balanced chemical equations for a classic multiple-choice particulate-drawing question first used by Nurrenbern and Pickering. The balanced equations and the number of students providing each equation are reported in this study. The most common student errors included a confusion between…

  1. Optimal Bandwidth Selection in Observed-Score Kernel Equating

    ERIC Educational Resources Information Center

    Häggström, Jenny; Wiberg, Marie

    2014-01-01

    The selection of bandwidth in kernel equating is important because it has a direct impact on the equated test scores. The aim of this article is to examine the use of double smoothing when selecting bandwidths in kernel equating and to compare double smoothing with the commonly used penalty method. This comparison was made using both an equivalent…

  2. The Covariant Formulation of Maxwell's Equations Expressed in a Form Independent of Specific Units

    ERIC Educational Resources Information Center

    Heras, Jose A.; Baez, G.

    2009-01-01

    The covariant formulation of Maxwell's equations can be expressed in a form independent of the usual systems of units by introducing the constants alpha, beta and gamma into these equations. Maxwell's equations involving these constants are then specialized to the most commonly used systems of units: Gaussian, SI and Heaviside-Lorentz by giving…

  3. Vorticity imbalance and stability in relation to convection

    NASA Technical Reports Server (NTRS)

    Read, W. L.; Scoggins, J. R.

    1977-01-01

    A complete synoptic-scale vorticity budget was related to convection storm development in the eastern two-thirds of the United States. The 3-h sounding interval permitted a study of time changes of the vorticity budget in areas of convective storms. Results of analyses revealed significant changes in values of terms in the vorticity equation at different stages of squall line development. Average budgets for all areas of convection indicate systematic imbalance in the terms in the vorticity equation. This imbalance resulted primarily from sub-grid scale processes. Potential instability in the lower troposphere was analyzed in relation to the development of convective activity. Instability was related to areas of convection; however, instability alone was inadequate for forecast purposes. Combinations of stability and terms in the vorticity equation in the form of indices succeeded in depicting areas of convection better than any one item separately.

  4. The Physics of Polarization

    NASA Astrophysics Data System (ADS)

    Landi Degl'Innocenti, Egidio

    2015-10-01

    The introductory lecture that has been delivered at this Symposium is a condensed version of an extended course held by the author at the XII Canary Island Winter School from November 13 to November 21, 2000. The full series of lectures can be found in Landi Degl'Innocenti (2002). The original reference is organized in 20 Sections that are here itemized: 1. Introduction, 2. Description of polarized radiation, 3. Polarization and optical devices: Jones calculus and Muller matrices, 4. The Fresnel equations, 5. Dichroism and anomalous dispersion, 6. Polarization in everyday life, 7. Polarization due to radiating charges, 8. The linear antenna, 9. Thomson scattering, 10. Rayleigh scattering, 11. A digression on Mie scattering, 12. Bremsstrahlung radiation, 13. Cyclotron radiation, 14. Synchrotron radiation, 15. Polarization in spectral lines, 16. Density matrix and atomic polarization, 17. Radiative transfer and statistical equilibrium equations, 18. The amplification condition in polarized radiative transfer, and 19. Coupling radiative transfer and statistical equilibrium equations.

  5. Initial value problem of space dynamics in universal Stumpff anomaly

    NASA Astrophysics Data System (ADS)

    Sharaf, M. A.; Dwidar, H. R.

    2018-05-01

    In this paper, the initial value problem of space dynamics in universal Stumpff anomaly ψ is set up and developed in analytical and computational approach. For the analytical expansions, the linear independence of the functions U_{j} (ψ;σ); {j=0,1,2,3} are proved. The differential and recurrence equations satisfied by them and their relations with the elementary functions are given. The universal Kepler equation and its validations for different conic orbits are established together with the Lagrangian coefficients. Efficient representations of these functions are developed in terms of the continued fractions. For the computational developments we consider the following items: 1. Top-down algorithm for continued fraction evaluation. 2. One-point iteration formulae. 3. Determination of the coefficients of Kepler's equation. 4. Derivatives of Kepler's equation of any integer order. 5. Determination of the initial guess for the solution of the universal Kepler equation. Finally we give summary on the computational design for the initial value problem of space dynamics in universal Stumpff anomaly. This design based on the solution of the universal Kepler's equation by an iterative schemes of quadratic up to any desired order ℓ.

  6. Feasibility of Using Qualitative Interviews to Explore Patients' Treatment Goals: Experience from Dermatology.

    PubMed

    Blome, Christine; von Usslar, Kathrin; Augustin, Matthias

    2016-06-01

    Qualitative interviews are used to assess understandability and content validity of patient-reported outcomes. However, the common approach of asking patients to paraphrase items may not be sufficient to completely reveal item content as understood by patients. We used qualitative interviews to elicit more detailed information about patients' understanding of treatment goal items for the Patient Benefit Index 2.0 (PBI 2.0). This questionnaire measures patient-relevant benefit from treatments for skin diseases by assessing goal importance prior to and goal attainment after treatment. We interviewed 16 patients with psoriasis, atopic dermatitis, leg ulcers, and vitiligo. Patients were asked to elaborate in detail on their understanding of 15 treatment goal items. Subsequently, they were asked to suggest changes in item wording and to name missing treatment goals. Interview transcripts were analyzed according to an adapted approach of content analysis. The task was easy for the patients to understand, and they shared detailed information on what each goal meant to them. Results of the content analysis induced a range of revisions of the PBI 2.0 items, including changes in wording (four items) and item order (two items). Four items were deleted because they were found to be redundant or irrelevant, and one item was added to the list of treatment goals. Asking patients to elaborate on their item understanding in qualitative interviews provided detailed insight into item content and understandability. This method has helped considerably to improve feasibility and content validity of the PBI 2.0.

  7. Adjusting for cross-cultural differences in computer-adaptive tests of quality of life.

    PubMed

    Gibbons, C J; Skevington, S M

    2018-04-01

    Previous studies using the WHOQOL measures have demonstrated that the relationship between individual items and the underlying quality of life (QoL) construct may differ between cultures. If unaccounted for, these differing relationships can lead to measurement bias which, in turn, can undermine the reliability of results. We used item response theory (IRT) to assess differential item functioning (DIF) in WHOQOL data from diverse language versions collected in UK, Zimbabwe, Russia, and India (total N = 1332). Data were fitted to the partial credit 'Rasch' model. We used four item banks previously derived from the WHOQOL-100 measure, which provided excellent measurement for physical, psychological, social, and environmental quality of life domains (40 items overall). Cross-cultural differential item functioning was assessed using analysis of variance for item residuals and post hoc Tukey tests. Simulated computer-adaptive tests (CATs) were conducted to assess the efficiency and precision of the four items banks. Splitting item parameters by DIF results in four linked item banks without DIF or other breaches of IRT model assumptions. Simulated CATs were more precise and efficient than longer paper-based alternatives. Assessing differential item functioning using item response theory can identify measurement invariance between cultures which, if uncontrolled, may undermine accurate comparisons in computer-adaptive testing assessments of QoL. We demonstrate how compensating for DIF using item anchoring allowed data from all four countries to be compared on a common metric, thus facilitating assessments which were both sensitive to cultural nuance and comparable between countries.

  8. In defense of causal-formative indicators: A minority report.

    PubMed

    Bollen, Kenneth A; Diamantopoulos, Adamantios

    2017-09-01

    Causal-formative indicators directly affect their corresponding latent variable. They run counter to the predominant view that indicators depend on latent variables and are thus often controversial. If present, such indicators have serious implications for factor analysis, reliability theory, item response theory, structural equation models, and most measurement approaches that are based on reflective or effect indicators. Psychological Methods has published a number of influential articles on causal and formative indicators as well as launching the first major backlash against them. This article examines 7 common criticisms of these indicators distilled from the literature: (a) A construct measured with "formative" indicators does not exist independently of its indicators; (b) Such indicators are causes rather than measures; (c) They imply multiple dimensions to a construct and this is a liability; (d) They are assumed to be error-free, which is unrealistic; (e) They are inherently subject to interpretational confounding; (f) They fail proportionality constraints; and (g) Their coefficients should be set in advance and not estimated. We summarize each of these criticisms and point out the flaws in the logic and evidence marshaled in their support. The most common problems are not distinguishing between what we call causal-formative and composite-formative indicators, tautological fallacies, and highlighting issues that are common to all indicators, but presenting them as special problems of causal-formative indicators. We conclude that measurement theory needs (a) to incorporate these types of indicators, and (b) to better understand their similarities to and differences from traditional indicators. (PsycINFO Database Record (c) 2017 APA, all rights reserved).

  9. Limited genetic covariance between autistic traits and intelligence: findings from a longitudinal twin study.

    PubMed

    Hoekstra, Rosa A; Happé, Francesca; Baron-Cohen, Simon; Ronald, Angelica

    2010-07-01

    Intellectual disability is common in individuals with autism spectrum conditions. However, the strength of the association between both conditions and its relevance to finding the underlying (genetic) causes of autism is unclear. This study aimed to investigate the longitudinal association between autistic traits and intelligence in a general population twin sample and to examine the etiology of this association. Parental ratings of autistic traits and performance on intelligence tests were collected in a sample of 8,848 twin pairs when the children were 7/8, 9, and 12 years old. Phenotypic and longitudinal correlations in the sample as a whole were compared to the associations in the most extreme scoring 5% of the population. The genetic and environmental influences on the overlap between autistic traits and IQ and on the stability of this relationship over time were estimated using structural equation modeling. Autistic traits were modestly negatively correlated to intellectual ability, both in the extreme scoring groups and among the full-range scores. The correlation was stable over time and was mainly explained by autistic trait items assessing communication difficulties. Genetic model fitting showed that autistic traits and IQ were influenced by a common set of genes and a common set of environmental influences that continuously affect these traits throughout childhood. The genetic correlation between autistic traits and IQ was only modest. These findings suggest that individual differences in autistic traits are substantially genetically independent of intellectual functioning. The relevance of these findings to future studies is discussed. (c) 2010 Wiley-Liss, Inc.

  10. Review of Measures of Worksite Environmental and Policy Supports for Physical Activity and Healthy Eating

    PubMed Central

    Reeds, Dominic N.; van Bakergem, Margaret A.; Marx, Christine M.; Brownson, Ross C.; Pamulapati, Surya C.; Hoehner, Christine M.

    2015-01-01

    Introduction Obesity prevention strategies are needed that target multiple settings, including the worksite. The objective of this study was to assess the state of science concerning available measures of worksite environmental and policy supports for physical activity (PA) and healthy eating (HE). Methods We searched multiple databases for instruments used to assess worksite environments and policies. Two commonly cited instruments developed by state public health departments were also included. Studies that were published from 1991 through 2013 in peer-reviewed publications and gray literature that discussed the development or use of these instruments were analyzed. Instrument administration mode and measurement properties were documented. Items were classified by general health topic, 5 domains of general worksite strategy, and 19 subdomains of worksite strategy specific to PA or HE. Characteristics of worksite measures were described including measurement properties, length, and administration mode, as well as frequencies of items by domain and subdomain. Results Seventeen instruments met inclusion criteria (9 employee surveys, 5 manager surveys, 1 observational assessment, and 2 studies that used multiple administration modes). Fourteen instruments included reliability testing. More items were related to PA than HE. Most instruments (n = 10) lacked items in the internal social environment domain. The most common PA subdomains were exercise facilities and lockers/showers; the most common HE subdomain was healthy options/vending. Conclusion This review highlights gaps in measurement of the worksite social environment. The findings provide a useful resource for researchers and practitioners and should inform future instrument development. PMID:25950572

  11. Review of measures of worksite environmental and policy supports for physical activity and healthy eating.

    PubMed

    Hipp, J Aaron; Reeds, Dominic N; van Bakergem, Margaret A; Marx, Christine M; Brownson, Ross C; Pamulapati, Surya C; Hoehner, Christine M

    2015-05-07

    Obesity prevention strategies are needed that target multiple settings, including the worksite. The objective of this study was to assess the state of science concerning available measures of worksite environmental and policy supports for physical activity (PA) and healthy eating (HE). We searched multiple databases for instruments used to assess worksite environments and policies. Two commonly cited instruments developed by state public health departments were also included. Studies that were published from 1991 through 2013 in peer-reviewed publications and gray literature that discussed the development or use of these instruments were analyzed. Instrument administration mode and measurement properties were documented. Items were classified by general health topic, 5 domains of general worksite strategy, and 19 subdomains of worksite strategy specific to PA or HE. Characteristics of worksite measures were described including measurement properties, length, and administration mode, as well as frequencies of items by domain and subdomain. Seventeen instruments met inclusion criteria (9 employee surveys, 5 manager surveys, 1 observational assessment, and 2 studies that used multiple administration modes). Fourteen instruments included reliability testing. More items were related to PA than HE. Most instruments (n = 10) lacked items in the internal social environment domain. The most common PA subdomains were exercise facilities and lockers/showers; the most common HE subdomain was healthy options/vending. This review highlights gaps in measurement of the worksite social environment. The findings provide a useful resource for researchers and practitioners and should inform future instrument development.

  12. Distribution of Total Depressive Symptoms Scores and Each Depressive Symptom Item in a Sample of Japanese Employees.

    PubMed

    Tomitaka, Shinichiro; Kawasaki, Yohei; Ide, Kazuki; Yamada, Hiroshi; Miyake, Hirotsugu; Furukawa, Toshiaki A; Furukaw, Toshiaki A

    2016-01-01

    In a previous study, we reported that the distribution of total depressive symptoms scores according to the Center for Epidemiologic Studies Depression Scale (CES-D) in a general population is stable throughout middle adulthood and follows an exponential pattern except for at the lowest end of the symptom score. Furthermore, the individual distributions of 16 negative symptom items of the CES-D exhibit a common mathematical pattern. To confirm the reproducibility of these findings, we investigated the distribution of total depressive symptoms scores and 16 negative symptom items in a sample of Japanese employees. We analyzed 7624 employees aged 20-59 years who had participated in the Northern Japan Occupational Health Promotion Centers Collaboration Study for Mental Health. Depressive symptoms were assessed using the CES-D. The CES-D contains 20 items, each of which is scored in four grades: "rarely," "some," "much," and "most of the time." The descriptive statistics and frequency curves of the distributions were then compared according to age group. The distribution of total depressive symptoms scores appeared to be stable from 30-59 years. The right tail of the distribution for ages 30-59 years exhibited a linear pattern with a log-normal scale. The distributions of the 16 individual negative symptom items of the CES-D exhibited a common mathematical pattern which displayed different distributions with a boundary at "some." The distributions of the 16 negative symptom items from "some" to "most" followed a linear pattern with a log-normal scale. The distributions of the total depressive symptoms scores and individual negative symptom items in a Japanese occupational setting show the same patterns as those observed in a general population. These results show that the specific mathematical patterns of the distributions of total depressive symptoms scores and individual negative symptom items can be reproduced in an occupational population.

  13. Measuring Psychobiosocial States in Sport: Initial Validation of a Trait Measure

    PubMed Central

    Bertollo, Maurizio; Ruiz, Montse C.; Bortoli, Laura

    2016-01-01

    We examined the item characteristics, the factor structure, and the concurrent validity of a trait measure of psychobiosocial states. In Study 1, Italian athletes (N = 342, 228 men, 114 women, Mage = 23.93, SD = 6.64) rated the intensity, the frequency, and the perceived impact dimensions of a psychobiosocial states scale, trait version (PBS-ST), which is composed of 20 items (10 functional and 10 dysfunctional) referring to how they usually felt before an important competition. In Study 2, the scale was cross validated in an independent sample (N = 251, 181 men, 70 women, Mage = 24.35, SD = 7.25). The concurrent validity of the PBS-ST scale scores were also examined in comparison with two sport-specific emotion-related measures and a general measure of affect. Exploratory structural equation modeling and confirmatory factor analysis of the data of Study 1 showed that a 2-factor, 15-item solution of the PBS-ST scale (8 functional items and 7 dysfunctional items) reached satisfactory fit indices for the three dimensions (i.e., intensity, frequency, and perceived impact). Results of Study 2 provided evidence of substantial measurement and structural invariance of all dimensions across samples. The low association of the PBS-ST scale with other measures suggests that the scale taps unique constructs. Findings of the two studies offer initial validity evidence for a sport-specific tool to measure psychobiosocial states. PMID:27907111

  14. The SF-8 Spanish Version for Health-Related Quality of Life Assessment: Psychometric Study with IRT and CFA Models.

    PubMed

    Tomás, José M; Galiana, Laura; Fernández, Irene

    2018-03-22

    The aim of current research is to analyze the psychometric properties of the Spanish version of the SF-8, overcoming previous shortcomings. A double line of analyses was used: competitive structural equations models to establish factorial validity, and Item Response theory to analyze item psychometric characteristics and information. 593 people aged 60 years or older, attending long life learning programs at the University were surveyed. Their age ranged from 60 to 92 years old. 67.6% were women. The survey included scales on personality dimensions, attitudes, perceptions, and behaviors related to aging. Competitive confirmatory models pointed out two-factors (physical and mental health) as the best representation of the data: χ2(13) = 72.37 (p < .01); CFI = .99; TLI = .98; RMSEA = .08 (.06, .10). Item 5 was removed because of unreliability and cross-loading. Graded response models showed appropriate fit for two-parameter logistic model both the physical and the mental dimensions. Item Information Curves and Test Information Functions pointed out that the SF-8 was more informative for low levels of health. The Spanish SF-8 has adequate psychometric properties, being better represented by two dimensions, once Item 5 is removed. Gathering evidence on patient-reported outcome measures is of crucial importance, as this type of measurement instruments are increasingly used in clinical arena.

  15. Erratum: ``FUSE and STIS Observations of the Warm-hot Intergalactic Medium toward PG 1259+593'' (ApJS, 153, 165 [2004])

    NASA Astrophysics Data System (ADS)

    Richter, Philipp; Savage, Blair D.; Tripp, Todd M.; Sembach, Kenneth R.

    2004-12-01

    There was a minor error in the form of equation (4) in the original paper; the first bracketed term on the right-hand side is missing a -1. The correct equation is: ΔX=0.5[(1+zmax)2-1]-[(1+zmin)2-1]. (4) Another error also occurred in the calculation of Ωb(BL) in the last paragraph of § 3.5 (p. 198). The correct limit is Ωb(BL)<=0.0035h-175 [instead of Ωb(BL)<=0.0031h-175]. Note the wrong value is cited a second time in list item 5 of the Summary (§ 5; p. 204).

  16. Mutagenic activity of south Indian food items.

    PubMed

    Sivaswamy, S N; Balachandran, B; Balanehru, S; Sivaramakrishnan, V M

    1991-08-01

    Dietary components and food dishes commonly consumed in South India were screened for their mutagenic activity. Kesari powder, calamus oil, palm drink, toddy and Kewra essence were found to be strongly mutagenic; garlic, palm oil, arrack, onion and pyrolysed portions of bread toast, chicory powder were weakly mutagenic, while tamarind and turmeric were not. Certain salted, sundried and oil fried food items were also mutagenic. Cissus quadrangularis was mutagenic, while 'decoctions' of cumin seeds, aniseeds and ginger were not. Several perfumes, essential oils and colouring agents, which are commonly used were also screened and many of them exhibited their mutagenic potential by inducing the 'reverse mutation' in Salmonella typhimurium tester strains.

  17. The big five personality traits: psychological entities or statistical constructs?

    PubMed

    Franić, Sanja; Borsboom, Denny; Dolan, Conor V; Boomsma, Dorret I

    2014-11-01

    The present study employed multivariate genetic item-level analyses to examine the ontology and the genetic and environmental etiology of the Big Five personality dimensions, as measured by the NEO Five Factor Inventory (NEO-FFI) [Costa and McCrae, Revised NEO personality inventory (NEO PI-R) and NEO five-factor inventory (NEO-FFI) professional manual, 1992; Hoekstra et al., NEO personality questionnaires NEO-PI-R, NEO-FFI: manual, 1996]. Common and independent pathway model comparison was used to test whether the five personality dimensions fully mediate the genetic and environmental effects on the items, as would be expected under the realist interpretation of the Big Five. In addition, the dimensionalities of the latent genetic and environmental structures were examined. Item scores of a population-based sample of 7,900 adult twins (including 2,805 complete twin pairs; 1,528 MZ and 1,277 DZ) on the Dutch version of the NEO-FFI were analyzed. Although both the genetic and the environmental covariance components display a 5-factor structure, applications of common and independent pathway modeling showed that they do not comply with the collinearity constraints entailed in the common pathway model. Implications for the substantive interpretation of the Big Five are discussed.

  18. Importance Has Been Considered in Satisfaction Evaluation: An Experimental Examination of Locke's Range-of-Affect Hypothesis

    ERIC Educational Resources Information Center

    Wu, Chia-huei; Yao, Grace

    2007-01-01

    Importance weighting is a common practice in quality of life (QOL) measurement research. Based on the widespread idea that important domains should make a greater contribution to individuals' QOL total score, the weighting procedure of multiplying item satisfaction by an item's importance has been adopted in many QOL instruments. Locke's [1969,…

  19. Confidence Bounds and Power for the Reliability of Observational Measures on the Quality of a Social Setting

    ERIC Educational Resources Information Center

    Shin, Yongyun; Raudenbush, Stephen W.

    2012-01-01

    Social scientists are frequently interested in assessing the qualities of social settings such as classrooms, schools, neighborhoods, or day care centers. The most common procedure requires observers to rate social interactions within these settings on multiple items and then to combine the item responses to obtain a summary measure of setting…

  20. Reduced OSM for Long Duration Targets: Individuation or Items Loaded into VSTM?

    ERIC Educational Resources Information Center

    Guest, Duncan; Gellatly, Angus; Pilling, Michael

    2012-01-01

    Typical studies of object substitution masking (OSM) employ a briefly presented search array. The target item is indicated by a cue/mask that surrounds but does not overlap the target and, compared to a common offset control condition, report of the target is reduced when the mask remains present after target offset. Given how little observers are…

Top